Monday, May 4, 2026
HomeBig DataSelect the fitting storage tier to your wants in Amazon OpenSearch Service

Select the fitting storage tier to your wants in Amazon OpenSearch Service

[ad_1]

Amazon OpenSearch Service (successor to Amazon Elasticsearch Service) permits organizations to carry out interactive log analytics, real-time software monitoring, web site search, and extra. OpenSearch is an open-source, distributed search and analytics suite derived from Elasticsearch. Amazon OpenSearch Service gives the most recent variations of OpenSearch, help for 19 variations of Elasticsearch (1.5 to 7.10 variations), and visualization capabilities powered by OpenSearch Dashboards and Kibana (1.5 to 7.10 variations).

On this publish, we current three storage tiers of Amazon OpenSearch Service—scorching, UltraWarm, and chilly storage—and focus on learn how to successfully select the fitting storage tier to your wants. This publish can assist you perceive how these storage tiers combine collectively and what the trade-off is for every storage tier. To decide on a storage tier of Amazon OpenSearch Service to your use case, it’s worthwhile to contemplate the efficiency, latency, and value of those storage tiers to be able to make the fitting determination.

Amazon OpenSearch Service storage tiers overview

There are three totally different storage tiers for Amazon OpenSearch Service: scorching, UltraWarm, and chilly. The next diagram illustrates these three storage tiers.

Sizzling storage

Sizzling storage for Amazon OpenSearch Service is used for indexing and updating, whereas offering quick entry to information. Commonplace information nodes use scorching storage, which takes the type of occasion retailer or Amazon Elastic Block Retailer (Amazon EBS) volumes hooked up to every node. Sizzling storage supplies the quickest doable efficiency for indexing and looking out new information.

You get the bottom latency for studying information within the scorching tier, so it is best to use the recent tier to retailer steadily accessed information driving real-time evaluation and dashboards. As your information ages, you entry it much less steadily and might tolerate greater latency, so preserving information within the scorching tier is now not cost-efficient.

If you wish to have low latency and quick entry to the info, scorching storage is an effective selection for you.

UltraWarm storage

UltraWarm nodes use Amazon Easy Storage Service (Amazon S3) with associated caching options to enhance efficiency. UltraWarm gives considerably decrease prices per GiB for read-only information that you simply question much less steadily and don’t want the identical efficiency as scorching storage. Though you possibly can’t modify the info whereas in UltraWarm, you possibly can transfer the info to the recent storage tier for edits earlier than shifting it again.

When calculating UltraWarm storage necessities, you contemplate solely the scale of the first shards. If you question for the listing of shards in UltraWarm, you continue to see the first and replicas listed. Each shards are stubs for a similar, single copy of the info, which is in Amazon S3. The sturdiness of knowledge in Amazon S3 removes the necessity for replicas, and Amazon S3 abstracts away any working system or service issues. Within the scorching tier, accounting for one duplicate, 20 GB of index makes use of 40 GB of storage. Within the UltraWarm tier, it’s billed at 20 GB.

The UltraWarm tier acts like a caching layer on prime of the info in Amazon S3. UltraWarm strikes information from Amazon S3 onto the UltraWarm nodes on demand, which hastens entry for subsequent queries on that information. For that purpose, UltraWarm works greatest to be used instances that entry the identical, small slice of knowledge a number of instances. You may add or take away UltraWarm nodes to extend or lower the quantity of cache towards your information in Amazon S3 to optimize your value per GB. To dial in your value, make sure you take a look at utilizing a consultant dataset. To watch efficiency, use the WarmCPUUtilization and WarmJVMMemoryPressure metrics. See UltraWarm metrics for an entire listing of metrics.

The mixed CPU cores and RAM allotted to UltraWarm nodes impacts efficiency for simultaneous searches throughout shards. We advocate deploying sufficient UltraWarm cases so that you simply retailer not more than 400 shards per ultrawarm1.medium.search node and 1,000 shards per ultrawarm1.giant.search node (together with each primaries and replicas). We advocate a most shard dimension of fifty GB for each scorching and heat tiers. If you question UltraWarm, every shard makes use of a CPU and strikes information from Amazon S3 to native storage. Working single or concurrent queries that entry many indexes can overwhelm the CPU and native disk assets. This will trigger longer latencies via inefficient use of native storage, and even trigger cluster failures.

UltraWarm storage requires OpenSearch 1.0 or later, or Elasticsearch model 6.8 or later.

If in case you have giant quantities of read-only information and wish to stability the fee and efficiency, use UltraWarm to your occasionally accessed, older information.

Chilly storage

Chilly storage is optimized to retailer occasionally accessed or historic information at $0.024 per GB per 30 days. If you use chilly storage, you detach your indexes from the UltraWarm tier, making them inaccessible. You may reattach these indexes in a number of seconds when it’s worthwhile to question that information. Chilly storage is a good match for situations by which a low ROI necessitates an archive or delete motion on historic information, or if it’s worthwhile to conduct analysis or carry out forensic evaluation on older information with Amazon OpenSearch Service.

Chilly storage doesn’t have particular occasion sorts as a result of it doesn’t have any compute capability hooked up to it. You may retailer any quantity of knowledge in chilly storage.

Chilly storage requires OpenSearch 1.0 or later, or Elasticsearch model 7.9 or later and UltraWarm.

Handle storage tiers in OpenSearch Dashboards

OpenSearch Dashboards put in in your Amazon OpenSearch Service area supplies a helpful UI for managing indexes in several storage tiers in your area. From the OpenSearch Dashboards primary menu, you possibly can view all indexes in scorching, UltraWarm, and chilly storage. You too can see the indexes managed by Index State Administration (ISM) insurance policies. OpenSearch Dashboards lets you migrate indexes between UltraWarm and chilly storage, and monitor index migration standing, with out utilizing the AWS Command Line Interface (AWS CLI) or configuration API. For extra info on OpenSearch Dashboards, see Utilizing OpenSearch Dashboards with Amazon OpenSearch Service.

Price issues

The new tier requires you to pay for what’s provisioned, which incorporates the hourly charge for the occasion sort. Storage is both Amazon EBS or an area SSD occasion retailer. For Amazon EBS-only occasion sorts, extra EBS quantity pricing applies. You pay for the quantity of storage you deploy.

UltraWarm nodes cost per hour similar to different node sorts, however you solely pay for the storage truly saved in Amazon S3. For instance, though the occasion sort ultrawarm1.giant.elasticsearch supplies as much as 20 TiB addressable storage on Amazon S3, in case you solely retailer 2 TiB of knowledge, you’re solely billed for two TiB. Like the usual information node sorts, you additionally pay an hourly charge for every UltraWarm node. For extra info, see Pricing for Amazon OpenSearch Service.

Chilly storage doesn’t incur compute prices, and like UltraWarm, you’re solely billed for the quantity of knowledge saved in Amazon S3. There aren’t any extra switch fees when shifting information between chilly and UltraWarm storage.

Instance use case

Let’s have a look at an instance with 1 TB of supply information per day, 7 days scorching, 83 days heat, twelve months chilly. For extra info on sizing the cluster, see Sizing Amazon OpenSearch Service domains.

For warm storage, you possibly can undergo a baseline estimation with the calculation as: storage wanted = (each day supply information in bytes * 1.25) * (number_of_replicas + 1) * variety of days retention. With one of the best observe for 2 replicas, we should always use two replicas right here. The minimal storage requirement to retain 7 TB of knowledge on the recent tier is (7TB*1.25)*(2+1)= 26.25 TB. For this quantity of storage, we’d like 6x R6g.4xlarge.search cases given the Amazon EBS dimension restrict.

We additionally must confirm from the CPU aspect, we’d like 25 major shards (1TB*1.25/50GB) =25. We’ve two replicas. With that, we have now whole 75 energetic shards. With that, the entire vCPU wanted is 75*1.5=112.5 vCPU. This implies 8x R6g.4xlarge.search cases. This additionally requires three devoted c6g.xlarge.search chief nodes.

When calculating UltraWarm storage necessities, you contemplate solely the scale of the first shards, as a result of that’s the quantity of knowledge saved in Amazon S3. For this instance, the entire major shard dimension for heat storage is 83*1.25=103.75 TB. Every ultrawarm1.giant.search occasion has 16 CPU cores and might tackle as much as 20 TiB of storage on Amazon S3. A minimal of six ultrawarm1.giant.search nodes is really helpful. You’re charged for the precise storage, which is 103.75 TB.

For chilly storage, you solely pay for the price of storing 365*1.25=456.25 TB on Amazon S3. The next desk comprises a breakdown of the month-to-month prices (USD) you’re prone to incur. This assumes a 1-year reserved occasion for the cluster cases with no upfront fee within the US East (N. Virgina) Area.

Price Kind Pricing Utilization Price per 30 days
Occasion Utilization R6g.4xlarge.search = $0.924 per hour 8 cases * 730 hours in a month = 5,840 hours 5,840 hours * $0.924 = $5,396.16
c6g.xlarge.search = $0.156 per hour 3 cases (chief nodes) * 730 hours in a month = 2,190 hours 2,190 hours * $0.156 = $341.64
ultrawarm1.giant.search = $2.68 per hour 6 cases * 730 hours = 4,380 hours 4,380 hours * $2.68 = $11,738.40
Storage Price Sizzling storage value (Amazon EBS) EBS basic function SSD (gp3) = $0.08 per GB per 30 days 7 days host = 26.25TB 26,880 GB * $0.08 = $2,150.40
UltraWarm managed storage value = $0.024 per GB per 30 days 83 days heat = 103.75 TB per 30 days 106,240 GB * $0.024 = $2,549.76
Chilly storage value on Amazon S3 = $0.022 per GB per 30 days twelve months chilly = 456.25 TB per 30 days 467,200 GB * $0.022 = $10,278.40

The overall month-to-month value is $32,454.76. The new tier prices $7,888.20, UltraWarm prices $14,288.16, and chilly storage is $10,278.40. UltraWarm permits 83 days of extra retention for barely extra value than the recent tier, which solely supplies 7 days. For almost the identical value as the recent tier, the chilly tier shops the first shards for as much as 1 yr.

Conclusion

Amazon OpenSearch Service helps three built-in storage tiers: scorching, UltraWarm, and chilly storage. Based mostly in your information retention, question latency, and budgeting necessities, you possibly can select one of the best technique to stability value and efficiency. You too can migrate information between totally different storage tiers. To start out utilizing these storage tiers, sign up to the AWS Administration Console, use the AWS SDK, or AWS CLI, and allow the corresponding storage tier.


Concerning the Writer

Changbin Gong is a Senior Options Architect at Amazon Internet Companies (AWS). He engages with prospects to create progressive options that tackle buyer enterprise issues and speed up the adoption of AWS companies. In his spare time, Changbin enjoys studying, working, and touring.

Wealthy Giuli is a Principal Options Architect at Amazon Internet Service (AWS). He works inside a specialised group serving to ISVs speed up adoption of cloud companies. Outdoors of labor Wealthy enjoys working and enjoying guitar.

[ad_2]

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments