This is a cache of https://www.elastic.co/pricing/serverless-search. It is a snapshot of the page at 2025-09-18T00:04:23.354+0000.

<strong>elasticsearch</strong> Serverless pricing | Elastic

Skip to main content

Deutsch
English
Español
Français
日本語
한국어
简体中文
Português

Start free trial Contact Sales

New

About us Partners Support|Login

elasticsearch Serverless

Pay only for what you use, with no infrastructure hassle. Discover the art of the possible with AI search, RAG-ready tools, and data analytics capabilities.

Pricing breakdown
	Pricing details Ingest* Per VCU per hour As low as $0.14 Search* Per VCU per hour As low as $0.09 Machine Learning Per VCU per hour As low as $0.07 Storage & Retention Per GB retained per month As low as $0.047 Egress Per GB transferred per month As low as $0.05 Per GB Vector profiles receive 50 GB free Elastic Managed Large Language Model (LLM) for AI Playground and AI Assistant* $4.50 Per million input tokens $21 Per million output tokens Elastic Inference Service: ELSER on GPU Per million output tokens As low as $0.08
Ingest*Per VCU per hour	As low as $0.14
Search*Per VCU per hour	As low as $0.09
Machine LearningPer VCU per hour	As low as $0.07
Storage & RetentionPer GB retained per month	As low as $0.047
EgressPer GB transferred per month *Vector profiles receive 50 GB free	As low as $0.05
Elastic Managed Large Language Model (LLM) for AI Playground and AI Assistant	$4.50 Per million input tokens $21 Per million output tokens
Elastic Inference Service: ELSER on GPUPer million output tokens	As low as $0.08

*These prices take effect December 1, 2024.

Ingest and retention metering is based on the uncompressed, normalized, fully enriched data volume that you ingest into your serverless project. Metered volumes will be much higher than the "raw" or compressed data size "on the wire."

elasticsearch Serverless tour

Seamlessly onboard your data into elasticsearch — whether it's custom application data or data from third-party apps.

Support package

Limited support is included with a Standard subscription; all other support pricing is based on the percentage of your consumption. For more information on what's included in each support level, please go to elastic.co/support.

Support package
Elastic Cloud organization subscription level*	Standard	Gold	Platinum	Enterprise
Support and total bill
Support level	Limited	Base	Enhanced	Premium
% of charge	Included	5%	10%	15%

*Subscription level is selected during sign up

elasticsearch Serverless pricing components

elasticsearch Serverless charges separately for compute (VCUs with 1GB RAM) and storage (GB), offering scalable, performance-driven pricing to meet your latency and throughput goals.

Virtual Compute Unit (VCU)
There are three specialized VCU types available to perform specific tasks.
Ingest VCUs: Handle data indexing into the Search AI Lake.
Search VCUs: Handle user driven searches, alerting rules, aggregations, transforms and geospatial queries against data in the Search AI Lake.
Machine Learning VCUs: Manage inference, ELSER workloads, and machine learning jobs.
Token usage
Elastic Managed Large Language Model usage per million Input and Output tokens: Leverage AI-powered search as a service without deploying a large language model (LLM) in your project.
ELSER usage charged per million tokens: Leverage ELSER on GPU for semantic search use cases
Adaptive resource provisioning
Ingest and ML compute resources automatically scale to meet workload demands.
Search compute resources dynamically adjust to workloads, ensuring consistent performance and responsiveness. With flexible Search Power settings, you have control over resource allocations to meet your performance needs.
Storage & retention
elasticsearch Serverless uses object stores for persistent storage in the Search AI Lake.
All data, regardless of type, recency, and frequency of use, is accessible from the Search AI Lake. The size of the Search AI Lake can be controlled with manual or managed data retention policies.
Storage is measured in GB.
Configurations
Two infrastructure configurations are available for elasticsearch Serverless: general purpose and vector (API only).
The general purpose option is used by default for all new projects and is appropriate for most use cases.
The vector option allocates more VCUs to your project for higher performance, but it also incurs additional costs due to the higher VCU allocation. This option is only recommended for projects using dense_vector field mappings with int4 or int8, with high dimensionality.

The same elasticsearch, just easier

Flexible scaling for variable workloads
Autoscaling ensures seamless resource allocation without the need for manual adjustments.
Guaranteed search availability
Search resources scale independently from indexing resources to provide consistent performance for uninterrupted data access.
Optimizations to improve search performance
Leverage built-in elasticsearch Serverless platform configurations to optimize response time and end user experience for different data types and use cases.
Build search experiences powered by generative AI
Deliver personalized and efficient search results with vector search and trained language models.
Quickly create development and test environments
Streamline software development and deployment processes with elasticsearch Serverless.

Frequently asked questions

Try elasticsearch Serverless free

What is elasticsearch Serverless?

Serverless projects use the core components of the Elastic Stack, such as elasticsearch and Kibana, and are based on Elastic’s Search AI Lake architecture that decouples compute and storage. Search and indexing operations are separated, which offers flexibility for scaling your workloads while ensuring a high level of performance.

Enjoy the following benefits with elasticsearch Serverless:

Management free. Elastic manages the underlying Elastic cluster, so you can focus on your data. With serverless projects, Elastic is responsible for automatic upgrades, data backups, and business continuity.
Autoscaled. To meet your performance requirements, the system automatically adjusts to your workloads.
Optimized data storage. Your data is stored in the Search Lake of your project, which serves as a cost-efficient and performant storage. A high performance layer is available on top of the Search Lake for your most queried data.
Pay for the performance you need. Pay for ingest, search, and ML resources separately as needed by the workloads you run.

How is Elastic Cloud Serverless different from Elastic Cloud Hosted?

How should I decide whether to choose elasticsearch Serverless or Elastic Stack Hosted?

How do I get started on elasticsearch Serverless?

Can I migrate data between elasticsearch Serverless and Elastic Stack Hosted?

What are Search Power settings?

How much will I pay*?

When should I use the General Purpose profile vs. Vector Optimized Profile?

Discover everything you can do with Elastic Cloud Serverless

Interactive Demo
Experience what serverless has to offer.
Explore live demo
Documentation
Learn how to create, manage, and run serverless projects
Read the docs
Free trial
Get started with simple solution-oriented, usage-based pricing.
Start free trial