What is provisioned concurrency in AWS Lambda?
Answer
Provisioned Concurrency is an AWS Lambda feature that pre-initializes a specified number of execution environments and keeps them warm indefinitely, eliminating cold starts for those environments. When a request comes in, it's immediately handled by a pre-initialized instance without any initialization delay. Configuration: you allocate provisioned concurrency at the function alias or version level — aws lambda put-provisioned-concurrency-config --function-name myFunc --qualifier LIVE --provisioned-concurrent-executions 10. Pricing: provisioned concurrency is billed at ~$0.000015/GB-second for the pre-warmed capacity, plus the standard execution cost. Use cases: latency-sensitive APIs (payment processing, real-time recommendations), JVM/Java functions with notoriously long cold starts (2-5 seconds), high-traffic periods (auto-scale provisioned concurrency with Application Auto Scaling based on schedule or metric). Caveat: provisioned concurrency doesn't help beyond its configured count — requests above that level still experience cold starts.
Previous
How do you manage secrets in serverless applications?
Next
How do you handle errors and retries in serverless functions?
More Serverless Architecture Questions
View all →- Intermediate How do you reduce cold start latency in serverless functions?
- Intermediate What is AWS Step Functions and when would you use it?
- Intermediate How do you handle state in serverless applications?
- Intermediate What is the Serverless Framework and how does it work?
- Intermediate How do you implement authentication in a serverless API?