The new product provides cost certainty for customers, as they will know upfront how long the job will run, how many GPUs they’ll use, and how much it will cost. The pricing for access to these resources will be dynamic, varying depending on supply and demand. The service is available starting today in the AWS US East (Ohio) region.
Key takeaways:
- AWS has launched Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML, a service that allows customers to reserve GPU instances for a specific amount of time.
- The service provides access to NVIDIA H100 Tensor Core GPUs instances in cluster sizes of one to 64 instances with 8 GPUs per instance, which can be reserved for up to 14 days.
- The pricing for this service will be dynamic, varying based on supply and demand, and users will know the total cost upfront, providing cost certainty.
- The new feature is available starting today in the AWS US East (Ohio) region.