Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

New AWS Service Lets Customers Rent Nvidia GPUs For Quick AI Projects - Slashdot

Nov 01, 2023 - slashdot.org
Amazon Web Services (AWS) has launched Amazon Elastic Compute Cloud (EC2) Capacity Blocks for Machine Learning (ML), allowing customers to rent access to NVIDIA H100 Tensor Core GPUs for a specific amount of time. This service is designed to assist companies running large language models that require GPUs, which are often expensive and in short supply. Customers can reserve GPU instances in clusters of one to 64, with eight GPUs per instance, for up to 14 days in one-day increments, up to eight weeks in advance.

The new product provides cost certainty, as customers will know upfront the duration of the job, the number of GPUs they will use, and the total cost. The service displays the total cost for the timeframe and resources, which users can adjust according to their budget and resource needs before purchasing. The feature is now generally available in the AWS US East (Ohio) region.

Key takeaways:

  • AWS has launched Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML to allow customers to rent GPUs for a defined amount of time.
  • The product provides access to NVIDIA H100 Tensor Core GPUs instances in cluster sizes of one to 64 instances with 8 GPUs per instance.
  • Customers can reserve time for up to 14 days in 1-day increments, up to 8 weeks in advance, and the instances will shut down automatically when the timeframe is over.
  • The new feature is generally available starting today in the AWS US East (Ohio) region.
View Full Article

Comments (0)

Be the first to comment!