Trainium3 UltraServers Now Available: Enabling Customers to Train and Deploy AI Models Faster at Lower Cost

(Neutral)

(Very Positive)

Key Terms

pflops technical

pflops (short for petaflops) is a measure of computing speed equal to one quadrillion (10^15) floating-point calculations per second, used to describe how many complex math operations a computer or chip can do each second. Investors use it like a horsepower or speedometer: higher pflops indicate greater raw capability for tasks such as artificial intelligence training, scientific simulations, or large-scale data processing, but it doesn’t alone guarantee efficiency, cost-effectiveness, or real-world performance.

gpu technical

A GPU (graphics processing unit) is a specialized computer chip designed to handle many calculations at once, originally for rendering images and video but now widely used for tasks like artificial intelligence, data analysis and high-performance computing. Investors watch GPU demand and prices because strong sales often signal growth for chip makers and their customers, affect profit margins and capital spending, and can forecast wider trends in gaming, AI adoption and cloud services.

See more from StockTitan in Google Search and AI answers. Adds StockTitan as a preferred source · opens Google

Add on Google

12/02/2025 - 01:30 PM

Amazon EC2 Trn3 UltraServers powered by AWS's first 3nm AI chip help organizations of all sizes run their most ambitious AI training and inference workloads

LAS VEGAS--(BUSINESS WIRE)-- At AWS re:Invent, Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the general availability of Trainium3 UltraServers powered by the new Trainium3 chip.

Key takeaways

Trainium3 UltraServers deliver high performance for AI workloads with up to 4.4x more compute performance, 4x greater energy efficiency, and almost 4x more memory bandwidth than Trainium2 UltraServers—enabling faster AI development with lower operational costs.
Trn3 UltraServers scale up to 144 Trainium3 chips, delivering up to 362 FP8 PFLOPs with 4x lower latency to train larger models faster and serve inference at scale.
Customers including Anthropic, Karakuri, Metagenomi, NetoAI, Ricoh, and Splash Music are reducing training and inference costs by up to 50% with Trainium, while Decart is achieving 4x faster inference for real-time generative video at half the cost of GPUs, and Amazon Bedrock is already serving production workloads on Trainium3.

To get the full story on AWS Trainium3 UltraServers, visit About Amazon.

About AWS

Amazon Web Services (AWS) is guided by customer obsession, pace of innovation, commitment to operational excellence, and long-term thinking. By democratizing technology for nearly two decades and making cloud computing and generative AI accessible to organizations of every size and industry, AWS has built one of the fastest-growing enterprise technology businesses in history. Millions of customers trust AWS to accelerate innovation, transform their businesses, and shape the future. With the most comprehensive AI capabilities and global infrastructure footprint, AWS empowers builders to turn big ideas into reality. Learn more at aws.amazon.com and follow @AWSNewsroom.

View source version on businesswire.com: https://www.businesswire.com/news/home/20251201046353/en/

Amazon.com, Inc.
Media Hotline
Amazon-pr@amazon.com
www.amazon.com/pr

Source: Amazon.com, Inc.

Trainium3 UltraServers Now Available: Enabling Customers to Train and Deploy AI Models Faster at Lower Cost

Key Terms

Related Articles