STOCK TITAN

Penguin Solutions Selected by Deepgram to Enable Deployment of Optimized AI Inference Infrastructure for Enterprise Voice AI

Rhea-AI Impact
(Moderate)
Rhea-AI Sentiment
(Very Positive)
Tags
AI

Key Terms

text-to-speech (tts) technical
Text-to-speech (TTS) is technology that converts written words into spoken audio, allowing press releases, earnings summaries, or other documents to be listened to instead of read. For investors it matters because TTS expands access and distribution—like turning a written report into a short podcast—making it easier to consume updates on the go, reach larger or non-reading audiences, and automate voice delivery for timely alerts or regulatory disclosures.
voice agent technical
A voice agent is an AI-driven system that listens to spoken requests, understands natural language, and responds or carries out tasks by voice—think of it as an automated employee you can talk to on a phone or smart speaker. For investors, voice agents matter because they can reshape how customers buy products and get support, cut operating costs, generate user data and privacy risk, and create new service features that may affect a company’s growth and valuation.
gpu technical
A GPU (graphics processing unit) is a specialized computer chip designed to handle many calculations at once, originally for rendering images and video but now widely used for tasks like artificial intelligence, data analysis and high-performance computing. Investors watch GPU demand and prices because strong sales often signal growth for chip makers and their customers, affect profit margins and capital spending, and can forecast wider trends in gaming, AI adoption and cloud services.
api-driven technical
API-driven describes a product, service, or business built around application programming interfaces (APIs) — digital doorways that let software systems talk, share data and automate tasks — rather than manual handoffs or one-off integrations. For investors it signals easier scaling, faster partnerships and lower per-unit operating costs because new customers or features can be added like snapping in modules; it also highlights dependence on technical partnerships and the need for robust cyber-security.
inference technical
Inference is the process of drawing a conclusion from available evidence or data, like a detective piecing together clues to form a likely story. For investors it matters because these judgments turn raw reports, test results, or market signals into expectations about future performance, risk, or regulatory outcomes—so how someone infers from the same facts can change investment decisions and valuation.
service level agreements (slas) technical
Service level agreements (SLAs) are written promises between a company and its customers that specify the quality and speed of a service—such as response time, availability, or fix times—and what happens if those promises aren’t met. For investors, SLAs matter because they quantify operational reliability and financial risk: strong SLAs can protect revenue and customer trust, while aggressive or missed SLAs can lead to penalties, lost clients, or higher costs, much like a warranty sets expectations for a product.

Strategic collaboration leverages Dell PowerEdge servers and NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs to deliver high-performance, low-latency voice experiences for mission-critical applications in healthcare and retail

FREMONT, Calif.--(BUSINESS WIRE)-- Penguin Solutions, Inc. (Nasdaq: PENG), the AI factory platform company, today announced a strategic collaboration with Deepgram and Dell Technologies to architect and deploy a fully optimized, production-ready infrastructure aligned to Deepgram’s demanding enterprise voice AI requirements. By leveraging its unique expertise in designing, building, deploying, and managing AI infrastructure with Dell PowerEdge servers and Dell PowerScale storage optimized for AI workloads, Penguin Solutions delivered an optimal solution to support and enhance Deepgram’s innovative Speech-to-Text (STT), Text-to-Speech (TTS), and Voice Agent capabilities, while ensuring maximum reliability and performance.

Penguin Solutions strategic collaboration with Deepgram and Dell Technologies on a fully-optimized, production-ready infrastructure delivers an optimal solution to support and enhance Deepgram’s innovative Speech-to-Text (STT), Text-to-Speech (TTS), and Voice Agent capabilities. The solution aligns to Deepgram’s demanding enterprise voice AI requirements while ensuring maximum reliability and performance.

Penguin Solutions strategic collaboration with Deepgram and Dell Technologies on a fully-optimized, production-ready infrastructure delivers an optimal solution to support and enhance Deepgram’s innovative Speech-to-Text (STT), Text-to-Speech (TTS), and Voice Agent capabilities. The solution aligns to Deepgram’s demanding enterprise voice AI requirements while ensuring maximum reliability and performance.

As enterprise adoption of generative AI accelerates, organizations must adhere to stricter service level agreements (SLAs), which require infrastructure that can ensure low latency and high concurrent usage. This Penguin-led deployment addresses these challenges by combining Deepgram’s innovative voice AI models with a purpose-built architectural design, a highly efficient deployment, and ongoing performance optimization.

"Modern AI workloads demand infrastructure that performs consistently and scales predictably under heavy loads, particularly for real-time inference applications like voice agents," said Joe Castillo, vice president of sales at Penguin Solutions. "By partnering with Deepgram and utilizing proven Dell AI infrastructure, Penguin Solutions is delivering a validated, scalable, end-to-end architecture. Our comprehensive framework equips Deepgram with the optimized infrastructure needed to reliably and accurately deliver complex voice AI capabilities in healthcare, retail, and other industries."

Drawing on its extensive experience with HPC and AI infrastructure, Penguin Solutions ensures that the underlying infrastructure meets the specific demands of Deepgram’s neural networks. The architecture also incorporates Dell PowerScale storage and Dell PowerEdge XE7745 servers with NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, which provide efficient inferencing that enables data-intensive voice applications to operate seamlessly in real-time environments.

"Deepgram is focused on delivering voice AI capabilities that meet the demanding performance, scalability, and reliability requirements of enterprise environments - something only Deepgram brings to the market today," said Abe Pursell, vice president of partnerships and business development at Deepgram. "The infrastructure behind our platform has to be equally robust to support that level of innovation. Penguin Solutions demonstrated a deep understanding of our technical requirements, translating them into a sophisticated infrastructure environment that meets and exceeds expectations. This enables us to continue delivering the enterprise-class capabilities our customers rely on."

“AI-driven voice applications are transforming how organizations engage with customers and patients, but success depends on a resilient, high-performance infrastructure foundation,” said David Noy, vice president, unstructured data solutions product management at Dell Technologies. “Our collaboration with Penguin Solutions demonstrates how AI-optimized Dell PowerScale storage and Dell PowerEdge servers with NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs can accelerate enterprise AI adoption at scale. Together, we’re enabling Deepgram to deliver secure, low-latency voice AI experiences that power mission-critical innovation across healthcare and retail.”

The Deepgram-Penguin Solutions-Dell collaboration comprises a comprehensive approach for enterprises looking to modernize their customer and employee experiences. With Deepgram’s API-driven voice capabilities, Penguin Solutions’ AI services, and Dell’s powerful AI infrastructure, organizations can achieve highly accurate, real-time transcription and speech synthesis—all while maintaining strict data governance and control.

For those attending NVIDIA GTC AI Conference and Expo March 16-19, 2026, in San Jose, CA, learn more about this innovative collaboration at Dell’s Booth #721 on March 17 at 3:30 p.m. for the session “Powering Enterprise Voice AI: Deepgram's Agentic Solution” presented by Penguin, Deepgram and Dell. Attendees can also stop by Penguin Solutions’ booth #1031 to speak with an AI factory platform expert.

Penguin Solutions is a trademark or registered trademark of Penguin Solutions, Inc. or its affiliates. All other trademarks are the property of their respective owners.

About Penguin Solutions

The most transformative technological advancements are often the hardest to deploy and optimize. Penguin Solutions, the AI factory platform company, has the innovative technologies, skills, experience, and partnerships needed to turn your AI ambitions into reality.

In addition to our AI capabilities, Penguin Solutions offers memory and LED solutions serving a wide range of high-performance and specialized applications.

For more information, visit https://www.penguinsolutions.com.

PR Contact

Maureen O’Leary

Penguin Solutions

Corporate Communications

1-602-330-6846

pr@penguinsolutions.com

Source: Penguin Solutions, Inc.

Penguin Solutions Inc

NASDAQ:PENG

View PENG Stock Overview

PENG Rankings

PENG Latest News

PENG Latest SEC Filings

PENG Stock Data

921.91M
50.83M
Information Technology Services
Semiconductors & Related Devices
Link
United States
FREMONT