Advanced Computing in the Age of AI | Friday, April 26, 2024

Penguin Computing Announces Open Compute-Based Deep Learning Solution With NVIDIA Tesla GPU Accelerators 

WASHINGTON, D.C., Oct. 26 -- Penguin Computing, a provider of high performance, enterprise data center and cloud solutions, today announced availability of Open Compute-based Tundra Relion X1904GT and Tundra Relion 1930g servers, powered by NVIDIA Tesla P40 and P4 GPU accelerators, respectively, for deep learning training and inference requirements.

Tundra Relion X1904GT, which already supports NVIDIA Tesla P100 and M40, now adds Tesla P40 to its accelerated computing platform offering design innovation, allowing customers to choose from a variety of topologies to accelerate applications. The Tundra Relion X1904GT also incorporates NVIDIA NVLink high-speed interconnect technology for best peer-to-peer performance.

Tundra combines the operational benefits and efficiency of Open Computing with a high density and high performance architecture needed to meet the requirements of the most demanding compute intensive applications. Penguin Computing’s Tundra platform demonstrates the company’s leadership in providing flexible, open platforms for the high performance computing market.

“Pairing Tundra Relion X1904GT with our Tundra Relion 1930g, we now have a complete deep learning solution in Open Compute form factor that covers both training and inference requirements,” said William Wu, Director of Product Management at Penguin Computing. “With the ever evolving deep learning market, the X1904GT with its flexible PCI-E topologies eclipses the cookie cutter approach, providing a solution optimized for customers’ respective applications. Our collaboration with NVIDIA is combating the perennial need to overcome scaling challenges for deep learning and HPC.”

NVIDIA Tesla GPU accelerators enable real-time responsiveness for the most complex deep learning models. Responsiveness is critical to user adoption for services such as interactive speech, visual search and video recommendations. As models increase in accuracy and complexity, CPUs are no longer capable of delivering interactive user experience. The NVIDIA Tesla P40 and P4 deliver more than 40X faster inference performance with INT8 operations for real-time responsiveness, even for the most complex models. Tesla GPU accelerators also meet exploding higher throughput requirements based on ever-growing data needs, in the form of sensor logs, images, videos and transactional records.

“Penguin Computing’s Open Compute servers, powered by NVIDIA Pascal architecture Tesla accelerators, are leading-edge deep learning solutions for the data center,” said Roy Kim, Director of Accelerated Computing at NVIDIA. “Tesla P40 and P4 GPU accelerators deliver 40x faster throughput for training and 40x responsiveness for inferencing compared to what was previously possible.”

Visit Penguin Computing’s booth at NVIDIA’s GPU Technology Conference, October 26-27, in Washington, D.C.

About Penguin Computing

Penguin Computing is one of the largest private suppliers of enterprise and high performance computing solutions in North America and has built and operates the leading specialized public HPC cloud service Penguin Computing on Demand (POD). Penguin Computing pioneers the design, engineering, integration and delivering of solutions that are based on open architectures and comprise non-proprietary components from a variety of vendors. Penguin Computing is also one of a limited number of authorized Open Compute Project (OCP) solution providers leveraging this Facebook-led initiative to bring the most efficient open data center solutions to a broader market, and has announced the Tundra product line which applies the benefits of OCP to high performance computing. Penguin Computing has systems installed with more than 2,500 customers in 40 countries across eight major vertical markets. Visit www.penguincomputing.com to learn more about the company and follow @PenguinHPC on Twitter.


Source: Penguin Computing

EnterpriseAI