Penguin Solutions Selected as the Managed Services Partner for Voltage Park’s NVIDIA Clusters
Penguin Solutions, an SGH brand, has been chosen by Voltage Park to manage its large-scale NVIDIA-based clusters. The partnership aims to enhance the efficiency of Voltage Park's AI infrastructure, which includes 24,000 GPUs. Penguin will provide professional and managed services, integrating their OriginAI® solution and Scyld ClusterWare® software across four data centers. Voltage Park's environment features advanced NVIDIA H100 Tensor Core GPUs and high-performance networking to support extensive AI training workloads. This collaboration will offer customers flexible computing solutions through both short-term and long-term rental options.
- Voltage Park selects Penguin Solutions to manage 24,000 NVIDIA H100 GPUs, enhancing infrastructure efficiency.
- Penguin's OriginAI® solution and Scyld ClusterWare® software integrated across Voltage Park's four data centers.
- Advanced AI infrastructure with high-performance, low-latency NVIDIA InfiniBand Networking.
- Flexible computing access offered through exchange-based pricing and long-term rentals.
- None.
Insights
Penguin Solutions' partnership with Voltage Park brings several strategic advantages to the table. Penguin Solutions' ability to deploy and manage large-scale AI infrastructure is important in an era where high-performance computing (HPC) is at the forefront of technological advancement. This collaboration leverages NVIDIA's H100 Tensor Core GPUs, which are known for their superlative performance in AI training tasks.
Unified GPU Memory setups, like the one described – 640GB of GPU memory per node – are a significant feature. This extensive GPU memory will support intensive workloads, ensuring scalability and efficiency across interconnected clusters. Furthermore, the use of InfiniBand Networking for high-performance, low-latency interconnects helps to meet the demands of large-scale AI training processes.
This setup will likely enhance the scalability and performance of Voltage Park's offerings, making it an attractive option for clients who require robust AI infrastructure. For investors, this indicates that Penguin Solutions is well-positioned to capture a growing market share in the AI compute space.
Overall, this partnership enhances the technological prowess and market competitiveness of both companies.
From a financial perspective, this partnership between Penguin Solutions and Voltage Park signals a significant revenue opportunity for Penguin Solutions. Managing and optimizing 24,000 GPUs in a cutting-edge AI infrastructure can generate substantial service revenue. Additionally, the deployment of Penguin's Scyld ClusterWare software platform across four data centers provides a unified management interface, potentially reducing operational costs and improving service efficiency.
Penguin's OriginAI® solution – which follows a comprehensive 'Design. Build. Deploy. Manage.' methodology – is designed for rapid deployment and high-quality outcomes. This end-to-end service offering may attract more clients looking for efficient and reliable AI infrastructure management, contributing positively to Penguin's long-term growth.
Retail investors should note that such partnerships enhance recurring revenue streams through managed services contracts, positioning Penguin Solutions favorably in the market. However, they should also consider the initial deployment costs and the time required to realize full revenue benefits.
Large-scale AI cloud service provider is making 24,000 GPUs available for on-demand users via innovative exchange platform as well as for long term rentals
Voltage Park, a next-gen cloud company, selected Penguin Solutions as its managed services partner to manage and maximize the efficiency of their large AI infrastructure. Penguin has large scale infrastructure experience to ensure maximum GPU performance and cluster availability to meet the needs of Voltage Park customers. (Photo: Business Wire)
Voltage Park’s cloud environment is one of the more significant ML compute infrastructures in the world. Under the architecture being deployed, each compute node instance features eight NVIDIA H100 Tensor Core GPUs for a total of 640GB of GPU memory per node. A high-performance, low-latency fabric built with NVIDIA InfiniBand Networking ensures workloads can scale across clusters of interconnected systems, allowing multiple instances to act as one massive GPU to meet the performance requirements of advanced AI training. High-performance storage is also being integrated to provide a complete solution for AI supercomputing.
“Voltage Park was in search of help to manage and maximize the efficiency of their large AI infrastructure,” said Pete Manca, President of Penguin Solutions. “As a multi-tenancy cloud provider, they offer customers several choices for computing power consumption. They turned to Penguin as an expert partner with large scale infrastructure experience to ensure maximum GPU performance and cluster availability that meets the needs of their compute-hungry customers.”
Penguin, a certified NVIDIA DGX-ready Managed Services Provider, will integrate and ensure production readiness for the majority of Voltage Park’s 24,000 GPU deployment of NVIDIA H100s and next-generation 3.2 TB InfiniBand and Ethernet interconnects. The deployment process will follow Penguin’s OriginAI® solution, a proven ‘Design. Build. Deploy. Manage.’ methodology with an objective of achieving a rapid and high-quality result. Penguin is also providing professional services and managed services, and the entire platform which spans four data centers will be powered by Penguin’s Scyld ClusterWare® software platform.
“Penguin’s track record of successfully deploying and managing large AI factories was compelling, but it was their Scyld ClusterWare software coupled with their services offerings that were truly pivotal to our decision,” said Ozan Kaya, CEO Voltage Park. “We are excited to bring customers AI infrastructure through our unique combination of flexible, exchange-based pricing for short-term compute access alongside traditional long-term rentals. Penguin’s end-to-end ability to deliver, optimize, and support the complete environment for multi-tenancy is helping bring our vision to life.”
To learn more about how Penguin designs, builds, deploys, and manages AI and high performance computing infrastructure at scale, visit: https://www.penguinsolutions.com/.
Penguin Solutions, OriginAI and Scyld ClusterWare are trademarks or registered trademarks of SMART Global Holdings, Inc. All other trademarks and registered trademarks are the property of their respective owners.
About Voltage Park
Voltage Park is a values-driven enterprise on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities, to seed-stage startups and nonprofits. With 24,000 H100 GPUs spread across six geographically distinct data centers, Voltage Park is in a unique position to provide top-tier performance for training and fine tuning of large ML models.
Explore Voltage Park’s Exchange Product at https://exchange.voltagepark.com/. Stay connected and follow Voltage Park on LinkedIn.
About Penguin Solutions
Penguin Solutions accelerates customers’ digital transformation with the power of emerging technologies in AI, HPC, and accelerated computing with solutions, software and services that span the continuum of edge, core, and cloud. By designing highly-advanced infrastructure, machines, and networked systems, we enable the world’s most innovative enterprises and government institutions to build the autonomous future, drive discovery, and amplify human potential.
Stay connected and follow Penguin Solutions on LinkedIn, Twitter, YouTube, and Facebook.
Penguin Solutions is an SGH Brand.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240711220509/en/
Maureen O’Leary
Director, Communications, Penguin Solutions/SGH
pr@sghcorp.com
Source: SMART Global Holdings, Inc.
FAQ
What did Voltage Park announce about their partnership with Penguin Solutions?
How many GPUs will Penguin Solutions manage for Voltage Park?
What is the main benefit of Penguin Solutions managing Voltage Park's AI infrastructure?
What software will Penguin Solutions use for managing Voltage Park's infrastructure?