IBM Brings the Speed of Light to the Generative AI Era with Optics Breakthrough
IBM has unveiled breakthrough research in co-packaged optics (CPO) technology that could revolutionize data center operations for AI applications. The innovation enables connectivity at the speed of light through polymer optical waveguide (PWG), potentially replacing electrical interconnects in data centers.
Key benefits include:
- Over 5x power reduction compared to electrical interconnects
- Up to 5x faster AI model training
- 80x faster bandwidth for chip-to-chip communication
- 6x more optical fibers at silicon photonics chip edge
The technology has passed all required manufacturing stress tests and could significantly reduce energy consumption while accelerating AI processing. The research was conducted at IBM's facilities in Albany, NY and Bromont, Quebec.
IBM ha svelato una ricerca innovativa nella tecnologia dell'ottica co-imballata (CPO) che potrebbe rivoluzionare le operazioni dei data center per le applicazioni di intelligenza artificiale. L'innovazione consente la connessione alla velocità della luce tramite guide d'onda ottiche in polimero (PWG), potenzialmente sostituendo gli interconnessioni elettriche nei data center.
I principali vantaggi includono:
- Oltre 5 volte la riduzione del consumo energetico rispetto agli interconnessioni elettriche
- Fino a 5 volte più veloce l'addestramento dei modelli di IA
- 80 volte più ampia la larghezza di banda per la comunicazione chip-to-chip
- 6 volte più fibre ottiche al bordo del chip fotonico in silicio
La tecnologia ha superato tutti i test di stress di produzione richiesti e potrebbe ridurre significativamente il consumo energetico accelerando nel contempo l'elaborazione dell'IA. La ricerca è stata condotta presso le strutture IBM ad Albany, NY e Bromont, Quebec.
IBM ha presentado una investigación innovadora en tecnología de óptica empaquetada (CPO) que podría revolucionar las operaciones de los centros de datos para aplicaciones de IA. La innovación permite la conectividad a la velocidad de la luz a través de guías de onda ópticas de polímero (PWG), lo que podría reemplazar las interconexiones eléctricas en los centros de datos.
Los principales beneficios incluyen:
- Más de 5 veces la reducción de energía en comparación con las interconexiones eléctricas
- Hasta 5 veces más rápida la formación de modelos de IA
- 80 veces más ancho de banda para la comunicación entre chips
- 6 veces más fibras ópticas en el borde del chip de fotónica de silicio
La tecnología ha superado todas las pruebas de resistencia de fabricación requeridas y podría reducir significativamente el consumo de energía mientras acelera el procesamiento de IA. La investigación se llevó a cabo en las instalaciones de IBM en Albany, NY y Bromont, Quebec.
IBM은 인공지능 응용 프로그램을 위한 데이터 센터 운영을 혁신할 수 있는 패키지식 광학(CPO) 기술에 대한 획기적인 연구 결과를 발표했습니다. 이 혁신은 폴리머 광도파로(PWG)를 통해 빛의 속도로 연결할 수 있게 해주며, 데이터 센터의 전기 인터커넥트를 대체할 수 있습니다.
주요 이점은 다음과 같습니다:
- 전기 인터커넥트에 비해 에너지 소비를 5배 이상 줄임
- AI 모델 훈련이 최대 5배 더 빨라짐
- 칩 간 통신을 위한 대역폭이 80배 더 빨라짐
- 실리콘 포토닉스 칩 가장자리에 6배 더 많은 광섬유
이 기술은 모든 필수 제조 스트레스 테스트를 통과하였으며, AI 처리 속도를 높이면서 에너지 소비를 크게 줄일 수 있습니다. 연구는 뉴욕주 알바니와 퀘벡주 브로몽에 있는 IBM 시설에서 진행되었습니다.
IBM a dévoilé une recherche révolutionnaire dans la technologie des optiques empaquetées (CPO) qui pourrait transformer les opérations des centres de données pour les applications d'intelligence artificielle. L'innovation permet une connectivité à la vitesse de la lumière grâce à des guides d'onde optiques en polymère (PWG), remplaçant potentiellement les interconnexions électriques dans les centres de données.
Les principaux avantages incluent :
- Réduction de plus de 5 fois de la consommation d'énergie par rapport aux interconnexions électriques
- Entraînement de modèles d'IA jusqu'à 5 fois plus rapide
- Bande passante 80 fois plus rapide pour la communication entre puces
- 6 fois plus de fibres optiques au bord de la puce en photonique de silicium
La technologie a passé tous les tests de stress de fabrication requis et pourrait réduire considérablement la consommation d'énergie tout en accélérant le traitement de l'IA. La recherche a été réalisée dans les installations d'IBM à Albany, NY et à Bromont, Québec.
IBM hat bahnbrechende Forschung in der Co-Packaged Optics (CPO) Technologie vorgestellt, die die Betriebsabläufe in Rechenzentren für KI-Anwendungen revolutionieren könnte. Die Innovation ermöglicht die Verbindung mit Lichtgeschwindigkeit über polymeroptische Wellenleiter (PWG) und könnte elektrische Verbindungen in Rechenzentren ersetzen.
Die wichtigsten Vorteile sind:
- Über 5-fache Reduzierung des Energieverbrauchs im Vergleich zu elektrischen Verbindungen
- Bis zu 5-fach schnellere Schulung von KI-Modellen
- 80-fach höhere Bandbreite für die Kommunikation zwischen Chips
- 6-fache Anzahl an optischen Fasern am Rand des Silizium-Photonik-Chips
Die Technologie hat alle erforderlichen Fertigungs-Stresstests bestanden und könnte den Energieverbrauch erheblich senken, während sie gleichzeitig die KI-Verarbeitung beschleunigt. Die Forschung wurde in den Einrichtungen von IBM in Albany, NY und Bromont, Quebec durchgeführt.
- Over 5x reduction in energy consumption compared to current electrical interconnects
- Potential to accelerate AI model training by up to 5x
- 80x faster bandwidth for chip-to-chip communication
- 6x increase in optical fiber density at chip edge
- Successful stress testing for manufacturing viability
- None.
Insights
This breakthrough in co-packaged optics (CPO) represents a significant technological leap for data center infrastructure and AI computing. The 80x bandwidth improvement and 5x power reduction could revolutionize how data centers handle AI workloads. The ability to train Large Language Models up to 5x faster addresses one of the major bottlenecks in AI development - training time and costs.
The technology's success in stress testing and compatibility with standard assembly processes indicates strong commercial viability. The 50-micrometer pitch achievement, with potential for 18-micrometer pitch, demonstrates exceptional precision in optical interconnect manufacturing. This could significantly reduce the "idle GPU" problem that currently plagues AI training infrastructure.
The innovation's potential to save energy equivalent to 5,000 U.S. homes' annual power consumption per AI model trained is particularly noteworthy as data centers face increasing scrutiny over their environmental impact.
This development could significantly impact IBM's market position in the rapidly growing AI infrastructure sector. With the global AI hardware market expanding, this proprietary technology gives IBM a competitive edge against rivals like NVIDIA and Intel in data center solutions.
The timing is strategic, coinciding with the surge in generative AI adoption and increasing demands for more efficient data center operations. The location of development in Albany's NSTC EUV Accelerator and Bromont facility strengthens IBM's position in the critical U.S.-Canada semiconductor corridor.
The technology's potential to reduce AI training costs and energy consumption addresses two major pain points for cloud service providers and AI companies. This could drive significant commercial adoption and revenue growth in IBM's infrastructure segment, particularly as companies scale their AI operations.
New co-packaged optics innovation could replace electrical interconnects in data centers to offer significant improvements in speed and energy efficiency for AI and other computing applications
Today, fiber optic technology carries data at high speeds across long distances, managing nearly all the world's commerce and communications traffic with light instead of electricity. Although data centers use fiber optics for their external communications networks, racks in data centers still predominantly run communications on copper-based electrical wires. These wires connect GPU accelerators that may spend more than half of their time idle, waiting for data from other devices in a large, distributed training process which can incur significant expense and energy.
IBM researchers have demonstrated a way to bring optics' speed and capacity inside data centers. In a technical paper, IBM introduces a new CPO prototype module that can enable high-speed optical connectivity. This technology could significantly increase the bandwidth of data center communications, minimizing GPU downtime while drastically accelerating AI processing. This research innovation, as described, would enable:
- Lower costs for scaling generative AI through a more than 5x power reduction in energy consumption compared to mid-range electrical interconnects1, while extending the length of data center interconnect cables from one to hundreds of meters.
- Faster AI model training, enabling developers to train a Large Language Model (LLM) up to five times faster with CPO than with conventional electrical wiring. CPO could reduce the time it takes to train a standard LLM from three months to three weeks, with performance gains increasing by using larger models and more GPUs.2
- Dramatically increased energy efficiency for data centers, saving the energy equivalent of 5,000
U.S. homes' annual power consumption per AI model trained.3
"As generative AI demands more energy and processing power, the data center must evolve – and co-packaged optics can make these data centers future-proof," said Dario Gil, SVP and Director of Research at IBM. "With this breakthrough, tomorrow's chips will communicate much like how fiber optics cables carry data in and out of data centers, ushering in a new era of faster, more sustainable communications that can handle the AI workloads of the future."
Eighty times faster bandwidth than today's chip-to-chip communication
In recent years, advances in chip technology have densely packed transistors onto a chip; IBM's 2 nanometer node chip technology can contain more than 50 billion transistors. CPO technology aims to scale the interconnection density between accelerators by enabling chipmakers to add optical pathways connecting chips on an electronic module beyond the limits of today's electrical pathways. IBM's paper outlines how these new high bandwidth density optical structures, coupled with transmitting multiple wavelengths per optical channel, have the potential to boost bandwidth between chips as much as 80 times compared to electrical connections.
IBM's innovation, as described, would enable chipmakers to add six times as many optical fibers at the edge of a silicon photonics chip, called "beachfront density," compared to the current state-of-the-art CPO technology. Each fiber, about three times the width of a human hair, could span centimeters to hundreds of meters in length and transmit terabits of data per second. The IBM team assembled a high-density PWG at 50 micrometer pitch optical channels, adiabatically coupled to silicon photonics waveguides, using standard assembly packaging processes.
The paper additionally indicates that these CPO modules with PWG at 50 micrometer pitch are the first to pass all stress tests required for manufacturing. Components are subjected to high-humidity environments and temperatures ranging from -40°C to 125°C, as well as mechanical durability testing to confirm that optical interconnects can bend without breaking or losing data. Moreover, researchers have demonstrated PWG technology to an 18-micrometer pitch. Stacking four PWGs would allow for up to 128 channels for connectivity at that pitch.
IBM's continued leadership in semiconductor R&D
CPO technology enables a new pathway to meet AI's increasing performance demands, with the potential to replace off-module communications from electrical to optical. It continues IBM's history of leadership in semiconductor innovation, which also includes the first 2 nm node chip technology, the first implementation of 7 nm and 5 nm process technologies, Nanosheet transistors, vertical transistors (VTFET), single cell DRAM, and chemically amplified photoresists.
Researchers completed design, modeling, and simulation work for CPO in
About IBM
IBM is a leading provider of global hybrid cloud and AI, and consulting expertise. We help clients in more than 175 countries capitalize on insights from their data, streamline business processes, reduce costs and gain the competitive edge in their industries. More than 4,000 government and corporate entities in critical infrastructure areas such as financial services, telecommunications and healthcare rely on IBM's hybrid cloud platform and Red Hat OpenShift to affect their digital transformations quickly, efficiently and securely. IBM's breakthrough innovations in AI, quantum computing, industry-specific cloud solutions and consulting deliver open and flexible options to our clients. All of this is backed by IBM's long-standing commitment to trust, transparency, responsibility, inclusivity and service. Visit www.ibm.com for more information.
Media Contacts:
Bethany Hill McCarthy
IBM Research
bethany@ibm.com
Willa Hahn
IBM Research
willa.hahn@ibm.com
1 | A reduction from five to less than one picojoule per bit. |
2 | Figures based on training a 70 billion parameter LLM using industry-standard GPUs and interconnects. |
3 | Figures based on training a large LLM (such as GPT-4) using industry-standard GPUs and interconnects. |
View original content to download multimedia:https://www.prnewswire.com/news-releases/ibm-brings-the-speed-of-light-to-the-generative-ai-era-with-optics-breakthrough-302327699.html
SOURCE IBM
FAQ
What is IBM's new optical technology breakthrough announced in December 2024?
How much faster can IBM's new CPO technology train AI models?
What energy savings does IBM's new optical technology offer?