• Tech News
  • Fintech
  • Startup
  • Games
  • Ar & Vr
  • Reviews
  • How To
  • More
    • Mobile Tech
    • Pc & Laptop
    • Security
What's Hot

The Nothing Headphone (1) is totally bizarre in the best kind of way

July 1, 2025

Apple Drops MLS Season Pass to Half-Price

July 1, 2025

Apple’s Next MacBook Might Have More in Common With Your iPhone Than You Think

July 1, 2025
Facebook Twitter Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Facebook Twitter Instagram Pinterest VKontakte
Behind The ScreenBehind The Screen
  • Tech News
  • Fintech
  • Startup
  • Games
  • Ar & Vr
  • Reviews
  • How To
  • More
    • Mobile Tech
    • Pc & Laptop
    • Security
Behind The ScreenBehind The Screen
Home»Pc & Laptop»Enfabrica 3.2 Tbps ACF SuperNIC Chip: Boosting AI Data Center Performance And Reliability For Massive AI Cluster Scale
Pc & Laptop

Enfabrica 3.2 Tbps ACF SuperNIC Chip: Boosting AI Data Center Performance And Reliability For Massive AI Cluster Scale

November 20, 2024No Comments5 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Enfabrica 3.2 Tbps ACF SuperNIC Chip: Boosting AI Data Center Performance And Reliability For Massive AI Cluster Scale
Share
Facebook Twitter LinkedIn Pinterest Email

At Supercomputing 2024 (SC24), Enfabrica Company unveiled a milestone in AI knowledge middle networking: the Accelerated Compute Cloth (ACF) SuperNIC chip. This 3.2 Terabit-per-second (Tbps) Community Interface Card (NIC) SoC redefines large-scale AI and machine studying (ML) operations by enabling huge scalability, supporting clusters of over 500,000 GPUs. Enfabrica additionally raised $115 million in funding and is anticipated to launch its (ACF) SuperNIC chip in Q1 2025.

Addressing AI Networking Challenges

As AI fashions develop more and more massive and complicated, knowledge facilities face mounting pressures to attach massive numbers of specialised processing models, equivalent to GPUs. These GPUs are essential for high-speed computation in coaching and inference however are sometimes left idle on account of inefficient knowledge motion throughout current community architectures. The problem lies in successfully interconnecting hundreds of GPUs to make sure optimum knowledge switch with out bottlenecks or efficiency degradation.

Conventional networking approaches can hyperlink roughly 100,000 AI computing chips in a knowledge middle earlier than inefficiencies and slowdowns develop into important. In response to Enfabrica’s CEO, Rochan Sankar, the corporate’s new know-how helps as much as 500,000 chips in a single AI/ML system, enabling bigger and extra dependable AI mannequin computations. By overcoming the constraints of typical NIC designs, Enfabrica’s ACF SuperNIC maximizes GPU utilization and minimizes downtime.

Key Improvements within the ACF SuperNIC

The ACF SuperNIC boasts a number of industry-first options tailor-made to trendy AI knowledge middle wants:

  1. Excessive-Bandwidth, Multi-Port Connectivity: The ACF SuperNIC delivers multi-port 800-Gigabit Ethernet to GPU servers, quadrupling the bandwidth in comparison with different GPU-attached NICs. This setup offers unprecedented throughput and enhances multipath resiliency, guaranteeing strong communication throughout AI clusters.
  2. Environment friendly Two-Tier Community Design: With a high-radix configuration of 32 community ports and as much as 160 PCIe lanes, the ACF SuperNIC simplifies the general structure of AI knowledge facilities. This effectivity permits operators to assemble huge clusters utilizing fewer tiers, decreasing latency and enhancing knowledge switch effectivity throughout GPUs.
  3. Scaling Up and Scaling Out: The Enfabrica ACF SuperNIC, with its high-radix, high-bandwidth, and concurrent PCIe/Ethernet multipathing and knowledge mover capabilities, can uniquely scale up and scale out 4 to eight latest-generation GPUs per server system. This considerably will increase AI clusters’ efficiency, scale, and resiliency, guaranteeing optimum useful resource utilization and community effectivity.
  4. Built-in PCIe Interface: The chip helps 128 to 160 PCIe lanes, delivering speeds over 5 Tbps. This design permits a number of GPUs to hook up with a single CPU whereas sustaining high-speed communication with knowledge middle backbone switches. The result’s a extra environment friendly and versatile format that helps large-scale AI workloads.
  5. Resilient Message Multipathing (RMM): Enfabrica’s proprietary RMM know-how boosts the reliability of AI clusters. By mitigating the impression of community hyperlink failures or flaps, RMM prevents job stalls, guaranteeing smoother and extra environment friendly AI coaching processes. Sankar notes the significance of this function, particularly in massive setups the place hyperlinks to switches failures develop into frequent.
  6. Software program-Outlined RDMA Networking: This distinctive function empowers knowledge middle operators with full-stack programmability and debuggability, bringing the advantages of software-defined networking (SDN) into Distant Direct Reminiscence Entry (RDMA) setups. It permits customization of the transport layer, which might optimize cloud-scale community topologies with out sacrificing efficiency.
See also  Dell: PC supply chain has returned to normal, but chip shortage is still impacting server market

Enhanced Resiliency and Effectivity

Conventional techniques typically require one-to-one connections between GPUs and varied parts, equivalent to PCIe switches and RDMA NICs. Nevertheless, because the variety of GPUs in a system will increase, the danger of hyperlinks to switches failures grows, with potential disruptions occurring as typically as each 23 minutes in setups with over 100,000 GPUs, in keeping with Shankar. 

The ACF SuperNIC addresses this challenge by enabling a number of connections from GPUs to switches. This redundancy minimizes the impression of particular person part failures, boosting system uptime and reliability.

The SuperNIC additionally introduces the Collective Reminiscence Zoning function, which helps zero-copy knowledge transfers and optimizes host memory management. By decreasing latency and enhancing reminiscence effectivity, this know-how maximizes the floating-point operations per second (FLOPs) utilization of GPU server fleets.

Scalability and Operational Advantages

The ACF SuperNIC’s design just isn’t solely about scale but additionally about operational effectivity. It offers a software program stack that integrates with commonplace communication, current interfaces, and RDMA networking operations. This compatibility ensures environment friendly deployment throughout numerous AI compute environments composed of GPUs and accelerators (AI chips) from completely different distributors. Information middle operators profit from streamlined networking infrastructure, decreasing complexity and enhancing the pliability of their AI knowledge facilities.

Availability and Future Prospects

Enfabrica’s ACF SuperNIC will probably be obtainable in restricted portions in Q1 2025, with each the chips and pilot techniques now open for orders by Enfabrica and chosen companions. As AI fashions demand increased efficiency and bigger scales, Enfabrica’s modern method might play a pivotal position in shaping the subsequent era of AI knowledge facilities designed to assist Frontier AI models.

See also  USB-C Charging Speeds Could Determine iPhone’s Future in Europe

Filed in . Learn extra about AI (Synthetic Intelligence), Chip, generative AI, Semiconductors, Server, SoC and Supercomputer.

Source link

ACF boosting Center Chip cluster data Enfabrica massive performance reliability Scale SuperNIC Tbps
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Driver Booster Review: Fix Driver Problems in Seconds for Free

June 13, 2025

The Base iPhone 17 May Only Get Last Year’s A18 Chip

June 3, 2025

Master These New iPhone Control Center Features in iOS 18.5

May 30, 2025

Security Company Warns iPhone Users of New Massive Scale Chinese Hacking Threat

May 13, 2025
Add A Comment

Comments are closed.

Editors Picks

Ford recalls more than 100,000 Mavericks, Escapes, and Corsairs over fire risk

July 10, 2022

Tracking the rise of homeworking across the UK

July 19, 2022

Final Fantasy 14’s next big patch gives everyone their own idyllic island

July 4, 2022

Viva Wallet and Klarna enter Europe-wide partnership

November 14, 2022

Subscribe to Updates

Get the latest news and Updates from Behind The Scene about Tech, Startup and more.

Top Post

The Nothing Headphone (1) is totally bizarre in the best kind of way

Apple Drops MLS Season Pass to Half-Price

Apple’s Next MacBook Might Have More in Common With Your iPhone Than You Think

Behind The Screen
Facebook Twitter Instagram Pinterest Vimeo YouTube
  • Contact
  • Privacy Policy
  • Terms & Conditions
© 2025 behindthescreen.fr - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.