Asics

Custom AI ASICs: Broadcom, Meta and the model-factory shift

How bespoke silicon and supply pacts are reshaping procurement for large model builders

An open server chassis reveals internal hardware components against a backdrop of illuminated racks in a data center. © The GPU Trade Inc 2026

By Nikhil R. Jain May 23, 2026

A new Tom’s Hardware roundup lays out how custom AI ASICs and long-term supply agreements are changing the economics and strategy of big-model development. The piece maps deals from Broadcom’s TPU commitments to Meta’s MTIA roadmap and argues the market is moving toward bespoke ‘picks-and-shovels’ for model factories.

One recent headline: Broadcom told investors it will route roughly 3.5 gigawatts of Google TPU capacity to Anthropic beginning in 2027 and pledged to design and supply future TPU generations through 2031. Those are multi‑year, gigawatt‑scale commitments that read like power‑purchase agreements for compute.

Broadcom’s role has evolved well beyond connector or switch silicon. The company now translates hyperscaler architecture into manufacturable ASICs, manages packaging and helps coordinate foundry runs, giving it leverage as an XPU design-and‑supply partner. That operational scope is changing where risk and control sit in the supply chain.

For hyperscalers and model builders, the practical result is more predictable capacity and longer procurement horizons. Instead of short GPU‑spot buys, large AI shops can lock in bespoke trains of accelerators and planned rollouts tied to product roadmaps and data‑center buildouts. That alters capital planning, negotiation tactics and inventory strategy across the industry.

Meta’s announcement of a four‑chip MTIA roadmap — the MTIA 300 through MTIA 500 — shows the other side of this trend: home‑grown silicon tailored to inference at scale. Meta says those chips will deliver steep increases in HBM bandwidth and compute across successive generations and that the company plans a rapid cadence of new variants.

Meta’s public engineering posts also underline a co‑design approach: software, model architects and hardware teams iterate together to squeeze efficiency from low‑precision formats, attention kernels and memory balances. That makes MTIA less a single product than the centerpiece of an in‑house model factory optimized end‑to‑end.

The emerging buyer power and bespoke designs are pressuring component and chip suppliers to adapt. Vendors that can offer co‑development, advanced packaging, and large‑scale delivery are winning wider roles — from ASIC design into systems integration and logistics. For suppliers that stay commodity‑focused, margins and strategic relevance look at risk in the new environment.

Nvidia, for its part, has responded by offering semi‑custom rack and fabric solutions to keep its interconnect at the center of hyperscaler stacks while still enabling third‑party silicon to plug into that fabric. Partnerships and investments affecting Marvell, Mellanox‑style interconnects, and networking stacks show how incumbents are trying to square bespoke demand with proprietary ecosystems.

Those strategic moves reshape pricing dynamics. Long, multi‑year capacity deals dampen spot‑market volatility but create chunky, winner‑take‑most contracts that favor large suppliers or integrated partners. Hyperscalers gain bargaining power to extract better unit economics, but they also assume long‑term risk if workloads pivot or a supplier misses a node in the roadmap.

Tom’s Hardware’s roundup frames this as a shift from simply buying compute to buying a predictable delivery pipeline — a ‘model‑factory’ infrastructure. That pipeline includes ASIC roadmaps, packaging and co‑development ties that are becoming parallel to, or even replacing, traditional GPU procurement for certain workloads.

There are strategic implications beyond price. Bespoke ASIC stacks can accelerate inference costs at scale, reduce power per token, and let firms tune chips to proprietary network primitives. But they also concentrate failure modes: if a custom ASIC underperforms or a foundry delay hits, a cloud or social‑media giant can face capacity bottlenecks that are harder to remedy than adding a rack of off‑the‑shelf GPUs.

For chip suppliers and cloud builders the takeaway is straightforward. Expect a bifurcated market where hyperscalers and big AI firms continue to invest in bespoke, co‑designed silicon and multi‑year delivery pacts, while a broad middle tier still relies on commercial accelerators and more flexible spot purchasing. The winners will be firms that can offer both deep technical co‑design and predictable delivery at scale.