USDS$0.9998▲ 0.00%FIGR_HELOC$1.04▲ 0.37%XLM$0.1832▲ 2.44%LEO$9.80▲ 2.76%ZEC$552.79▲ 9.50%XAG$58.81▲ 0.06%SOL$78.02▲ 3.98%NATGAS$2.92▲ 0.52%BNB$579.18▲ 1.57%ETH$1,875.59▲ 5.14%HYPE$66.79▲ 5.45%RAIN$0.0147▲ 2.99%WBT$56.54▲ 2.83%TRX$0.3264▲ 0.55%BRENT$85.48▲ 0.89%DOGE$0.0741▲ 2.91%XAU$4,040.30▼ 0.51%WTI$79.75▲ 0.52%XRP$1.10▲ 3.40%BTC$64,734.00▲ 3.30%USDS$0.9998▲ 0.00%FIGR_HELOC$1.04▲ 0.37%XLM$0.1832▲ 2.44%LEO$9.80▲ 2.76%ZEC$552.79▲ 9.50%XAG$58.81▲ 0.06%SOL$78.02▲ 3.98%NATGAS$2.92▲ 0.52%BNB$579.18▲ 1.57%ETH$1,875.59▲ 5.14%HYPE$66.79▲ 5.45%RAIN$0.0147▲ 2.99%WBT$56.54▲ 2.83%TRX$0.3264▲ 0.55%BRENT$85.48▲ 0.89%DOGE$0.0741▲ 2.91%XAU$4,040.30▼ 0.51%WTI$79.75▲ 0.52%XRP$1.10▲ 3.40%BTC$64,734.00▲ 3.30%

Prices as of 04:57 UTC

Author: Rhys Donnelly

Marvell Technology AI Revenue Crossed $1 Billion in Q1 FY2027

Marvell Technology AI Revenue Crossed $1 Billion in Q1 FY2027

Marvell Technology reported in its Q1 FY2027 earnings (February through April 2026, results published June 3, 2026) that its AI revenue — comprising custom application-specific integrated circuit designs commissioned by cloud hyperscalers and electro-optical interconnect components for AI data center fabric — crossed $1 billion in a single quarter for the first time, reaching $1.1 billion and representing approximately 55 percent of Marvell’s total Q1 FY2027 revenue of $2.0 billion, which itself grew 62 percent year over year from $1.23 billion in Q1 FY2026. Marvell’s Q1 FY2027 investor filings show the data center segment reaching $1.6 billion in the quarter — up more than 80 percent year over year — with custom ASIC revenue constituting the dominant and fastest-growing component, driven by production ramp of AI inference and training chips designed by Marvell’s engineering teams under multi-year engagements with Amazon Web Services (Trainium and Inferentia custom silicon), Microsoft Azure (Azure Maia inference chip), and Google (optical interconnect components supporting TPU cluster networking). Marvell’s position in the custom AI silicon market distinguishes it structurally from the general-purpose GPU market that Nvidia dominates: Marvell designs chips to specification under long-term contracts with a small number of hyperscaler customers who want proprietary inference economics not available through merchant GPU procurement, accepting the 18-to-24-month design cycle and minimum volume commitment that custom ASIC development requires in exchange for per-chip economics optimised for their specific workload mix, data centre topology, and thermal envelope. The $1 billion quarterly AI revenue milestone — which Marvell CEO Matt Murphy guided toward at the company’s October 2025 analyst day when he raised the FY2027 AI revenue target to $4.5 billion from the prior $4 billion guidance — arrived one quarter earlier than the consensus analyst estimate of Q2 FY2027, reflecting stronger-than-anticipated production volume ramp across the Amazon Trainium3 and Azure Maia 2 programmes that each entered high-volume manufacturing in Q4 FY2026 and Q1 FY2027 respectively. Dell Technologies AI server revenue crossing $10 billion in FY2026 establishes the demand context for Marvell’s custom ASIC growth: as enterprises deploy AI server clusters at scale, the hyperscalers supplying the cloud compute that underpins enterprise AI inference demand are simultaneously investing in custom silicon to reduce per-inference cost below the level achievable with merchant GPUs, creating a parallel market for ASIC design capacity that Marvell and Broadcom currently supply in volume while Intel, Qualcomm, and Alchip compete for incremental design wins.

Marvell’s custom ASIC business model is architecturally different from both the merchant GPU market and the traditional semiconductor licensing model because Marvell retains manufacturing responsibility — sourcing wafers from TSMC at N3 and N4 nodes and delivering packaged silicon to the hyperscaler customer — while the customer owns the chip architecture and instruction set, which they developed internally with Marvell’s design services team co-engineering the physical implementation. This hybrid ownership model means Marvell carries the production yield risk (fabricating defective die reduces the revenue recognised per wafer purchased from TSMC) while the customer carries the architecture risk (a chip that performs below its design specification on the target workload becomes the customer’s problem, not Marvell’s); Marvell’s gross margin of approximately 61 percent on the ASIC revenue line reflects this risk allocation, with Marvell earning design services revenue on the front-end engineering phase and a per-unit margin on the back-end manufacturing volume that is lower than Nvidia’s approximately 78 percent product gross margin but justified by the contracted volume certainty — Marvell’s hyperscaler customers commit to multi-year purchase volumes of typically hundreds of millions of units before the chip enters production, eliminating the demand risk that affects merchant chip vendors. The interconnect component of Marvell’s AI revenue — particularly its PAM4 digital signal processor technology for 400G and 800G coherent optical transceivers used to connect AI clusters within and between data centres — benefits from a different dynamic than the ASIC programme: optical interconnect is a shared infrastructure component that every AI cluster requires regardless of which chip vendor supplies the compute, making Marvell’s COLORZ and Alaska coherent DSP families a cross-architectural revenue stream that grows with overall AI infrastructure deployment rather than with any single customer’s ASIC programme. IDC’s AI infrastructure market sizing projects the total AI server and networking market at $150 billion by 2028 with custom silicon growing at 35 percent compound annual growth rate through the period, a trajectory that validates Marvell’s three-year investment in design services headcount — approximately 12,000 engineers globally as of Q1 FY2027, up from 7,000 in FY2023 — required to execute simultaneous multi-chip design programmes at the complexity level that N3-node AI ASICs demand. ARM Holdings’ server market penetration through AWS Graviton provides a complementary lens on the same structural shift: as hyperscalers design increasing proportions of their own compute silicon on ARM architecture licensed from ARM Holdings, the physical implementation of those designs requires ASIC design services and foundry-adjacent manufacturing partnerships of exactly the kind that Marvell provides — making ARM’s royalty growth and Marvell’s ASIC revenue two correlated expressions of the same underlying trend of hyperscaler silicon internalisation.

What Marvell’s $4.5 Billion AI Revenue Target for FY2027 Signals About Custom Silicon Scale

Marvell’s revised FY2027 full-year AI revenue guidance of $4.5 billion — raised from $4.0 billion at the October 2025 analyst day and now trending toward a potential upward revision following the Q1 FY2027 beat — implies quarterly AI revenue of $1.1 to $1.25 billion through the remaining three quarters of FY2027 (May 2026 through January 2027), a trajectory that requires both continued production ramp on existing programmes and incremental revenue from design programmes that Marvell has disclosed are in active development without naming the end customer. The undisclosed programmes are a meaningful component of Marvell’s forward valuation because the company’s investor disclosures indicate it has secured design wins with at least two hyperscalers beyond its publicly discussed Amazon and Microsoft programmes — the commercial logic being that cloud operators who have invested in custom silicon design capability (Google with TPUs, Meta with MTIA, Amazon with Trainium, Microsoft with Maia) are unlikely to return to merchant GPU dependency for incremental workloads if their custom chip economics are favourable, driving a self-reinforcing cycle of ASIC investment that aggregates into increasing design services demand for Marvell. The risk concentration is correspondingly high: Marvell’s top two customers — Amazon and Microsoft — collectively represent approximately 65 percent of data centre segment revenue, meaning a programme delay, architecture change, or hyperscaler capital expenditure reduction at either company would materially impact Marvell’s AI revenue quarterly. Cisco’s AI networking and Nexus Hyperfabric revenue provides context on the network fabric layer that Marvell’s interconnect components ultimately terminate into: as AI cluster scale grows from hundreds to tens of thousands of accelerators, the switching and routing infrastructure connecting those accelerators becomes a proportionally larger share of total cluster cost, which benefits Marvell’s switching ASIC business (acquired through the Innovium purchase) in addition to its optical DSP revenue. Oracle Cloud’s AI infrastructure revenue represents the enterprise demand signal that makes hyperscaler custom silicon investment commercially rational: as enterprise AI workloads migrate to cloud — Oracle’s GPU cluster bookings representing a known portion of hyperscaler-type AI infrastructure demand outside the traditional big-three cloud providers — the aggregate compute demand that drives hyperscaler capacity investment (and therefore custom ASIC production volumes) remains higher than any single cloud provider’s own organic workload growth would justify, sustaining the volume commitments that underpin Marvell’s contracted revenue certainty.

What Marvell Technology’s $1 Billion AI Revenue Reveals About What Hyperscalers Are Actually Discovering in Custom Silicon

Marvell’s $1 billion AI revenue milestone is fundamentally a product discovery story — but the customers doing the discovering are hyperscalers, not end users, and the product being discovered is silicon architecture. When Google, Amazon, Microsoft, and Meta commission custom ASIC designs through Marvell’s engineering services, they are running large-scale product discovery experiments: what chip architecture delivers the inference performance-per-watt their specific AI workload actually needs, at the reliability and supply chain security that production infrastructure requires, without the pricing premium of a general-purpose GPU? Marvell’s revenue growth is evidence that these discovery experiments have produced enough positive outcomes to fund production volumes.

The product discovery insight that custom silicon reveals is that AI workloads are not homogeneous. A general-purpose GPU architecture is optimized for training and broadly useful for a range of inference tasks. But a hyperscaler running billions of inference requests daily on a specific model architecture with a known distribution of sequence lengths, memory access patterns, and batch sizes has a fundamentally different optimization target than a general-purpose GPU was designed to serve. Custom ASIC designs for this use case — where the chip is designed around the workload rather than the workload being adapted to the chip — can deliver significant efficiency gains on the specific performance dimensions the hyperscaler cares most about. This is not a new insight; one hyperscaler’s custom processor program demonstrated it a decade ago. The rest of the hyperscaler field is now discovering the same thing with their own specific workloads.

The product management question that Marvell’s milestone poses is about the next wave of custom silicon discovery: what workloads beyond hyperscaler training and inference could justify ASIC-level optimization at production volume? The candidates are enterprise AI inference at scale (large organizations running models on-premises with known workload characteristics), edge inference on consumer devices (where power consumption and heat constraints create a strong case for workload-specific silicon), and specialized AI applications in healthcare, manufacturing, and autonomous systems where the performance-per-watt constraint is extreme. Marvell’s $1 billion is evidence that the first wave of custom silicon discovery has been commercially validated. The second wave’s timeline depends on how fast adjacent markets discover their own workload-specific optimization gap — and on whether the engineering services model that worked for hyperscalers can scale to serve smaller-volume enterprise customers.

04/07/2026
The 2026 Memory Supercycle Reached Consumer Devices
The 2026 memory shortage stopped being an AI story this quarter. It is now a phone-and-laptop story, and that shift changes what it means for crypto. When memory scarcity lived inside the data center, decentralized infrastructure networks could tell a clean bull story: hardware is scarce, so idle capacity is valuable, so pay people in tokens to supply it. That story is now only half true. The same wafers being denied to your next phone are being denied to the storage and GPU nodes that DePIN networks depend on. Demand for decentralized capacity is rising. So is the cost of supplying it, and the second curve is steeper.

Here is the specific claim this piece defends: the memory supercycle is a genuine tailwind for on-chain demand and a genuine headwind for on-chain supply, and any DePIN token thesis that only priced in the first half is about to get repriced. We argued in early 2026 that the memory crunch handed DePIN its best demand argument yet. That was correct as far as it went. It did not go far enough.

The numbers that moved the story downmarket

Start with the price signal, because it is unambiguous. Gartner in April 2026 forecast that DRAM average prices would rise 125% across the year and NAND flash 234%, inside a semiconductor market it expects to clear $1.3 trillion in revenue. Those are not spot-market blips. They are annual averages, which means the pain compounds through every device that ships in the second half.

Deloitte’s 2026 semiconductor outlook put a floor under it: consumer memory such as DDR5 rose roughly fourfold between September and November 2025, with another 50% increase plausible in the first half of 2026. Deloitte also flagged the structural cause plainly. Roughly half of industry revenue in 2026 is expected to come from AI chips for data centers, and that is where the scarce wafers are going.

The mechanism is a wafer-allocation decision, not a manufacturing failure. High-bandwidth memory now consumes a disproportionate share of DRAM capacity because it is stacked, complex, and margin-rich. One industry teardown estimated HBM had taken around 23% of DRAM wafer supply, and every wafer committed to an HBM stack bound for an Nvidia GPU is a wafer that never becomes a smartphone module. Samsung, SK Hynix and Micron are the only three companies that can make this trade at scale, and they are making it in favor of the buyer that pays the most per bit.

The downstream damage is now measurable. IDC projected smartphone shipments to fall 12.9% in 2026 and PC shipments 11.3%, driven directly by memory cost forcing device makers to raise prices, cut specifications, or both. When a shortage starts deleting units from the consumer market rather than just raising cloud bills, it has crossed a line. This is no longer an enterprise procurement problem. It is a household one.

Why this looked like pure DePIN fuel

The optimistic read on all of this is easy to construct, and it is not wrong. Decentralized physical infrastructure networks exist to monetize hardware that would otherwise sit idle. Scarcity raises the value of any capacity you can bring online. If a hyperscaler cannot get enough memory or GPUs, the theory goes, some of that unmet demand should route to networks that aggregate consumer and prosumer hardware and pay for it in tokens.

There is real evidence the demand side is responding. Decentralized GPU and compute markets have spent 2026 pitching themselves as the release valve for a supply-constrained AI buildout, the same argument we traced when OpenAI’s $122 billion compute-financing round made the case for decentralized compute. Render Network routes GPU rendering and inference jobs to a distributed fleet. Akash Network runs a marketplace for underused cloud and GPU capacity. io.net aggregates GPUs into clusters for AI workloads. On the storage side, Filecoin and Arweave sell durable, verifiable capacity that competes on price with hyperscaler object storage. When centralized supply is rationed, a marketplace that can source capacity from thousands of independent operators has a real pitch.

Bitcoin miners tell the same story from the other direction. As we noted when Nvidia’s stock stayed flat while its chips got more essential, miners with power contracts and cooling already in place have become accidental AI-infrastructure landlords. Scarcity in the physical layer rewards whoever already owns physical capacity. That part of the DePIN thesis is intact.

The half nobody priced in

Now the uncomfortable side. DePIN networks do not manufacture hardware. They rent yours. Every storage provider on Filecoin, every GPU on Render or io.net, every node on a DePIN network is a machine somebody bought, and that machine is now more expensive to buy and to expand. The supercycle that raises demand for decentralized capacity simultaneously raises the capital cost of the operators who supply it.

Consider a Filecoin storage provider. Its economics depend on cost per terabyte of usable capacity, which is dominated by drives, but modern storage nodes also lean on substantial RAM for sealing and proof generation. When DDR5 prices double year over year, the marginal cost of adding sealing capacity climbs, and the token reward per terabyte has to cover a higher hardware bill to keep the operator solvent. The network can raise nothing by decree. It can only hope token price or storage demand rises fast enough to compensate. The memory supercycle is now widely expected to run through 2028, which means this is a multi-year squeeze on node economics, not a quarter of noise.

The GPU networks face the sharper version of the same problem. A Render or io.net operator competing to host inference needs current-generation accelerators, and those cards ship with HBM that is being allocated first to the buyers paying data-center prices. An independent operator trying to expand a fleet is bidding for the same scarce silicon as the hyperscalers, without the hyperscaler’s purchase agreements or priority in the queue. The network’s demand pitch is strongest precisely when its suppliers can least afford to grow. That is the contradiction the token models mostly ignored.

This is why the framing matters. A DePIN token that rallied on “scarcity is good for us” priced a demand curve and forgot a cost curve. If operator margins compress because hardware inflation outpaces token rewards, supply growth stalls, quality operators exit, and the network’s ability to actually absorb the overflow demand it was pitching gets thinner, not thicker. Scarcity is only bullish for a rental network if the network can keep attracting rentals faster than its landlords’ costs rise.

Which networks survive the squeeze, and which do not

The split is going to run along one line: whether a network’s rewards are indexed to real, rising demand or to a fixed emission schedule set when hardware was cheap.

Networks with demand-linked economics have a path through. If Akash or io.net can charge inference customers prices that rise with the underlying cost of compute, operators can pass hardware inflation through to buyers and stay solvent. The marketplace design does the work. Networks that pay operators from a fixed token emission calibrated to 2024 hardware costs do not have that lever. Their operators eat the inflation while rewards stay flat in token terms, and the only thing that saves them is a token price appreciating fast enough to cover the gap, which is a speculative bet, not an operating model.

Storage is more defensible than compute here, for a boring reason. Storage nodes are weighted toward drives and bandwidth, and while memory cost is climbing, the bill of materials is less exposed to HBM scarcity than a GPU node whose entire value proposition is the accelerator. Arweave’s endowment model, which pre-funds perpetual storage from an upfront fee, at least attempts to price durability against future hardware cost rather than assuming today’s prices hold. Whether the endowment math survives a sustained memory supercycle is a fair question, but it is at least the right question to be asking. A DePIN network that has never modeled a hardware-inflation scenario is flying blind through a repricing it cannot control.

What this means for the rest of 2026

The macro picture is a shortage that has moved from cloud invoices to consumer shelves, projected to persist for years, driven by a deliberate industry choice to serve AI demand first. For crypto, the honest read is that the memory supercycle is not a simple long on DePIN. It is a barbell. It strengthens the demand case for decentralized storage and compute while raising the cost base of every operator who supplies that capacity, and it rewards networks with demand-linked pricing while punishing networks running on cheap-hardware emission math.

The investors who did well out of the first phase treated the crunch as a one-sided tailwind. The ones who do well from here will read the balance sheet on both sides: is this network’s supply getting more expensive faster than its demand is getting more valuable? For a lot of DePIN tokens, the answer over the next eighteen months is going to be uncomfortable, and the ones honest enough to model it now are the ones worth holding through it.

Frequently asked questions

Is the 2026 memory shortage actually affecting consumer devices or just data centers? Both, and the consumer effect is the newer development. IDC projects smartphone shipments falling 12.9% and PC shipments falling 11.3% in 2026, driven by memory costs forcing device makers to raise prices or cut specifications. The root cause is that memory makers are directing scarce wafers toward high-bandwidth memory for AI accelerators, which pays more per bit than consumer memory. Gartner’s forecast of a 125% jump in DRAM prices and a 234% jump in NAND flash reflects an annual average, so the cost pressure compounds across every device shipping through the second half of the year rather than easing.

Does the memory crunch help or hurt DePIN tokens? It does both, which is the point. It helps the demand side, because scarce centralized capacity makes decentralized storage and compute marketplaces more attractive as a release valve. It hurts the supply side, because DePIN operators have to buy the same inflating hardware to run their nodes. A network whose rewards are linked to real demand can pass higher costs through to customers. A network paying operators from a fixed token emission set when hardware was cheap cannot, and its operator margins compress until token price or demand bails them out.

Which DePIN categories are most exposed to the hardware squeeze? GPU compute networks such as Render, io.net and Akash are the most exposed, because their value depends on current-generation accelerators that ship with the exact high-bandwidth memory being rationed first to data-center buyers. Storage networks such as Filecoin and Arweave are somewhat more insulated, because their bill of materials is weighted toward drives and bandwidth rather than HBM, though rising DRAM prices still raise the cost of sealing and proof generation. The safest position is a network with demand-linked pricing rather than fixed emissions.

How long is the memory supercycle expected to last? Current industry analysis expects the memory supercycle to run through 2028, driven by structural AI demand rather than a temporary inventory swing. That timeline matters for crypto because it turns a hardware-cost spike into a multi-year operating condition. DePIN networks and Bitcoin miners repurposing hardware for AI inference are planning around a sustained high-cost environment, not a transient shortage, which rewards operators who already own physical capacity and penalizes those who need to expand into an expensive market.

Are Bitcoin miners winners or losers in this environment? Miners with existing power contracts, cooling and physical footprint are relative winners, because scarcity rewards whoever already owns capacity. Many have repositioned as AI-infrastructure hosts, renting out data-center space and power to compute-hungry customers who cannot source their own. The catch is the same one facing DePIN operators: expanding into new accelerator capacity means bidding for scarce silicon against better-capitalized hyperscalers. The advantage belongs to what a miner already has, not to what it now wants to buy.

Sources
What the 2026 Memory Supercycle Reaching Consumer Devices Reveals About the Design Opportunity in On-Device Intelligence

From a human-centered computing perspective, the memory supercycle is not a semiconductor story. It is a design opportunity that has not yet been taken. When consumer devices carry sufficient DRAM and NAND flash to run large language models locally — without cloud round-trips, without API latency, without the privacy implications of sending personal context to a remote server — the design possibilities for contextually intelligent personal computing expand dramatically. But expanded memory capacity does not automatically produce better user experiences. The bottleneck the supercycle has not solved is the design layer: translating raw on-device processing capability into interfaces that people understand, trust, and find genuinely useful rather than just technically impressive.

The history of personal computing is full of capability inflection points where increased processing power did not translate into proportionally better user experiences because interface design lagged behind the hardware. The original Macintosh mattered not because its processor was faster than competing personal computers, but because its designers understood that the person using it was not a programmer and that the computer had to model the user’s mental model rather than the engineer’s implementation model. On-device AI in 2026 faces the same design challenge. The memory supercycle has delivered the hardware foundation. The interface design work — how to surface a locally running model’s context understanding in ways that feel natural rather than intrusive — is largely undone. The design gap is wider than the hardware gap was.

The most important design question on-device AI raises is not what the model can do but when it should act. A device that infers context from stored messages, calendar events, notes, and browsing history has the technical capability to surface highly relevant suggestions. But unsolicited relevance feels like surveillance when the design does not first establish trust. The design principle of discoverability — making capabilities visible without imposing them — becomes critical when the capability is contextual intelligence rather than a button that does one thing. The memory supercycle has moved the capability ceiling. Moving the design floor to match it is the work that determines whether 2026’s on-device AI hardware investment produces a generation of devices people find transformatively useful or merely technically capable.

What the Memory Supercycle Teaches About the Discipline of Saying No to Features That Aren’t Ready

The temptation, when hardware capability moves this fast, is to ship every feature the new capability ceiling makes technically possible. That temptation is exactly wrong, and it is the mistake that separates products people love from products people merely tolerate. The right response to a capability inflection is not to build everything the new ceiling allows. It is to figure out, with real discipline, which small number of things the new capability makes possible are actually worth building — and to say no to the rest, even when saying no means leaving obvious, marketable capability on the table.

Contextual intelligence, done right, requires a level of restraint that is uncomfortable for teams that have just been handed more compute headroom than they know what to do with. The instinct is to use the headroom for more features, more integrations, more surfaces where the AI shows up. The discipline is to ask, for every one of those possibilities, whether it makes the core experience simpler or whether it just makes the feature list longer. A device that surfaces contextual intelligence in five well-considered moments, each one earning the user’s trust through relevance and restraint, will outperform a device that surfaces contextual intelligence in fifty moments, most of which the user learns to ignore or actively resents.

The design floor problem identified in this article’s earlier analysis is really a focus problem wearing design clothes. Teams that have not decided what their AI-hardware product is fundamentally for will use expanded memory capacity to do more of everything, because doing more of everything feels like progress and is easier to justify in a roadmap review than saying no to features that are technically possible but not yet good. The devices that turn this capability ceiling into something people find transformatively useful, rather than merely technically impressive, will be the ones built by teams willing to ship less — and to make the few things they do ship extraordinary rather than merely comprehensive.
04/07/2026
Dell Technologies AI Server Revenue Crossed $10 Billion in FY2026

Dell Technologies AI Server Revenue Crossed $10 Billion in FY2026

Dell Technologies reported in its FY2026 full-year earnings (fiscal year ending January 30, 2026, results published February 26, 2026) that its AI-optimised server revenue — the subset of Infrastructure Solutions Group (ISG) server revenue attributable to configurations built specifically for AI training and inference workloads, including PowerEdge XE9680 and XE9680L (8-GPU and 4-GPU server configurations) and PowerEdge R760xa (AI inference-optimised 2U server) — crossed $10 billion for the fiscal year, reaching $10.3 billion and representing a 156 percent year-over-year increase from $4.0 billion in FY2025. Dell’s FY2026 annual investor filings show ISG total revenue for FY2026 reached $52.8 billion — up 36 percent from $38.9 billion in FY2025 — with AI server revenue representing 19.5 percent of total ISG revenue compared to 10.3 percent in FY2025, a penetration shift that Dell CEO Michael Dell described on the earnings call as “the fastest segment transition in our history.” Dell’s AI server backlog at the close of FY2026 (January 30, 2026) stood at approximately $9 billion in unfulfilled orders — meaning that the $10 billion in AI server revenue reported for FY2026 was constrained by manufacturing and component supply capacity rather than by customer demand. The backlog figure matters because it establishes that Dell’s AI server revenue growth trajectory in FY2027 (February 2026 – January 2027) is not demand-dependent: approximately $9 billion in orders are already contracted and awaiting fulfilment, providing revenue visibility through at least two additional fiscal quarters before new order flow needs to sustain the growth rate. Dell’s position in the AI server market is that of a hyperscale-volume OEM — a company that purchases Nvidia H100 and H200 GPU hardware, AMD Instinct MI300X GPU hardware, and custom networking components (typically InfiniBand or 400GbE Ethernet) and integrates them into purpose-built server chassis, rack-scale cooling systems, and pre-validated AI reference architectures — rather than a chip manufacturer (Nvidia), a cloud provider (AWS, Azure, GCP), or an infrastructure software company (Cisco). The OEM position is structurally important: Dell’s revenue scales with the total volume of AI server deployments across all enterprise customers and cloud providers who do not build their own servers (AWS, Google, and Microsoft build significant internal server capacity but also purchase commercial OEM systems for burst capacity and specific configurations), creating a demand base that is broader and more diversified than any single cloud provider’s AI infrastructure spend. Cisco’s AI networking revenue crossing $5 billion for East-West GPU traffic fabrics within AI data centres represents the adjacent infrastructure layer that Dell’s AI server deployments create demand for: each Dell PowerEdge XE9680 GPU server deployed in an enterprise AI cluster requires approximately 8 high-bandwidth network ports connecting its 8 Nvidia H200 GPUs to the AI fabric — meaning Dell’s AI server shipment volumes directly scale Cisco’s AI networking demand, with the two companies’ AI revenue growth rates correlated through the same underlying enterprise AI data centre build-out cycle.

Dell’s AI server competitive position is defined by two advantages that are distinct from the chip-level performance differentiation that Nvidia and AMD compete on: supply chain scale and enterprise integration services. Dell’s supply chain scale — as one of the largest purchasers of Nvidia GPU hardware globally, acquiring several hundred thousand GPU units annually across all product lines — gives Dell direct allocation visibility into Nvidia’s production volumes that smaller OEM competitors (Super Micro Computer, Hewlett Packard Enterprise) cannot consistently match, and allows Dell to offer enterprise customers committed delivery timelines for AI server orders with a certainty that requires direct Nvidia volume purchase agreements to maintain. The supply chain advantage became commercially decisive in 2024 and 2025, when Nvidia H100 supply remained constrained relative to demand: enterprises that ordered AI servers through Dell’s volume OEM channel received committed delivery schedules backed by Dell’s Nvidia allocation, while enterprises attempting to purchase GPU compute through spot markets or smaller OEM channels faced multi-quarter delivery uncertainty. Dell’s enterprise integration services advantage is the second structural differentiator: unlike cloud GPU rental (AWS P5 instances, Azure ND H100 v5, Google A3 Mega), Dell’s on-premises AI server deployment includes professional services for data centre site readiness assessment (power, cooling, network connectivity), ProSupport mission-critical maintenance contracts covering GPU hardware failures within a four-hour response SLA, and Dell’s AI Factory with Nvidia reference architecture validation — a pre-engineered configuration that specifies the exact hardware, firmware, and software stack required to run NVIDIA AI Enterprise software suite at production performance on Dell PowerEdge XE9680 hardware without enterprise IT teams needing to independently validate the configuration. The on-premises versus cloud comparison is central to Dell’s AI server market thesis: enterprises that can cost-justify GPU compute at the scale of 50 or more GPU equivalents operating continuously find that on-premises GPU infrastructure at Dell’s server pricing reaches total cost of ownership parity with equivalent cloud GPU instance pricing within 18 to 24 months of deployment, after which on-premises cost advantages grow as the hardware is amortised over a 5-to-7-year useful life while cloud instance pricing remains fixed or declines more slowly. IDC’s Q4 2025 Worldwide Server Tracker shows Dell holding 17.3 percent market share by revenue in the AI-optimised server category, second to Super Micro Computer’s 22.1 percent share, with the gap narrowing from 8.4 points in Q2 2025 to 4.8 points in Q4 2025 as Dell’s supply chain normalisation and AI Factory reference architecture traction improved Dell’s competitive win rate against Super Micro’s lower-cost but less integrated offerings. ARM Holdings’ royalty revenue growth from AI chip compute subsystems includes royalty contributions from the ARM-based CPUs (Dell’s PowerEdge XE9680 uses Intel Xeon CPUs, not ARM, but Dell’s broader PowerEdge server line increasingly includes ARM-based configurations through AWS Graviton3 and Qualcomm CLOUD AI 100 options) that provide the CPU management layer alongside GPU compute in heterogeneous AI server configurations — a dimension of the AI hardware supply chain that sits upstream of Dell’s OEM assembly but generates royalty revenue that correlates with Dell’s server unit shipment volumes.

What Dell’s AI Server Backlog Means for FY2027 Revenue Visibility

Dell’s $9 billion AI server backlog at FY2026 close is the most significant forward revenue indicator in the company’s recent history, and its composition — weighted toward large enterprise data centre refreshes and colocation provider deployments rather than hyperscaler orders — reveals the specific customer segment driving Dell’s AI server demand. Dell’s customer disclosures indicate that no single customer represents more than 10 percent of AI server backlog, confirming that the demand is distributed across hundreds of enterprise accounts rather than concentrated in a handful of hyperscaler relationships that could create exposure to hyperscaler capex cycle volatility. The distribution of demand across enterprise accounts is structurally valuable because enterprise data centre AI deployments have different procurement cycles than hyperscaler infrastructure spending: hyperscalers accelerate or defer data centre builds in 12-to-18-month cycles correlated with their own revenue growth, while enterprise organisations procure on 3-to-5-year IT infrastructure refresh cycles that are less correlated with quarterly technology sector sentiment. Dell’s guidance for FY2027 AI server revenue of $14 to $16 billion ($12 billion from backlog fulfilment at the current delivery rate, plus projected $2 to $4 billion in new orders during the fiscal year) implies continued growth even if new order intake slows significantly from FY2026’s pace — a revenue visibility profile that most hardware companies cannot match this far into an upcycle. The H200 to B200 GPU platform transition also benefits Dell’s FY2027 outlook: Nvidia’s Blackwell B200 GPU architecture, which began commercial availability in H2 FY2026 (second half of Dell’s FY2026, roughly August 2025 through January 2026), commands higher average selling prices than the H200 (B200-based PowerEdge XE9680 configurations are priced approximately 35 to 40 percent above equivalent H200 configurations), meaning Dell’s FY2027 AI server revenue per unit shipped will increase as the mix shifts toward Blackwell. Oracle Cloud’s AI infrastructure revenue doubling in FY2026 represents the cloud-side demand signal that Dell’s on-premises AI server business both competes with and complements: Oracle’s OCI AI infrastructure growth demonstrates the magnitude of total enterprise AI compute demand, some of which is served by Oracle’s cloud capacity (where Oracle itself is a major purchaser of Nvidia GPU servers, including Dell OEM configurations) and some of which is served by enterprises deploying on-premises Dell AI server capacity — with the key enterprise decision being whether workload sensitivity, regulatory compliance, and TCO economics favour cloud or on-premises AI compute at each organisation’s specific scale. The Wall Street Journal’s analysis of Dell’s FY2026 AI server results characterises the $10 billion milestone as the confirmation that AI infrastructure spending has moved permanently into traditional enterprise procurement channels rather than remaining concentrated in cloud providers — a structural shift that creates durable revenue opportunity for enterprise-focused OEMs like Dell across the AI infrastructure build-out cycle regardless of which cloud provider or foundation model company wins the AI application layer.

What Dell’s AI Server Revenue Reveals About Where Enterprise AI Infrastructure Has Pricing Power

Dell Technologies crossing $10 billion in AI server revenue in FY2026 is a milestone that needs disaggregating through competitive structure to understand what it actually reveals. AI servers are hardware incorporating NVIDIA GPUs that Dell configures, brands, and supports. The five-forces question is straightforward: where in this value chain does Dell hold pricing power, and where does it act as a margin pass-through?

Buyer power is the most structurally consequential force. Hyperscaler buyers — AWS, Azure, Google Cloud — negotiate AI server contracts at a scale that gives them significant leverage and often bypasses OEMs entirely through direct ODM relationships with Quanta, Foxconn, and similar manufacturers. Dell’s pricing power concentrates in the enterprise segment: large corporations building private AI infrastructure, financial services firms, healthcare systems, defense contractors. These buyers have fewer ODM alternatives, rely on Dell’s ProSupport service infrastructure, and make purchasing decisions through established IT procurement relationships. That segment is where the $10B is stickiest.

Supplier power is the force that constrains Dell’s upside. NVIDIA holds extraordinary leverage over GPU allocation and pricing. Dell functions as a pass-through for GPU cost in many PowerEdge XE configurations; expanding margin on the GPU component is structurally impossible without NVIDIA’s cooperation. Dell’s margin lever is the surrounding services stack — ProSupport contracts, deployment services, financing — which are attached to AI server sales but not dependent on NVIDIA pricing.

The structural question for the next three years is whether Dell’s position compounds or commoditizes. Compounding scenario: AI server deployments at enterprise accounts generate ProSupport and managed services revenue that deepens Dell’s data center relationship, making Dell the incumbent for the next infrastructure refresh cycle. Commoditization scenario: enterprise buyers move toward hyperscaler managed AI services (co-location, cloud bursting), reducing on-premise AI hardware demand and shifting Dell’s addressable market. The $10B is a current milestone; the competitive structure determines whether it becomes a floor or a ceiling.

What Dell’s $10 Billion AI Server Business Reveals About the Brand Problem Every Hardware Vendor Faces in a Services Market

Dell’s $10 billion AI server milestone carries a branding problem inside it. The story Dell has been telling — that AI hardware at enterprise scale runs through the same buying relationships, the same reseller networks, and the same ProSupport infrastructure that Dell has built over 40 years — is a compelling story for CFOs and CIOs who want AI infrastructure without the procurement friction of a new vendor relationship. But the story has a ceiling. Once AI infrastructure is established as a buy-from-existing-vendor proposition, the brand work that drove adoption becomes the commodity. Dell’s AI server brand is strongest during the adoption phase, when enterprise buyers are making first commitments to on-premise AI infrastructure. That phase has a limited duration. The brand question Dell has not yet answered is what Dell stands for after “familiar vendor for AI infrastructure” is no longer a differentiated message.

The framing problem is that Dell is selling a category rather than a position. $10 billion in AI server revenue is an impressive sales achievement, but it measures a category (enterprise AI hardware) not a defensible position within it. HPE, Supermicro, and ODMs compete in the same category with similar hardware, and their competitive pitch is also “we sell AI infrastructure to enterprises.” The differentiated framing that makes a category position into a brand position would look like: “Dell is the vendor enterprises choose when uptime and support SLA matter more than spec-sheet efficiency,” or “Dell is the only AI infrastructure vendor whose total cost of ownership model includes ProSupport’s labor cost reduction at scale.” These are positions, not categories. Dell’s current brand story emphasizes volume ($10B milestone) and availability (supply chain relationships with Nvidia). Neither is a brand position; both are table-stakes claims once the category has matured.

The branding implication of the $10 billion milestone is that Dell needs to move its brand message one layer down the value stack before commoditization pressures drive it there involuntarily. A hardware company that defines its brand at the product layer (“our AI servers run at X teraflops”) will be commoditized by spec convergence. A hardware company that defines its brand at the service layer (“our infrastructure delivers X uptime with Y support response time backed by Z labor-cost reduction model”) creates a positioning moat that spec comparison shopping cannot easily overcome. Dell’s $10 billion is the commercial validation that it has won the adoption phase. The brand work for the commoditization phase — when price pressure and ODM alternatives intensify — has not yet begun in earnest.

02/07/2026
Nvidia Stock Stayed Flat as AI Chip Demand Kept Growing
Nvidia’s stock has gone almost nowhere in 2026 while the PHLX Semiconductor index has climbed 79%, according to The Motley Fool. That gap is not a warning that Nvidia is weakening — its Vera Rubin systems are in mass production, shipping to North American tech giants from July, priced roughly 25% above Grace Blackwell at an estimated $3.5–4 million per system, per TradingKey and CNBC. The verdict is simpler and more useful: the AI trade is rotating away from the pick-seller and toward everyone the picks enable. And the most direct crypto-native beneficiary of that rotation is the group of former bitcoin miners quietly rebuilding themselves into AI landlords.

When a stock stops rewarding earnings growth, the market is telling you the easy money has moved downstream. Nvidia’s fiscal 2026 revenue hit $215.9 billion, up 65% year over year, per the same Motley Fool coverage, yet the multiple compressed anyway. That is the signature of a market repricing the value chain — moving margin from the chip designer toward the foundries, the power and cooling suppliers, and the operators who own the buildings the chips go into. In crypto, those operators already exist. They spent the last cycle mining bitcoin.

The rotation is priced, not predicted

The 79-point spread between Nvidia and the broader semiconductor index is the cleanest evidence that this is happening now, not later. Investors are still buying AI exposure aggressively — IDC forecasts semiconductor industry revenue jumping 53% in 2026 to $1.29 trillion, per the sector data cited across Motley Fool’s analysis. They are simply buying it somewhere other than the name that led the last three years.

Vera Rubin makes the point tangible. Nvidia claims 10x performance-per-watt over Blackwell, and the buildout has reportedly pushed Nvidia to more than 20% of TSMC’s revenue while enriching power and cooling suppliers, per TrendForce. The value is spreading to the ecosystem around the chip. We saw an early version of this rotation in Arm’s AI royalty revenue becoming its primary growth driver and in AMD’s accelerator business closing ground — the market rewarding the picks-adjacent layer even when the picks themselves stall.

The crypto-native beneficiary is hiding in plain sight

The downstream layer capturing this rotation is physical: power, land, cooling, and the operational competence to run gigawatt-scale facilities. Bitcoin miners spent years assembling exactly that. They hold interconnect agreements, energized substations, and the industrial discipline to run megawatts of hardware around the clock. In 2026 they are converting those assets into AI hosting contracts — and, tellingly, selling bitcoin to fund the conversion, per CoinDesk.

Core Scientific is the flagship. Having emerged from bankruptcy in 2024, it signed a $10.2 billion, 12-year agreement with CoreWeave and is building six AI data centers under that lease, according to CoinDesk’s report on its subsequent $3.3 billion bond sale to finance the shift. The deal is expected to generate roughly $10 billion in revenue. This is a company that mined bitcoin turning its power footprint into a decade-long contract with one of the fastest-growing GPU clouds — the same CoreWeave capacity that labs like OpenAI depend on.

TeraWulf has signed HPC contracts totaling $12.8 billion, anchored by Google-backed Fluidstack and other counterparties. Roughly 27% of its revenue already comes from AI, a figure projected to reach about 70% by year-end, per the insights4vc 2026 thesis. IREN, formerly Iris Energy, secured a $9.7 billion deal with Microsoft to host 76,000 Nvidia GB300 GPUs across 200MW at its Childress, Texas campus. The pattern repeats because the asset that matters — energized, permitted, coolable power capacity — is the exact bottleneck the Vera Rubin buildout is straining, the same physical constraint we traced in the 2026 memory crunch and DePIN’s demand case.

Why the miners, specifically

Anyone can want to build an AI data center. Very few can plug one in. Grid interconnection queues in the US stretch years, and energized capacity at scale is the scarcest input in the entire AI buildout. Miners front-ran that scarcity by accident — they chased cheap power for bitcoin and ended up holding the one asset the AI boom cannot manufacture on demand. The Vera Rubin systems shipping in July need somewhere to live, and the somewhere has to already have power.

That is why the contracts are structured as decade-long leases rather than spot arrangements. CoreWeave’s 12-year commitment to Core Scientific and Microsoft’s deal with IREN are hyperscalers locking in scarce, ready capacity before competitors do. The miners are not pivoting into a crowded market; they are monetizing a moat they did not know they were digging. It is a cleaner version of the infrastructure logic behind the institutional flows we covered in Bitcoin ETF inflows crossing $50 billion — capital rewarding crypto-adjacent operators for owning something structurally scarce.

The financing decision that proves the thesis

The sharpest signal is what the miners are willing to give up. Selling bitcoin — the asset their entire prior thesis was built to accumulate — to fund an AI conversion is a revealed preference, not a press release. It says management believes a 12-year HPC lease is worth more than continued exposure to the coin they were founded to mine. Core Scientific’s willingness to take on $3.3 billion in junk-rated debt on top of that says the same thing with a credit rating attached.

That is a real reallocation of conviction, and it deserves the skeptical footnote: it also concentrates these companies’ fortunes on Nvidia’s roadmap and a handful of hyperscaler counterparties. If AI capex cools, a miner that sold its bitcoin and levered up on GPU-hosting debt is exposed on both sides. The pivot is rational given today’s demand curve. It is not risk-free, and the ones that sold the most bitcoin have the least cushion if the curve bends.

What Nvidia’s flat chart actually tells operators

For operators and investors, the read is to stop treating Nvidia’s stock as the thermometer for the AI trade. The chip is more essential than ever — 10x performance-per-watt, a 25% price increase the market is absorbing, mass production confirmed — and the stock is flat anyway. That combination means the returns are migrating to whoever owns the scarce complements: TSMC’s capacity, the power and cooling supply chain, and the physical hosting footprint that former miners happen to control.

The crypto angle here is not a token. It is equity and infrastructure. The clearest way to express “AI compute demand keeps rising” through a crypto-native lens in 2026 is not a GPU-rental token but the miners converting hash power into AI landlording. For the fuller map of which decentralized and crypto-adjacent infrastructure is generating durable revenue versus running on narrative, VaaSBlock’s breakdown of what is working in DePIN in 2026 is the reference worth keeping open.

FAQ

Why is Nvidia’s stock flat in 2026 if its chips are selling so well?

Because the market is rotating AI exposure downstream. Nvidia’s fiscal 2026 revenue rose 65% to $215.9 billion and Vera Rubin is in mass production at a 25% price premium, yet the stock has gone nearly nowhere while the PHLX Semiconductor index climbed 79%, per The Motley Fool. When a stock stops rewarding strong earnings, it usually means the easy returns have moved to the rest of the value chain — foundries, power and cooling suppliers, and the operators who own the data centers. The chip is more essential and the stock is flat, which is the signature of a rotating trade.

How are bitcoin miners connected to the AI chip boom?

They own the scarcest input: energized, permitted, coolable power capacity at industrial scale. US grid interconnection queues stretch years, so ready power is the bottleneck the AI buildout cannot manufacture on demand. Miners assembled that footprint chasing cheap electricity for bitcoin and are now converting it into AI hosting contracts. Core Scientific signed a $10.2 billion, 12-year deal with CoreWeave, TeraWulf holds $12.8 billion in HPC contracts, and IREN secured a $9.7 billion Microsoft deal for 76,000 GB300 GPUs, per CoinDesk and insights4vc.

Why are miners selling bitcoin to fund the pivot?

It is a revealed preference. Selling the asset their entire prior strategy was built to accumulate signals that management believes a decade-long AI hosting lease is worth more than continued bitcoin exposure, per CoinDesk’s reporting. Core Scientific also took on $3.3 billion in junk-rated debt to accelerate the shift. The willingness to give up bitcoin and take on that debt is the strongest evidence the pivot is a genuine reallocation of conviction rather than a marketing rebrand — though it also raises risk if AI capex cools.

Is the miner-to-AI pivot risky?

Yes. Converting to AI hosting concentrates a miner’s fortunes on Nvidia’s roadmap and a handful of hyperscaler counterparties like CoreWeave, Microsoft, and Fluidstack. A company that sold its bitcoin and levered up on GPU-hosting debt is exposed on both sides if AI capital spending slows — it has neither the coin upside nor a diversified tenant base. The decade-long lease structures mitigate some of this by locking in revenue, but the miners that sold the most bitcoin have the least cushion. The pivot is rational given 2026 demand, not risk-free.

What is the best crypto-native way to express AI compute demand in 2026?

Through infrastructure and equity rather than a single GPU-rental token. The most direct expression is the former bitcoin miners converting power footprints into AI landlording — Core Scientific, TeraWulf, and IREN — because they capture the scarce physical complement that Nvidia’s chips require. Decentralized compute networks like Akash and Render capture the inference layer. The common thread is that returns in 2026 accrue to whoever owns the scarce complements to the chip, not to the chip stock itself.

Sources
What Nvidia’s Flat Stock During Growing AI Chip Demand Reveals About the Psychology of Priced-In Expectations

Nvidia’s stock price staying roughly flat while AI chip demand continues growing looks like a paradox to many observers. It is not. It is one of the most documented mechanisms in market psychology: when a future earnings trajectory is already widely understood and consensus-priced, incremental confirmation of that trajectory moves the price less than newcomers expect. Each new data point that confirms what the market already believed is worth less than the previous one.

Nvidia’s stock compounded roughly 2,200 percent between January 2023 and its peak valuation in mid-2025. That compounding was driven by genuine price discovery — the market progressively repricing Nvidia’s future earnings as AI training demand became real, then accelerating, then clearly durable. Each Nvidia earnings report from 2023 through early 2025 genuinely surprised the market upward, because each report demonstrated that the previous consensus estimate of AI chip demand had been wrong in the same direction: too conservative.

By FY2026, the AI chip demand story is not surprising anyone. Institutional investors, retail participants, and sell-side analysts all expect Nvidia’s data center revenue to grow. When earnings confirm what the market already believed, the surprise-weighted mechanism produces a flat stock. The flat price is not the market losing faith in Nvidia’s growth; it is the market saying the growth was already in the price.

The Bitcoin miner AI pivot trade described in this article represents a classic rotation from priced-in to mispriced. Mining infrastructure companies reconfiguring GPU capacity for AI inference are a derivative play that has not yet been fully priced by the market — they carry AI infrastructure exposure without the valuation premium Nvidia already trades at. Institutional capital rotating from a fully priced asset into a derivative asset with similar exposure and lower current valuation is the structural driver of that trade, not any fundamental change in AI chip demand. Nvidia’s flat stock is a signal about pricing, not about fundamentals.

What Nvidia’s Flat Stock on Growing AI Chip Revenue Reveals About the Growth Loop That Drives Next-Stage AI Infrastructure Adoption

The growth loop perspective on Nvidia’s flat stock requires separating the performance of the product from the performance of the investment. Nvidia’s AI chip revenue is genuinely compounding — each generation of infrastructure deployed at hyperscaler, enterprise, and research institution scale creates the trained models, the inference workloads, and the developer ecosystem that generates demand for the next generation of infrastructure. This is a supply-side growth loop: Nvidia chips enable AI capability that creates demand for more Nvidia chips. The loop has been running since 2023 and is not yet showing structural signs of slowing. What is slowing is the investment return from holding Nvidia stock — because the loop’s existence and durability is now fully priced into the equity.

The growth loop that matters for the next stage of Nvidia adoption is not the hyperscaler training loop (which is already mature) but the enterprise inference loop. The training market is concentrated in a small number of hyperscalers and large model labs with known procurement patterns. The inference market is distributing across a much larger population of enterprises deploying AI-powered applications in production. The enterprise inference loop has different properties: more heterogeneous workloads, lower tolerance for infrastructure complexity, stronger preference for managed services, and lower capital budgets per deployment than hyperscalers. This creates a different distribution motion — more channel-dependent, more ISV-mediated, more sensitive to total cost of ownership than raw training throughput.

The growth loop implication for Nvidia’s flat stock is that the equity market has run ahead of the enterprise inference loop’s actual monetization timeline. The market priced the hyperscaler training loop’s potential in 2023-2024, and the training-era revenue realization followed. The enterprise inference loop’s monetization will follow a longer, more distributed timeline — more customers making smaller decisions at irregular cadences rather than a few customers making enormous decisions at predictable cycles. That distribution flattens the revenue growth curve compared to the training-era step function. A flat stock price on growing revenue implies: the market sees the loop running but is waiting for the enterprise inference monetization timeline to inflect before moving the multiple higher.
02/07/2026
Cisco’s AI Networking Revenue Crossed $5 Billion

Cisco’s AI Networking Revenue Has Crossed $5 Billion and Enterprise Data Centers Are Being Rebuilt for East-West GPU Traffic

Cisco disclosed in its Q3 FY2026 earnings call on May 14, 2026, that AI-related product orders had crossed $5 billion in the trailing twelve months — the first time Cisco has broken out AI networking as a separate revenue metric, reflecting both the size of the segment and the need to explain why Cisco’s networking hardware business is recovering after five consecutive quarters of enterprise spending contraction that followed the COVID-era overbuild cycle. Cisco’s Q3 FY2026 investor materials identify two distinct AI networking revenue streams: Cisco Nexus 9000 series switches being configured as AI cluster fabrics (replacing the traditional InfiniBand networking used in early GPU clusters with Ethernet-based connectivity that integrates with enterprise customers’ existing Cisco networking infrastructure), and Cisco Nexus HyperFabric, an AI-specific networking product launched in 2024 that provides a pre-configured fabric architecture optimised for the east-west GPU-to-GPU communication patterns that large language model training and inference require. The east-west traffic pattern is the defining architectural difference between AI data centers and traditional enterprise data centers: conventional enterprise networking was designed around north-south traffic — data moving between end-user devices and servers, or between on-premise infrastructure and the internet — where a hierarchy of distribution and access layer switches routes traffic through a central spine. AI training clusters require a fundamentally different architecture because the dominant traffic pattern is between GPUs within the same cluster during distributed training, where each GPU must communicate with dozens or hundreds of other GPUs simultaneously to synchronise gradient updates, parameter values, and activation states across the model training run. This east-west traffic pattern generates aggregate bandwidth demands of 400 to 800 gigabits per second per GPU node — orders of magnitude higher than the 10 to 25 gigabits per second per server that traditional enterprise networking was designed to support — requiring fabric architectures with near-zero latency, extremely high bandwidth-to-switch-port density, and lossless transport that preserves packet ordering across thousands of simultaneous flows. ARM Holdings’ compute subsystem royalties flow in part from the AI chip designs that generate this extreme east-west networking demand — every GPU sold into an AI training cluster creates a corresponding networking infrastructure requirement that Cisco’s HyperFabric products are designed to address.

Cisco’s competitive position in AI networking faces a structural challenge from Nvidia, which has its own high-performance networking division (formerly Mellanox) that sells InfiniBand interconnects — the dominant networking technology in GPU clusters before Ethernet became a viable alternative for AI workloads. InfiniBand’s historical advantage was its remote direct memory access capability, which allows GPUs to read and write each other’s memory without CPU intermediation, reducing the latency of gradient synchronisation during training by 2 to 3 times compared to standard Ethernet. Cisco’s Nexus HyperFabric and the broader Ultra Ethernet Consortium standard (of which Cisco is a founding member alongside AMD, Broadcom, and Intel) are attacking the InfiniBand dominance by demonstrating that modern 400G and 800G Ethernet fabrics with RoCE (RDMA over Converged Ethernet) achieve latency performance that is within 15 to 20 percent of InfiniBand in large-cluster training environments — a gap that Cisco argues is more than compensated by the operational advantage of running AI cluster networking on the same Ethernet infrastructure that enterprise customers already manage with Cisco tools, eliminating the need for a separate InfiniBand management layer that requires specialised expertise. The enterprise customer preference for single-vendor networking management is Cisco’s primary commercial advantage in AI networking: the 85 percent of Fortune 500 companies that run Cisco as their primary enterprise networking vendor have a strong default preference for extending that infrastructure into their AI cluster buildouts rather than introducing a new networking vendor and a new operational framework for AI-specific infrastructure. Cloudflare’s AI Gateway and edge inference products operate at the software layer above the physical networking fabric that Cisco provides — both companies are capturing value from the AI infrastructure buildout at different layers of the stack, with Cisco owning the physical transport layer and Cloudflare owning the API management and edge delivery layer above it.

What Cisco AI Defense Adds to the Networking Business

Cisco launched AI Defense in Q1 2026 as a security product specifically designed for enterprises deploying AI applications — addressing the security risks that emerge when employees and developers connect enterprise data to AI APIs (OpenAI, Anthropic, Google) without the visibility, access control, and data loss prevention mechanisms that IT security teams apply to conventional application traffic. AI Defense monitors and enforces policy on AI API calls from within the enterprise network perimeter: when a developer in a finance department submits customer account data to ChatGPT for analysis through an approved productivity tool, AI Defense classifies the data type, applies the enterprise’s data classification policy (marking customer PII as restricted and blocking the API call if the destination model provider’s data handling terms do not meet the enterprise’s compliance requirements), and logs the interaction for audit purposes. This is the same data loss prevention (DLP) function that Cisco’s existing security portfolio applies to email, USB transfers, and web uploads — extended to AI API traffic, which has emerged as the fastest-growing uncategored egress channel in enterprise networks since the commercial deployment of AI productivity tools accelerated in 2024 and 2025. Cisco’s integration of AI Defense into its existing security portfolio means enterprise customers can enforce AI API traffic policies through the same management console they use for all other network security policies, rather than deploying a standalone AI security tool from a new vendor. Palantir’s AIP platform addresses a complementary problem — ensuring that the AI-generated decisions and analytics that enterprises act on are grounded in verified enterprise data rather than model hallucinations — but the governance problem Palantir solves is at the application and decision layer, while Cisco AI Defense solves it at the network transport layer. Gartner’s networking and AI infrastructure research for 2026 projects that AI networking infrastructure — combining AI cluster fabrics, AI security tooling, and AI traffic management — will represent 35 percent of total enterprise networking spend by 2028, up from less than 10 percent in 2024, a trajectory that validates Cisco’s decision to break out AI networking as a separate revenue disclosure and to restructure its product development priorities around the AI data center buildout cycle.

Why the AI Networking Market Allows Cisco to Escape the Hardware Commoditisation Cycle

Cisco’s historical vulnerability in networking hardware has been commoditisation: white-box switching vendors (Arista’s whitebox alternatives, barefoot-based Broadcom-chipset switches programmed with open-source P4) have eroded Cisco’s pricing power in commodity 10G and 25G access-layer switching by offering comparable packet forwarding at significantly lower price per port. AI networking infrastructure is structurally resistant to this commoditisation pressure for two reasons specific to the AI cluster deployment context. First, AI cluster networking performance is directly tied to training throughput — a 20 percent improvement in fabric latency translates to a proportional improvement in training speed for distributed models, which at the scale of a 200,000 GPU cluster like xAI’s Colossus represents hundreds of millions of dollars in compute cost per training run saved or lost depending on fabric quality. Enterprises and hyperscalers buying AI networking infrastructure are willing to pay a meaningful premium for performance and reliability because the cost of a fabric-induced slowdown during a large training run far exceeds the cost difference between a premium and commodity switch. Second, AI cluster networking requires deep integration with GPU vendor drivers, RDMA network libraries, and cluster management software in ways that commodity white-box switches managed by generic open-source software cannot currently support with the same operational reliability as Cisco’s validated HyperFabric stack. Workday’s enterprise software business demonstrates a parallel commoditisation-resistance dynamic: HCM functionality in isolation is available from lower-cost vendors, but Workday’s data moat (1.5 billion skill inferences) and validated compliance workflows justify premium pricing for enterprise HR automation because the cost of errors in payroll, compliance, and headcount planning exceeds the cost of the software. Cisco’s AI networking premium is analogously justified by the training throughput cost of fabric underperformance at scale. The Wall Street Journal’s enterprise technology coverage through Q2 2026 frames Cisco’s AI networking pivot as the most important product strategy shift at the company since its 2015 to 2019 pivot to subscription software — a pivot that reduced Cisco’s hardware revenue dependence but took five years to reflect in financial results, while the AI networking cycle is producing immediate hardware revenue growth in the current quarter rather than requiring a multi-year transition period.

What the East-West Traffic Paradigm Reveals About Enterprise IT’s Mental Model Gap

Don Norman’s central insight in The Design of Everyday Things is that products fail not because users are unintelligent but because the designer’s mental model of how the product works and the user’s mental model of how the task works have diverged. Applied to enterprise AI networking, the east-west GPU traffic problem is exactly this kind of design mismatch — but the gap is not between Cisco’s design model and the user’s task model. It is between the task the AI infrastructure needs to perform and the conceptual framework enterprise IT has spent twenty years developing to think about networking.

Enterprise IT built its networking intuitions around north-south traffic: requests from devices to servers, responses from servers to devices, data moving between premises and the internet. The hierarchy of access, distribution, and core switches was designed for this pattern. Enterprise IT professionals who understand Cisco’s routing and switching architecture fluently are skilled at reasoning about traffic that originates at the edge and terminates at the center. The AI cluster networking problem is the structural inverse: 400 to 800 gigabits per second of simultaneous GPU-to-GPU communication moving laterally across the cluster rather than vertically through a hierarchy. The switches that enterprise IT knows how to configure for north-south traffic are the wrong conceptual tool for east-west cluster fabric — not wrong on technical merit, but wrong as a mental model for understanding where the bottlenecks live and how to diagnose them. An engineer who has optimized north-south latency for fifteen years and then tries to troubleshoot an east-west fabric congestion event will reach for the wrong instruments because the failure mode is in a dimension their mental model does not track.

What Cisco’s HyperFabric product does well from a design standpoint is make the AI cluster networking problem tractable for professionals whose mental models are north-south oriented. HyperFabric’s management interface uses the same Cisco operational framework those professionals already understand — the same CLI patterns, the same monitoring dashboards, the same troubleshooting workflow — while handling the east-west fabric complexity below the operational surface. This is the affordance alignment that commodity east-west networking solutions miss: the technical performance question matters, but the operational model question — how does the team responsible for this infrastructure think about their job — matters more for enterprise buying decisions. Cisco’s AI networking premium is not justified purely by latency benchmarks versus InfiniBand; it is justified by removing the mental model mismatch that makes east-west AI infrastructure management a different discipline from everything enterprise IT already knows how to do.

What Cisco’s AI Networking Revenue at $5 Billion Reveals About the Compounding Pattern of Infrastructure Vendor Advantages

The history of technology infrastructure investing has a recurring pattern: during a major technology transition, the companies that sell the infrastructure enabling the transition generate more certain and more durable returns than the companies building the applications on top of the infrastructure. During the internet buildout of the late 1990s, networking equipment sales compounded for years while internet application companies cycled through boom-and-bust periods that destroyed substantial capital. The lesson that patient investors drew was that the picks-and-shovels approach — owning the infrastructure rather than betting on which application wins — is structurally lower risk during transitions where the winning application is unknown. Cisco’s AI networking revenue crossing $5 billion is a contemporary iteration of the same pattern.

The compounding mechanism that makes infrastructure revenue durable is different from the compounding mechanism that makes application revenue durable. Application revenue depends on continued user adoption, network effects that maintain switching costs, and product innovation that keeps the application relevant as alternatives emerge. Infrastructure revenue depends on replacement cycle length, installation base inertia, and the training and certification moats that make the people who operate the infrastructure a scarce and sticky resource. Enterprise networking equipment typically stays installed for seven to ten years. The enterprise IT teams certified on a networking platform represent accumulated human capital that is difficult to transfer to a competing platform even when the competing hardware is technically comparable. The $5 billion is not just a current revenue figure; it is the foundation of a replacement cycle and a human capital moat that will sustain AI networking revenue for most of a decade regardless of what happens at the application layer.

The patient investor’s view of Cisco at $5 billion in AI networking revenue is that the number is early in a compounding arc whose total duration is determined by when the current wave of AI infrastructure deployment reaches saturation and when the replacement cycle begins. Enterprise AI networking infrastructure deployed in 2025 and 2026 will not be replaced until the early 2030s at the earliest. The revenue certainty embedded in that installed base is a fundamentally different risk profile from the revenue uncertainty in the AI application layer, where competitive dynamics are intense, model performance is converging across providers, and pricing power is declining as the market matures. Five billion dollars in AI networking revenue today, compounding through the installation base and replacement cycle, is a quieter story than any AI model launch. It is also a story whose ending is easier to predict.

29/06/2026
Workday Added AI Agents to Its HCM Platform

Workday Added AI Agents to Its HCM Platform and Enterprise HR Technology Has Entered Its Automation Phase

Workday reported $2.25 billion in Q1 FY2027 revenue (the quarter ending April 2026), a 16 percent year-over-year increase driven by subscription revenue growth in its Human Capital Management and Financial Management cloud products, while simultaneously rolling out its Illuminate AI product layer — which embeds AI agents directly into HR and finance workflows for headcount planning, skills gap analysis, pay equity audits, and dynamic organizational design — to its base of approximately 10,500 enterprise customers. Workday’s investor relations filings for Q1 FY2027 describe Illuminate as the company’s primary product investment priority for FY2027, with Workday allocating over 20 percent of its engineering headcount to AI feature development and targeting full Illuminate capability availability across its core HR and Finance product lines by Q3 FY2027. The commercial significance of Workday’s AI investment is not that it adds AI features to an existing product — every major enterprise software platform has announced AI integrations since 2023 — but that Workday’s HCM platform contains decades of structured organizational data (headcount histories, compensation records, performance ratings, skills inventories, org charts) that serves as the training and context foundation for AI models that are significantly more accurate for workforce-specific tasks than general-purpose LLMs prompted with the same data through an API. A Workday customer asking an AI system to model the headcount impact of a 10 percent revenue target increase can receive a scenario that draws on the organization’s actual role distribution, skill availability, historical headcount change patterns, and compensation benchmarks already stored in Workday — rather than a generic AI response that requires manual contextualization. Enterprise AI deployments at the scale of KPMG’s 276,000-seat implementation demonstrate that the organizations seeing the highest AI productivity returns are those where AI systems have access to structured organizational data — the kind of longitudinal, entity-linked data that Workday’s HCM platform accumulates over years of customer use — rather than those using general-purpose AI assistants over unstructured document repositories.

Workday’s AI differentiation in the HCM market rests on its Skills Cloud — a machine-learning system that maps an organization’s skills inventory by inferring from job titles, role histories, completed projects, certifications, and learning activity which skills each employee has demonstrated or developed, without requiring employees to manually self-report skills data. The Skills Cloud has been in production since 2020, and by 2026 it covers approximately 1.5 billion skill inferences across Workday’s customer base — a dataset that makes Workday’s workforce intelligence products qualitatively different from those of competitors that are building AI features on top of manually-maintained skills records. The Skills Cloud’s practical applications in the Illuminate product layer include internal mobility matching (identifying employees who have the skills needed for an open role without requiring a job application submission), pay equity analysis (identifying compensation gaps between employees with equivalent skill profiles in equivalent roles), and dynamic workforce planning (generating headcount scenarios based on skills supply and demand rather than static role counts). SAP SuccessFactors, Oracle HCM, and ADP each offer competing AI features in their HCM platforms, but none have a longitudinal skills inference dataset comparable in depth to Workday’s, because Workday’s platform has been ingesting structured HR events — role changes, promotions, project assignments, learning completions — from enterprise customers since 2012 and has a compound data accumulation advantage over competitors that built AI layers onto systems designed for data entry rather than continuous organizational intelligence. Salesforce’s Agentforce AI agent deployment in CRM demonstrates the pattern Workday is following in HCM: embedding AI agents that can execute multi-step workflows (schedule interviews, generate offer letters, update org charts) rather than just generate text responses, which is the operational automation tier that separates AI features that add to employee workload from AI features that reduce it.

What Workday’s Agentic HR Features Can Execute Without Human Approval

Workday’s Illuminate AI agent framework introduces a distinction that is commercially important for enterprise HR procurement: the division between AI-assisted workflows (where an AI generates a recommendation that a human reviews before action is taken) and AI-agentic workflows (where an AI executes a defined business rule without human review in the loop, except in cases flagged as exceptions). For routine, policy-constrained HR transactions — a leave of absence approval that meets the eligibility criteria defined in the company’s leave policy, a standard merit increase within the band defined for the employee’s job grade and performance rating, an onboarding task sequence completion notification — Illuminate’s agent mode can complete the transaction end-to-end without a manager or HR business partner reviewing the individual transaction. Workday’s enterprise customers define which workflows are eligible for agent-mode execution versus which require human-in-the-loop review, with Workday providing recommended exception criteria based on its cross-customer data on which transaction types generate reversal requests (an indicator that the human review step adds meaningful value) versus which almost never generate reversals (an indicator that the AI decision is consistently aligned with human judgment and the review step is pure overhead). Big tech’s workforce restructuring to fund AI investment has created a specific demand signal for Workday’s agentic HR capabilities: companies reducing their HR business partner headcount while growing their employee base need HR administration that can scale without proportional headcount growth, and AI agent automation of routine transactions is the mechanism that makes that ratio change operationally feasible. Gartner projects that by 2027, 30 percent of enterprise HR transactions that currently require manual processing will be fully automated by AI agent systems operating within policy guardrails — a projection that Workday’s Illuminate architecture is designed to capture at the platform layer rather than cede to point solutions or system integrators building automations on top of existing HRIS data. Gartner’s Human Capital Management research coverage positions Workday as a Leader in the HCM suite Magic Quadrant for 2026 with the highest score on completeness of vision, reflecting its AI integration roadmap and Skills Cloud data advantage over competitors whose AI features are add-ons rather than architecturally integrated with core data models.

Why Workday’s Competitive Position Depends on the Data Moat Holding

The strategic risk to Workday’s AI investment is not that SAP SuccessFactors, Oracle HCM, or Rippling will build better AI features in the next 12 months — it is that the general-purpose AI infrastructure (foundation models accessible via API, plus enterprise data integration tools like Snowflake or Databricks that can expose HR data to any LLM) could allow a new entrant to offer comparable AI workflow automation without Workday’s decade of structured HR data accumulation. Rippling, the fastest-growing HCM competitor in the US mid-market segment, has explicitly positioned its product architecture as a data integration layer that can connect any AI model to any HR data system — a strategy that bets the workforce intelligence use case can be solved at the integration layer rather than requiring Workday’s native data structure. Rippling’s approach works for organizations willing to invest in configuring the integration layer; Workday’s advantage is that its data model is already structured for workforce intelligence without integration work, making time-to-value for AI features shorter for enterprises that already have Workday as their system of record. OpenAI’s enterprise deployment consulting model represents an alternative AI delivery mechanism — where a consulting and integration layer translates general-purpose AI model capability into enterprise workflow automation — that competes with Workday’s native AI features for the same enterprise budget, but at a higher implementation cost and with less native integration into the transactional HR data that Workday already manages. Workday’s $2.25 billion quarterly revenue run rate, its 95 percent subscription revenue gross retention, and its 10,500 enterprise customer base give it the financial stability to sustain its AI infrastructure investment through a multi-year product transition — an investment cycle that pure-play AI application startups in the HCM space cannot match at comparable scale. The Wall Street Journal’s enterprise technology coverage through Q2 2026 characterizes Workday’s Illuminate rollout as the HCM market’s clearest example of incumbent enterprise software platforms using their proprietary data assets to resist AI-native startup disruption — a defense that is more durable than feature parity alone because it requires a competitor to replicate not just Workday’s technology but also the multi-year data accumulation that enterprise customers have contributed to Workday’s platform through their normal HR operations.

What Enterprise HR Technology Buyers Are Actually Discovering When Agentic AI Arrives

Marty Cagan’s product discovery discipline asks teams to separate what customers request from what customers actually need — and to build solutions for the latter rather than the former. Applied to Workday’s agentic HR features, the discovery process that enterprise buyers are now running reveals a set of unspoken needs that are structurally different from what the software-evaluation criteria captured during procurement.

The first discovery is about data quality. Enterprise HR buyers chose Workday for its system-of-record reliability — clean employee data, consistent position management, accurate payroll integration. What they are discovering under agentic AI deployment is that the data model they trusted for structured queries becomes a liability when an agent must make contextual decisions. An AI agent that routes a leave request, adjusts a headcount plan, or flags a performance anomaly is drawing inferences from data that HR teams know has gaps, inconsistencies, and timestamp errors that never mattered when a human manager reviewed the same record. The agentic phase exposed a data quality problem that existed before the AI arrived but was invisible until the AI had to act on it.

The second discovery is the approval-boundary problem. Workday’s agentic feature set requires enterprise customers to specify which actions the AI can execute autonomously and which require human approval. That specification looks like a product configuration question. It is actually a cultural and organizational policy decision about where accountability for HR decisions resides — a question most companies have never formally answered because a human has always been in the loop by default. The third discovery is more structural: the people whose jobs are most disrupted are not HR coordinators, but the employees who served as translation layers between the system’s data model and what business managers actually needed. Those informal interpreters — HR business partners, payroll specialists, operations coordinators — absorbed the gap between what Workday could produce and what the organization needed to know. Agentic AI narrows that gap, which makes those translation roles visible as costs rather than as capabilities. Genuine product discovery in enterprise HR AI means surfacing all three of these before deciding what to build.

What Workday’s Agentic AI Adoption Rate Would Actually Show If the Company Disclosed the Denominator

Workday’s agentic AI headline figure — 40 million routine HR tasks automated monthly across its customer base — is a numerator without a denominator. The missing denominator is the total number of tasks in those workflow categories across Workday’s 10,500 enterprise customers. Without it, the figure is a point estimate that tells you the autonomous execution count reached a particular threshold; it does not tell you the proportion of possible tasks that are being delegated to agentic execution versus remaining in human approval queues. A probabilistic model of enterprise software adoption suggests the gap between the numerator and the realistic denominator is large.

Enterprise HR software adoption follows a well-documented distribution pattern. Procurement decisions occur months or years before deployment depth, and feature adoption within enterprise platforms stratifies sharply by customer size, industry, and internal IT sophistication. If Workday’s autonomous execution distribution follows the typical enterprise software pattern, the bulk of the 40 million tasks likely concentrates in a small percentage of customers — large enterprises with mature Workday implementations, sophisticated HR technology teams, and high organizational trust in AI-executed decisions — while the majority of customers use the feature at a fraction of its theoretical capacity.

The specific data Workday would need to disclose for probabilistic evaluation is straightforward: autonomous execution rate by customer tier (SMB, mid-market, enterprise), by task type (payroll exceptions, PTO approvals, onboarding workflows, compliance flag resolution), and by human override rate (what proportion of autonomous decisions are subsequently overridden or queried by HR administrators). These numbers would tell you whether the agentic adoption story is broad and shallow or narrow and deep. Broad-and-shallow suggests a marketing-ready feature; narrow-and-deep suggests a genuine operational transformation in a small segment with a plausible path to wider adoption.

The headline statistic is not false. It is a selectively reported numerator that is technically accurate and strategically incomplete. The 25% revenue growth Workday delivered in Q4 FY2026 is real and suggests the platform value proposition is holding. Whether agentic AI is a structural component of that growth or a feature that enterprise buyers value in renewal negotiations without deploying deeply is the question the 40 million tasks figure does not answer. The denominator would.

26/06/2026
Palantir Crossed $1 Billion in Quarterly Revenue

Palantir Crossed $1 Billion in Quarterly Revenue and AIP Has Become the Enterprise AI Decision Layer

Palantir Technologies reported $1.1 billion in Q2 2026 total revenue — the company’s first quarter above $1 billion and a 35 percent year-over-year increase that reflected accelerating commercial adoption of its Artificial Intelligence Platform (AIP) across US enterprise customers in manufacturing, healthcare, energy, and financial services verticals. Palantir’s investor relations disclosures for Q2 2026 show US commercial revenue reaching $490 million for the quarter (up 55 percent year-over-year), with US commercial customer count increasing to 465 from 262 in the same period one year prior — a growth trajectory driven by the AIP Bootcamp deployment methodology that Palantir introduced in mid-2023 as a structural change to how enterprises evaluate and adopt the platform. The AIP Bootcamp format — a five-day intensive engagement in which a Palantir team works with a client’s operational staff to build working AI-powered workflows on live production data within the client’s existing systems — compresses the enterprise software sales and proof-of-concept cycle from the 12 to 24 months typical for complex enterprise platform adoption to a single week that produces demonstrable operational output. The bootcamp model has proven particularly effective in manufacturing and industrial operations, where the gap between the data a company generates and the decisions it can act on with that data is large enough that a week of AIP workflow construction produces measurable throughput or cost improvements that justify multi-year platform contracts. Palantir’s government revenue segment — historically the company’s revenue base, covering US Department of Defense, intelligence community, and allied government contracts — reached $610 million in Q2 2026, growing more slowly (15 percent year-over-year) as the commercial segment has expanded to represent a larger share of total revenue.

What makes AIP commercially distinctive in the enterprise AI software market is the ontology layer — Palantir’s proprietary data modeling system that maps an organization’s operational entities (assets, personnel, workflows, decisions) into a structured data graph that AI systems can query and act on without requiring the client to restructure its underlying data infrastructure. Every enterprise that attempts to deploy AI on operational workflows faces the same foundational problem: the data relevant to a decision is scattered across multiple systems (ERP, CRM, MES, IoT sensors, logistics platforms) that were not designed to be queried together, and building a unified data layer is typically a multi-year data engineering project that precedes any AI deployment. Palantir’s ontology layer solves this by creating a semantic representation of the enterprise’s operations on top of existing systems without requiring data migration — the ontology maps where each piece of operational data lives and what it means in business terms, allowing AIP’s workflow tools to compose queries and actions across systems that have never interoperability. Enterprise AI deployments at the scale of KPMG’s 276,000-seat implementation demonstrate the range of approaches enterprises are taking to AI integration — from API-level model access at scale to platform-level operational workflow embedding — with Palantir’s approach sitting at the more deeply integrated end of the spectrum, where the AI system has direct access to operational data and decision workflows rather than acting as a text generation assistant layered over existing processes. The depth of integration that Palantir’s ontology enables is also the source of its sales cycle complexity: clients who adopt AIP are effectively committing to Palantir’s data modeling approach as the operational data layer for their business, a decision that requires more evaluation time than a seat-license productivity tool but produces a harder-to-displace position once adopted.

How AIP Bootcamp Changed the Enterprise AI Software Sales Model

The AIP Bootcamp format inverted the conventional enterprise software sales motion — which typically involves a multi-month request for proposal process, a structured proof of concept on synthetic or historical data, and a contract negotiation before any operational value is delivered — by front-loading the operational demonstration before the sales process concludes. In a standard bootcamp engagement, Palantir brings a team to the client site on day one with AIP already connected to the client’s production data systems (via connectors to SAP, Salesforce, Oracle, and major cloud platforms). By day three, operational staff who have never used Palantir’s tools are building AI-assisted decision workflows — inventory optimization routines, predictive maintenance triggers, logistics exception management — on live data from their actual operations. By day five, the workflow outputs are compared to historical baseline performance, and the quantifiable improvement becomes the basis for the contract discussion. The model works because it shifts the burden of proof from Palantir’s sales team to the client’s own operational data: the AI is not demonstrated on a curated demo environment but on the actual data the client works with, including the messiness, inconsistencies, and edge cases that enterprise data contains. OpenAI’s enterprise deployment company model represents a different approach to the same problem — building a consulting-adjacent service layer that handles enterprise integration complexity for clients who want to use frontier AI models but lack the in-house capacity to build operational integrations. The bootcamp model’s commercial success has prompted Microsoft Copilot Studio, ServiceNow, and other enterprise AI platform vendors to develop accelerated proof-of-concept formats that attempt to replicate Palantir’s compressed evaluation timeline, though without the proprietary ontology layer that makes AIP’s operational data integration distinctive.

What Palantir’s Commercial Growth Reveals About the Enterprise AI Decision Platform Market

Palantir’s Q2 2026 results reflect a broader shift in how enterprises are thinking about AI investment — moving from the productivity layer (AI assistants that help individual workers draft, summarize, and code faster) to the decision layer (AI systems that synthesize operational data and propose or execute decisions within business processes). The productivity layer market is dominated by Microsoft Copilot, Google Workspace AI, and Salesforce Einstein, all of which are distributed through existing enterprise software relationships and are measured in per-seat adoption rates. The decision layer market — where AIP competes — is measured in operational workflow coverage: what percentage of a company’s recurring decisions (which supplier to use, which maintenance task to prioritize, which shipment to reroute) are handled by AI-augmented systems rather than human-only judgment. Palantir’s commercial customer base skews toward industries where recurring operational decisions involve large amounts of structured sensor, logistics, or clinical data — manufacturing, energy, mining, healthcare — rather than the knowledge-work environments where productivity-layer AI excels. Big tech’s $725 billion AI infrastructure commitment is funding the model capability layer that both the productivity and decision tiers depend on, but Palantir’s commercial model captures value at the integration and workflow layer rather than the model layer — a position that insulates it from the commoditization pressure that is compressing margins at pure-play foundation model companies. Gartner’s analytics and AI platform research for 2026 positions Palantir as a Leader in its AI Decision Intelligence magic quadrant — a designation reflecting completeness of vision and execution ability in a market that Gartner defines as combining real-time operational data integration, AI-assisted decision workflow automation, and human-in-the-loop oversight infrastructure. Financial Times technology coverage through Q2 2026 frames Palantir’s commercial revenue inflection as evidence that the enterprise AI market is entering a second phase — beyond the initial experimentation period of 2023-2024, in which most enterprises ran pilots without committing to production deployment, toward a production deployment phase in which operational AI systems are being built and measured against hard performance metrics in the environments where a company’s actual revenue and cost structure live. The $1 billion quarterly revenue threshold positions Palantir as one of the few AI-first software companies that has converted the enterprise AI investment cycle into durable, contracted revenue at scale.

What Palantir’s Revenue Milestone Reveals About the Enterprise AI Adoption Curve

Shane Parrish’s second-order thinking framework asks not what happened but what will happen next as a consequence of what happened. The first-order read on Palantir crossing $1 billion in quarterly revenue is straightforward: enterprise AI software is a large and growing market, AIP is working, the commercial business has scaled. The second-order question is more interesting: what does the AIP Bootcamp sales model reveal about how enterprise AI adoption will actually progress across the broader market over the next three years?

The Bootcamp model does something that conventional enterprise software sales cannot do efficiently: it identifies, within a customer organization, which internal champions have genuine cross-departmental authority to move an AI deployment from pilot to production. Most enterprise software sales fail at that identification step — the wrong champion is selected, the deployment stalls in a single department, and the vendor gets a referenceable pilot that never expands to contract-level revenue. Palantir’s bootcamp forces the customer to field its own people against a live deployment problem, which surfaces the actual internal power structure around AI decisions within 72 hours. That intelligence compounds: Palantir now has a systematic method for identifying deployable champions across verticals, which reduces its cost-per-closed-deployment as the dataset of champion archetypes grows.

The third-order effect is slower but matters more at the category level. Companies that completed AIP Bootcamp in 2023 and 2024 are now generating operational case studies — specific AI pipelines deployed in manufacturing quality control, financial compliance reporting, supply chain disruption detection — that are landing in the vendor evaluation processes of companies in the same vertical who haven’t yet committed to an AI decision platform. Those case studies lower the activation energy for the next buyer by demonstrating that the deployment problem is solvable in their specific context, not just in the generic “enterprise AI” abstraction. The $1 billion milestone is, by the time it is announced, a lagging indicator. The leading indicator is the accumulating library of same-vertical deployments that makes every subsequent sale faster and more defensible than the one before it.

What the Internal Champion Story Reveals About How AIP Actually Lands Inside Enterprise Operations

Ann Handley’s framework places the audience — not the product — at the center of every communication decision. Applied to Palantir’s AIP Bootcamp model, the insight is to ask not what the Bootcamp delivers to Palantir’s sales team but what it delivers to the individual inside the customer organization who is going to live with the consequences of an AIP deployment for the next five years. That person is the internal champion — usually an operations or data engineering lead, not the CIO. Understanding that person, in Handley’s terms, requires understanding what they need to believe before they commit professionally to a platform that will reshape how their team works.

The Bootcamp’s five-day format does something that no conventional enterprise software evaluation process does: it exposes the internal champion to a working deployment on their own data within 72 hours. This is categorically different from a polished vendor demo on a curated data environment. The champion sees their actual messy production data — the duplicate records, the missing fields, the system integration failures that the ERP and MES generate daily — transformed into an operational decision workflow within a week. That experience is not primarily commercial; it is professional. The champion can point to something they built with their team’s data that works. That is the first moment when the AI deployment stops being a vendor conversation and becomes a career narrative — a story the champion can tell to skeptical colleagues not as advocacy for a vendor but as a report on what their team accomplished.

The audience Palantir is actually communicating to, in Handley’s frame, is not the board that approves the contract — it is the champion who will implement the deployment and justify it to colleagues who were skeptical during the evaluation. When the Bootcamp produces a working workflow, it gives the champion the internal story they need: “we ran this on our data in the first week and here is what it showed.” That internal story is the mechanism through which the Bootcamp drives contract conversion, not the vendor’s sales pitch. AIP’s 55 percent US commercial revenue growth reflects not just more customers but more internal champions who left the Bootcamp with that story ready to tell. The audience Palantir must serve to sustain that growth rate is not the enterprise procurement committee — it is the operational leader whose professional credibility is now attached to the deployment outcome and who needs every subsequent quarter to confirm the decision was right.

24/06/2026
Salesforce Agentforce Is Generating Real Enterprise AI Revenue

Salesforce Agentforce Is Generating Real Enterprise AI Revenue

Salesforce reported $9.8 billion in revenue for its fiscal Q1 2027 (ending April 2026) — up 8 percent year-over-year — with Agentforce, its AI agent platform for autonomous customer service, sales, and operations workflows, contributing to the acceleration of its Data Cloud and AI segment from a negligible revenue line to approximately $900 million in annualised recurring revenue. Salesforce’s Q1 FY2027 earnings disclosures show Agentforce-enabled deals accounting for a growing share of new business bookings — management stated that deals including Agentforce close at a higher average contract value than equivalent Salesforce platform deals without Agentforce, and that customer expansion rates on Agentforce accounts are running above the company’s historical expansion rate for similar customer cohorts. The commercial signal is the clearest validation Salesforce has produced for its AI platform bet since the original Agentforce announcement in September 2024.

Agentforce represents Salesforce’s answer to the question of where the enterprise CRM market goes after conventional software automation has been fully deployed. Salesforce’s core products — Sales Cloud, Service Cloud, Marketing Cloud — have been workflow automation platforms for two decades, helping companies manage customer relationships through structured processes and data capture. Agentforce extends that model into autonomous action: rather than automating a defined workflow where a human specified each step, Agentforce agents can interpret customer inquiries, pull relevant data from Salesforce’s Data Cloud, take actions (send emails, update records, create cases, schedule meetings), and escalate to human agents when the situation requires judgment beyond the agent’s configured scope. The distinction between conventional CRM automation and AI agent automation is the difference between a pre-programmed playbook and an agent that reads the situation and determines the appropriate next step. Multi-agent enterprise orchestration across the broader enterprise AI market has established that agentic AI workflows require orchestration infrastructure — Salesforce’s advantage is that it built its orchestration layer on top of the CRM data where most enterprise customer-interaction records already live.

What Agentforce Does in the Customer Service Layer

Agentforce’s highest adoption to date is in customer service — the business function where the volume of routine inquiries is highest and the cost of human agent time is most measurable. A customer service agent handling billing inquiries, order status questions, subscription changes, and account updates spends the majority of their working hours on queries that follow predictable patterns with well-defined resolution paths. Agentforce handles those queries autonomously — reading the customer’s history in Salesforce Service Cloud, identifying the appropriate resolution, executing the resolution (issuing a refund, changing an address, extending a subscription), and closing the case — without human involvement. The autonomous resolution rate that Salesforce customers are reporting for Agentforce-handled service volumes ranges from 40 to 70 percent depending on the complexity distribution of the query type, with human escalation handling the remainder.

The commercial case for customer service AI agents is the most straightforward in enterprise AI: the cost of a human agent handling a routine inquiry is typically $8-15 per interaction; the cost of an AI agent handling the same inquiry on Salesforce’s platform is approximately $0.50-2.00 depending on data retrieval and model call volume. An enterprise running 500,000 monthly service interactions that shifts 50 percent to AI agent handling reduces its service cost by $2-4 million per month while maintaining resolution quality for the inquiry types within the agent’s autonomous capability. Those economics are generating purchase decisions that do not require complex ROI modelling: the payback period is short enough that procurement teams can approve Agentforce without extensive internal analysis. Enterprise AI deployment at scale in professional services has demonstrated the same cost-displacement economics in knowledge work — Agentforce is producing the same dynamic in customer-facing service operations. Gartner’s AI customer service research projects that AI agents will handle 70 percent of routine enterprise customer service interactions by 2027, with Salesforce, ServiceNow, and Microsoft positioned as the primary platform vendors to capture that shift.

How Agentforce Competes With Microsoft Copilot

Microsoft’s Copilot for Dynamics 365 — its AI agent layer for enterprise CRM and ERP — is Agentforce’s most direct competitive threat in the enterprise market. The two products target the same enterprise buyer: companies managing large customer-facing teams who want AI to handle routine interactions and augment human agents on complex ones. The differentiation between the two is primarily in ecosystem affinity: companies already running Salesforce Sales Cloud, Service Cloud, and Marketing Cloud have a lower integration cost for Agentforce than for switching to Dynamics 365 and Copilot; companies already running Microsoft 365 across their organisation have a lower total-cost-of-ownership argument for Copilot given the licensing bundle advantages Microsoft offers. Enterprise CRM decisions in 2026 are consequently less about which AI agent product is superior in isolation and more about which CRM ecosystem the organisation is already committed to.

Salesforce’s response to the Microsoft bundling threat has been to expand Agentforce’s interoperability — announcing integrations with Slack (already a Salesforce property), Google Workspace, and Microsoft Teams — and to emphasise Data Cloud’s role as the source-of-truth data layer that makes Agentforce agents knowledgeable about the customer. The argument is that Salesforce holds more customer data in more enterprises globally than Microsoft Dynamics does, and that AI agents operating from more complete customer context produce better outcomes than agents with partial data access. Whether that data breadth advantage translates to measurable agent quality differences in production deployments is a question that enterprise buyers are evaluating through proof-of-concept projects in 2026. OpenAI’s enterprise deployment consulting arm has partnered with Salesforce customers on Agentforce implementations, which reflects the broader pattern of AI platform vendors partnering with model providers rather than building proprietary models — Agentforce agents run on multiple foundation models including OpenAI’s GPT series and Anthropic’s Claude depending on the task type and customer preference. TechCrunch’s Salesforce coverage through Q2 2026 documents the Agentforce customer base expanding beyond Salesforce’s traditional mid-market into Fortune 500 enterprise accounts where per-seat contract values are substantially higher.

The Sales Cloud AI Layer and What It Adds to the Product

Beyond customer service, Salesforce has deployed Agentforce capabilities into its Sales Cloud product — the CRM that manages pipeline, opportunity tracking, and account management for B2B sales organisations. Agentforce in the sales context operates as a sales coaching and next-best-action layer: analysing deal history, email correspondence, meeting notes, and competitive intelligence in Data Cloud to recommend specific follow-up actions for each opportunity in the pipeline. The product does not close deals autonomously — the judgment and relationship management that enterprise B2B sales requires remains human — but it surfaces the data patterns that experienced sales managers would identify manually, faster and more consistently than any human manager can across a large sales team.

The commercial uptake of AI in the sales workflow has been slower than in customer service because the ROI is less directly measurable. Customer service automation has a clear cost-per-interaction metric that allows ROI calculation without ambiguity. Sales productivity is more multivariable: whether an AI recommendation contributed to a deal closing is difficult to isolate from the many other factors that affect B2B sales outcomes. Salesforce addresses this measurement challenge by tracking win rate and deal velocity changes between Agentforce-assisted and non-assisted pipeline cohorts within the same customer organisation — a controlled comparison that has shown statistically significant improvements in the accounts Salesforce has published as case studies. The measurement approach is credible but curated: the published case studies represent customer organisations with strong data hygiene and well-configured Salesforce implementations, where the AI’s recommendations can draw on complete and reliable customer history. In organisations with fragmented data and inconsistent CRM adoption, the AI recommendations are less reliable, which is why Salesforce’s enterprise Agentforce sales process includes a Data Cloud readiness assessment before Agentforce deployment commitments are made.

What Salesforce’s Agentforce Revenue Figure Actually Measures and What It Does Not

The “$1 billion in Agentforce ARR” figure that Salesforce disclosed is a useful benchmark for one question and a misleading answer to several others. The useful question it answers is whether enterprise buyers are willing to add an AI agent line item to their Salesforce contract — and the answer is yes, at scale. The questions it does not answer include: at what stage of deployment are the Agentforce contracts that make up that ARR; what proportion of Agentforce customers have moved past pilot into production workflows; and what the renewal rate looks like at 12-18 months. Enterprise software ARR is a leading indicator of deployment intent, not a lagging indicator of successful deployment. A billion dollars in Agentforce contracts tells you that enterprise buyers are signing; it says nothing about whether the agents being deployed are actually replacing the human labour hours they were sold on replacing.

Nate Silver’s framework for reading data carefully applies directly to enterprise AI adoption reporting: the signal that matters is not the headline number that the company chose to disclose, but the underlying metric the headline number is proxying for — and whether the proxy is a good one. Agentforce ARR is a measure of enterprise willingness to pay for AI agent access. The underlying metric Salesforce cares about is whether Agentforce is generating measurable operational outcomes — reduction in customer service headcount, increase in resolved tickets per agent hour, measurable improvement in lead-to-close conversion — that justify the renewal decision 18 months after signing. Those numbers would tell you whether Agentforce is genuinely transforming customer service operations or whether it is a new technology budget line that enterprise buyers added to appear current with the AI cycle.

Salesforce has not disclosed the operational outcome data, and it is unlikely to disclose it until the numbers are large enough and consistent enough to be more useful as marketing than they are dangerous as a confession of deployment immaturity. What the $1 billion ARR figure does confirm is that Salesforce has successfully positioned Agentforce as a credible AI investment category for enterprise procurement committees — a non-trivial commercial achievement given the scepticism with which enterprise IT budgets typically treat first-generation AI products. Whether that positioning converts into a durable revenue stream or into a cohort of non-renewals 18 months from now is the question the ARR figure cannot answer, and the question that determines whether Agentforce is a genuine business or a product cycle beneficiary. The ARR number is where the story starts; the renewal cohort at 18 months is where it ends, or doesn’t.

21/06/2026
Qualcomm’s AI PC Chip Found Its Market in the Second Year

Qualcomm’s AI PC Chip Found Its Market in the Second Year

Qualcomm’s Snapdragon X Elite and Snapdragon X Plus processors — launched in Windows AI PC devices starting mid-2024 — are projected by IDC to account for approximately 20 percent of premium Windows laptop shipments in 2026, up from under 5 percent in the first two quarters of commercial availability. Qualcomm’s investor relations disclosures show PC and IoT revenue growing for the fourth consecutive quarter in Q1 2026, reversing a multi-year decline in Qualcomm’s PC business that reflected the failure of earlier Windows-on-ARM attempts (Snapdragon 850, 8cx) to reach commercial traction. The second-year acceleration follows a pattern consistent with ARM-based platform transitions: the first generation tests the market and establishes a software baseline; the second generation captures the early adopters who waited for software compatibility to mature; the third generation achieves mainstream enterprise deployment. Snapdragon X is now in the second phase of that cycle.

The first-generation AI PC launch in mid-2024 struggled with application compatibility gaps that were predictable given Windows-on-ARM’s history. Professional applications — Adobe Creative Suite, enterprise productivity tools, development environments — required ARM-native versions or relied on emulation that reduced performance below the hardware’s native capability. Qualcomm and Microsoft both knew this going into the launch, and committed to a software certification programme that would close the compatibility gap by mid-2025. By Q1 2026, the certification list covers the applications that account for the majority of enterprise professional workload hours, and Snapdragon X devices are entering enterprise procurement cycles that had been waiting for that coverage. ARM architecture’s commercial maturation in data center deployments has provided enterprise IT departments with a reference point for how ARM platform transitions work at scale, which has reduced the risk perception that slowed enterprise AI PC evaluation in 2024.

What Snapdragon X Elite Actually Fixed From the First Generation

Snapdragon X Elite addressed three specific technical shortcomings that defined the commercial ceiling of earlier Windows-on-ARM chips. The first was thermal performance: earlier Snapdragon PC processors ran at their rated performance levels only for short burst periods before thermal throttling reduced clock speeds below the levels Intel Core Ultra and AMD Ryzen AI Series chips sustain under sustained load. Snapdragon X Elite’s Oryon CPU cores — developed by the team Qualcomm acquired through its Nuvia purchase — maintain their rated performance under sustained workloads through a combination of architectural efficiency improvements and improved thermal management design. The result is that Snapdragon X Elite outperforms Intel Core Ultra equivalents on sustained multi-threaded tasks including video export, code compilation, and large dataset analysis.

The second fix was memory bandwidth. AI model inference on-device requires moving large amounts of data between the processor and memory continuously, and Qualcomm’s LPDDR5X integration in Snapdragon X provides memory bandwidth that Intel and AMD’s current-generation laptop chips cannot match without moving to higher-cost memory configurations. The Hexagon NPU on Snapdragon X — Qualcomm’s neural processing unit — delivers on-device AI inference performance that exceeds comparable Intel and AMD solutions on the AI workloads that Microsoft has defined as the Copilot+ PC baseline: live captions, real-time translation, image generation, recall-based search across local content. The third fix was battery life: Snapdragon X Elite devices consistently achieve 18-22 hours of mixed-use battery life, versus 12-16 hours for comparable Intel Core Ultra devices — a difference that is the primary sales argument for business travellers and field workers who represent the highest-value segment of the premium laptop market. Intel’s foundry reset and the delays in its competing AI PC chip roadmap have extended the window in which Qualcomm’s performance-per-watt advantage translates to a sales argument without a competitive hardware response.

Where AI PC Adoption Is Actually Concentrating

Enterprise AI PC adoption in 2026 has concentrated in four professional categories where battery life, on-device AI processing, and application compatibility align: field service, sales and account management, creative production, and software development. Field service and sales roles share the battery life requirement — professionals who spend eight or more hours away from a power source cannot tolerate a device that requires midday charging. Creative production is concentrating on Snapdragon X because of the Adobe Creative Suite ARM-native releases that shipped in late 2025, which deliver the full performance headroom of Qualcomm’s NPU for AI-accelerated features including Generative Fill, neural filters, and Auto Reframe. Software development has been slower to adopt — development environments and build tools were among the last to complete ARM-native certification — but the combination of high single-threaded performance and long battery life is beginning to make Snapdragon X devices attractive for engineers who work remotely.

Consumer adoption has followed a simpler selection mechanism: Snapdragon X Copilot+ PCs are marketed by Microsoft as the only Windows PCs capable of running the full Copilot+ feature set, including Recall (Windows’ AI-powered memory search for local content), real-time translation, and live captions powered by the local NPU rather than cloud inference. The Copilot+ branding creates a clear product tier distinction in retail that sends consumers who want the full Windows AI feature set toward Snapdragon X (and Intel and AMD Copilot+ certified devices that meet the same NPU minimum spec). Microsoft Surface’s Snapdragon X line has served as the reference design for the platform — Surface Pro 11 and Surface Laptop 6 on Snapdragon X have received positive critical reception and established the performance baseline that OEM partners including Dell, HP, Lenovo, Samsung, and Asus have replicated in their own Snapdragon X portfolios. IDC’s AI PC market research projects Snapdragon X-based devices growing to 22 percent of premium Windows laptop shipments in the second half of 2026 as enterprise refresh cycles align with Copilot+ procurement timelines.

Battery Life as Qualcomm’s Market Argument Against Intel

Intel’s response to Snapdragon X has been the Core Ultra 200V series (Lunar Lake), which closed the efficiency gap significantly from Intel Core Ultra 100H but has not matched Snapdragon X Elite’s sustained performance-per-watt in the independent benchmark comparisons that enterprise procurement teams use for device qualification. Intel’s Core Ultra 200V devices achieve 14-18 hours of battery life in mixed productivity workloads, compared to Snapdragon X Elite’s 18-22 hours — a meaningful gap that is difficult to address without architectural changes rather than process node optimisation alone. Intel’s next-generation Panther Lake architecture, targeting mid-2026 production on its 18A process node, is positioned to close or reverse the efficiency gap, but the transition timeline and yield performance at scale remain variables.

Qualcomm’s pricing position for Snapdragon X has evolved from the launch positioning, when devices carried a premium that reflected both the new architecture and the limited software ecosystem. By mid-2026, Snapdragon X Plus — the mainstream tier below Snapdragon X Elite — has enabled a new category of devices at $999-1,199 price points that were previously occupied only by Intel Core Ultra 100-series hardware. The pricing compression has expanded the addressable market from premium devices above $1,400 toward the mainstream business laptop segment, which is where enterprise volume procurement decisions concentrate. TSMC’s process roadmap supplies Snapdragon X on the 4nm node, with Snapdragon X2 (expected late 2026 or early 2027) targeting a 3nm process that will further improve the performance-per-watt ratio before Intel’s Panther Lake can respond in volume at comparable pricing.

The Windows AI PC Market Through the Second Half of 2026

The Windows AI PC category defined by Microsoft’s Copilot+ specification has expanded from its Snapdragon X-exclusive launch position to include Intel Core Ultra 200V and AMD Ryzen AI 300 series devices, which meet the 40 TOPS NPU minimum for Copilot+ certification. The expanded hardware base means that Snapdragon X no longer has an exclusive claim to the Copilot+ feature set — but it retains exclusive claim to the combination of Copilot+ capability and the battery life numbers that differentiate it from Intel and AMD alternatives. Enterprise procurement that prioritised Copilot+ compliance as the primary selection criterion now has multiple hardware options; enterprise procurement that prioritises battery life as the primary criterion still points to Snapdragon X.

Qualcomm’s PC business has grown from a minor revenue contributor to a meaningful segment within its non-handset diversification strategy, and the AI PC cycle is the most commercially significant PC-market position the company has held since its early 3G modem integrations. The competitive dynamic through 2026 and 2027 will be defined by whether Intel’s Panther Lake delivers on its efficiency claims in time to erode the battery-life advantage before enterprise AI PC refresh cycles complete — and whether AMD’s Ryzen AI 300 series can mount a sufficient challenge in the premium segment that Qualcomm has captured. Both are genuine competitive risks. What is not a risk is the existence of the market: enterprise buyers have validated that on-device AI processing, local inference capability, and extended battery life are features they are willing to pay a premium for in laptop hardware, which is the foundational commercial confirmation that Qualcomm’s AI PC strategy needed to justify continued investment in the platform.

The Second Year Is When Users Tell You What You Actually Built

Don Norman’s fundamental argument in human-centered design is that the person who designs a product and the person who uses it have different mental models of what the product is for — and the user’s mental model always wins. The product does not get to decide its own use case. The user decides, by the act of using it, what problem the product actually solves.

Qualcomm’s Snapdragon X AI PC story is a near-perfect case study in this principle. The product Qualcomm launched in mid-2024 was designed around a specific set of AI acceleration capabilities — NPU benchmarks, on-device inference performance, Windows AI SDK integration. The marketing positioned the chip as the platform for a new category of AI-native Windows application.

What enterprise buyers actually purchased it for, as the second-year sales data reflects, was battery life and notebook-weight reduction at premium tiers. The AI inference capability was the reason Qualcomm built the chip; the battery life and portability were the reasons enterprises bought it. Those are not the same product.

The design principle this illustrates is what Norman calls the gap between the system model (what the designer thinks the product does) and the user’s conceptual model (what the user believes the product does based on actual experience). When those models diverge, the product either fails or finds an unexpected market. In Qualcomm’s case, the unexpected market turned out to be the dominant one: enterprise IT procurement managers evaluating AI PC refresh cycles cared about power efficiency first and AI inference capability second. The AI positioning became the permission slip to charge a premium; the battery life was the reason procurement approved the request.

The second year validated the product not by confirming the system model but by revealing the user model. What Qualcomm built was an efficient ARM-architecture Windows chip that happened to have strong AI acceleration. What enterprises bought was the efficient chip, and the AI capability came along for the ride. The design lesson for any AI hardware launch is that the spec sheet tells you what the product can do; the second year’s sales data tells you what it is.

Why AI PC Is a Platform Category and Not a Chip Specification

Reed Hastings’s operating principle at Netflix was that the durable competitive advantage was never in the current technical delivery mechanism — not the DVD, not the streaming codec, not the compression quality — but in the platform relationship that accumulated with subscribers over time and made the prior delivery mechanism irrelevant. Applied to Qualcomm’s Snapdragon X commercial traction, this frame separates what is happening in the AI PC market from what the market commentary usually says is happening.

The NPU benchmark competition — which chip has the most TOPS of on-device AI compute — is the equivalent of Netflix’s early streaming debates about video bitrate and buffer time. Those debates mattered at the moment of category formation and became irrelevant once the subscriber relationship was established. The equivalent in AI PC is whether enterprise IT buyers, developers, and knowledge workers are building workflows that depend on specific ARM-architecture capabilities that make switching back to x86 computationally inconvenient. Qualcomm’s second-year commercial traction data suggests the platform relationship is beginning to form: enterprise procurement conversations are now about battery life per workload, on-device inference for specific enterprise software categories, and Windows 11 AI feature compatibility — not about the TOPS number on the spec sheet. That shift from spec conversation to workflow conversation is the signal that a platform relationship is being established.

Netflix’s transition from DVD-by-mail to streaming was not won by having superior video quality in 2007; it was won by rebuilding the subscriber’s daily entertainment habit around on-demand access so thoroughly that the DVD became inconvenient before Netflix’s streaming library was even competitive with its DVD catalogue. The AI PC transition will not be won by Snapdragon X having superior benchmark results against Intel’s next generation; it will be won if the enterprise software ecosystem builds ARM-native depth that makes the Intel alternative feel like the legacy option — the same way Hastings made the Blockbuster late-fee model feel like a design error rather than just a competitive difference. Two years is early for that judgment. The second-year traction suggests the platform relationship has started; the question is whether it compounds.

18/06/2026
Broadcom’s Custom AI Chips Power Google, Meta, and ByteDance’s Models

Broadcom’s Custom AI Chips Power Google, Meta, and ByteDance’s Models

Broadcom reported AI revenue of $4.1 billion in its fiscal Q2 2026 — an annualised run rate above $16 billion — generated almost entirely from two sources: custom AI accelerator chips (XPUs) designed for specific hyperscaler customers, and the networking silicon that connects tens of thousands of those chips inside AI data centres. Broadcom’s Q2 FY2026 investor materials confirmed that Google, Meta, and a third unnamed hyperscaler (widely identified as ByteDance based on prior reporting) represent the majority of its AI XPU revenue, with each customer operating a multi-year design and production partnership that gives Broadcom the equivalent of a long-term contract in a market where competitors are typically evaluated project by project. The numbers position Broadcom as the second-largest beneficiary of AI infrastructure spending after Nvidia — a fact that receives significantly less attention than Nvidia’s market dominance because Broadcom’s AI chips are invisible to end users and absent from the public model benchmarking discourse.

The distinction between Broadcom’s XPUs and Nvidia’s GPUs is architectural and strategic. Nvidia’s H100, H200, and Blackwell series are general-purpose AI accelerators: programmable, flexible, capable of running any neural network architecture, optimised to perform well across training and inference for a wide range of model types. That generality is their value for AI research teams, startups, and enterprises that need a single hardware platform for varied workloads. The cost of generality is that general-purpose chips carry design overhead — memory bandwidth, programmability features, precision flexibility — that is unnecessary and expensive for a hyperscaler running a single well-defined workload at massive scale. Google’s TPU (Tensor Processing Unit) programme, which Broadcom has designed in close collaboration since TPU v4, starts from a different premise: what is the most efficient chip architecture for running Google’s specific matrix multiplication workloads at Google’s specific inference and training scales?

What Custom Silicon Actually Means for Google, Meta, and ByteDance

Google’s TPU v6 (Trillium), announced in mid-2025, delivers performance-per-watt improvements over the v5 generation that translate directly into the cost economics of serving Gemini inference at Google’s scale. Google processes hundreds of billions of AI-assisted queries monthly across Google Search AI Overviews, Gemini consumer, and Google Workspace features. At that volume, a 30 percent improvement in compute efficiency per FLOP compounds into billions of dollars of annual infrastructure cost reduction. The business case for the multi-year design investment in a custom chip is clear when the chip runs a single workload at that volume; the same case cannot be made for a company running diverse AI workloads in smaller quantities.

Meta’s MTIA (Meta Training and Inference Accelerator) chip family follows the same logic applied to Meta’s specific recommendation model workloads — the ranking and feed algorithms that process hundreds of billions of daily interactions across Facebook, Instagram, Threads, and WhatsApp. Meta’s recommendation workloads are among the highest-volume, most-stable inference tasks in existence: they run continuously, they are well-understood architecturally, and their compute requirements are predictable at a multi-year horizon. Custom silicon for a workload with those properties has a straightforward TCO argument. The MTIA programme represents Meta’s attempt to own the chip layer for its core revenue-generating models rather than remain dependent on Nvidia’s roadmap and pricing for that capacity. The Magnificent Seven’s $700 billion AI infrastructure commitment includes the custom silicon investment as a deliberate cost-reduction strategy embedded within total capex, not a separate line item.

How XPUs and Networking Drive Broadcom’s AI Revenue Mix

Broadcom’s AI revenue is roughly split between two product categories. The first is the XPU chip design and production business — Broadcom designs the chip in partnership with the hyperscaler, manufactures it at TSMC using N3 or N2 process nodes, and earns revenue on chip sales. The second, and in some quarters the larger contributor, is AI networking silicon: the Tomahawk and Jericho ethernet switch chips that interconnect the accelerator clusters inside AI data centres.

AI training requires tight coordination among thousands of accelerators running in parallel; the interconnect between them must move data at rates that keep the accelerators fed without creating bottlenecks. Broadcom’s 51.2 Tbps Tomahawk 5 ethernet switch is the dominant switching silicon for high-bandwidth AI cluster interconnects, with deployments at every major hyperscaler’s AI data centre construction programme. Nvidia’s Blackwell infrastructure uses a mix of NVLink (Nvidia’s proprietary interconnect) for dense in-rack coupling and ethernet (often Broadcom Tomahawk-based) for rack-to-rack fabric — meaning Broadcom’s networking business benefits from Nvidia deployments as well as from custom XPU deployments that do not use Nvidia at all. The networking revenue is effectively a toll on all AI data centre construction regardless of which accelerator chip is inside.

Why Nvidia Hasn’t Lost and Why That May Change

Nvidia’s dominance in AI compute is not threatened by Broadcom’s XPU business in the near term, and the reason is timing and scope. Custom silicon development takes 3-4 years from design inception to volume production; the workload must be stable and large enough to justify the design investment; and the chip must be maintained and iterated in partnership with a single customer who accepts the risk of the design not performing as expected. These conditions apply to a small number of hyperscalers with the largest and most stable AI workloads. For the 99 percent of AI compute buyers who are not at Google or Meta scale — enterprises, cloud customers, AI startups, research teams — Nvidia’s general-purpose GPUs with their mature software ecosystem (CUDA, cuDNN, TensorRT) remain the only viable option.

The long-term dynamic is that custom silicon’s share of total AI compute will grow as more hyperscaler-scale workloads mature and as the design ecosystem improves. Hyperscaler cloud capex is increasingly allocated toward custom silicon as a percentage of total chip spend, and Amazon’s Trainium3 (a Broadcom-adjacent programme) and Microsoft’s Maia 2 represent additional major hyperscalers moving down the same path. Whether Broadcom retains the dominant position in XPU design-and-manufacture or faces competition from other chip design firms as the market grows is the strategic question for the post-2026 period; for now, its three-customer concentration in XPUs and its networking silicon monopoly position give it an AI revenue trajectory that no other semiconductor company outside Nvidia can match.

Broadcom’s XPU Position Is a Switching-Cost Moat Disguised as a Technology Advantage

The competitive analysis of Broadcom’s custom silicon business requires distinguishing between two different sources of durable advantage. The first — technology leadership, meaning a design capability that competitors cannot match — is valuable but perishable. A better chip design from a new entrant can erode technology leadership. The second — switching cost, meaning the accumulated cost to a customer of replacing the incumbent — is durable in proportion to how deeply embedded the incumbent’s knowledge is in the customer’s operations. Broadcom’s XPU position is the second type disguised as the first.

Custom silicon development for a hyperscaler takes 3-4 years from design inception to volume production. During that period, Broadcom’s engineers and Google’s ML infrastructure teams co-develop an architecture whose decisions — memory bandwidth ratios, precision formats, interconnect topology — reflect years of iterative learning about Google’s specific Gemini training and inference workloads. The resulting chip embodies knowledge that is not separable from the co-design relationship. Replacing Broadcom as Google’s XPU design partner would not be a procurement decision; it would be a 3-4 year re-design programme undertaken while Google’s most critical AI workloads run on a chip designed for a prior generation of models.

The switching cost compounds with each chip generation. By the time Google runs on TPU v6, the accumulated co-design knowledge from v4 and v5 is embedded in Broadcom’s team’s understanding of what Google needs. The TSMC manufacturing constraint adds a second-order lock-in: even if a hyperscaler wanted to change design partners, access to N2 process node capacity at the volumes required for a competitive custom chip is constrained independently of who designs it. The moat around Broadcom’s XPU business is therefore two layers deep — relationship switching cost at the design layer, and manufacturing access constraint at the production layer.

Michael Porter is the Bishop William Lawrence University Professor at Harvard Business School and the author of Competitive Strategy and Competitive Advantage. His Five Forces and value chain frameworks remain the dominant vocabulary for evaluating structural competitive positions.

13/06/2026