BNB$579.18▲ 1.57%XRP$1.10▲ 3.40%USDS$0.9998▲ 0.00%NATGAS$2.92▲ 0.52%TRX$0.3264▲ 0.55%HYPE$66.79▲ 5.45%SOL$78.02▲ 3.98%FIGR_HELOC$1.04▲ 0.37%BTC$64,734.00▲ 3.30%LEO$9.80▲ 2.76%DOGE$0.0741▲ 2.91%ETH$1,875.59▲ 5.14%XAG$58.81▲ 0.06%WTI$79.75▲ 0.52%XLM$0.1832▲ 2.44%XAU$4,040.30▼ 0.51%ZEC$552.79▲ 9.50%RAIN$0.0147▲ 2.99%WBT$56.54▲ 2.83%BRENT$85.48▲ 0.89%BNB$579.18▲ 1.57%XRP$1.10▲ 3.40%USDS$0.9998▲ 0.00%NATGAS$2.92▲ 0.52%TRX$0.3264▲ 0.55%HYPE$66.79▲ 5.45%SOL$78.02▲ 3.98%FIGR_HELOC$1.04▲ 0.37%BTC$64,734.00▲ 3.30%LEO$9.80▲ 2.76%DOGE$0.0741▲ 2.91%ETH$1,875.59▲ 5.14%XAG$58.81▲ 0.06%WTI$79.75▲ 0.52%XLM$0.1832▲ 2.44%XAU$4,040.30▼ 0.51%ZEC$552.79▲ 9.50%RAIN$0.0147▲ 2.99%WBT$56.54▲ 2.83%BRENT$85.48▲ 0.89%

Prices as of 04:57 UTC

Author: Alani Tahir

AMD Outran Nvidia by More Than 100 Points in 2026. The AI Chip Trade Just Priced In Commoditization
The single most important number in semiconductors this year is not Nvidia’s revenue growth. It is the spread between two stock charts. Nvidia’s shares are up roughly 13% year to date in 2026 despite 85% revenue growth last quarter and analyst expectations of 96% growth next quarter. AMD is up somewhere between 130% and 150% over the same stretch. A company growing revenue at 85% is being treated by the market as ex-growth, while its distant number-two competitor is treated as the growth story. That inversion is not noise. It is the market pricing in the commoditization of AI compute, and that repricing has direct consequences for crypto’s compute-adjacent trades.

The lazy read is that AMD is winning and Nvidia is losing. That is not what the spread means. Nvidia still commands roughly 80% of the AI accelerator market against AMD’s 5% to 7%. What the spread means is subtler and more important: investors have stopped paying for Nvidia’s dominance because they have started to believe that dominance no longer commands monopoly pricing. The AI chip trade has rotated from betting on one supplier’s moat to betting on the supply chain that erodes it.

The spread, not the leader, is the signal

Start with the raw performance. The PHLX Semiconductor Sector index has gained roughly 79% in 2026. Inside that index, the dispersion is enormous. Nvidia, still the largest AI chipmaker by far, has delivered a low-double-digit return. AMD has more than doubled. AMD’s data center revenue hit a record $5.8 billion, up 57% year over year, now more than half of total company revenue. The market is rewarding the trajectory of the challenger far more than the scale of the incumbent.

Why would a market do that to a company still growing revenue 85%? Because stock prices discount the future, not the present, and the future the market is now pricing for Nvidia is one of margin compression. Nvidia’s gross margins have run in the mid-70s, a level that only survives while it is the sole credible supplier of frontier training silicon. Every credible second source — AMD’s Instinct line, the hyperscalers’ custom chips — chips away at the pricing power those margins depend on. AMD does not have to take Nvidia’s market share to hurt Nvidia’s multiple. It only has to be good enough that buyers can negotiate.

And buyers now can. OpenAI signed a multi-year commitment for 6 gigawatts of AMD GPUs, with the first gigawatt landing in the second half of 2026 on the MI450. Meta committed to up to 6 gigawatts of custom AMD Instinct MI450 deployments, an arrangement reported to carry a multi-year value near $60 billion. When the two most compute-hungry buyers in the world publicly diversify away from a single vendor, they are not just buying chips. They are demonstrating to the market that the single-vendor premium is over.

Why the MI400 series changes the negotiation, not the market share

AMD’s technical position is better than its 5% to 7% share suggests. The MI400 series flagship, the MI455X, is specified at 40 PFLOPS of FP4 performance and 432 GB of HBM4, with Helios rack systems shipping in the third quarter of 2026. On paper, that is competitive with Nvidia’s current generation, and AMD claims a first-to-2nm advantage on part of the line. AMD’s own November 2025 analyst day set a target of double-digit AI accelerator market share within three to five years.

Hitting that target is not the point for the stock, and this is where most coverage gets the causality backwards. AMD’s share could stall at 10% and the thesis still works, because AMD’s real product is not the GPU. It is optionality for the buyer. Nvidia’s late-August earnings and the fall shipment of its next-generation Vera Rubin systems will almost certainly show strong numbers. But strong numbers into a market that now has a credible second source produce a different multiple than strong numbers into a monopoly. The market has already made that adjustment. It did it in the spread between the two stocks, months before either company’s next earnings call.

This is a classic late-cycle pattern in a hardware supercycle: the trade broadens from the obvious leader to the picks-and-shovels tier and the second sources. It happened to Cisco in the networking build-out and to the memory makers in prior data-center cycles. The leader keeps growing revenue while the market’s incremental dollar rotates to whatever is earlier in its own re-rating. Recognizing that pattern is worth more than debating whether Nvidia is a good company. It obviously is. The question the spread answers is whether it is still a monopoly, and the market has voted no.

The crypto and Web3 read: this rotation is the DePIN entry signal

Commoditizing AI silicon is the single most bullish structural development for crypto’s compute-adjacent sector, and the AMD-Nvidia spread is the cleanest signal that it is happening. The entire thesis behind decentralized GPU networks and Bitcoin miners pivoting to AI hosting depends on one condition: that AI compute stops being a proprietary bottleneck and becomes a rentable commodity. A market that is actively de-rating the monopoly supplier and re-rating the challenger is a market telling you that condition is arriving.

The most direct beneficiaries are the Bitcoin miners that have converted power and cooling infrastructure into AI hosting — a rotation we flagged when Nvidia’s flat stock signaled the AI trade was rotating toward miners. Crusoe, IREN, and the CoreWeave-style operators whose cloud revenue crossed $1.5 billion built their moat on cheap, contracted power — the one input a hyperscaler cannot conjure quickly. When the GPU itself commoditizes, the scarce input shifts from silicon to megawatts, and the miners already own the megawatts. That is why Bitcoin miners have repeatedly outperformed on AI-hosting news even when Bitcoin itself was flat.

On the decentralized side, GPU-rental networks like Akash Network and io.net benefit from a wider pool of non-Nvidia hardware they can aggregate. A network’s addressable supply grows every time a credible non-Nvidia accelerator ships, because it means more heterogeneous hardware that a coordination layer can pool and route. AMD’s MI450 ramp, the hyperscalers’ custom chips, and the broadening supply base are collectively the supply-side unlock these networks have been waiting for. The tokens tied to render and inference distribution — Render Network among them — are levered to exactly this commoditization. The caveat, as always, is that a wider supply base does not by itself create demand; it lowers the cost floor these networks must clear. But a falling cost floor is precisely what the AMD-Nvidia spread is pricing in.

The verdict

Do not read the AMD-Nvidia spread as a horse race between two chipmakers. Read it as a referendum on whether AI compute stays a monopoly-priced good or becomes a competitively supplied one. The market has already ruled: a company growing revenue 85% is priced for margin compression, while its challenger is priced for the share it has not yet taken. That verdict is the clearest macro signal available that the AI compute layer is commoditizing — and commoditizing compute is the precondition every crypto compute trade, from AI-pivoting miners to decentralized GPU networks, has been waiting on. The spread between two stock charts is telling you the door is opening. The question is which crypto-adjacent operators are positioned to walk through it, and the answer is the ones that already own the input silicon cannot replace: power.

Frequently asked questions

Why is AMD up over 100% while Nvidia is roughly flat in 2026? Nvidia still dominates the AI accelerator market with around 80% share and posted 85% revenue growth last quarter, but the market discounts the future rather than the present. Investors have begun pricing in margin compression for Nvidia as credible second sources — AMD’s Instinct line and hyperscaler custom silicon — erode its monopoly pricing power. AMD, starting from a low base, is being re-rated for the share it could take. The spread reflects a rotation from betting on one supplier’s moat to betting on the supply chain that erodes it, not a simple win-lose outcome.

Does AMD need to take Nvidia’s market share for the trade to work? No, and that is the most misunderstood part. AMD’s share could stall in the low double digits and the thesis still holds, because AMD’s real product for the market is buyer optionality. Once large buyers like OpenAI and Meta have a credible second source, they can negotiate, and Nvidia’s mid-70s gross margins compress even if its unit volume keeps growing. The stock market impact comes from the change in Nvidia’s pricing power, which a viable challenger creates regardless of whether it wins the majority of sockets.

What are the OpenAI and Meta AMD commitments? OpenAI signed a multi-year commitment for 6 gigawatts of AMD GPUs, with the first gigawatt deploying in the second half of 2026 on the MI450. Meta committed to up to 6 gigawatts of custom AMD Instinct MI450 deployments, an arrangement reported to carry a multi-year value near $60 billion. These commitments matter beyond the revenue because they publicly demonstrate that the most compute-hungry buyers in the world are diversifying away from single-vendor dependence, which is the market signal driving the re-rating.

How does semiconductor commoditization help crypto? The decentralized-compute and Bitcoin-miner-pivot theses depend on AI compute becoming a rentable commodity rather than a proprietary bottleneck. A market actively de-rating the monopoly supplier and re-rating the challenger is evidence that commoditization is underway. As the GPU itself commoditizes, the scarce input shifts from silicon to power and cooling — which Bitcoin miners like IREN and Crusoe already own — and decentralized GPU networks like Akash and io.net gain a wider heterogeneous hardware pool to aggregate. Commoditization lowers the cost floor these operators must clear.

Is Nvidia in trouble? Not operationally. Nvidia remains the largest AI chipmaker by a wide margin, its Vera Rubin systems begin shipping this fall, and its late-August earnings will very likely show strong growth. The 2026 stock underperformance is a valuation story, not a business-deterioration story: the market is unwilling to keep paying a monopoly multiple for a company that now faces credible competition. Strong numbers into a contested market simply command a lower multiple than strong numbers into a monopoly, which is the adjustment the AMD-Nvidia spread has already made.

Sources
14/07/2026
ServiceNow Now Assist Reached 2,600 Enterprise Customers

ServiceNow Now Assist Reached 2,600 Enterprise Customers in Q1 2026

ServiceNow reported in its Q1 2026 earnings (January through March 2026, results published April 23, 2026) that Now Assist — the generative AI layer integrated across ServiceNow’s IT Service Management, Customer Service Management, HR Service Delivery, and Security Operations product lines — had reached 2,600 paying enterprise customers, up from approximately 800 at the close of Q1 2025 and representing a 225 percent year-over-year growth rate that makes Now Assist one of the fastest-scaling enterprise AI products in the SaaS industry by customer count. ServiceNow’s Q1 2026 earnings disclosures show total revenue reached $3.24 billion in the quarter, up 18 percent year-over-year from $2.75 billion in Q1 2025, with subscription revenue of $3.13 billion (up 19 percent) and a current remaining performance obligation — the forward revenue under contract — of $12.1 billion, reflecting the multi-year nature of enterprise ServiceNow agreements and providing revenue visibility through FY2027. ServiceNow’s net revenue retention rate of 128 percent in Q1 2026 — which measures how much revenue from the prior year’s customer cohort has grown through expansion purchases — is the primary indicator that Now Assist is generating meaningful expansion within ServiceNow’s existing enterprise customer base rather than contributing primarily through new customer acquisitions. A net revenue retention rate of 128 percent means that for every dollar of Q1 2025 subscription revenue, ServiceNow’s same customer cohort generated $1.28 of Q1 2026 subscription revenue — a 28-cent expansion per dollar, the majority of which ServiceNow attributes on its earnings call to Now Assist and Pro Plus tier upsell within existing enterprise accounts. Now Assist’s commercial structure reinforces the expansion dynamic: Now Assist is not a standalone product but a per-seat add-on license to existing ServiceNow product subscriptions — an enterprise that pays for ServiceNow ITSM can add Now Assist for ITSM at an incremental per-seat charge, applying AI summarisation, resolution recommendation, and automated routing to its existing ITSM incident workflow without implementing a new product or changing its operational processes. This add-on structure means Now Assist’s addressable market within ServiceNow’s existing 8,100-plus enterprise customers is the near-entirety of that installed base, and the 2,600 Now Assist customers as of Q1 2026 represent 32 percent penetration of the installed base — a penetration rate that, if it continues expanding to 50 or 60 percent by FY2027, implies several hundred million dollars of incremental annual contract value without any new-logo enterprise acquisition. Cisco’s AI networking revenue crossing $5 billion for enterprise data centre fabric infrastructure serves the physical networking layer that ServiceNow’s cloud-delivered platform relies on for enterprise connectivity, but the two companies’ AI revenue stories are structurally complementary rather than overlapping: Cisco sells AI-capable network hardware to the enterprise data centres and colocation facilities that host the ServiceNow cloud infrastructure, while ServiceNow sells AI workflow software that runs on that infrastructure — with both companies’ AI revenue growth driven by the same underlying enterprise AI adoption trend at different layers of the stack.

Now Assist’s commercial differentiation from general-purpose enterprise AI platforms (Google Gemini in Workspace, Microsoft Copilot in Office 365) is the vertical depth of its workflow integration: rather than providing a horizontal AI assistant that can answer questions and draft text across any business context, Now Assist is specifically trained and integrated into the exact workflow steps that ServiceNow orchestrates for IT, customer service, and HR operations. A Now Assist incident summarisation in ITSM does not simply produce a text summary of the incident ticket — it pulls the incident’s full resolution history, cross-references similar past incidents from the enterprise’s historical ITSM data, identifies the most-applied resolution patterns for incidents with similar symptom combinations, and presents the on-call engineer with a pre-formatted next-action recommendation that links to the relevant knowledge base articles and assigns estimated resolution time based on historical data for similar incidents at the same enterprise. This vertical integration is possible because ServiceNow has more than a decade of structured ITSM workflow data — hundreds of millions of incidents, changes, and service requests from 8,100-plus enterprise customers — that provides the training signal for workflow-specific AI that general-purpose foundation model training data cannot replicate. ServiceNow’s partnership with Nvidia — announced in 2024 and expanded in Q1 2026 to include Now Assist powered by Nvidia NIM microservices for enterprises that choose to run Now Assist inference on Nvidia-based private cloud infrastructure rather than ServiceNow’s shared cloud — gives enterprises with data residency or compliance requirements an on-premises Now Assist deployment option that maintains workflow integration depth while keeping inference compute within the enterprise’s own infrastructure boundary. Gartner’s 2026 Magic Quadrant for IT Service Management places ServiceNow in the Leaders quadrant with the highest overall placement, with Gartner’s evaluation noting that Now Assist reduced mean time to resolve for P1 incidents by an average of 22 percent across the enterprise deployments in Gartner’s survey data, and reduced the proportion of incidents requiring human escalation from 47 percent to 31 percent in deployments where Now Assist was fully integrated into the first-line response workflow. Gartner’s survey data also shows that 61 percent of enterprises using ServiceNow ITSM as their primary incident management platform planned to add Now Assist in the next 12 months as of Q1 2026 — the highest stated AI feature adoption intent of any enterprise workflow product category Gartner surveys, which Gartner attributes to the measurable operational outcome improvement (MTTR reduction) being more direct and quantifiable than the productivity improvements claimed by horizontal AI assistant products. Cloudflare’s AI Gateway for multi-provider API management addresses an adjacent infrastructure need for enterprises deploying Now Assist in multi-cloud environments: Cloudflare AI Gateway can sit between an enterprise’s ServiceNow environment and the external Nvidia NIM or AWS Bedrock-hosted model inference endpoint that Now Assist uses, providing rate limiting, cost monitoring, and fallback routing across inference providers — a complementary toolchain position that illustrates how enterprise AI deployments increasingly require multiple vendor layers even for a single application workflow like ITSM AI assistance.

What Now Assist’s 225 Percent Growth Rate Tells Enterprises About AI Workflow ROI

The 225 percent year-over-year customer growth rate for Now Assist is anomalously fast even within the context of enterprise AI adoption in 2025-2026, and its explanation is specific to the measurability of ITSM workflow AI outcomes. Enterprise AI products that address productivity (Copilot in Word, Gemini in Docs) generate diffuse benefits — individual employee time savings on tasks that were previously done manually, which are difficult to aggregate into a CFO-legible ROI figure for renewal justification. Enterprise AI products that address operational workflows (Now Assist reducing incident MTTR, Agentforce reducing service case handle time) generate concentrated, measurable benefits — a 22 percent reduction in P1 incident MTTR translates directly into fewer engineer-hours per incident, reduced service downtime per incident, and lower SLA breach penalties for the enterprise, all of which can be quantified against the cost of the Now Assist per-seat license with enough precision to generate a positive ROI in the first six months of deployment. The measurability advantage compounds over renewal cycles: an enterprise that renewed Now Assist after a one-year ITSM deployment can present its IT operations data showing MTTR trend, escalation rate trend, and ticket auto-close rate trend as direct evidence of ROI, making the renewal budget justification a data presentation rather than a value narrative. ServiceNow’s customer success organisation contributes to the renewal evidence base: ServiceNow provides enterprises with a “Now Assist Impact Dashboard” that aggregates Now Assist utilisation metrics, resolution time comparisons between AI-assisted and non-AI-assisted incidents, and estimated time-savings calculations in the same reporting interface as the enterprise’s broader ServiceNow operational analytics. The combination of measurable ROI and in-product ROI reporting creates a renewal dynamic that explains Now Assist’s 128 percent net revenue retention: enterprises that see measurable MTTR improvement in year one upgrade to broader Now Assist coverage (adding CSM or HRSD modules alongside ITSM) in year two, increasing annual contract value while the measurable outcome data continues to justify the expanded spend. Palantir AIP’s enterprise AI revenue and government contract growth demonstrates the contrasting enterprise AI adoption dynamic in high-value, low-volume deployments: Palantir’s AIP platform generates per-customer contract values of $5 million to $50 million annually, with deployment complexity requiring Palantir’s professional services “boot camp” methodology, while ServiceNow’s Now Assist generates $50,000 to $500,000 per customer annually with deployment primarily handled by the enterprise’s existing ServiceNow administrators — a per-customer revenue difference of roughly 10-to-1 but a customer count scaling advantage for ServiceNow of roughly 100-to-1 at equivalent market penetration rates. Workday’s AI HCM features for workforce management represents the HR workflow AI market that Now Assist for HRSD competes with directly: both products embed AI summarisation and recommendation into HR service requests (benefits queries, payroll corrections, onboarding task management), with Workday’s advantage being deeper integration with payroll and financial data and ServiceNow’s advantage being broader integration with ITSM and customer service workflows in enterprises that use ServiceNow as their cross-departmental service management platform. The Wall Street Journal’s coverage of ServiceNow’s Q1 2026 results frames the 2,600 Now Assist customer milestone as the point at which enterprise workflow AI has proven its ROI at sufficient scale and breadth of deployment to be considered a standard enterprise software procurement category rather than an experimental technology investment — a framing that, if accurate, implies the next competitive cycle in ITSM and CSM software will be defined by AI workflow depth and measurability rather than by the feature breadth and integration ecosystem factors that have defined the category since ServiceNow’s inception.

What ServiceNow’s 2,600 Now Assist Customers Reveal About the Narrative That Closes Enterprise AI Deals

ServiceNow’s 2,600 Now Assist customer count is a sales narrative as much as a product metric. The story it tells to the enterprise buying committee is that AI workflow automation in ITSM and CSM is no longer experimental — that 2,600 enterprises of scale have evaluated the technology and found it production-worthy. This is the social proof layer of enterprise sales content: not claims about features but claims about what peers have already decided. The number functions precisely because it signals that the risk of being first has passed. The buyer who signs in the second half of 2026 is not an early adopter; they are joining an established community of production users, which is a fundamentally different risk story to bring to a budget committee.

The content that converts enterprise buyers is not technical specification but outcome narrative. ServiceNow’s most effective marketing for Now Assist will not be latency benchmarks or training dataset descriptions; it will be stories about specific enterprises that used the platform to close a measurable operational gap. Which customer reduced average handle time by a specific percentage? Which IT operations team deflected a specific volume of tier-1 tickets in the first quarter? Which deployment generated a specific ROI inside twelve months? The 2,600 number is the container for those stories, but the stories themselves are what convert the buying committee member who needs to justify the investment in a board presentation. The count is the headline; the outcome narrative is the body copy that makes the headline credible.

The content marketing risk for ServiceNow at 2,600 customers is that the success story pool diversifies across industries, workflow types, and deployment sizes to the point where the generic Now Assist narrative loses its specificity. The most effective enterprise content marketing segments its social proof by buyer persona — showing a healthcare CIO a healthcare ITSM outcome, a financial services IT leader a financial services Now Assist result — rather than presenting undifferentiated aggregate counts. 2,600 customers is the ceiling of what a count-based claim can do. The next growth phase belongs to persona-specific outcome narratives that speak directly to the highest-anxiety objections of each specific buyer type.

What ServiceNow’s 2,600-Customer Aggregate Reveals About the Design Problem Every Enterprise AI Platform Eventually Faces

The 2,600-customer count is a system-level metric. It describes the platform, not the experience any single IT service agent has when they open a ticket and Now Assist offers a suggested resolution. This distinction matters more than it appears to, because the design principle that governs whether an enterprise AI feature actually gets used is discoverability at the point of need — not aggregate adoption at the company level. A 2,600-customer count tells you the platform has cleared procurement. It tells you nothing about whether the individual agent using it every day finds the AI suggestion helpful, trustworthy, or worth the cognitive overhead of evaluating before accepting.

Good design makes the right action obvious without making the system feel like it is making decisions for the user. Now Assist’s AI suggestions inside a ticketing workflow succeed or fail based on a narrow, specific design question: does the suggested resolution appear at the moment the agent needs it, with enough context to evaluate quickly, and with an easy path to override if it’s wrong? Get that interaction pattern right, and the AI becomes an invisible accelerant — the agent barely notices they are using it because it simply makes their existing workflow faster. Get it wrong, and the AI becomes an obstacle the agent has to work around, regardless of how sophisticated the underlying model is. The 2,600-customer number cannot tell you which of these is happening inside any given deployment.

The design signal worth watching as Now Assist scales past 2,600 customers is not the count but the interaction friction: how many suggested resolutions are accepted without modification, how many are edited before use, and how many are dismissed outright. That breakdown is a design health metric in a way the customer count is not. A high dismissal rate signals a mismatch between what the AI suggests and what the agent’s actual context requires — a design failure, not a capability failure, because the underlying model may be technically correct and still be wrong for the moment it was deployed into. ServiceNow’s next milestone worth publishing is not a bigger customer number. It is the interaction-level evidence that Now Assist has solved the harder problem: making AI assistance feel like a natural extension of the agent’s workflow rather than a system they have to manage.

03/07/2026
The 2026 Memory Crunch Hands DePIN Its Best Demand Case Yet
The memory shortage gripping the chip industry in mid-2026 is not a cyclical blip waiting to correct. It is a structural reallocation of the world’s most fungible hardware resource away from consumers and toward AI data centers, and it has quietly built the strongest demand case decentralized infrastructure networks have ever had. When DRAM prices surged by up to 89% in Q2 2026 and Samsung, SK Hynix and Micron warned the squeeze could run past 2027, they confirmed something that crypto’s compute and storage projects have argued for two years: hardware capacity is now the scarce asset, and whoever can mobilize idle silicon at the edges wins.

This is the article’s claim, stated plainly: the AI memory crunch is a one-way door, not a price cycle, and that permanence is what turns DePIN from a token-subsidy experiment into a real arbitrage against rationed centralized supply.

What Actually Happened To Memory In 2026

The numbers are not subtle. TrendForce flagged the surge persisting into Q1 2026, with smartphone and notebook brands already raising prices and downgrading specs. By Q2, specific components told the story: a 96Gb (12GB) LPDDR5X module climbed from $77.1 to $145.9 — an 89% jump in a single quarter, per component pricing tracked across the consumer segment. Gartner estimated a 130% combined surge in DRAM and SSD prices by the end of 2026, translating into a 17% rise in PC prices and 13% on smartphones.

The cause is a deliberate manufacturing decision, not an accident. Samsung, SK Hynix and Micron shifted the bulk of combined production toward high-bandwidth memory for AI servers, with HBM consuming 23% of total DRAM wafer output, up from 19% in 2025. The margin logic is brutal: a single HBM3E module sells for roughly $60 to $100, versus $5 to $10 for a comparable amount of conventional DDR5. When the same wafer can be sold at eight to ten times the price into AI demand, consumer DRAM does not get expanded — it gets starved.

The consequence flows straight to buyers. Gartner projects worldwide PC shipments down 10.4% and smartphones down 8.4% in 2026. Lenovo, Dell, HP, Acer and ASUS have warned of 15-20% hikes and contract resets, and base-model phones are sliding back toward 4GB of RAM. The market is not absorbing a price increase. It is shrinking.

Why This Is Structural, Not Cyclical

Memory has always been the most cyclical corner of semiconductors — gluts and shortages on a roughly two-year clock. The reason 2026 breaks the pattern is supply timing. New fab capacity from Micron and SK Hynix will not reach volume production until 2027 at the earliest, so the gap is locked in by physics and construction schedules, not sentiment. Samsung and SK Hynix have told customers the AI-driven shortage could last until 2027 and beyond, with buyers already reserving supply years in advance.

The demand side compounds the problem. AI infrastructure spending from Amazon, Microsoft, Meta and Alphabet alone is expected to reach a combined $700 billion in 2026, and memory is a non-negotiable input to every GPU cluster those dollars buy. This is the same compute build-out we tracked when xAI scaled Colossus toward a million GPUs and when AMD pushed the MI300X into enterprise data centers. Every one of those accelerators needs HBM stacked beside it. The hyperscalers are not competing with consumers for memory at the margin — they are buying the entire margin and then some.

There is also a contested layer to the story. Samsung, SK Hynix and Micron face a class-action antitrust suit in California alleging the three coordinated capacity constraints under cover of the HBM transition, with plaintiffs claiming restricted conventional DRAM supply drove an extreme price surge. The legal merits are unproven and the companies dispute the framing. But the suit matters editorially for one reason: it puts three firms in control of a resource the entire AI economy now depends on, and concentration of that kind is exactly the condition decentralized alternatives are built to exploit.

The Crypto Angle: DePIN Gets Its Best Demand Environment Ever

Here is where the memory crunch stops being a hardware story and becomes a crypto one. Decentralized Physical Infrastructure Networks — DePIN — coordinate idle real-world hardware (GPUs, storage, bandwidth) through token incentives and sell that capacity into open markets. For years the bear case was simple: token subsidies, not real demand, kept the lights on. Rationed centralized supply changes that math.

Akash Network is the clearest example. It posted a record $5 million in compute spend in Q1 2026, with its AkashML platform processing 1.7 billion tokens daily on OpenRouter for AI inference, according to DePIN revenue analysis from BlockEden. Its March 2026 Burn-Mint Equilibrium mechanism automatically buys and burns AKT whenever customers pay for compute, tying token scarcity to actual usage rather than emission schedules. That is the pivot that matters: demand-driven deflation replacing inflationary subsidy.

The pattern repeats across the sector. Filecoin has shifted toward paid storage deals with AI firms and researchers, with revenue per terabyte stabilizing as genuine customers commit to longer terms. GPU-focused networks Render, Aethir and io.net compete on inference workloads, where roughly 70% of 2026 GPU demand now sits — and where decentralized networks hold a structural cost edge over hyperscalers because inference tolerates distributed, lower-tier hardware better than training does. Bittensor coordinates open AI model markets. Grass monetizes residential bandwidth. None of these networks needs to beat NVIDIA on raw performance. They need to be available and cheaper when the centralized supply is rationed, reserved years out, and priced like a luxury good.

This connects to a broader thesis we have argued before: the most durable crypto demand comes from tokenizing real-world economic activity, not from speculative loops. DePIN tokenizes the supply side of the compute economy. When memory and GPU capacity become the bottleneck for a $700 billion build-out, any network that can credibly aggregate spare hardware at the edges is selling into the single hottest market in technology. The crunch did not create DePIN. It gave DePIN a customer.

The honest caveat: DePIN’s addressable demand is still small against hyperscaler scale, and a16z-tracked Web3 compute usage remains a rounding error next to AWS or Azure. Token mechanics can still mask thin real revenue. But the direction is unambiguous — every quarter of rationed centralized memory pushes marginal AI workloads to look harder at decentralized supply, and the cost gap is widening in DePIN’s favor, not narrowing.

Who Gets Hurt And Who Gets Paid

The losers are easy to name. Consumers buying PCs and phones in 2026 are paying a memory tax measured in double-digit percentages, with worse specs at the low end. PC and smartphone OEMs eat margin compression and shrinking unit volumes. Any AI startup without reserved memory contracts faces supply uncertainty that compounds its compute bill.

The winners are equally clear. SK Hynix has seen revenue from AI-related memory products more than triple since 2024, and all three memory giants are posting margins they have not enjoyed in a decade. The hyperscalers locking in supply years ahead protect their roadmaps. And at the speculative edge, DePIN tokens get a fundamental tailwind that does not depend on a broad crypto bull market — it depends on memory staying scarce, which the fab timelines say it will. We saw a similar capacity-as-moat dynamic when Oracle turned raw AI infrastructure into a revenue engine: in a shortage, whoever controls the capacity sets the terms.

What To Watch Next

Three signals will confirm or break this thesis over the next two quarters. First, the antitrust suit: if discovery shows deliberate constraint, expect regulatory pressure that could ironically accelerate interest in decentralized supply as a hedge against concentrated control. Second, DePIN paid-revenue curves: if Akash, Filecoin and the GPU networks keep converting AI demand into recurring on-chain payments rather than one-off spikes, the “real demand” case is proven. Third, the 2027 fab timeline: any slippage in Micron or SK Hynix volume production extends the shortage and the DePIN tailwind with it.

The cleanest way to read 2026 is this. The AI build-out turned memory into the new oil, three companies into its OPEC, and consumers into the people paying at the pump. DePIN is the wildcat driller betting the shortage lasts long enough to make distributed supply worth the friction. On current fab math, that is not a bad bet.

FAQ

Why are memory chip prices surging so much in 2026?

The surge is driven by a deliberate manufacturing shift. Samsung, SK Hynix and Micron reallocated production capacity toward high-bandwidth memory (HBM) for AI data centers because it sells for eight to ten times the price of conventional consumer DRAM. That left fewer wafers for the DDR5 and LPDDR5X chips used in PCs and phones. With HBM consuming 23% of total DRAM wafer output and AI infrastructure spending heading toward $700 billion in 2026, consumer memory supply is being starved. New fab capacity will not reach volume until 2027, so the shortage is locked in by construction timelines rather than short-term sentiment.

Is the memory shortage a normal cycle or something permanent?

Memory is historically the most cyclical part of the chip industry, but 2026 breaks the usual two-year pattern. The difference is timing: demand from AI compute build-outs is structural and growing, while new supply is physically constrained until at least 2027. Samsung and SK Hynix have told customers the shortage could persist past 2027, with buyers reserving capacity years ahead. It is better understood as a multi-year reallocation of hardware toward AI than a temporary spike that will quickly reverse, though the eventual fab expansions will ease it.

How does the memory crunch help crypto DePIN projects?

DePIN networks like Akash, Filecoin, Render and Aethir coordinate idle hardware and sell its capacity into open markets using token incentives. When centralized memory and GPU supply is rationed, reserved years ahead, and priced like a luxury good, decentralized alternatives become more competitive on availability and cost — especially for AI inference, which tolerates distributed hardware better than training. Akash posted a record $5 million in Q1 2026 compute spend and now burns AKT tokens tied to real usage. The crunch gives these networks genuine demand rather than token-subsidy demand, though their scale is still small against hyperscalers.

What is the antitrust lawsuit against the memory makers about?

Samsung, SK Hynix and Micron face a class-action antitrust suit in California alleging they coordinated capacity constraints under the cover of transitioning production to HBM. Plaintiffs claim the firms restricted conventional DRAM supply to drive prices sharply higher. The legal merits are unproven and the companies dispute the framing, so the allegations should be read as claims, not findings. The case matters strategically because it highlights how three firms now control a resource the entire AI economy depends on — the kind of concentration that strengthens the argument for decentralized supply alternatives.

Will PC and smartphone prices keep rising because of this?

For 2026, yes. Gartner estimates a 130% combined surge in DRAM and SSD prices by year-end, raising PC prices roughly 17% and smartphones 13% versus 2025. OEMs including Lenovo, Dell, HP, Acer and ASUS have confirmed 15-20% hikes and spec downgrades, with low-end phones returning to 4GB of RAM. Worldwide PC shipments are projected to fall 10.4% and smartphones 8.4%. Prices should ease once new fab capacity reaches volume production from 2027 onward, but the relief depends on those timelines holding and AI demand not absorbing the new supply first.

Sources
What the DePIN Demand Case for Memory Does Not Reveal About Who Controls the Shortage Narrative

The memory crunch demand argument for decentralized physical infrastructure networks has a clean logic: AI training and inference require more HBM bandwidth than current fabrication supply can satisfy; DePIN networks can aggregate distributed excess capacity; therefore DePIN has its first legitimate, non-speculative demand case. The logic is not wrong. The question that serious journalism asks before endorsing a demand case is: who is publishing it, and who benefits from its adoption?

Data center operators holding HBM3e inventory have a direct interest in scarcity framing — it validates premium pricing and justifies capital expenditure cycles that are already committed. NVIDIA, which bundles HBM supply with its GPU allocation process, benefits from the narrative that memory shortage is the binding constraint on AI expansion rather than GPU allocation itself. DRAM manufacturers — Samsung, SK Hynix, and Micron — collectively benefit from a ‘permanent scarcity’ characterization of the memory market that supports pricing power through 2027 and beyond. DePIN projects benefit from the demand case regardless of whether decentralized memory nodes can actually deliver at the latency and bandwidth specifications AI training requires.

The technical complication that the demand case sidesteps is not minor. AI training workloads using gradient checkpointing require memory bandwidth with nanosecond synchronization. Distributed memory nodes introduce latency from physical distance and network routing that is fundamentally incompatible with the synchronization requirements of serious training runs. DePIN may be viable for inference workloads with less stringent latency requirements. It is not a practical substitute for HBM in model training at current architectures.

The investigative question is straightforward: which DePIN project has demonstrated actual AI training workloads running on distributed memory infrastructure at the bandwidth and latency specifications real training requires? Not proof-of-concept tests. Not synthetic benchmarks. Actual training runs on production models with disclosed performance data. The memory crunch is real and well-documented. Whether DePIN is the structural solution or the narrative beneficiary of the shortage framing is a distinction the demand case as presented does not help you make.
30/06/2026
Cloudflare’s AI Gateway Has Processed 100 Billion Inference Requests

Cloudflare’s AI Gateway Has Processed 100 Billion Inference Requests and Edge Delivery Has Become the Default AI Infrastructure Layer

Cloudflare reported Q1 2026 revenue of $580 million — up 27 percent year-over-year from $456 million in Q1 2025 — with its AI product portfolio (AI Gateway, Workers AI, Vectorize, and NeuralSwitch) collectively crossing $500 million in annualised run-rate revenue and AI Gateway processing more than 100 billion inference API requests in the quarter, a figure that positions Cloudflare not as a foundation model provider competing with OpenAI or Anthropic but as the delivery and management layer through which enterprise applications route their AI API calls before those calls reach the model provider. Cloudflare’s Q1 2026 investor materials describe AI Gateway’s commercial function precisely: it sits between an enterprise application and the AI model providers that application calls (OpenAI, Anthropic, Google Vertex AI, Meta Llama API, Cohere), providing a unified management layer for caching, rate limiting, cost analytics, fallback routing, and compliance logging across all AI provider relationships from a single configuration interface. An enterprise running five different AI models across its product suite — a coding assistant on GitHub Copilot’s model, a customer service bot on Claude 3.5, a document summariser on GPT-4o, an image generator on Gemini, and a search augmentation layer on Llama 4 — previously had to manage API keys, usage monitoring, cost allocation, and failure handling for each provider independently, with no unified view of total AI spending or provider reliability across the stack. Cloudflare AI Gateway eliminates that management complexity by treating AI provider APIs the same way its core CDN product treats origin server connections: as a class of upstream resources to be routed, cached, monitored, and load-balanced from a single control plane. The caching feature specifically — which stores AI API responses for semantically similar queries and serves the cached response to subsequent requests without calling the model provider — produces the most immediate commercial ROI: AI Gateway customers see an average 30 to 40 percent reduction in AI API costs in the first 90 days of deployment as repeated queries to the same model provider (a documentation chatbot being asked common questions, an internal knowledge assistant handling routine lookups) are served from cache rather than generating new inference calls. ARM Holdings’ royalty revenue from AI chips demonstrates how infrastructure-layer companies capture value from the AI compute stack without needing to develop the applications that run on it — Cloudflare’s AI Gateway occupies a comparable infrastructure position one layer above the compute, capturing value from the routing and management of AI API calls that every enterprise application generates regardless of which model or cloud provider the application ultimately uses.

Workers AI, Cloudflare’s serverless inference product, addresses a different part of the enterprise AI infrastructure problem: latency and data sovereignty for inference workloads that cannot tolerate the 100 to 200 millisecond round-trip times that centralized cloud AI endpoints produce for users located far from the AWS us-east-1, GCP us-central1, or Azure East US regions where the majority of commercial AI model endpoints are hosted. Cloudflare runs Workers AI on its global network of 330-plus data centers in more than 120 countries, meaning that a user in São Paulo, Lagos, or Singapore making an AI inference request through a Workers AI deployment receives a response from a node that is typically within 20 milliseconds of their physical location rather than from a US East Coast endpoint at 150 to 250 milliseconds. The latency advantage matters most for AI applications where the interaction is synchronous and user-facing — a real-time translation feature, a customer-facing chatbot with a visible typing indicator, an AI image filter applied to a video stream — because human perception of response delay degrades interaction quality noticeably above 100 milliseconds in conversational contexts. Workers AI currently supports inference for open-weight models including Llama 4 Scout, Mistral 7B, Gemma 2, and Whisper (audio transcription), with Cloudflare running the model weights on its own GPU infrastructure distributed across the global network. Workers AI is not positioned to replace centralised cloud AI for training workloads (which require GPU clusters with high-bandwidth interconnects that Cloudflare’s distributed single-node architecture does not support) but specifically targets inference workloads where geographic distribution, latency, and data residency are constraints that centralised cloud endpoints cannot satisfy. Palantir’s AIP analytics platform operates at the application layer above cloud AI infrastructure, deploying ontology-driven decision intelligence for enterprises that have already solved their AI infrastructure procurement — Cloudflare Workers AI sits at the infrastructure layer below, providing the edge inference capacity that applications like AIP call when they need low-latency inference outside the US regions where centralised cloud AI is optimised.

What Cloudflare NeuralSwitch Does for Enterprise AI Cost Management

Cloudflare launched NeuralSwitch in June 2026 as an extension of AI Gateway that applies automated routing logic to AI API calls based on real-time model availability, cost, and task complexity — selecting the lowest-cost capable model for each inference request from a configured pool of providers rather than routing all requests to a single model regardless of whether that model’s capability level is necessary for the task. The commercial rationale is straightforward: an enterprise application that routes all AI inference requests to GPT-4o at $15 per million output tokens is overpaying for simple classification, extraction, and structured generation tasks that Llama 4 Scout at zero marginal cost (via Workers AI) or Claude Haiku at $1.25 per million output tokens handles with equivalent output quality. NeuralSwitch’s routing logic evaluates each incoming prompt against a task complexity classifier (itself a small, fast inference model running at the edge) and selects the appropriate model tier from the enterprise’s configured provider pool: a multi-step reasoning task routes to GPT-4o or Claude 3.5 Sonnet; a document summarisation request routes to a mid-tier model; a simple keyword extraction or classification request routes to a fast, inexpensive model running on Workers AI at the edge. Early NeuralSwitch deployments are reporting 50 to 65 percent reductions in total AI API spend compared to single-provider configurations, by matching task complexity to model capability rather than using frontier-class models for tasks where 95 percent of the value is available from a model that costs 10 percent as much. Workday’s Illuminate AI layer applies similar task routing logic within the HCM context — agentic workflows that require policy-constrained human-level judgment route to different model configurations than the routine data extraction and summarisation tasks that Illuminate handles without human review. Gartner’s edge computing research for 2026 projects that by 2028, 40 percent of enterprise AI inference workloads will run at the edge rather than through centralised cloud endpoints, driven by latency requirements, data sovereignty obligations under the EU AI Act and equivalent regional regulations, and cost optimisation through geographic proximity to end users. The Wall Street Journal’s enterprise technology coverage through Q2 2026 frames Cloudflare’s AI infrastructure business as the clearest example of a network-layer company converting its existing infrastructure advantage (330-plus global PoPs, 20 percent of internet traffic) into AI delivery value — a conversion that required no new physical infrastructure buildout but rather a software and services layer deployed on existing hardware that was already positioned at the edge of the global internet.

Why Cloudflare’s Network Position Makes AI Gateway Defensible Against Hyperscaler Competition

The strategic risk for Cloudflare’s AI Gateway and Workers AI business is that AWS, Azure, and Google Cloud each have financial and technical resources to build identical management and edge inference products within their existing cloud platforms, and that enterprise customers already running AI workloads on a single hyperscaler could be retained by that hyperscaler’s native AI management tools rather than routing through a third-party like Cloudflare. The defence against this risk is Cloudflare’s multi-cloud positioning: enterprises that use AI models from multiple providers — which, as of Q1 2026, is the majority of large enterprise AI deployments according to Cloudflare’s customer data — have a structural reason to prefer a neutral management layer like AI Gateway over any single hyperscaler’s native AI management tools, because a neutral layer does not create pricing dependency on a single provider and does not expose query data to a hyperscaler that is simultaneously a competitor in the foundation model market. An enterprise using both Azure OpenAI Service and Anthropic’s Claude (a common combination where GPT-4o handles general tasks and Claude handles compliance-sensitive document review) has an alignment problem with Microsoft’s native AI management tools: Microsoft Azure’s observability and cost tools are well-instrumented for Azure OpenAI Service calls but do not treat Anthropic API calls with the same native fidelity. Cloudflare AI Gateway is provider-neutral by design and commercial interest, because its business model depends on managing calls to all AI providers rather than favouring any single one. GitHub Copilot’s enterprise growth has been driven in part by Microsoft’s ability to bundle AI coding assistance with its existing developer toolchain — a distribution advantage that works in Microsoft’s favour for single-provider enterprise AI deployments but that creates a friction point for multi-provider enterprise AI architectures where Cloudflare’s neutrality is a commercial advantage rather than a disadvantage. Cloudflare’s Vectorize vector database product — which stores embedding vectors for retrieval-augmented generation applications at the edge, adjacent to the Workers AI inference endpoints that generate them — further deepens the AI infrastructure relationship by making Cloudflare the storage layer for the context data that AI inference calls retrieve, creating a data gravity effect that is structurally similar to how Amazon S3’s dominance in object storage has reinforced AWS compute adoption by keeping data and compute co-located within the same provider’s network.

What Cloudflare’s 100 Billion Inference Request Milestone Reveals About Where Startups Build AI Infrastructure

One hundred billion inference requests is a number produced by cumulative developer decisions. Every startup that chose Cloudflare Workers AI over a direct API call to a hyperscaler added to that total. The decision pattern is not primarily about cost, though Cloudflare’s pricing is competitive. It is about the combination of low lock-in risk and zero operational overhead. A startup that routes inference through Cloudflare’s AI Gateway has not committed to a specific model provider, a specific cloud vendor, or a specific inference architecture. That optionality has real value when the underlying model landscape is changing at the pace it changed between 2024 and 2026.

NeuralSwitch is interesting precisely because it makes a previously manual decision automatic. Multi-model routing — using the cheapest adequate model for each request type — is something sophisticated teams were doing in configuration files. NeuralSwitch makes it a platform default. The economics of AI product building have made this decision valuable: a startup spending $40,000 per month on inference with an 8x cost spread between frontier and small models has genuine P&L incentive to get routing right. The startups that will benefit most from NeuralSwitch are those running heterogeneous workloads where prompt complexity varies enough to justify the routing overhead.

The hyperscaler comparison matters but is asymmetric. AWS, Azure, and Google Cloud offer inference as part of a larger platform value proposition. Cloudflare offers inference as infrastructure for builders who want to stay independent of that platform consolidation. The 100 billion milestone is evidence that a meaningful segment of the developer market has made the independence bet. Whether that bet compounds into enterprise adoption or remains developer-tier depends on whether Cloudflare can bring enterprise-grade compliance, audit logging, and SLA guarantees to match the hyperscalers’ enterprise motion. That is the product gap the enterprise version of Cloudflare AI Gateway still has to close.

What Cloudflare’s 100 Billion Inference Requests Reveal About the Pricing Structure Behind AI Infrastructure Independence

The 100 billion milestone has been covered as a developer adoption story. The more important number Cloudflare has not published is the average revenue per inference request. Cloudflare Workers AI pricing uses a neuron-based compute abstraction rather than a flat per-request or per-token rate, which means 100 billion inference requests translates to a revenue figure that varies enormously by model size and request complexity. The hyperscalers publish exact per-token pricing. Cloudflare’s neuron abstraction makes apples-to-apples comparison deliberately difficult, which is a pricing strategy as much as a product decision.

The investigative question is who is actually capturing the margin on these 100 billion inferences. Cloudflare’s gross margin on Workers AI is structurally different from its CDN and security products, where bandwidth costs have been compressed through network scale over many years. GPU compute has no equivalent commodity dynamic: Nvidia’s H100 and H200 utilization pricing has not fallen the way bandwidth costs fell, and demand from hyperscalers, AI labs, and cloud providers is outpacing supply. Cloudflare is routing inference requests through GPU capacity it does not own, positioning on routing efficiency and developer ergonomics rather than on controlling the underlying compute. That margin structure looks more like a marketplace premium than an infrastructure moat. The developer who values Cloudflare’s multi-model routing and independence from a single provider is paying for orchestration, not for compute at cost.

The enterprise version of this story is also a pricing and commercial model story. Enterprise buyers pay premiums for accountability infrastructure: compliance attestations, audit logs, SLA guarantees, and the enterprise agreement mechanism that makes infrastructure purchases predictable within annual budget cycles. The hyperscalers have spent decades perfecting the enterprise agreement model, including committed spend tiers, negotiated credits, and dedicated technical account management that makes switching costs prohibitive even when a competitor’s unit pricing is lower. Cloudflare closing the enterprise gap requires not just audit logging and SLA product features but a commercial motion that competes with the enterprise agreement mechanism itself. The 100 billion inference requests establishes credibility in the developer tier. The commercial model that converts developer credibility into enterprise revenue at enterprise margins has not yet been publicly demonstrated, and that gap is the one that determines whether the independence bet compounds into durable enterprise revenue.

What Cloudflare’s 100 Billion AI Inference Requests Reveal About the Underground Developer Psychology Driving the Independence-First AI Infrastructure Movement

The 100 billion inference request milestone carries a specific meaning in developer culture that is not visible in the headline number. Cloudflare’s AI Gateway has reached scale because of a decision — made by hundreds of thousands of individual developers, independently — to route their AI inference through a layer that abstracts across model providers and retains user control over provider selection. That decision is not primarily a technical one. It reflects a value orientation: the developers who built into Cloudflare’s AI Gateway did so specifically because they distrust the lock-in structures that major model providers would prefer they accept. The 100 billion requests are a measure of how many inference calls were made by people who cared enough about AI infrastructure independence to make the architectural choice to care about it.

Developer culture has a social structure that looks chaotic from the outside but is highly legible from inside. Within that structure, infrastructure independence is a status signal. Choosing to depend on a model provider’s API directly — without an abstraction layer — is a statement about what you believe your options are. Choosing to route through an abstraction layer that lets you swap providers is a statement about what you believe your options should be. The first choice is made by developers who trust the dominant model provider not to change its pricing, its terms, its availability, or its capability parity in ways that hurt them. The second choice is made by developers who have read enough technical history to know that trust in platform providers is structurally fragile. The 100 billion requests are a tally of how many developers have thought through that question and arrived at the independence-first answer.

The gap between developer credibility and enterprise revenue at enterprise margins is ultimately a psychology gap, not a commercial gap. Enterprise procurement does not operate through the same value system as the developer underground. Enterprise buyers are not optimizing for infrastructure independence; they are optimizing for risk reduction, vendor accountability, and the ability to explain their technology choices to a risk committee. Cloudflare’s path from 100 billion inference requests to durable enterprise margins requires translating the independence-first developer argument into the risk-reduction enterprise argument — and those are not the same argument. The developers who built into Cloudflare’s AI Gateway did so because independence mattered to them. The enterprise buyers who will generate enterprise-margin revenue will adopt Cloudflare because vendor accountability and compliance auditability matter to them. That translation is the commercial work the milestone has not yet done.

28/06/2026
Arm’s AI Chip Royalty Revenue Became Its Primary Growth Driver

Arm Holdings Royalty Revenue From AI Chips Has Become the Primary Growth Driver and Compute Subsystem Licensing Is Expanding the Model

Arm Holdings reported royalty revenue of $1.1 billion in fiscal Q4 2026 (the quarter ended March 2026), a 46 percent year-over-year increase that reflected the accelerating shipment of AI chips — including Nvidia’s Blackwell B200 GPU family, Apple’s M4 processor series, Qualcomm’s Snapdragon X Elite, and Amazon’s Graviton 4 data center processor — that use ARM-designed processor cores and pay Arm a per-chip royalty each time a chip ships at commercial volume. Arm Holdings’ investor relations filings for FY2026 show total annual revenue reaching $4.7 billion (up 47 percent from FY2025’s $3.2 billion), with royalty revenue growing faster than licensing revenue for the second consecutive year — a mix shift that is commercially significant because royalties scale with semiconductor unit shipments and average selling prices without requiring Arm to add customers or renegotiate contracts. The royalty acceleration reflects two compounding factors: the transition from ARM v8 architecture to ARM v9 architecture (which carries a higher royalty rate per chip than v8) is now approximately 70 percent complete across major chip designs, and the average selling price of ARM-based chips is increasing as AI processing requirements push chip designers toward higher compute, higher memory bandwidth designs that carry larger royalty obligations. Arm does not disclose per-chip royalty rates publicly, but management has indicated that the blended royalty rate improvement from v8 to v9 is in the mid-single-digit percentage range as a fraction of chip ASP — a modest per-chip improvement that compounds into material revenue growth at the billions of chips that ARM-architecture designs ship annually. Qualcomm’s Snapdragon X commercial traction in AI PCs exemplifies the royalty revenue dynamic: each Snapdragon X Elite shipped generates an ARM v9 royalty payment at a higher rate than the Snapdragon 8 Gen series it replaces for comparable market segments, so Qualcomm’s AI PC market share growth directly expands Arm’s royalty revenue without requiring any change to the licensing arrangement between the two companies.

Arm’s Compute Sub-System (CSS) licensing program represents the structural innovation in Arm’s business model that has the most significant long-term implications for the company’s revenue per customer relationship. Traditional ARM licensing involves a chip designer licensing the ARM instruction set architecture (ISA) or a specific CPU IP core design, then integrating that core with other IP (GPU, memory controller, interconnect) to build a complete chip. Arm’s CSS program packages pre-integrated compute cluster designs — a CPU IP core, memory subsystem, interconnect, and coherency fabric that are already validated to work together — that chip designers can license as a complete module and integrate into a larger SoC without performing the architectural integration work themselves. CSS licensing carries a materially higher annual license fee than individual IP licensing, because the value delivered is not just the CPU IP but the engineering work of making a validated, production-ready compute cluster. Major semiconductor companies that have adopted CSS include MediaTek (for its Dimensity AI chiplets), Marvell Technology (for its data center interconnect and infrastructure processors), and several smartphone chip vendors that use CSS to accelerate their design cycles. The CSS model is Arm’s response to the competitive pressure from RISC-V — an open-source instruction set architecture that chip designers can use without royalty payments. RISC-V adoption has grown from negligible to approximately 3 to 4 percent of addressable embedded and microcontroller chip designs by 2026, but has not yet penetrated the high-performance computing segments (smartphones, data center CPUs, AI accelerators) where Arm earns the majority of its royalty revenue. TSMC’s 2nm N2 process node expansion is relevant to Arm’s royalty trajectory because TSMC N2 is being used for ARM v9-based designs from Apple (M5), Qualcomm (Snapdragon X successor), and MediaTek — each additional N2 tapeout at TSMC typically represents an ARM v9-based design that will generate royalties at the higher v9 rate.

How Arm’s Data Center Revenue Has Grown Beyond AWS Graviton

Arm’s data center CPU revenue was initially concentrated in Amazon Web Services, which began deploying its custom Graviton ARM-based server processors in 2018 and has expanded Graviton to power approximately 40 percent of AWS EC2 compute capacity by 2026. The data center story has broadened since 2023-2024 in ways that are materially increasing Arm’s data center royalty exposure: Microsoft Azure has deployed Arm-based Cobalt 100 processors in its cloud infrastructure, Google has deployed custom ARM-based Axion processors for internal and customer workloads, and Ampere Computing (an ARM-based server CPU startup backed by Oracle) has captured meaningful share in hyperscale-adjacent deployments. The Nvidia GH200 and GB200 superchips — the configurations that hyperscalers are buying at scale for AI training and inference — integrate Nvidia’s Grace CPU (ARM Neoverse V2 core design) with an H200 or B200 GPU on a high-bandwidth NVLink interconnect, making every AI accelerator sale at the datacenter hyperscaler level simultaneously an ARM v9 royalty event. Arm management has not disclosed the Grace royalty rate specifically, but the Neoverse V2 license agreement was disclosed as a multi-hundred-million-dollar arrangement at the time of the Nvidia-Arm acquisition attempt (2020-2022), and the per-chip royalty on each GH200/GB200 sold represents a significantly higher absolute royalty than a standard server CPU because the overall chip selling price is dramatically higher. The $700+ billion in cloud capex commitments from Amazon, Microsoft, and Google for AI infrastructure translates directly into a sustained multi-year demand signal for ARM-based data center processors — both hyperscaler-custom (Graviton, Cobalt, Axion) and standard Arm Neoverse-based server CPUs from Ampere and other vendors — that Arm’s management has explicitly cited as a visibility improvement in royalty revenue forecasting. The earlier trajectory of ARM’s data center penetration through AWS Graviton established the initial data center royalty stream that Arm’s FY2026 results now show has expanded to multiple hyperscaler-custom designs and Nvidia’s AI accelerator superchip configurations.

What Arm’s Stock Performance and Valuation Reflect About the Royalty Model’s Growth Ceiling

Arm Holdings stock (NASDAQ: ARM) traded near $200 per share by mid-2026, approximately four times its $51 IPO price from September 2023, giving the company a market capitalization of approximately $170 billion on roughly $4.7 billion in FY2026 revenue — a revenue multiple of approximately 36 times that reflects investor expectations of sustained high-rate royalty revenue growth as AI chip designs proliferate and ARM v9 adoption nears completion. The valuation premium relative to standard semiconductor companies (which trade at 5 to 8 times revenue) reflects Arm’s royalty model characteristic: revenue grows with each new ARM-based chip that ships globally without requiring Arm to add manufacturing capacity, hire proportional headcount, or incur proportional cost of goods sold. Arm’s operating margin profile — approximately 45 percent in FY2026 — is structurally above semiconductor manufacturers’ margins because royalties are essentially pure margin above the fixed costs of maintaining the ISA and supporting licensees’ design implementations. The growth ceiling question is the central investor debate: how much higher can royalty revenue grow if ARM already powers approximately 99 percent of smartphones, approximately 40 to 60 percent of data center CPUs by unit count, and a growing share of AI accelerators? Arm’s answer is that royalty revenue per ARM chip shipped has not yet peaked, because the shift from v8 to v9 architecture and the shift from general-purpose CPUs to AI-specific compute designs (higher ASP, higher royalty absolute value) will continue to expand revenue per unit even as total unit growth moderates from peak smartphone growth rates. IDC’s semiconductor market research for 2026 projects AI-accelerated chip demand to sustain ARM royalty revenue growth above 20 percent annually through 2027, with the primary risk factor being an acceleration of RISC-V adoption in edge AI and IoT applications that could reduce Arm’s royalty exposure in embedded markets while leaving the high-value data center and mobile segments intact. Financial Times technology coverage through Q2 2026 characterizes Arm as the most structurally advantaged company in the AI infrastructure supply chain because its revenue scales with AI chip deployment volume across every chip vendor rather than being concentrated in a single product line, hardware generation, or customer relationship — a diversification that Nvidia, TSMC, and ASML each lack in their respective portions of the AI chip supply chain.

What Discipline in the Licensing Model Produced When Everyone Else Was Building Chips

Jocko Willink’s framework is that discipline is the mechanism by which long-term advantage compounds — not inspiration, not market timing, not the right product at the right moment, but the sustained execution of a correct strategic choice under pressure to deviate from it. ARM Holdings’ royalty revenue growth in fiscal Q4 2026 is what 30 years of licensing discipline looks like when the market it embedded itself in becomes the most valuable infrastructure category in technology.

The discipline ARM maintained was the decision not to manufacture chips. This sounds like a simple business model choice. It required sustained resistance to a pressure that every generation of semiconductor expansion intensified: the argument that vertically integrating — owning the foundry, controlling the packaging, capturing the margin at the chip level rather than the architecture level — would produce better financial outcomes. Intel made that vertical integration bet and spent the 2010s defending a manufacturing lead that eventually became a liability when TSMC’s process technology outpaced its own. Qualcomm’s history is partly a story of the difficulties of designing chips when ARM’s architectural alternatives created commoditization pressure at the SoC level. NVIDIA built its own architectural alternatives in parallel with using ARM in mobile. In each case, the alternative to ARM’s pure licensing model created strategic complications that ARM avoided by maintaining the discipline not to compete with its own customers.

The 46% royalty revenue growth in the quarter reflects that discipline compounding across NVIDIA’s Blackwell GPUs, Apple’s M4, Qualcomm’s Snapdragon X Elite, and Amazon’s Graviton 4 simultaneously — four of the most significant chip programs in the market, all generating royalties for the same licensor. ARM did not build any of those chips. It built the architecture that made each of them possible, took its royalty, and allowed the chip designers to compete with each other for the market share that ARM collects regardless of who wins. The compute subsystem licensing expansion — the article’s additional revenue layer — is the same discipline applied to a new design scope: as chip architects integrate more subsystems at the IP level, ARM’s addressable royalty surface expands without ARM manufacturing anything. Discipline produced this. Thirty years of it.

25/06/2026
Oracle Cloud Infrastructure Is Taking AI Revenue From AWS and Azure

Oracle Cloud Infrastructure Is Taking AI Revenue From AWS, Azure, and Google

Oracle reported $14.3 billion in total revenue for fiscal Q4 2026 (ending May 31, 2026) — up 15 percent year over year — with cloud infrastructure revenue growing 53 percent annually to reach a $25 billion annualized run-rate and remaining performance obligations (contracted-but-not-yet-recognized future revenue) crossing $130 billion for the first time in the company’s history. Oracle’s investor relations disclosures show AI GPU cluster demand filling Oracle Cloud Infrastructure capacity faster than Oracle can build it — with management confirming on the Q4 earnings call that every GPU cluster Oracle has provisioned in 2026 has been sold before it came online, and that the constraint on Oracle’s cloud revenue growth is now data center construction pace and power provisioning capacity, not customer demand. A $130 billion backlog growing faster than a $25 billion annualized revenue run-rate is the most direct available measure that Oracle Cloud Infrastructure has shifted from a secondary cloud option to a primary AI compute destination for enterprises that cannot secure equivalent GPU cluster availability from AWS, Azure, or Google Cloud on comparable timelines.

Oracle’s position in the cloud market as recently as 2023 was that of a credible but tertiary player in a market structured around three incumbents. Gartner’s cloud infrastructure market share tracking shows AWS commanding roughly 30 percent of global cloud infrastructure revenue, Microsoft Azure at approximately 22 percent, and Google Cloud at 11 percent, with Oracle OCI below 5 percent and characterized primarily by customers running Oracle databases who found OCI the lowest-friction option for adjacent workloads. What changed between 2023 and 2026 was not primarily Oracle’s product quality — OCI had been technically competitive for years — but the arrival of a demand category, AI GPU compute, where the three incumbents were simultaneously supply-constrained and where Oracle had made early infrastructure commitments that gave it provisioned capacity at exactly the moment enterprise demand for that capacity reached its highest point. Oracle’s NVIDIA H100 GPU cluster buildout, accelerated through 2023-2024 ahead of broader AI demand confirmation, positioned the company to fill the availability gap that AWS and Azure left open during their own GPU capacity expansion programs. The capital commitments that Amazon, Microsoft, and Google made to AI infrastructure in 2026 have expanded the total market significantly, but they have not eliminated the gap Oracle exploited — enterprise customers who secured OCI GPU clusters in Q4 2023 and Q1 2024 are now under multi-year committed-spend contracts, and their contracted revenue is in Oracle’s $130 billion backlog.

What the $130 Billion Backlog Actually Signals

Remaining performance obligations represent contracted future revenue the customer has committed to purchase but Oracle has not yet recognized. A $130 billion RPO is not a guarantee of $130 billion in next-year revenue — Oracle will recognize roughly $25-30 billion of it in FY2027 — but it is a guarantee of customer commitment. Multi-year cloud contracts are directional infrastructure decisions: enterprises signing 3- to 5-year OCI committed-spend agreements have selected Oracle as a primary vendor, not a backup. The switching cost of migrating AI workloads off OCI — re-architecting pipelines, migrating data, retraining operations teams, renegotiating NVIDIA licensing and support terms — is high enough that contract renewal is the default behavior unless Oracle gives a customer a specific reason to leave. A $130 billion backlog growing by $20-25 billion per quarter is, structurally, a measurement of how many enterprise AI workload decisions have been made in Oracle’s favor and how durable those decisions are through the medium term.

The Oracle-Microsoft infrastructure interconnect partnership, active since 2024 and expanded to additional regions through 2025-2026, is the clearest market-structure signal in the AI infrastructure race. Under the partnership, enterprises can deploy workloads that span OCI and Azure through a unified management interface, with compute charges billed through whichever provider’s infrastructure runs the workload. The arrangement is commercially unusual because it acknowledges explicitly that Microsoft Azure alone cannot satisfy the AI GPU cluster demand of its enterprise customer base — and that routing overflow demand to Oracle is preferable to losing those customers to AWS or Google. For Oracle, the partnership provides Azure’s distribution reach and enterprise customer relationships at no direct sales cost. Microsoft’s AI revenue gap relative to its capex commitment reflects the same supply constraint Oracle has benefited from — Azure has been spending aggressively on AI infrastructure and still cannot satisfy all the GPU cluster demand from its installed enterprise base. The interconnect partnership resolves that tension while it persists, and Oracle’s contracted backlog captures the revenue from that resolution window regardless of what the hyperscaler capacity picture looks like when the contracts expire.

How Oracle Won AI Compute Without Playing the Hyperscaler Game

Oracle’s competitive advantages in 2026 are specific rather than broad. In AI GPU compute availability — particularly for NVIDIA H100 and H200 clusters where AWS and Azure had allocation queues measured in quarters during peak demand — OCI’s wait times have been materially shorter at comparable demand peaks, and OCI’s per-GPU pricing has been consistently at or below the hyperscaler rate cards for equivalent instance types. In Oracle database workloads, OCI is the natural home because running Oracle databases on AWS or Azure introduces licensing and support complications that OCI avoids by design. In dedicated single-tenant AI infrastructure for regulated industries — healthcare organizations with HIPAA requirements, financial institutions with data sovereignty constraints, federal contractors with FedRAMP obligations — OCI’s willingness to provision isolated customer-dedicated GPU clusters at pricing that matches the hyperscaler equivalents has differentiated Oracle in enterprise competitive evaluations where multi-tenant compute is disqualifying. Where OCI trails materially: managed services breadth (AWS has approximately 220 distinct managed services, OCI roughly 150), developer tooling ecosystem, geographic data center coverage outside North America and Western Europe, and the startup-to-enterprise pipeline that AWS and Azure have built through startup credit programs over fifteen years. The $700 billion AI infrastructure commitment from the Magnificent Seven in 2026 will eventually close the GPU availability gap that gave Oracle its market entry window — the question is whether Oracle can use the contracted backlog period to embed OCI more deeply in enterprise database and application estates before the availability advantage normalizes.

Why Enterprise AI Buyers Are Signing Multi-Year OCI Contracts

Enterprise infrastructure decisions in 2026 are being made under conditions that favor Oracle’s specific strengths: GPU cluster availability is the highest-priority variable for AI workload infrastructure selection, pricing transparency matters more than ecosystem breadth for workloads where the compute requirement is defined before the infrastructure is selected, and multi-year committed-spend economics reward the vendor that can guarantee delivery timeline over the vendor with the largest service catalogue. Reuters technology coverage through Q2 2026 characterizes Oracle’s cloud position as a specialist that has successfully exploited the AI compute bottleneck without becoming a full-stack hyperscaler competitor — a viable long-term position if AI infrastructure spending remains concentrated at the top of the enterprise budget stack. The strategic risk Oracle faces is the 2027 timeline: Gartner projects that hyperscaler GPU capacity reaches parity with enterprise demand by late 2027, at which point Oracle’s availability advantage normalizes and pricing becomes the primary competitive variable. Oracle’s answer to that risk is the depth-of-integration strategy — embedding OCI into enterprise database operations, analytics workflows, and application estates through APEX, Autonomous Database, and Oracle Analytics Cloud integrations that run materially better on OCI than on competing infrastructure. Whether that depth-of-integration moat is sufficient to retain backlog customers when their multi-year contracts expire is the key question Oracle’s FY2029 revenue will answer. For FY2027 and FY2028, the contracted backlog makes the answer clear: the AI compute window has already been converted into durable forward revenue regardless of what the competitive infrastructure market does in the interim.

How Oracle Wins AI Infrastructure Without Competing in the Model Layer

Aggregation theory — the framework Ben Thompson has applied most consistently to understanding platform power — distinguishes between platforms that control access to users and platforms that control access to supply. Google aggregates user attention and uses that control to set the terms of the advertising market. Netflix aggregates content rights and uses that control to negotiate with studios. Oracle’s position in enterprise AI is neither of those things. Oracle controls access to GPU compute at a price point and contract structure that enterprises buying multi-year AI infrastructure commitments find more predictable than the hyperscaler alternatives — and it has no competing AI model product that creates the channel conflict that makes AWS, Azure, and Google Cloud structurally uncomfortable for enterprises that want neutral infrastructure.

The $130 billion contract backlog that OCI has accumulated is not primarily a signal about Oracle’s model capabilities. It is a signal about enterprise procurement decisions made by companies that need GPU capacity and are specifically avoiding concentrating that dependency in the same vendor from whom they buy their foundational AI services. A pharmaceutical company using Microsoft’s Copilot and Azure OpenAI for internal productivity tooling has a rational reason to prefer Oracle or another provider for its GPU training infrastructure: it does not want Microsoft to have both the client-facing AI model relationship and the underlying compute contract. The structural separation between model vendor and infrastructure vendor has commercial value to the enterprise even when the infrastructure vendor’s technology is not meaningfully differentiated from the hyperscaler equivalent.

Oracle’s $130 billion backlog reflects this procurement logic at scale. The enterprises signing 3-5 year OCI contracts are not doing so because OCI’s performance benchmarks consistently outperform AWS or Azure. They are doing so because Oracle occupies the “neutral compute” position in the AI infrastructure market — a position that AWS, Azure, and Google Cloud structurally cannot occupy because each of them is also selling the frontier AI models that enterprise AI buyers are using as their primary application layer. Oracle’s competitive moat in enterprise AI is not technical leadership. It is the absence of a model business — which looks like a weakness from a product innovation standpoint and functions as a trust advantage in the enterprise procurement conversations that are producing the $130 billion backlog.

What the Enterprise Procurement Chain Behind Oracle’s $130 Billion AI Backlog Actually Looks Like

A $130 billion contract backlog is a number that needs to be followed through the organizations that signed those contracts. Enterprise AI infrastructure commitments of this size do not emerge from a CTO’s preference for a particular vendor’s GPU throughput metrics. They emerge from procurement processes that involve budget committees, vendor risk assessments, legal review of multi-year commitment structures, board-level capital allocation approvals, and finance team analysis of vendor diversification mandates. The people who matter in a $130 billion GPU compute commitment decision are not the ones quoted in Oracle’s investor presentations. They are the procurement officers, the CFOs, and the board risk committees who authorized the capital.

Oracle’s neutral compute positioning resonates in enterprise procurement because the procurement officers have a specific mandate concern: dependency concentration risk when their infrastructure vendor also competes with their AI model vendor. AWS, Azure, and Google Cloud are all pursuing enterprise AI application revenues that compete with enterprise-built and third-party AI applications. An enterprise that signs a five-year GPU compute commitment with AWS is signing with an entity that may be competing with its own AI application layer in eighteen months. The procurement chain recognizes that risk because recognizing it is the procurement chain’s job. Oracle’s $130 billion backlog reflects, in significant part, that risk committee consensus — not just Oracle’s technical superiority.

What an investigative inquiry would want to know is the composition of that backlog. How much is driven by genuine compute-cost efficiency versus vendor diversification mandates? How much is concentration from a small number of very large customers versus broad enterprise distribution? What are the exit clauses, and what do they reveal about how committed these customers actually are? A $130 billion number with three customers and flexible exits means something structurally different from a $130 billion number with forty customers on strict multi-year terms. Oracle has not disclosed the backlog composition at that level of detail. The vendor narrative is that demand is strong and structural. The follow-the-money question is who signed, why, and what they have the right to do if the rationale changes.

22/06/2026
Palo Alto Networks Pushed Platform Consolidation and It Worked

Palo Alto Networks Pushed Platform Consolidation and It Worked

Palo Alto Networks reported $2.3 billion in revenue for its fiscal Q3 2026 — up 15 percent year-over-year — with its platformisation strategy producing the specific commercial outcome its management team had staked the company’s growth narrative on: customers consolidating multiple point-solution security vendors onto Palo Alto’s integrated platform were generating significantly higher annualised recurring revenue per account than customers running individual products. Palo Alto Networks’ investor disclosures show that accounts with three or more platform modules — combining its SASE (Secure Access Service Edge), Cloud Security, and Security Operations products — churn at substantially lower rates than single-module customers and expand faster over a two-to-three-year relationship. The platformisation bet, which Palo Alto announced aggressively in early 2024 and which initially spooked investors when it offered free product trials to accelerate consolidation, appears to be generating the land-and-expand economics that justify the short-term revenue deferral it required.

The cybersecurity market’s structural shift toward platform consolidation is one of the defining buyer-behaviour changes of 2025-2026. Enterprise security buyers who spent the 2018-2023 period assembling best-of-breed point solutions — individual vendors for endpoint detection, cloud workload protection, network security, identity management, email security, and security operations — are now evaluating the total cost of running 20-35 separate vendor relationships against the operational overhead and integration complexity that stack creates. AI-driven cyber attack sophistication has accelerated this evaluation: a security stack that requires manual correlation of alerts across 15 different vendor consoles cannot respond to AI-accelerated attacks that move from initial access to lateral movement in minutes rather than hours. The case for consolidation is now being driven by operational necessity rather than cost alone.

What Platformisation Actually Means in Practice

Palo Alto’s platformisation strategy is built around three product families that can be sold individually or as a unified platform. The first is Prisma SASE — a cloud-delivered network security product that combines secure web gateway, cloud access security broker, zero-trust network access, and SD-WAN into a single cloud service that replaces the fragmented collection of network security appliances many enterprises operate. The second is Prisma Cloud — a cloud security posture management and cloud workload protection platform that monitors cloud infrastructure across AWS, Azure, and GCP for misconfigurations, vulnerabilities, and runtime threats. The third is Cortex XSIAM — an AI-driven security operations platform that replaces traditional SIEM (Security Information and Event Management) systems with a model that ingests security telemetry at much larger scale and applies machine learning to reduce alert volume and prioritise genuine threats.

The commercial logic is that an enterprise running all three families through Palo Alto is spending more per year than it would on any individual product, but is replacing a larger number of point-solution vendor contracts that together cost more than the consolidated platform price. Enterprise security procurement teams have validated this math in enough RFP processes that platformisation consolidation is now a standard consideration in annual security budget cycles rather than a novel concept requiring extensive internal advocacy. Regulatory AI risk frameworks in financial services and healthcare have also created demand for unified audit trails and evidence of comprehensive security posture — requirements that fragmented point-solution stacks struggle to satisfy without significant manual integration effort, and that consolidated platforms address natively.

How Palo Alto Is Separating From CrowdStrike in the Platform Narrative

Palo Alto and CrowdStrike are the two companies most associated with the cybersecurity platform consolidation narrative, but they have attacked it from different starting positions. CrowdStrike’s Falcon platform originated in endpoint detection and response — it built outward from the endpoint into identity security, cloud workload protection, and threat intelligence. Palo Alto originated in network security — its next-generation firewall business established its enterprise relationships, from which it expanded into cloud security and security operations. The two platform stories therefore land differently with different security buyer personas: CrowdStrike’s narrative resonates most strongly with security operations teams focused on endpoint and identity visibility; Palo Alto’s resonates most with network and infrastructure security teams managing cloud and hybrid environments.

The distinction matters for understanding which accounts each company is likely to consolidate versus which it will share. A financial services enterprise that already runs CrowdStrike across 50,000 endpoints and trusts its Falcon platform for endpoint detection is unlikely to replace it with Palo Alto’s endpoint product; but that same enterprise may adopt Palo Alto Prisma SASE for its network security layer and Cortex XSIAM for security operations, creating a multi-platform outcome rather than a single-vendor outcome. The security market’s actual trajectory in 2025-2026 is less about one platform winning across all layers and more about two or three platforms each winning across specific layers — with the integration work between platforms becoming the residual complexity that both companies sell professional services to manage. Gartner’s cybersecurity market research characterises this as a “platform of platforms” outcome rather than a single-vendor winner-takes-all scenario — the consolidation is real, but the number of remaining platforms stabilises at two to four rather than collapsing to one. AI agent orchestration in enterprise workflows has added a security dimension that Palo Alto is addressing through Cortex XSIAM’s AI-driven detection — a layer where the platform’s ability to correlate telemetry across network, cloud, and endpoint gives it an advantage that point-solution vendors cannot replicate without the same breadth of data.

The Free-Trial Strategy and Why It Deferred Revenue to Build ARR

Palo Alto’s controversial decision in early 2024 to offer its Cortex XSIAM platform to existing customers at no charge for a defined trial period — a move it called “platformisation acceleration” — was widely misread as a sign of pricing weakness or competitive desperation. The actual strategic logic was a customer acquisition model borrowed from enterprise SaaS: offering a premium product at zero cost for a defined period to existing customers converts a theoretical sales conversation into a live deployment, at which point the switching cost of removing the product from production creates a negotiating position for paid conversion that the vendor did not have before deployment.

The XSIAM trial strategy required Palo Alto to defer approximately $400-600 million in revenue that would have been recognised sooner under a traditional paid deployment model. That deferral produced the guidance shortfall that rattled investors in early 2024. What it also produced, by fiscal Q3 2026, is a cohort of enterprise accounts that have run Cortex XSIAM in production for 12-18 months, have built security workflows around it, have trained their security operations teams on it, and have generated 18 months of historical telemetry that makes the platform more valuable with each passing month. Converting those accounts to paid contracts at renewal has proceeded at a higher rate than Palo Alto’s own internal targets, confirming that the trial-to-paid conversion model works in enterprise security when the product generates genuine operational dependency during the trial period. S&P Global’s cybersecurity market analysis through Q1 2026 shows Palo Alto gaining enterprise account share in security operations at the expense of traditional SIEM vendors including IBM QRadar and Splunk — the segment where the XSIAM free trial was concentrated.

The Risk That Remains in the Consolidated Platform Bet

Platform consolidation strategies carry a specific failure mode: a security incident caused by a platform failure affects every layer simultaneously rather than being contained within one product’s scope. The CrowdStrike July 2024 outage — in which a faulty content update to the Falcon sensor caused millions of Windows systems to crash globally — illustrated the systemic risk that single-platform concentration introduces. Enterprises that had consolidated their endpoint security entirely onto Falcon had no fallback; enterprises that maintained some platform diversity could route around the affected product. The incident did not slow enterprise platform consolidation meaningfully in 2025, but it permanently altered the risk conversation: enterprise security buyers now routinely ask consolidated platform vendors how their products fail safe and what redundancy exists at the product layer.

Palo Alto has addressed this directly in its enterprise sales conversations, emphasising the modular architecture of its platform products — each family (SASE, Cloud, XSIAM) can be operated independently rather than requiring the entire platform to function for any individual component to operate. The modularity argument is real but partial: a security operations team that has built its workflows around Cortex XSIAM’s unified telemetry stream cannot simply substitute another product in the event of a Palo Alto service disruption without disrupting those workflows. The dependency that makes consolidated platforms commercially sticky is the same dependency that creates operational risk under failure scenarios. Managing that risk is now a central component of enterprise security architecture conversations — and it is a conversation that Palo Alto’s sales and solutions engineering teams are better equipped to have in 2026 than they were in 2024, when the CrowdStrike incident first made platform concentration risk a board-level security topic.

Why Distributed Security Responsibility Equals No Security Accountability

In security, complexity is the enemy of ownership. When an enterprise operates 25 distinct security vendor relationships — endpoint detection from one vendor, cloud workload protection from a second, identity security from a third, network traffic analysis from a fourth — every potential failure point has a different contractual owner. The SIEM vendor sees the alert. The endpoint vendor controls the remediation tool. The cloud security vendor manages the posture gap. The identity vendor holds the authentication log. When a breach happens, and it will, each vendor can point to the boundary of their product’s scope. No single team inside the enterprise owns the sequence of events connecting initial access to lateral movement to data exfiltration. That is not a technology problem. It is a leadership problem wearing a technology mask.

Jocko Willink’s principle of extreme ownership is simple: the person responsible for an outcome owns everything that contributes to it, including the failures of the systems they depend on. Applying that principle to enterprise security means the CISO who owns the security outcome owns the entire stack — not the individual contractual boundaries between vendors. A security architecture that distributes responsibility across 25 vendors distributes accountability in a way that makes clean ownership structurally impossible. When the breach investigation report arrives, the narrative almost always reveals that the signals were present across multiple vendor dashboards simultaneously and that no single team had both the visibility and the authority to connect them fast enough to intervene. The distributed architecture was not a cost-optimisation failure. It was an accountability failure that the architecture made inevitable.

Palo Alto’s platformisation thesis addresses this directly, and the commercial traction it has produced in the two years since the company staked its growth narrative on it confirms that CISOs are recognising the ownership problem. A CISO running Palo Alto’s three integrated product families — Prisma SASE for network security, Prisma Cloud for cloud workload protection, Cortex XSIAM for security operations — owns a single integrated telemetry layer with one escalation path, one quarterly business review, and one contract to hold accountable when the system misses something. That accountability structure does not exist in a 25-vendor stack, because no individual vendor accepts responsibility for the aggregate outcome. The enterprises consolidating onto Palo Alto are not only buying operational efficiency or lower total vendor cost. They are buying back the ability to own their security posture without apology — and for a CISO who answers to a board after a breach, that clarity of ownership is worth more than any individual product’s feature checklist.

What the Enterprise Buyer Experiences When Security Platform Consolidation Actually Happens

Julie Zhuo’s product-empathy framework starts from the lived experience of the person using the product, not the intended experience described in the vendor’s case study. Applied to Palo Alto’s platform consolidation story, this means asking not whether the architecture is structurally superior to a 25-vendor stack — it clearly is — but what the enterprise security buyer actually experiences during the consolidation process.

The first experience is a parallel-run period that the consolidation narrative rarely describes honestly. Enterprise security teams do not decommission legacy tools when the new platform is installed. They run both systems simultaneously, often for 12 to 18 months, because no security operations leader will accept a detection gap during the transition period. The parallel run doubles operational complexity before it reduces it — security analysts must monitor outputs from both environments, triage alerts from two detection systems, and maintain expertise in tools they know they are eventually deprecating. The efficiency gain that the consolidated platform promises is real but deferred. The immediate experience of consolidation is more work, not less.

The second experience is internal political friction. The teams that owned the legacy tools — endpoint security, network monitoring, cloud posture management — built their expertise, their vendor relationships, and often their career identities around those specific tools. Consolidation on Palo Alto does not just change the software; it changes the organizational structure of who owns security decisions and whose expertise is now the centre of gravity in the security operations team. That friction is not a product failure — it is the normal human response to a major organisational change. The third experience is the operational-readiness lag: the platform can be technically installed and reporting events from day one, but the security operations team is not yet trained to use the new detection logic, the new response playbooks, or the new investigation workflows effectively. The free-trial strategy accelerated the procurement decision. It did not accelerate the operational readiness. The honest product-empathy reading of platform consolidation is: the architecture is better, the path to realising that better architecture takes longer and costs more in disruption than the consolidation narrative acknowledges, and the enterprise buyer who understands that going in is better positioned than the one who expected the efficiency gains to arrive in quarter one.

19/06/2026
AMD’s AI Chip Revenue Is Growing but Still Far Behind Nvidia

AMD’s AI Chip Revenue Is Growing but Still Far Behind Nvidia

AMD’s data center GPU segment generated $3.7 billion in Q1 2026 revenue, driven by continued deployment of MI300X accelerators at Microsoft Azure, Oracle Cloud, and Meta’s AI inference infrastructure — a result that confirms AMD has secured a durable second-tier position in the AI accelerator market while illustrating the scale of the gap it still needs to close against Nvidia. AMD’s Q1 2026 earnings showed data center GPU revenue growing 80 percent year-over-year, the fourth consecutive quarter of strong growth in the segment, but Nvidia’s data center revenue in the same period exceeded $39 billion — a ratio that makes AMD’s position look like a distinct competitor rather than a credible challenger at the top of the market.

The distinction matters for how the AI accelerator market is characterised. AMD is not a niche alternative to Nvidia; it has secured real commitments from three of the world’s largest AI infrastructure buyers and is embedded in production inference workloads at scale. But AMD is also not competing with Nvidia for the same purchase decisions at the same customers. The MI300X’s commercial success has concentrated in inference — running trained models against new inputs — rather than training, where Nvidia’s H100 and H200 Blackwell GPUs dominate the large-scale cluster deployments that drive the largest purchase orders. The inference-vs-training split in AMD’s deployment base is not a weakness; it reflects a deliberate market positioning decision that has allowed AMD to grow revenue without winning head-to-head against Nvidia on the workload type where Nvidia’s CUDA software ecosystem advantage is strongest. Nvidia’s Q1 FY27 $81 billion revenue quarter illustrates the scale of the training-cluster market that AMD is not yet competing for at full scale.

Where MI300X Deployments Actually Landed

The three largest confirmed MI300X deployments in 2025-2026 share a common characteristic: they are at hyperscalers with the in-house engineering capacity to work around CUDA’s absence by investing in ROCm software optimisation at scale. Microsoft Azure launched MI300X-based virtual machine instances in mid-2025, targeting inference workloads where customers are running open-source models — Llama, Mistral, Falcon — rather than models that have been specifically optimised for Nvidia GPU memory architecture. Oracle Cloud Infrastructure became AMD’s largest enterprise cloud deployment partner for MI300X, positioning the accelerators as an alternative to Nvidia for customers facing GPU availability constraints during periods of high demand. Meta has disclosed using MI300X for portions of its internal AI inference infrastructure, primarily for serving recommendation models and content ranking systems where throughput at a given cost per query is the primary performance metric — a deployment confirmed in Meta’s Q1 2026 earnings disclosures as part of the company’s broader AI infrastructure diversification away from single-vendor GPU dependency.

Each of these deployments reflects a specific economic argument for MI300X rather than an assertion that it outperforms Nvidia across all workloads. At Microsoft Azure, MI300X instances offer a lower cost per token for inference on specific open-source models because AMD has invested in ROCm optimisation for those model architectures. At Oracle, the argument is availability — MI300X hardware is accessible on timescales where H100 allocation queues extend months. At Meta, the argument is cost-per-query at inference scale, where the volume of recommendation requests is high enough that even modest per-query cost advantages justify the engineering investment in non-CUDA infrastructure. Broadcom’s custom XPU deployments at Google and Meta reflect the same pressure: hyperscalers are actively building alternatives to Nvidia procurement dependency, and AMD is one of those alternatives alongside custom silicon programmes. SemiAnalysis’s AI chip deployment tracking through Q1 2026 shows MI300X’s share of inference-workload GPU hours at the three largest cloud providers consistently growing quarter-over-quarter.

AMD’s Revenue Gap From Nvidia in AI Accelerators

The revenue ratio between Nvidia and AMD in AI accelerators has not narrowed meaningfully even as AMD’s absolute revenue has grown. Nvidia’s Blackwell-generation accelerators have captured the large-scale training cluster market at a higher price point than the prior H100 generation, which has expanded Nvidia’s total AI revenue faster than AMD’s 80 percent year-over-year growth can close the gap. The gap between $3.7 billion and $39 billion in a single quarter is not a trajectory problem — AMD’s growth rate is real — but a market-share problem: Nvidia is growing at comparable rates off a much larger base, and the training-cluster market that drives Nvidia’s highest revenue per accelerator is the segment AMD has not entered at comparable scale.

The software ecosystem gap explains the structural difficulty. CUDA, Nvidia’s parallel computing platform and programming model, has been the default development environment for AI research and production engineering since 2007. The models that enterprise AI teams have trained, the inference optimisations they have developed, and the deployment pipelines they have built are all CUDA-native. Migrating a production AI workload from CUDA to AMD’s ROCm is an engineering project that most organisations have not prioritised when Nvidia hardware is available, because the cost of the migration exceeds the cost savings on the hardware at typical deployment scales. The customers who have migrated to MI300X are either hyperscalers with the engineering capacity to absorb migration costs, or new deployments where a team is building a pipeline from scratch and can choose ROCm from the outset. AMD’s MI350 roadmap targets the training market more directly than MI300X did, with architectural improvements aimed at reducing the performance gap that makes CUDA migration less economically compelling.

What the MI325X Changes for 2026

AMD launched the MI325X as an incremental upgrade to the MI300X in late 2025, with increased HBM3E memory capacity and bandwidth improvements that address the specific constraint — memory ceiling — that limits MI300X on the largest inference batch sizes. The MI325X does not close the training-cluster performance gap against Nvidia’s Blackwell series, which is designed around a very different memory architecture and interconnect scheme for multi-GPU training jobs. What MI325X does is extend AMD’s competitiveness in inference for larger models and larger batch sizes, which is the workload category where AMD’s commercial traction has been strongest.

The MI350X, announced for late 2026 production, is the generation where AMD has stated ambitions to compete more directly with Nvidia for training cluster deployments. MI350X is expected to use AMD’s CDNA4 architecture with a substantially larger compute die and improved interconnect for multi-GPU configurations. Whether it succeeds in winning training cluster commitments at hyperscalers depends as much on ROCm software maturity and CUDA-compatibility layer development as on hardware specifications — the hardware gap is closing faster than the software gap, and the software gap is where AMD’s commercial ceiling currently sits. TSMC’s N2 process ramp supplies both AMD and Nvidia with leading-edge silicon, which means AMD’s hardware performance trajectory is constrained by design and architecture rather than by process technology access — the same N2 node is available to both.

Custom Silicon as the Third Option for Hyperscalers

The framing of the AI accelerator market as an Nvidia-vs-AMD competition understates a third trajectory that is growing in parallel: hyperscaler custom silicon. Google’s TPU, Meta’s MTIA, Amazon’s Trainium and Inferentia, and Microsoft’s Maia all represent investments in accelerators that bypass both Nvidia and AMD for specific workloads where the hyperscaler’s software team can optimise a custom design more effectively than a general-purpose GPU architecture. The custom silicon programmes do not eliminate Nvidia or AMD from the hyperscaler market — they complement them by handling the workloads that are most cost-sensitive and most amenable to specialised optimisation, while Nvidia and AMD handle the workloads where flexibility and general-purpose performance are more valuable than unit economics at fixed workload types.

For AMD, the growth of custom silicon programmes at hyperscalers is a different kind of competitive pressure than Nvidia’s dominance. Nvidia’s lead is a software ecosystem problem; custom silicon programmes are a customer insourcing problem. Customers that invest in their own accelerator designs are reducing their dependency on both Nvidia and AMD simultaneously, which limits the total addressable market for merchant silicon in the long run even as AI infrastructure spend grows. AMD’s strategic response has been to pursue customers who do not have the scale to justify custom silicon investment — enterprise AI buyers, mid-tier cloud providers, research institutions — where the cost of building a proprietary accelerator is prohibitive and the choice is between Nvidia and AMD rather than between external vendors and internal design. The revenue growth demonstrates that segment is commercially viable; the trajectory will depend on whether AMD’s software investment in ROCm can make the migration argument compelling at enterprise scale beyond the hyperscalers who have already committed.

The Software Stack Is the Real Product

Steve Jobs told Stanford graduates in 2005 that you cannot connect the dots looking forward — you can only connect them looking backward. In AMD’s case, the dots connect to a pattern the semiconductor industry has traced before: a company with excellent hardware and a structural disadvantage in the developer ecosystem that makes the hardware less commercially accessible than its specifications suggest it should be.

The MI300X’s benchmark performance is not in question. At specific inference tasks — particularly the memory-bandwidth-intensive workloads where HBM3E capacity gives AMD an architectural edge — the chip competes effectively. The question that the revenue gap reveals is not whether the chip is good. The question is why the chip’s customers are mostly the three or four hyperscalers who have the engineering teams to bear the switching cost.

The answer is ROCm. Nvidia’s CUDA ecosystem is not primarily a collection of compute primitives — it is fifteen years of accumulated developer tooling, library support, debugging environments, profiling tools, and documented workflows that lower the cost of integrating Nvidia hardware into any AI system. ROCm is technically functional; the gap is in the depth of the ecosystem surrounding it. An enterprise AI team evaluating AMD MI300X is not evaluating the chip — they are evaluating whether their engineers’ existing skills, the frameworks their models run on, and the libraries they depend on will work without modification. For most enterprise teams below the hyperscaler tier, the answer is still uncertain enough to make Nvidia the lower-risk choice regardless of the hardware economics.

This is the pattern Jobs understood about Apple’s Macintosh era against Microsoft’s platform dominance: the product that wins is not always the product with the best specs. It is the product that costs the least to use given everything the developer already knows and has built. AMD is building better chips. The real product it needs to build is the one that reduces the gap between what an Nvidia-trained engineer knows and what an AMD-deployed system requires. ROCm investment is heading in that direction; the revenue data measures how far it still has to go.

What the AMD and NVIDIA Enterprise AI Chip Deals Actually Show About Who Controls the Market

Bob Woodward’s investigative method is to follow the specific decision chain rather than the headline metric. The headline metric in the AI accelerator market is revenue gap: NVIDIA’s quarterly GPU revenue is several multiples of AMD’s. That number is accurate. The investigation that matters for understanding who controls the market is not the quarterly delta but the structure of the procurement decisions that produced it — specifically, which buyers chose AMD, under what constraints, and what that reveals about where NVIDIA’s market control is strongest and where it is not.

NVIDIA’s enterprise AI hardware position is not primarily a silicon advantage. MI300X and AMD’s Instinct line benchmark comparably to NVIDIA hardware in several inference workloads — the performance gap in training is real and significant, but inference-optimized deployments are a large and growing share of enterprise AI compute. NVIDIA’s advantage is an enterprise sales architecture built through fifteen years of CUDA developer tooling, direct-to-customer sales relationships at hyperscalers and AI research labs, and a software ecosystem (cuDNN, cuBLAS, NCCL) so deeply embedded in the ML developer workflow that migrating off it requires retraining teams, rewriting libraries, and accepting a productivity hit during transition. The investigation of AMD’s 2026 wins shows a consistent pattern: AMD is closing deals in workloads where CUDA’s network effect is weakest — inference-only clusters at hyperscalers, Meta’s custom-model inference pools, Microsoft Azure’s inference-optimized VM families where GPU vendor mixing is acceptable. Those aren’t AMD defeating NVIDIA in the AI chip market; they are AMD finding specific market pockets where the switching cost of CUDA is lower than the cost of NVIDIA’s pricing premium.

Following the specific procurement decisions leads to a different picture than the aggregate revenue gap implies. AMD’s 2026 commercial AI revenue is concentrated in a small number of large hyperscaler contracts for specific inference workloads, not distributed across the enterprise customer base the way NVIDIA’s training-oriented deployments are. The concentration makes AMD’s position more fragile than the revenue growth rate suggests — a single hyperscaler decision to consolidate on NVIDIA or a custom silicon alternative would produce a large discrete revenue drop rather than a gradual share erosion. The investigation of what AMD actually controls in 2026 shows a real but narrow market position, in specific workload categories, at specific customers. That is more precise than “second place in AI chips” and more useful for understanding how the market actually works.

17/06/2026
Snowflake and Databricks Are Converging on the Same AI Data Platform

Snowflake and Databricks Are Converging on the Same AI Data Platform

Snowflake’s Q1 FY2027 product revenue reached $996 million — its first brush with the $1 billion quarterly mark — growing 26 percent year-over-year under CEO Sridhar Ramaswamy, who has spent the 18 months since taking over repositioning the company’s roadmap around AI workloads. In the same period, Databricks crossed $3.5 billion in annualised revenue, growing above 50 percent year-over-year, and is preparing for an IPO that analysts are pricing in the $80-100 billion range. Snowflake’s Q1 FY2027 investor materials and Databricks’ most recent funding disclosures confirm that the two largest independent data platforms are now competing for the same enterprise budget category — AI-ready data infrastructure — despite having been built for different purposes from opposite ends of the same problem. The convergence is creating a consolidation moment in enterprise data architecture that CIOs and chief data officers are being forced to navigate without a clear answer about which platform wins.

Snowflake was built as a cloud-native SQL data warehouse: governed, performant, accessible to business analysts through standard SQL interfaces, priced on compute and storage consumption. Its architecture separated storage from compute in a way that made it radically more flexible than on-premise data warehouses and drove one of the most successful enterprise software IPOs in history in 2020. Databricks was built as a unified analytics platform on top of Apache Spark, designed by the academic team that created Spark and optimised for data engineering and machine learning workloads that required Python notebooks, distributed compute, and direct access to raw data lakes rather than warehouses. The two platforms attracted different buyers — Snowflake’s SQL-first approach won with analytics and BI teams; Databricks’ code-first approach won with data science and ML engineering teams — and for several years co-existed without direct competition in most enterprise accounts.

Snowflake and Databricks Started at Opposite Ends of the Same Stack

The architectural gap that kept the two platforms non-competing has closed from both sides. Snowflake added Snowpark — a framework enabling Python, Java, and Scala workloads to run directly in Snowflake — and acquired Neeva (an AI search company) to accelerate its AI feature roadmap. Snowflake Cortex, the company’s AI/ML layer, provides large language model inference, text-to-SQL capabilities, and document AI directly within the Snowflake environment, enabling analysts who have never written Python to run LLM-powered queries against their governed data. Databricks, moving in the opposite direction, added Databricks SQL — a high-performance SQL warehouse that competes directly with Snowflake’s core competency — and has aggressively marketed the Lakehouse architecture as a unified replacement for the Snowflake-plus-Databricks two-platform approach that many enterprises currently operate.

The strategic logic of both movements is the same: the AI era has elevated the importance of the data layer, and the platform that wins the AI data layer wins a multi-decade renewal of enterprise software spend. Cloud infrastructure capex is growing at rates that reflect AI workload growth, and both Snowflake and Databricks are positioned to capture the application layer above that infrastructure if they can deliver the governed, accessible, AI-augmented data platform that enterprises actually need. The problem for enterprise buyers is that both platforms are credibly claiming to be that platform, and neither has yet demonstrated that its historically weaker side — Snowflake’s ML credentials, Databricks’ governance and BI credentials — has fully caught up to the other’s core strength.

What Cortex and Mosaic AI Actually Deliver

Snowflake Cortex and Databricks Mosaic AI are the respective AI product layers that each company is betting on to differentiate in the AI era. Cortex provides LLM functions accessible via SQL: COMPLETE (text generation), EXTRACT_ANSWER (question answering over documents), SENTIMENT, SUMMARIZE, and TRANSLATE. These are high-level, low-friction functions that allow a data analyst to run AI against their Snowflake data without writing Python or managing model infrastructure. The value proposition is accessibility — the analyst who has been using SQL for a decade can now apply AI to their data without crossing a technical threshold they have not previously had to cross.

Mosaic AI on Databricks targets a different user: the ML engineer or data scientist who wants to fine-tune foundation models on proprietary data, run large-scale model training on distributed Databricks clusters, and deploy models into production with MLflow tracking. The Databricks approach assumes a higher technical floor and delivers deeper capability at that floor — model customisation, vector search, AI agent tooling, and the Unity Catalog governance layer that bridges ML model management with data governance. The practical division is that Cortex is winning with centralised analytics teams who need AI features without ML expertise, while Mosaic AI is winning with data science organisations that are building bespoke AI products. Enterprise AI cost management is a concern on both platforms: Cortex’s per-call LLM pricing and Mosaic AI’s GPU compute charges add cost layers that data platform budgets did not previously carry.

Microsoft Fabric as the Third Competitor

The Snowflake-Databricks duopoly framing obscures a significant third force: Microsoft Fabric, announced in 2023 and generally available since late 2023, which attempts to unify the data engineering, analytics, and AI layers within Microsoft’s existing enterprise ecosystem. Fabric integrates OneLake storage, Synapse Analytics, Power BI, Azure ML, and Real-Time Intelligence into a single governance and management surface. For enterprises already paying for Microsoft Azure and Microsoft 365, Fabric’s pricing is bundled in ways that make standalone Snowflake or Databricks economics harder to justify to a CFO — not because Fabric has matched either platform’s depth, but because the incremental cost of Fabric for an existing Microsoft customer is often near zero relative to the existing enterprise agreement. Microsoft’s positioning is laid out on the Fabric product page.

Snowflake and Databricks are both aware of the Microsoft bundling risk and have positioned their independence — and their multi-cloud neutrality, running natively on AWS, Azure, and Google Cloud — as the differentiator that Fabric cannot replicate. A company standardised on Fabric is a company standardised on Azure; a company on Snowflake or Databricks can shift cloud providers without losing their data platform investment. The precedent from enterprise workflow platforms is instructive: platform independence has consistently commanded a premium when the alternative is lock-in to a single hyperscaler’s ecosystem, and the enterprise data category — where data gravity is even higher than workflow gravity — may prove more resistant to hyperscaler consolidation than adjacent categories. Whether that premium is sufficient to sustain two independent unicorns plus an IPO candidate in a category that Microsoft is bundling aggressively is the question that will resolve in the next platform purchasing cycle.

Why Enterprises Are Running Both and Whether That Can Last

The most common enterprise data architecture in 2026 is a combination of Snowflake for governed SQL analytics and Databricks for ML and data engineering, with data shared between them via open formats (Delta Lake, Iceberg, Parquet) that both platforms support. This two-platform approach is expensive — licencing both platforms for a large enterprise adds several million dollars annually to data infrastructure costs — and creates operational complexity around data synchronisation, access governance, and skills development. Databricks’ messaging has explicitly targeted this two-platform reality as the argument for consolidating onto a single Lakehouse; Snowflake’s messaging has equally explicitly targeted it as the argument for staying with Snowflake and using Cortex rather than maintaining a separate ML platform. This is exactly the kind of platform-monetisation overlap hedge funds were exiting tech to avoid.

The two-platform situation will not last indefinitely: at some point in the next two to three years, the enterprise organisations that currently run both will face a renewal cycle in which one platform’s AI capabilities have become strong enough to justify consolidation, and the switching-cost analysis will tip toward whichever platform has closed the capability gap more convincingly. Which direction that consolidation goes — Lakehouse unifying data engineering and analytics, or cloud data warehouse expanding into ML — will determine which of the two companies captures the majority of the enterprise AI data infrastructure category that both are competing to own.

When Two Competitors Converge on the Same Architecture, the Category Wins

Shane Parrish at Farnam Street builds on Charlie Munger’s observation that the best mental models force you to look at a situation from a different level of abstraction than the one that feels most natural. The natural way to read the Snowflake-Databricks convergence story is as a competitive battle — two well-funded companies fighting for the same enterprise data contracts. The second-order read is more useful: when two strong competitors converge on an identical architecture, the category they are both converging toward tends to beat the alternatives they are both abandoning.

Through 2022, the enterprise data infrastructure market had two genuine camps: cloud data warehouses optimised for structured SQL analytics, of which Snowflake was the commercial leader, and data lakehouse platforms optimised for ML pipelines and unstructured data engineering, of which Databricks was the commercial leader. Enterprises had a real architectural choice. The SQL shop and the Python shop pointed at different platforms and the platforms were genuinely different.

What the 2026 convergence eliminates is that real choice. Databricks’ SQL Analytics has closed the performance gap with Snowflake’s data warehouse sufficiently that a new enterprise buyer evaluating both platforms faces two products that can do most of what the other one does. Snowflake’s ML and Spark integration has similarly closed the gap with Databricks’ native data engineering environment. The buyer now chooses based on pricing, existing contracts, support relationships, and which sales team showed up better — not based on fundamental architectural fit.

The mental model that applies here is what Parrish calls “avoiding the obvious wrong choice” — the observation that eliminating clearly bad options is more valuable than optimising among equivalent good ones. For enterprise AI data infrastructure buyers in 2026, the obvious wrong choices (proprietary on-premise databases, first-generation Hadoop stacks, single-workload solutions) have been eliminated by the convergence. Snowflake and Databricks have each become defensible enough that either is a reasonable choice. The category — unified cloud AI data platform — has already won. Which company captures the larger share of that category over the next three years is a second-order question, and it will be decided by sales execution and switching costs rather than architectural differentiation.

12/06/2026
AI-Generated Attacks Are Reshaping Cybersecurity Spending in 2026

AI-Generated Attacks Are Reshaping Cybersecurity Spending in 2026

CrowdStrike’s 2026 Global Threat Report recorded a median adversary “breakout time” — the elapsed time between initial access to a network and lateral movement to other systems — of 2 minutes and 48 seconds, down from 7 minutes in 2024. The CrowdStrike 2026 Global Threat Report attributes the compression primarily to AI-assisted attack automation: intrusion tools that identify exploitable network paths, generate privilege escalation commands, and exfiltrate target data with minimal human attacker intervention between steps. The breakout-time figure is the most directly operational of the report’s metrics — defenders have, in theory, a window to detect and contain an intrusion before lateral movement; at under 3 minutes, that window requires automated detection to be practically useful.

The budget response from enterprise security teams is measurable in the earnings reports of the two dominant pure-play cybersecurity platforms. Palo Alto Networks reported $2.3 billion in quarterly revenue in its most recent fiscal quarter, with “next-generation security” (its AI-integrated product suite) growing at 37% year-on-year. CrowdStrike’s Falcon platform added 800 net new customers in its most recent quarter despite an already-large installed base. Both companies are attributing the demand acceleration to AI-augmented threat sophistication raising the minimum viable security posture for enterprises that previously considered themselves below the targeting threshold for sophisticated intrusions.

Sub-Three-Minute Breakout Times Are Forcing a Defence Redesign

The practical implication of sub-3-minute breakout time is not that human security analysts are useless — it is that human-speed detection is structurally insufficient for the initial containment decision. Security operations centres built around human review of alerts, with analysts triaging and escalating, operate on timelines that were adequate when breakout time was measured in hours. At sub-3-minute breakout, the containment decision must be automated: a detection event triggers isolation of the affected endpoint before an analyst reviews it, with human review of the isolation decision happening after the fact.

This constraint is reshaping the security architecture buying pattern more than any specific threat. Identity and access management (IAM) — which controls what any authenticated session can access — is receiving the largest incremental budget because it can constrain lateral movement even when initial access succeeds. If an attacker compromises a user credential, IAM controls limit what that credential can reach. The speed of the intrusion is less consequential when the available lateral paths are constrained.

Anthropic’s Project Glasswing zero-day research, which identified 10,000 software vulnerabilities using Claude’s Mythos Preview, is a direct example of how AI is accelerating the vulnerability discovery side of the security landscape. The same capability that enables defensive research enables offensive discovery; the 1% patch rate that Anthropic observed in their responsible disclosure programme is a measure of how far patch velocity lags behind vulnerability identification velocity — a gap that AI-assisted scanning is widening.

Where Security Budgets Are Flowing

The allocation shift in enterprise security budgets in 2026 has two dominant destinations: identity security and AI-integrated detection tooling. Identity security — Microsoft Entra ID, Okta, CyberArk — is growing because the attack vector for most AI-assisted intrusions is credential compromise rather than technical exploitation. Phishing emails generated by LLMs at scale, with personalisation that previously required individual attacker research, are producing credential compromise rates that exceed prior-year baselines even at organisations with mature security training programmes.

AI-integrated detection — CrowdStrike Falcon’s AI correlation layer, Palo Alto’s Cortex XSIAM, Darktrace’s autonomous response — is growing because the volume of security telemetry generated by modern enterprise environments exceeds human analyst review capacity. A mid-size enterprise generates millions of log events per day; the security operations centre cannot review them without automated triage. AI-driven triage — classifying events by severity, correlating related events into incidents, and suppressing noise — is becoming a prerequisite for staffed security operations at any scale, not a premium capability.

Cloudflare’s record revenue alongside workforce reduction demonstrates the same pattern in network security infrastructure: AI is enabling more traffic analysis, more bot detection, and more DDoS mitigation with fewer human operators per unit of protected traffic. The Cloudflare case study is widely cited in enterprise security discussions because it shows that the productivity gain from AI-integrated security tooling can be substantial even when the overall threat volume is rising.

The Small Business and Mid-Market Exposure Gap

The cybersecurity budget acceleration is concentrated in large enterprises. The CISA AI Cybersecurity Collaboration Playbook, published in early 2026, explicitly acknowledges that smaller organisations face the same AI-augmented threat landscape as large enterprises but lack the budget and staffing to deploy equivalent defensive tooling. The playbook’s recommendations for smaller organisations centre on identity hygiene (multi-factor authentication, privileged access management) and managed detection and response (MDR) services that outsource the AI-integrated security operations function to a third-party provider.

The MDR market — where a vendor operates the security operations function as a service — is growing faster than the enterprise security product market, partly for this reason. Small and mid-size businesses that cannot build an AI-integrated security operations function internally are outsourcing it to MDR vendors who amortise the tooling investment across a larger client base. CrowdStrike’s Falcon Complete (managed detection and response), Microsoft’s Defender for Business, and SentinelOne’s Vigilance are all reporting mid-market growth that exceeds their enterprise segment growth rates.

Big tech’s workforce reductions to fund AI infrastructure have reduced the headcount of security teams at companies simultaneously increasing their AI exposure surface. This tension — fewer security engineers at organisations deploying more AI-integrated infrastructure — is one of the structural dynamics that MDR vendors are capitalising on. The security staffing market has not kept pace with the security posture requirements created by AI infrastructure deployment, and the gap is being closed by managed services rather than internal hiring.

The Security Industry Measures the Threats Its Products Address

Glenn Greenwald’s core investigative question — who benefits from the narrative, and who provided the data that constructs it — applies with particular force to cybersecurity threat reporting. The 2 minutes 58 seconds breakout time figure, cited as the justification for a fundamental re-architecture of enterprise security spending, comes from CrowdStrike’s own threat intelligence report. CrowdStrike sells the AI-powered detection tools that the sub-3-minute breakout time makes necessary. The circularity here is not evidence of bad faith — the data may be accurate — but it is evidence that the reader should know who is making the measurement before accepting what the measurement implies about spending requirements.

This is not unique to CrowdStrike. The major cybersecurity vendors — Palo Alto Networks, SentinelOne, Microsoft Defender — all publish annual threat intelligence reports that document the threat landscape their own tools are designed to address. The reports are methodologically rigorous and the data is generally reliable. The question is not accuracy but completeness: what is not measured, and what conclusions does the unmeasured data prevent? Breakout time tells you about lateral movement velocity once a network is breached. It does not tell you about initial breach vector distribution, which determines whether endpoint detection speed is actually the bottleneck in a typical enterprise compromise. If 70% of breaches begin with phishing-enabled credential theft, then sub-3-minute breakout detection is solving the second problem, not the first.

The small and mid-market exposure gap is real. The concentration of AI-augmented security tools in large enterprise deployments creates an asymmetric vulnerability that is not well served by the current security vendor market structure — the tools that address AI-generated attack volume are priced and architected for organisations with dedicated security operations teams. This is a structural market failure that managed security service providers are filling more effectively than direct vendor channels. The question that the next security budget cycle should be asking is not “which AI detection tool performs best on the benchmark” but “which threat vector is actually responsible for the most breaches in our organisation’s category, and is our current spend addressing that vector or a more visible but less prevalent one?”

09/06/2026