reddit.com via Reddit May 26th 2026

NVIDIA, AMD, Intel, Apple GPU ISAs Converge, Dev Proves

chips open source gpu-architecture compute open-source

Key insights

All four major GPU vendors (NVIDIA, AMD, Intel, Apple) independently converged on the same fundamental ISA patterns across 16 microarchitectures.
One developer read 5,000+ pages of GPU architecture documentation to validate that vendor-agnostic kernel abstraction is technically feasible.
The portable ISA layer directly targets CUDA ecosystem lock-in, with early benchmarks suggesting competitive cross-vendor performance.

Why this matters

The CUDA monopoly is one of NVIDIA's most durable competitive moats, and credible evidence that multi-vendor GPU programming is viable without performance sacrifice puts real pressure on that lock-in. A working portable ISA layer could shift procurement calculus for AI infrastructure teams, making AMD, Intel, and Apple GPUs viable at production scale rather than fallback options. If the convergence thesis holds under broader testing, toolchain and compiler vendors face a structural shift in what abstraction layers they need to build and maintain.

Summary

A solo developer built a portable GPU ISA abstraction after reading 5,000+ pages across 16 microarchitectures from NVIDIA, AMD, Intel, and Apple. The central finding: all four vendors are independently converging on the same fundamental ISA patterns, making a vendor-agnostic kernel layer viable without performance cost. Essentially: (NVIDIA, AMD, Intel, Apple) are co-evolving GPU ISA primitives despite competing products. - Source material: NVIDIA PTX, AMD ISA guides, Intel Xe specs, and reverse-engineered Apple GPU docs - The abstraction targets CUDA lock-in directly, enabling a single kernel codebase across all four vendors - Early benchmarks and architectural debate are active in the thread GPU silicon is maturing toward commodity primitives, pushing vendor differentiation into toolchains and ecosystems rather than instruction sets.

Potential risks and opportunities

Risks

Apple or NVIDIA could pursue legal action over reverse-engineered GPU specs used in the abstraction layer, exposing the project to cease-and-desist pressure before wider adoption
Enterprise AI teams adopting the portable ISA early could face breakage if vendors introduce microarchitecture changes in 2026 hardware generations that diverge from the documented convergence patterns
NVIDIA could accelerate CUDA toolchain improvements or extend PTX in proprietary directions, widening the performance gap and undermining the benchmark case for vendor-agnostic kernels

Opportunities

AMD and Intel GPU divisions could directly fund or integrate the portable ISA project to erode NVIDIA's CUDA ecosystem advantages with enterprise AI infrastructure teams
ML framework vendors (PyTorch, JAX, XLA teams) could use the convergence findings to build more aggressive vendor-agnostic compiler backends, reducing their own CUDA dependency in the next major release cycles
AI infrastructure providers offering non-NVIDIA GPU fleets (Lambda Labs, CoreWeave competitors) gain a technical foundation to demonstrate competitive performance parity across vendor hardware

What we don't know yet

Whether the portable ISA performance benchmarks hold at production-scale AI workloads beyond the early results shown in the Reddit thread
Which specific sources were used for Apple GPU reverse-engineering, and whether Apple's undocumented ISA details are stable enough across hardware generations to build on reliably
Whether any of the four vendors (NVIDIA, AMD, Intel, Apple) have acknowledged or responded to the documented ISA convergence finding

Originally reported by reddit.com

Read the original article →

Original headline: r/MachineLearning: Developer Builds Portable GPU ISA After Reading 5,000+ Pages Across 16 Microarchitectures — All Four Vendors Converging on Same Patterns