Neural network with bright connections representing algorithmic discovery and AI self-improvement
machine-learning

AlphaEvolve: 11 Records Proving ML Is Already Redesigning Itself

NeuralPulse|25 de maio de 2026|9 min read|Ler em Português

56 years. 20,442 days. That's exactly how long Volker Strassen's record for complex 4×4 matrix multiplication lasted — 49 scalar multiplications, the best humanity could produce since 1969. Until May 2026, when an AI system called AlphaEvolve, from Google DeepMind, simply delivered 48.

Strassen was no ordinary name in computing history. In 1969, his algorithm proved it was possible to multiply 4×4 matrices with just 49 multiplications — versus 64 with the naive method. For 56 years, successive generations of researchers tried, unsuccessfully, to reach 48. What none of them achieved in over half a century of intellectual effort, AlphaEvolve accomplished in weeks of computational search.

One fewer multiplication. It seems small, almost irrelevant to those who don't live by computational efficiency. But in computing, every operation counts — especially when that operation is executed trillions of times per second worldwide. Matrix multiplication lies at the heart of absolutely everything involving machine learning — from training transformers to graphics rendering, from scientific simulation to cryptography. A gain that seems marginal at micro scale becomes colossal savings when multiplied by billions of daily executions.

The feat, however, goes far beyond mathematics. AlphaEvolve is not a model that solves known problems — it is an algorithm discovery engine. It generates, tests, validates, and optimizes new ways of computing, without relying on human intuition. The closest historical parallel might be the moment when early compilers began optimizing code better than assembly programmers — only now the leap is several orders of magnitude in scale and scope. And, in a loop that until recently seemed reserved for science fiction, it is redesigning the very physical infrastructure that runs machine learning. The system that optimizes the chips that run the models that, in turn, will feed and turbocharge the next version of the system.

In our view, this is no longer just an article about an AI record. It is about the exact moment when ML began to systematically and intentionally self-improve — and the numbers proving this is already running at industrial scale, in real companies, with tangible and measurable results.

What is AlphaEvolve? (And Why It's Different from Everything Before)

Before diving into the impressive numbers, it's worth understanding what makes AlphaEvolve so different from earlier systems like AlphaGo, AlphaFold, or AlphaDev.

Most AI systems we know are models that solve specific, well-defined problems. ChatGPT generates coherent text. AlphaFold predicts three-dimensional protein structures with accuracy comparable to lab experiments. Gemini processes language and images simultaneously. They all receive input, process it according to a fixed objective, and return output — solving a problem that is already mapped and understood.

The crucial difference lies in what is optimized. Traditional reinforcement learning models learn a policy to win within a closed environment — like AlphaGo on the Go board. AlphaEvolve operates one layer above: it discovers algorithms, which are sequences of mathematical operations, validated by the universal criterion of computational efficiency. Its search space doesn't have 19×19 positions — it's a combinatorial universe of algebraic operations whose size defies any estimate.

AlphaEvolve doesn't work that way. It is an algorithmic discovery system: instead of solving a problem directly, it discovers new ways of solving problems. The distinction is subtle in description but revolutionary in practice.

The method is simple in concept yet brutal in computational execution. The system starts from an immense space of possible mathematical operations — a universe of combinations no human could explore in many lifetimes — and, using search guided by reinforcement learning, converges on algorithms that are simultaneously logically correct and computationally efficient. It's as if a software engineer could test billions of different implementations per second and, in the end, choose the best for each context.

The crucial difference from any human approach? An engineer optimizes within what they know. They start from existing solutions and refine them incrementally, guided by intuition and experience. AlphaEvolve optimizes within what is mathematically possible — without bias, without attachment to traditions, without established paradigms. While a human tends to seek familiar solutions, the algorithmic engine explores paths no one ever considered because they violate assumptions we didn't even know we were making. And sometimes what is possible is counterintuitive, strange, almost wrong to human eyes, and yet dramatically more efficient. Solutions no engineer would consider because they completely deviate from established patterns of thought.

The result of this unconstrained approach is a list of records spanning areas as diverse as pure mathematics, data center infrastructure, genomics, warehouse logistics, quantum computing, and molecular simulation. Eleven completely different areas, a single algorithmic discovery engine serving as the common denominator.

The 11 Records in Numbers (and What Each Means in Practice)

We compiled the main results released by DeepMind in May 2026 (Source). It's worth studying this table carefully — the orders of magnitude and diversity reveal a pattern that goes beyond isolated numbers:

Application AreaAchieved ResultImmediate Practical Impact
Mathematics (4×4 matrices)48 multiplications — Strassen's record brokenFirst improvement to the classic algorithm since 1969
Google Infrastructure0.7% of global computational resources recoveredTens of thousands of H100 GPUs freed for other workloads
Gemini (Pallas kernel)23% speedup in kernel1% less total model training time
Genomics (PacBio)30% fewer errors in variant detectionPreviously hidden pathogenic mutations now detectable
Power grids (AC OPF)14% → 88% viable solutions for GNNEnables practical optimization of energy distribution
Quantum circuits (Willow)10x less error in quantum processorsReliability leap for quantum hardware
Klarna (transformer models)2x faster trainingSame output quality, half the GPU consumption
FM Logistic (routing)10.4% improvement in logistics routes~15,000 km/year of travel avoided
Schrödinger (MLFF)4x speedup in training and inferenceAccelerated R&D cycles in drug discovery
50 open mathematical problems75% rediscovered SOTA; 20% found better solutionRobust validation on unsolved problems
Spanner (Google)20% less write amplificationLSM-tree storage optimized at planetary scale

Eleven completely distinct application areas. Eleven results proving the central thesis. In several cases, AlphaEvolve improved what dedicated teams of human engineers and scientists couldn't improve in decades of specialized work.

The most eloquent aspect, however, is not any individual record — it's the impressive diversity of them. The same system that breaks a 56-year-old mathematical record also optimizes warehouse logistics and accelerates drug discovery. This is not a purpose-specific tool. It is a general algorithmic optimization capability, applicable to virtually any domain that can be formulated as a search problem for computational efficiency. And that, indeed, is a historic milestone.

Four Real Cases Showing the Scale of the Thing

AlphaEvolve is not a lab experiment or a controlled academic demonstration. Real companies, in real sectors of the economy, are already using the algorithms it discovered in production — and the results are concrete, measurable, and in some cases, impressive.

Klarna — the Swedish fintech, globally known for its online payment solutions, used AlphaEvolve to optimize one of its largest transformer models in production. The result was direct and unambiguous: double the training speed without any measurable loss in model output quality. In day-to-day practice, this means Klarna's machine learning team can iterate twice as fast on new architectures and datasets, spend half the compute budget per training cycle, and deliver better models in less time. For a company processing millions of financial transactions daily — Klarna operates in over 45 countries, managing payments totaling tens of billions of dollars annually — the impact on operational cost and innovation speed is immediate and significant. Transformer models are the heart of fraud detection and real-time risk analysis systems; training them at double the speed means responding to new fraud patterns in half the time. In a sector where response agility defines business security, this speed gain has value far beyond GPU cost reduction.

FM Logistic — the French logistics company applied AlphaEvolve to the classic Traveling Salesman Problem (TSP), one of the most studied problems in computer science and operations research. The problem was tackled at warehouse scale, with dozens of delivery points per route. The gain achieved was 10.4% in routing efficiency. The percentage sounds modest until you translate the number into tangible terms: over 15,000 kilometers per year of travel avoided. That's trucks not running, diesel not being burned, driver hours being saved, fleet maintenance being reduced. In logistics, where margins are tight, 10% efficiency is often the difference between a profitable contract and one in the red.

Schrödinger — specialized in computational discovery of materials and drugs, Schrödinger uses Machine Learned Force Fields (MLFF) to simulate molecular interactions with high precision, replacing traditional molecular dynamics simulations that are much slower. AlphaEvolve accelerated the training and inference of these force fields by approximately 4 times. Gabriel Marques, Technical Lead of Machine Learning at Schrödinger, contextualized the business impact: "AlphaEvolve allows us to explore larger chemical spaces faster and more efficiently than ever. Faster MLFF inference has real business impact, shortening R&D cycles in drug discovery, catalyst design, and materials development." (Source) In pharmaceutical and materials industries, where each month of R&D saved is worth millions of dollars in avoided costs and anticipated revenue, a 4x gain in simulation speed is potentially transformative for a company's competitiveness. Instead of simulating dozens of drug candidates per month, researchers can now explore hundreds — dramatically expanding the chemical search space without increasing the computational budget.

PacBio — in precision genomics, AlphaEvolve significantly improved DeepConsensus, a DNA sequencing error correction system developed by PacBio. The error rate in genetic variant detection dropped by 30% — a substantial leap for a field where every percentage point of accuracy is hard-won. Aaron Wenger, Senior Director at PacBio, highlighted the rel

#machine-learning#deepmind#alphaevolve#self-improvement#algorithms
Compartilhar: