AI Milestones - Timeline of Key Achievements | MidnightAI.org

Total Milestones

Tracked events

Achieved

Historical milestones

Predicted

Future milestones

Humanoid Robot Mass Production

June 1, 2027

PredictedHigh Impact

First mass production of AI-powered humanoid robots for commercial use

Confidence:

40%

Autonomous Research Agent

December 1, 2026

PredictedHigh Impact

AI systems capable of conducting independent scientific research

Confidence:

50%

AGI-Level Reasoning

June 1, 2026

PredictedHigh Impact

Models achieve human-expert level on complex multi-step reasoning benchmarks

Confidence:

40%

Claude Discovers macOS Kernel Vulnerability CVE-2026-28952

May 25, 2026

AchievedAnthropic

First documented case of an AI system independently discovering a critical operating system kernel vulnerability.

View Source →

OpenAI Prepares for IPO Filing

May 20, 2026

AchievedOpenAI

OpenAI announces preparation to file for initial public offering, marking a major corporate milestone for the AI industry leader.

View Source →

Anthropic Expands to Colossus2 with GB200 GPUs

May 20, 2026

AchievedAnthropic

Anthropic announces major infrastructure expansion to Colossus2 cluster using NVIDIA's GB200 chips for next-generation model training.

View Source →

Lance: Unified Image/Video Generation and Understanding

May 20, 2026

AchievedByteDance (Doubao)

ByteDance releases Lance, an open-source 3B parameter model combining image/video generation and understanding capabilities in a single architecture.

View Source →

OpenAI Model Disproves Discrete Geometry Conjecture

May 20, 2026

AchievedOpenAI

An OpenAI model successfully disproved a central conjecture in discrete geometry, demonstrating advanced mathematical reasoning capabilities.

View Source →

Gemini 3.5 Flash: Frontier Intelligence with Action

May 19, 2026

AchievedGoogle DeepMind

Google releases Gemini 3.5 Flash, representing a major advancement in their flagship model series with enhanced action capabilities.

View Source →

AI Outperforms Physicians in Clinical Reasoning Study

April 30, 2026

Achieved

Largest comparative study shows AI system surpasses doctors in clinical diagnosis and decision-making using real emergency department data.

View Source →

Tuna-2: Direct Pixel-to-Multimodal Model

April 27, 2026

Achieved

First unified multimodal model performing visual understanding and generation directly from pixel embeddings without vision encoders.

View Source →

Amateur Solves 60-Year Erdős Problem with ChatGPT

April 25, 2026

AchievedOpenAI

First documented case of AI assistance enabling solution to a major unsolved mathematical problem.

View Source →

UAE Announces 50% Government AI Automation by 2028

April 25, 2026

Achieved

UAE becomes first country to commit to running half of government operations via autonomous AI systems.

View Source →

Google Announces $40B Investment in Anthropic

April 24, 2026

AchievedAnthropic

Google commits up to $40 billion in cash and compute resources to Anthropic in massive AI partnership deal.

View Source →

DeepSeek V4 Pro Model Released

April 24, 2026

AchievedDeepSeek

DeepSeek releases V4 Pro, a major new flagship model representing significant advancement in their model capabilities.

View Source →

GPT-5.5 Released with Enhanced Capabilities

April 23, 2026

AchievedOpenAI

OpenAI releases GPT-5.5, a major flagship model update with significant capability improvements.

View Source →

Anthropic Secures $5B Amazon Investment

April 21, 2026

AchievedAnthropic

Anthropic receives $5B from Amazon with commitment to $100B in cloud spending, marking major industry partnership.

View Source →

Apollo: 25B Record Healthcare Foundation Model

April 20, 2026

Achieved

Apollo foundation model trained on 25 billion medical records from 7.2 million patients across 28 modalities, representing the largest healthcare AI model.

View Source →

Humanoid Robot Breaks Human Half-Marathon Record

April 19, 2026

Achieved

荣耀's 'Lightning' robot completed Beijing half-marathon in 50:26, faster than human world record, marking breakthrough in autonomous athletic performance.

View Source →

White House Emergency Meeting on Claude Mythos

April 17, 2026

AchievedAnthropic

Anthropic CEO meets with White House officials over national security concerns regarding Claude Mythos's autonomous vulnerability discovery capabilities.

View Source →

SpecGuard: Step-Level Verification for LLM Reasoning

April 16, 2026

Achieved

New framework enables verification-aware speculative decoding with step-level error detection without external reward models, improving multi-step reasoning accuracy.

View Source →

R3D: Scalable 3D Policy Learning for Robotics

April 16, 2026

Achieved

Breakthrough in 3D policy learning with transformer-based encoder solving training instabilities and overfitting issues.

View Source →

Think in Latent Thoughts: Sign Language Translation

April 16, 2026

Achieved

Novel reasoning-driven framework treats sign language translation as cross-modal reasoning task with latent thought sequences.

View Source →

See, Point, Refine: Multi-Turn GUI Grounding

April 14, 2026

AchievedOpenAI

OpenAI introduces multi-turn approach for precise GUI interaction with visual feedback and error correction.

View Source →

AiScientist: Autonomous Long-Horizon ML Research

April 14, 2026

Achieved

System enabling autonomous AI agents to sustain coherent ML research progress across comprehension, implementation, and experimentation over days.

View Source →

Visual Preference Optimization with Rubric Rewards

April 14, 2026

AchievedOpenAI

rDPO framework introduces instance-specific rubrics for fine-grained visual reasoning preference optimization in multimodal tasks.

View Source →

N-Day-Bench: Real Vulnerability Discovery Benchmark

April 13, 2026

Achieved

Monthly-refreshed benchmark testing whether LLMs can find known security vulnerabilities in real repository codebases with sandboxed exploration.

View Source →

Berkeley Exposes Critical AI Agent Benchmark Flaws

April 11, 2026

Achieved

Berkeley researchers demonstrate systematic ways to break top AI agent benchmarks, highlighting fundamental evaluation methodology issues.

View Source →

MolmoWeb: Open Visual Web Agent Framework

April 9, 2026

Achieved

Open-source visual web agent with transparent training data and methodology for autonomous web navigation tasks.

View Source →

ClawBench: Real-World AI Agent Evaluation Framework

April 9, 2026

AchievedAnthropic

Anthropic introduces ClawBench, a comprehensive evaluation framework testing AI agents on 153 everyday online tasks across 144 live platforms.

View Source →

Act Wisely: Meta-Cognitive Tool Use Framework

April 9, 2026

Achieved

Research breakthrough addressing agents' meta-cognitive deficits in arbitrating between internal knowledge and external tool usage.

View Source →

Meta Announces Muse Spark Personal Superintelligence

April 8, 2026

AchievedMeta AI

Meta introduces Muse Spark, positioning it as a step toward personal superintelligence capabilities for individual users.

View Source →

MegaTrain Enables 100B+ Parameter Training on Single GPU

April 8, 2026

Achieved

Research breakthrough allows full-precision training of 100+ billion parameter language models on a single GPU, dramatically reducing training costs.

View Source →

Claude Mythos Preview for Cybersecurity Released

April 7, 2026

AchievedAnthropic

Anthropic releases specialized Claude model variant focused on advanced cybersecurity capabilities with detailed system card documentation.

View Source →

Google Gemma-4 Multimodal Model Series Released

April 2, 2026

AchievedGoogle DeepMind

Google releases Gemma-4 series with any-to-any and image-text-to-text capabilities across multiple parameter sizes (4B-31B).

View Source →

Claude Demonstrates Full OS Kernel Exploit Generation

April 1, 2026

AchievedAnthropic

Claude successfully wrote a complete FreeBSD remote kernel RCE exploit with root shell, demonstrating advanced cybersecurity capabilities.

View Source →

Former Qwen Lead's Agentic Thinking Manifesto

March 26, 2026

AchievedAlibaba (Qwen)

Original Alibaba Qwen technical lead publishes influential essay on transitioning from reasoning to agentic thinking paradigms.

View Source →

ARC-AGI-3 Benchmark Released

March 25, 2026

Achieved

New benchmark designed to measure artificial general intelligence through novel reasoning tasks, addressing limitations of previous AI evaluation methods.

View Source →

VTAM: Video-Tactile-Action Models for Robotics

March 24, 2026

Achieved

First multimodal framework combining video, tactile sensing, and action prediction for contact-rich physical interactions.

View Source →

SpecEyes: Speculative Acceleration for Agentic AI

March 24, 2026

AchievedOpenAI

OpenAI introduces framework to accelerate multimodal agent reasoning through speculative perception and planning.

View Source →

GPT-5.4 Pro Solves Frontier Math Open Problem

March 24, 2026

AchievedOpenAI

First AI system confirmed to solve an open mathematical research problem, marking breakthrough in AI mathematical reasoning capabilities.

View Source →

iPhone 17 Pro Runs 400B Parameter Model

March 23, 2026

Achieved

First demonstration of a 400 billion parameter language model running natively on a mobile device, showcasing dramatic advances in on-device AI.

View Source →

Reasoning Circuits Discovery in Transformers

March 18, 2026

Achieved

Researchers discover discrete 3-4 layer 'reasoning circuits' in transformers that can be duplicated to dramatically improve logical deduction performance without training.

View Source →

Online Experiential Learning Framework

March 17, 2026

Achieved

Research introduces framework enabling language models to continuously improve from real-world deployment experience rather than offline training only.

View Source →

Nvidia Launches Vera CPU for Agentic AI

March 16, 2026

Achieved

Nvidia introduces purpose-built CPU architecture specifically designed for agentic AI workloads, marking hardware specialization for autonomous agents.

View Source →

Morgan Stanley Predicts Major AI Breakthrough in H1 2026

March 14, 2026

Achieved

Investment bank warns of imminent AI breakthrough driven by rapid computing expansion that could strain power grids and disrupt jobs globally.

View Source →

John Carmack Challenges AGI Timeline Predictions

March 14, 2026

Achieved

Legendary programmer John Carmack publicly disputes OpenAI and other labs' aggressive AGI timelines, stating 'We Are Not on the Brink of AGI' with significant implications for industry investment.

View Source →

Claude Opus/Sonnet 4.6 Achieves 1M Context Window

March 13, 2026

AchievedAnthropic

Anthropic's Claude models now support 1 million token context windows in general availability, enabling processing of extremely long documents.

View Source →

Understudy: Teach-by-Demonstration Desktop Agent

March 12, 2026

Achieved

First desktop agent that learns tasks from single demonstrations across GUI apps, browsers, terminals, and messaging tools in unified sessions.

View Source →

Nvidia Invests $26B in Open-Source AI Development

March 12, 2026

Achieved

Nvidia announces major strategic shift with $26 billion investment in open-source AI models over five years, competing directly with OpenAI and other closed-source providers.

View Source →

How We Track Milestones

Milestones are identified through analysis of research publications, product announcements, and expert assessments. Predictions are based on current progress trajectories and capability assessments.

Read our methodology

Total Milestones

Tracked events

Achieved

Historical milestones

Predicted

Future milestones

Humanoid Robot Mass Production

June 1, 2027

PredictedHigh Impact

First mass production of AI-powered humanoid robots for commercial use

Confidence:

40%

Autonomous Research Agent

December 1, 2026

PredictedHigh Impact

AI systems capable of conducting independent scientific research

Confidence:

50%

AGI-Level Reasoning

June 1, 2026

PredictedHigh Impact

Models achieve human-expert level on complex multi-step reasoning benchmarks

Confidence:

40%

Claude Discovers macOS Kernel Vulnerability CVE-2026-28952

May 25, 2026

AchievedAnthropic

First documented case of an AI system independently discovering a critical operating system kernel vulnerability.

View Source →

OpenAI Prepares for IPO Filing

May 20, 2026

AchievedOpenAI

OpenAI announces preparation to file for initial public offering, marking a major corporate milestone for the AI industry leader.

View Source →

Anthropic Expands to Colossus2 with GB200 GPUs

May 20, 2026

AchievedAnthropic

Anthropic announces major infrastructure expansion to Colossus2 cluster using NVIDIA's GB200 chips for next-generation model training.

View Source →

Lance: Unified Image/Video Generation and Understanding

May 20, 2026

AchievedByteDance (Doubao)

ByteDance releases Lance, an open-source 3B parameter model combining image/video generation and understanding capabilities in a single architecture.

View Source →

OpenAI Model Disproves Discrete Geometry Conjecture

May 20, 2026

AchievedOpenAI

An OpenAI model successfully disproved a central conjecture in discrete geometry, demonstrating advanced mathematical reasoning capabilities.

View Source →

Gemini 3.5 Flash: Frontier Intelligence with Action

May 19, 2026

AchievedGoogle DeepMind

Google releases Gemini 3.5 Flash, representing a major advancement in their flagship model series with enhanced action capabilities.

View Source →

AI Outperforms Physicians in Clinical Reasoning Study

April 30, 2026

Achieved

Largest comparative study shows AI system surpasses doctors in clinical diagnosis and decision-making using real emergency department data.

View Source →

Tuna-2: Direct Pixel-to-Multimodal Model

April 27, 2026

Achieved

First unified multimodal model performing visual understanding and generation directly from pixel embeddings without vision encoders.

View Source →

Amateur Solves 60-Year Erdős Problem with ChatGPT

April 25, 2026

AchievedOpenAI

First documented case of AI assistance enabling solution to a major unsolved mathematical problem.

View Source →

UAE Announces 50% Government AI Automation by 2028

April 25, 2026

Achieved

UAE becomes first country to commit to running half of government operations via autonomous AI systems.

View Source →

Google Announces $40B Investment in Anthropic

April 24, 2026

AchievedAnthropic

Google commits up to $40 billion in cash and compute resources to Anthropic in massive AI partnership deal.

View Source →

DeepSeek V4 Pro Model Released

April 24, 2026

AchievedDeepSeek

DeepSeek releases V4 Pro, a major new flagship model representing significant advancement in their model capabilities.

View Source →

GPT-5.5 Released with Enhanced Capabilities

April 23, 2026

AchievedOpenAI

OpenAI releases GPT-5.5, a major flagship model update with significant capability improvements.

View Source →

Anthropic Secures $5B Amazon Investment

April 21, 2026

AchievedAnthropic

Anthropic receives $5B from Amazon with commitment to $100B in cloud spending, marking major industry partnership.

View Source →

Apollo: 25B Record Healthcare Foundation Model

April 20, 2026

Achieved

Apollo foundation model trained on 25 billion medical records from 7.2 million patients across 28 modalities, representing the largest healthcare AI model.

View Source →

Humanoid Robot Breaks Human Half-Marathon Record

April 19, 2026

Achieved

荣耀's 'Lightning' robot completed Beijing half-marathon in 50:26, faster than human world record, marking breakthrough in autonomous athletic performance.

View Source →

White House Emergency Meeting on Claude Mythos

April 17, 2026

AchievedAnthropic

Anthropic CEO meets with White House officials over national security concerns regarding Claude Mythos's autonomous vulnerability discovery capabilities.

View Source →

SpecGuard: Step-Level Verification for LLM Reasoning

April 16, 2026

Achieved

New framework enables verification-aware speculative decoding with step-level error detection without external reward models, improving multi-step reasoning accuracy.

View Source →

R3D: Scalable 3D Policy Learning for Robotics

April 16, 2026

Achieved

Breakthrough in 3D policy learning with transformer-based encoder solving training instabilities and overfitting issues.

View Source →

Think in Latent Thoughts: Sign Language Translation

April 16, 2026

Achieved

Novel reasoning-driven framework treats sign language translation as cross-modal reasoning task with latent thought sequences.

View Source →

See, Point, Refine: Multi-Turn GUI Grounding

April 14, 2026

AchievedOpenAI

OpenAI introduces multi-turn approach for precise GUI interaction with visual feedback and error correction.

View Source →

AiScientist: Autonomous Long-Horizon ML Research

April 14, 2026

Achieved

System enabling autonomous AI agents to sustain coherent ML research progress across comprehension, implementation, and experimentation over days.

View Source →

Visual Preference Optimization with Rubric Rewards

April 14, 2026

AchievedOpenAI

rDPO framework introduces instance-specific rubrics for fine-grained visual reasoning preference optimization in multimodal tasks.

View Source →

N-Day-Bench: Real Vulnerability Discovery Benchmark

April 13, 2026

Achieved

Monthly-refreshed benchmark testing whether LLMs can find known security vulnerabilities in real repository codebases with sandboxed exploration.

View Source →

Berkeley Exposes Critical AI Agent Benchmark Flaws

April 11, 2026

Achieved

Berkeley researchers demonstrate systematic ways to break top AI agent benchmarks, highlighting fundamental evaluation methodology issues.

View Source →

MolmoWeb: Open Visual Web Agent Framework

April 9, 2026

Achieved

Open-source visual web agent with transparent training data and methodology for autonomous web navigation tasks.

View Source →

ClawBench: Real-World AI Agent Evaluation Framework

April 9, 2026

AchievedAnthropic

Anthropic introduces ClawBench, a comprehensive evaluation framework testing AI agents on 153 everyday online tasks across 144 live platforms.

View Source →

Act Wisely: Meta-Cognitive Tool Use Framework

April 9, 2026

Achieved

Research breakthrough addressing agents' meta-cognitive deficits in arbitrating between internal knowledge and external tool usage.

View Source →

Meta Announces Muse Spark Personal Superintelligence

April 8, 2026

AchievedMeta AI

Meta introduces Muse Spark, positioning it as a step toward personal superintelligence capabilities for individual users.

View Source →

MegaTrain Enables 100B+ Parameter Training on Single GPU

April 8, 2026

Achieved

Research breakthrough allows full-precision training of 100+ billion parameter language models on a single GPU, dramatically reducing training costs.

View Source →

Claude Mythos Preview for Cybersecurity Released

April 7, 2026

AchievedAnthropic

Anthropic releases specialized Claude model variant focused on advanced cybersecurity capabilities with detailed system card documentation.

View Source →

Google Gemma-4 Multimodal Model Series Released

April 2, 2026

AchievedGoogle DeepMind

Google releases Gemma-4 series with any-to-any and image-text-to-text capabilities across multiple parameter sizes (4B-31B).

View Source →

Claude Demonstrates Full OS Kernel Exploit Generation

April 1, 2026

AchievedAnthropic

Claude successfully wrote a complete FreeBSD remote kernel RCE exploit with root shell, demonstrating advanced cybersecurity capabilities.

View Source →

Former Qwen Lead's Agentic Thinking Manifesto

March 26, 2026

AchievedAlibaba (Qwen)

Original Alibaba Qwen technical lead publishes influential essay on transitioning from reasoning to agentic thinking paradigms.

View Source →

ARC-AGI-3 Benchmark Released

March 25, 2026

Achieved

New benchmark designed to measure artificial general intelligence through novel reasoning tasks, addressing limitations of previous AI evaluation methods.

View Source →

VTAM: Video-Tactile-Action Models for Robotics

March 24, 2026

Achieved

First multimodal framework combining video, tactile sensing, and action prediction for contact-rich physical interactions.

View Source →

SpecEyes: Speculative Acceleration for Agentic AI

March 24, 2026

AchievedOpenAI

OpenAI introduces framework to accelerate multimodal agent reasoning through speculative perception and planning.

View Source →

GPT-5.4 Pro Solves Frontier Math Open Problem

March 24, 2026

AchievedOpenAI

First AI system confirmed to solve an open mathematical research problem, marking breakthrough in AI mathematical reasoning capabilities.

View Source →

iPhone 17 Pro Runs 400B Parameter Model

March 23, 2026

Achieved

First demonstration of a 400 billion parameter language model running natively on a mobile device, showcasing dramatic advances in on-device AI.

View Source →

Reasoning Circuits Discovery in Transformers

March 18, 2026

Achieved

Researchers discover discrete 3-4 layer 'reasoning circuits' in transformers that can be duplicated to dramatically improve logical deduction performance without training.

View Source →

Online Experiential Learning Framework

March 17, 2026

Achieved

Research introduces framework enabling language models to continuously improve from real-world deployment experience rather than offline training only.

View Source →

Nvidia Launches Vera CPU for Agentic AI

March 16, 2026

Achieved

Nvidia introduces purpose-built CPU architecture specifically designed for agentic AI workloads, marking hardware specialization for autonomous agents.

View Source →

Morgan Stanley Predicts Major AI Breakthrough in H1 2026

March 14, 2026

Achieved

Investment bank warns of imminent AI breakthrough driven by rapid computing expansion that could strain power grids and disrupt jobs globally.

View Source →

John Carmack Challenges AGI Timeline Predictions

March 14, 2026

Achieved

Legendary programmer John Carmack publicly disputes OpenAI and other labs' aggressive AGI timelines, stating 'We Are Not on the Brink of AGI' with significant implications for industry investment.

View Source →

Claude Opus/Sonnet 4.6 Achieves 1M Context Window

March 13, 2026

AchievedAnthropic

Anthropic's Claude models now support 1 million token context windows in general availability, enabling processing of extremely long documents.

View Source →

Understudy: Teach-by-Demonstration Desktop Agent

March 12, 2026

Achieved

First desktop agent that learns tasks from single demonstrations across GUI apps, browsers, terminals, and messaging tools in unified sessions.

View Source →

Nvidia Invests $26B in Open-Source AI Development

March 12, 2026

Achieved

Nvidia announces major strategic shift with $26 billion investment in open-source AI models over five years, competing directly with OpenAI and other closed-source providers.

View Source →

How We Track Milestones

Milestones are identified through analysis of research publications, product announcements, and expert assessments. Predictions are based on current progress trajectories and capability assessments.

Read our methodology