Leaderboard

    Best AI for Science 2026.

    Find the best AI for science and research. Ranked by GPQA Diamond, FrontierMath, Frontier Science Research, CharXiv, and MMMU-Pro benchmarks.

    Gemini 3 Pro Preview
    Google
    91.9%91.9%81%81.4%โ€”โ€”โ€”
    gpt-5.4-pro
    OpenAI
    โ€”90.5%โ€”โ€”17.4%โ€”10.8%
    gpt-5.4
    OpenAI
    92.8%89.4%81.2%โ€”15.6%47.6%9.4%
    gpt-5.1
    OpenAI
    88.1%88.1%โ€”โ€”โ€”26.7%โ€”
    gpt-5.2-pro
    OpenAI
    93.2%87.9%โ€”โ€”14.9%โ€”8.7%
    gpt-5.3-codex
    OpenAI
    โ€”87.7%โ€”โ€”14.6%โ€”7.3%
    Grok 4
    xAI
    87.5%87.5%โ€”โ€”โ€”โ€”โ€”
    gpt-5
    OpenAI
    85.7%87.3%78.4%81.1%โ€”26.3%โ€”
    Claude Opus 4.5
    Anthropic
    87%87%โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Pro
    Google
    83%86.4%โ€”โ€”โ€”โ€”โ€”
    gpt-5.2
    OpenAI
    92.4%85.4%79.5%82.1%12.9%40.3%7.6%
    Grok 3
    xAI
    84.6%84.6%โ€”โ€”โ€”โ€”โ€”
    Kimi K2 Thinking
    Moonshot AI
    โ€”84.5%โ€”โ€”โ€”โ€”โ€”
    Claude Sonnet 4.5
    Anthropic
    83.4%83.4%โ€”โ€”โ€”โ€”โ€”
    o3
    OpenAI
    83.3%83.3%76.4%78.6%10.3%15.8%0%
    o4-mini
    OpenAI
    81.4%81.4%โ€”72%โ€”โ€”โ€”
    o3-mini
    OpenAI
    77.2%79.7%โ€”โ€”โ€”9.2%โ€”
    Claude Opus 4
    Anthropic
    79.6%79.6%โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Flash
    Google
    82.8%78.3%โ€”โ€”โ€”โ€”โ€”
    o1
    OpenAI
    78%75.7%โ€”โ€”โ€”5.5%โ€”
    Claude Sonnet 4
    Anthropic
    75.4%75.4%โ€”โ€”โ€”โ€”โ€”
    Claude Haiku 4.5
    Anthropic
    73%73%โ€”โ€”โ€”โ€”โ€”
    DeepSeek R1OSS
    DeepSeek
    โ€”71.5%โ€”โ€”โ€”โ€”โ€”
    gpt-4.5-preview
    OpenAI
    โ€”71.4%โ€”โ€”โ€”โ€”โ€”
    Llama 4 MaverickOSS
    Meta
    69.8%69.8%59.6%โ€”โ€”โ€”โ€”
    Claude Sonnet 3.7
    Anthropic
    84.8%68%โ€”โ€”โ€”โ€”โ€”
    gpt-4.1-mini
    OpenAI
    65%65%โ€”56.8%โ€”โ€”โ€”
    Claude 3.5 Sonnet
    Anthropic
    67.2%65%โ€”โ€”โ€”โ€”โ€”
    DeepSeek V3OSS
    DeepSeek
    59.1%64.8%โ€”โ€”โ€”โ€”โ€”
    Gemini 2.0 Flash
    Google
    62.1%62.1%โ€”โ€”โ€”โ€”โ€”
    o1 mini
    OpenAI
    60%60%โ€”โ€”โ€”โ€”โ€”
    Llama 4 ScoutOSS
    Meta
    57.2%57.2%โ€”โ€”โ€”โ€”โ€”
    GPT-4o
    OpenAI
    70.1%56.1%59.9%58.8%โ€”โ€”โ€”
    Llama 3.3 70B
    Meta
    โ€”50.5%โ€”โ€”โ€”โ€”โ€”
    gpt-4.1-nano
    OpenAI
    50.3%50.3%โ€”40.5%โ€”โ€”โ€”
    Llama 3.1 405B
    Meta
    โ€”49%โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Pro
    Amazon
    โ€”46.9%โ€”โ€”โ€”โ€”โ€”
    gpt-4.1
    OpenAI
    66.3%43.4%โ€”56.7%13.2%โ€”โ€”
    Claude 3.5 Haiku
    Anthropic
    41.6%41.6%โ€”โ€”โ€”โ€”โ€”
    GPT-4o-mini
    OpenAI
    40.2%40.2%โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Omni
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Pro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Micro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Premier
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Claude 2
    Anthropic
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Claude 2.1
    Anthropic
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Claude 3 Haiku
    Anthropic
    33.3%โ€”โ€”โ€”โ€”โ€”โ€”
    Claude 3 Opus
    Anthropic
    50.4%โ€”โ€”โ€”โ€”โ€”โ€”
    Showing 1โ€“50 of 336 models

    Building with these APIs?

    Get 10+ Next.js AI templates with auth, payments, and more.

    Get Templates โ€” $249

    All Large Language Models

    Perplexity

    5 models

    MiniMax

    4 models

    01.ai

    1 models

    AI21 Labs

    2 models

    Anyscale

    2 models

    Baidu

    1 models

    Cohere

    4 models

    Inception

    1 models

    LG AI Research

    1 models

    Nous Research

    1 models

    Reka

    3 models

    StepFun

    1 models

    Xiaomi

    1 models

    Z AI

    8 models