Leaderboard

    LLM Leaderboard & Arena Rankings 2026.

    Complete LLM leaderboard with daily-updated benchmark scores. Compare and rank the best LLMs by GPQA, MMLU, HLE, SWE-bench, and more alongside API pricing.

    Gemini 3 Pro Preview
    Google
    45.8%91.9%91.9%76.2%72.1%100%โ€”โ€”โ€”โ€”
    gpt-5.4-pro
    OpenAI
    โ€”โ€”90.5%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    gpt-5.4
    OpenAI
    39.8%92.8%89.4%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    gpt-5.1
    OpenAI
    โ€”88.1%88.1%76.3%โ€”94%โ€”โ€”โ€”โ€”
    gpt-5.2-pro
    OpenAI
    36.6%93.2%87.9%โ€”โ€”100%โ€”74.1%โ€”โ€”
    gpt-5.3-codex
    OpenAI
    โ€”โ€”87.7%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Grok 4
    xAI
    40%87.5%87.5%โ€”โ€”91.7%โ€”โ€”โ€”โ€”
    gpt-5
    OpenAI
    24.8%85.7%87.3%74.9%โ€”94.6%โ€”โ€”โ€”โ€”
    Claude Opus 4.5
    Anthropic
    โ€”87%87%80.9%โ€”โ€”โ€”90.8%โ€”โ€”
    Gemini 2.5 Pro
    Google
    17.8%83%86.4%63.2%50.8%83%โ€”88.9%90.9%88.2%
    gpt-5.2
    OpenAI
    34.5%92.4%85.4%80%โ€”100%โ€”88%โ€”โ€”
    Grok 3
    xAI
    โ€”84.6%84.6%โ€”โ€”93.3%โ€”79.9%โ€”โ€”
    Kimi K2 Thinking
    Moonshot AI
    โ€”โ€”84.5%โ€”โ€”99.1%โ€”โ€”โ€”โ€”
    Claude Sonnet 4.5
    Anthropic
    โ€”83.4%83.4%โ€”โ€”87%โ€”โ€”โ€”โ€”
    o3
    OpenAI
    14.7%83.3%83.3%69.1%โ€”98.4%โ€”93.4%โ€”โ€”
    o4-mini
    OpenAI
    14.7%81.4%81.4%68.1%โ€”92.7%โ€”โ€”โ€”โ€”
    o3-mini
    OpenAI
    โ€”77.2%79.7%49.3%15%โ€”97.9%โ€”โ€”โ€”
    Claude Opus 4
    Anthropic
    โ€”79.6%79.6%72.5%โ€”75.5%โ€”88.8%โ€”โ€”
    Gemini 2.5 Flash
    Google
    11%82.8%78.3%60.4%26.9%72%โ€”โ€”50.4%โ€”
    o1
    OpenAI
    โ€”78%75.7%41%47%โ€”96.4%โ€”โ€”โ€”
    Claude Sonnet 4
    Anthropic
    โ€”75.4%75.4%72.7%โ€”70.5%โ€”84%91.4%88.7%
    Claude Haiku 4.5
    Anthropic
    โ€”73%73%73.3%โ€”80.7%โ€”90.8%โ€”85.2%
    DeepSeek R1OSS
    DeepSeek
    โ€”โ€”71.5%โ€”โ€”โ€”97.3%โ€”โ€”โ€”
    gpt-4.5-preview
    OpenAI
    โ€”โ€”71.4%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Llama 4 MaverickOSS
    Meta
    โ€”69.8%69.8%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Claude Sonnet 3.7
    Anthropic
    โ€”84.8%68%70.3%โ€”54.8%82.2%โ€”โ€”โ€”
    gpt-4.1-mini
    OpenAI
    3.7%65%65%23.6%โ€”40.2%โ€”80.1%9.8%54.6%
    Claude 3.5 Sonnet
    Anthropic
    โ€”67.2%65%49%โ€”โ€”78%90.4%95.4%96.4%
    DeepSeek V3OSS
    DeepSeek
    โ€”59.1%64.8%42%24.9%โ€”94%โ€”โ€”โ€”
    Gemini 2.0 Flash
    Google
    โ€”62.1%62.1%โ€”โ€”โ€”89.7%โ€”โ€”โ€”
    o1 mini
    OpenAI
    โ€”60%60%โ€”โ€”โ€”90%85.2%โ€”92.4%
    Llama 4 ScoutOSS
    Meta
    โ€”57.2%57.2%โ€”โ€”โ€”โ€”79.6%โ€”โ€”
    GPT-4o
    OpenAI
    5.3%70.1%56.1%33.2%38.2%โ€”60.3%88.7%95%90%
    Llama 3.3 70B
    Meta
    โ€”โ€”50.5%โ€”โ€”โ€”77%86%86.5%88.4%
    gpt-4.1-nano
    OpenAI
    โ€”50.3%50.3%โ€”โ€”โ€”โ€”80.1%โ€”54.6%
    Llama 3.1 405B
    Meta
    โ€”โ€”49%โ€”โ€”โ€”73.8%87%โ€”89%
    Amazon Nova Pro
    Amazon
    โ€”โ€”46.9%โ€”โ€”โ€”76.6%85.9%โ€”โ€”
    gpt-4.1
    OpenAI
    5.4%66.3%43.4%54.6%โ€”46.4%โ€”โ€”โ€”67%
    Claude 3.5 Haiku
    Anthropic
    โ€”41.6%41.6%40.6%โ€”โ€”69.4%โ€”โ€”โ€”
    GPT-4o-mini
    OpenAI
    โ€”40.2%40.2%8.7%โ€”โ€”70.2%82%โ€”87.2%
    Amazon Nova 2 Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Omni
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Pro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”75%โ€”โ€”
    Amazon Nova Micro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Premier
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”87.4%โ€”โ€”
    Claude 2
    Anthropic
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”78.5%92.3%94.5%
    Claude 2.1
    Anthropic
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”78.5%91%87.5%
    Claude 3 Haiku
    Anthropic
    โ€”33.3%โ€”โ€”โ€”โ€”โ€”76.7%85.9%โ€”
    Claude 3 Opus
    Anthropic
    โ€”50.4%โ€”โ€”โ€”โ€”โ€”88.2%95.4%73%
    Showing 1โ€“50 of 336 models

    Building with these APIs?

    Get 10+ Next.js AI templates with auth, payments, and more.

    Get Templates โ€” $249

    All Large Language Models

    Perplexity

    5 models

    MiniMax

    4 models

    01.ai

    1 models

    AI21 Labs

    2 models

    Anyscale

    2 models

    Baidu

    1 models

    Cohere

    4 models

    Inception

    1 models

    LG AI Research

    1 models

    Nous Research

    1 models

    Reka

    3 models

    StepFun

    1 models

    Xiaomi

    1 models

    Z AI

    8 models