Leaderboard

    Best AI for Reasoning 2026.

    Find the best AI for reasoning. Ranked by Humanity's Last Exam, GPQA Diamond, ARC-AGI 2, SimpleQA, and more. Compare intelligence and reasoning across all major LLMs.

    Claude Opus 4.6
    Anthropic
    53.1%91.3%โ€”โ€”โ€”โ€”68.8%โ€”โ€”
    Gemini 3.1 Pro
    Google
    51.4%94.3%โ€”โ€”โ€”โ€”77.1%โ€”โ€”
    M
    Kimi K2-Thinking-0905OSS
    Moonshot AI
    51%84.5%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Kimi K2.5OSS
    Moonshot AI
    50.2%87.6%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Claude Sonnet 4.6
    Anthropic
    49%89.9%โ€”โ€”โ€”โ€”58.3%โ€”โ€”
    Gemini 3 Pro Preview
    Google
    45.8%91.9%91.9%72.1%โ€”โ€”31.1%โ€”โ€”
    Gemini 3 Flash Preview
    Google
    43.5%90.4%โ€”68.7%โ€”โ€”33.6%โ€”โ€”
    GLM-4.7OSS
    Z AI
    42.8%85.7%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Grok 4
    xAI
    40%87.5%87.5%โ€”โ€”โ€”15.9%โ€”โ€”
    gpt-5.4
    OpenAI
    39.8%92.8%89.4%โ€”27.4%37%73.3%8.7%76.7%
    gpt-5.2-pro
    OpenAI
    36.6%93.2%87.9%โ€”25.4%35.9%54.2%5.9%70%
    gpt-5.2
    OpenAI
    34.5%92.4%85.4%โ€”23.4%33.7%52.9%4.6%60%
    Qwen 3.5 397B A17BOSS
    Qwen
    28.7%88.4%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    M
    LongCat-Flash-Thinking-2601OSS
    Meituan
    25.2%80.5%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    gpt-5
    OpenAI
    24.8%85.7%87.3%โ€”โ€”โ€”โ€”โ€”โ€”
    X
    MiMo-V2-FlashOSS
    Xiaomi
    22.1%83.7%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    MiniMax M2.1OSS
    MiniMax
    22%81%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Pro Preview 06-05
    Google
    21.6%86.4%โ€”54%โ€”โ€”โ€”โ€”โ€”
    Grok 4 Fast
    xAI
    20%85.7%โ€”95%โ€”โ€”โ€”โ€”โ€”
    DeepSeek-V3.2-ExpOSS
    DeepSeek
    19.8%79.9%โ€”97.1%โ€”โ€”โ€”โ€”โ€”
    Qwen3-235B-A22B-Thinking-2507OSS
    Qwen
    18.2%81.1%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Pro
    Google
    17.8%83%86.4%50.8%โ€”โ€”4.9%โ€”โ€”
    DeepSeek-R1-0528OSS
    DeepSeek
    17.7%81%โ€”92.3%โ€”โ€”โ€”โ€”โ€”
    GLM-4.6OSS
    Z AI
    17.2%81%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    gpt-5-mini
    OpenAI
    16.7%82.3%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Gemini 3.1 Flash Lite Preview
    Google
    16%86.9%โ€”43.3%โ€”โ€”โ€”โ€”โ€”
    DeepSeek V3.1OSS
    DeepSeek
    15.9%74.9%โ€”93.4%โ€”โ€”โ€”โ€”โ€”
    N
    Nemotron 3 Nano (30B A3B)OSS
    NVIDIA
    15.5%75%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    GPT OSS 120BOSS
    OpenAI
    14.9%80.1%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    o4-mini
    OpenAI
    14.7%81.4%81.4%โ€”โ€”โ€”โ€”โ€”โ€”
    o3
    OpenAI
    14.7%83.3%83.3%โ€”17.7%25.4%6.5%4%53.3%
    GLM-4.7-FlashOSS
    Z AI
    14.4%75.2%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    GLM-4.5OSS
    Z AI
    14.4%79.1%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Qwen3 VL 235B A22B ThinkingOSS
    Qwen
    13.6%โ€”โ€”44.4%โ€”โ€”โ€”โ€”โ€”
    MiniMax M2OSS
    MiniMax
    12.5%78%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Flash
    Google
    11%82.8%78.3%26.9%โ€”โ€”โ€”โ€”โ€”
    GPT OSS 20BOSS
    OpenAI
    10.9%71.5%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Magistral MediumOSS
    Mistral
    9%70.8%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    gpt-5-nano
    OpenAI
    8.7%71.2%โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    MiniMax M1 80KOSS
    MiniMax
    8.4%70%โ€”18.5%โ€”โ€”โ€”โ€”โ€”
    gpt-4.1
    OpenAI
    5.4%66.3%43.4%โ€”15.7%21.4%โ€”2.3%30%
    GPT-4o
    OpenAI
    5.3%70.1%56.1%38.2%โ€”โ€”โ€”โ€”โ€”
    Gemini 2.5 Flash LiteOSS
    Google
    5.1%64.6%โ€”10.7%โ€”โ€”โ€”โ€”โ€”
    M
    Kimi K2 InstructOSS
    Moonshot AI
    4.7%75.1%โ€”31%โ€”โ€”โ€”โ€”โ€”
    gpt-4.1-mini
    OpenAI
    3.7%65%65%โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Omni
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova 2 Pro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Lite
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Amazon Nova Micro
    Amazon
    โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”โ€”
    Showing 1โ€“50 of 336 models

    Building with these APIs?

    Get 10+ Next.js AI templates with auth, payments, and more.

    Get Templates โ€” $249

    All Large Language Models

    Perplexity

    5 models

    MiniMax

    4 models

    01.ai

    1 models

    AI21 Labs

    2 models

    Anyscale

    2 models

    Baidu

    1 models

    Cohere

    4 models

    Inception

    1 models

    LG AI Research

    1 models

    Nous Research

    1 models

    Reka

    3 models

    StepFun

    1 models

    Xiaomi

    1 models

    Z AI

    8 models