EXPLORE MODELS Header Image

    EXPLORE MODELS

    Explore and experiment with today's leading models. Use our model documentation to setup your model of choice in minutes.

    TEXT-TO-TEXT

    Prices shown are per 1 million tokens

    DeepSeek R1 visualization
    DeepSeek
    FP8

    DeepSeek R1

    DeepSeek-R1 is an open-source first-generation reasoning model leveraging large-scale reinforcement learning to achieve state-of-the-art performance in math, code, and reasoning tasks, and includes distilled models suitable for various applications.

    $0.45 / $2.15
    125K Context
    DeepSeek R1 0528 visualization
    DeepSeek
    FP8

    DeepSeek R1 0528

    DeepSeek-R1 is an open-source first-generation reasoning model leveraging large-scale reinforcement learning to achieve state-of-the-art performance in math, code, and reasoning tasks, and includes distilled models suitable for various applications.

    $0.50 / $2.15
    125K Context
    DeepSeek R1 Distill Llama 70B visualization
    DeepSeek
    FP8

    DeepSeek R1 Distill Llama 70B

    Feel the power of reasoning models. This distilled model beats GPT-4o on math & matches o1-mini on coding.

    $0.10 / $0.40
    125K Context
    DeepSeek V3 0324 visualization
    DeepSeek
    FP8

    DeepSeek V3 0324

    DeepSeek-V3-0324 is an advanced language model with improved reasoning capabilities, enhanced web development support, superior Chinese writing proficiency, and refined function calling accuracy, designed to provide detailed search analysis and high-quality interactive experiences.

    $0.45 / $1.45
    125K Context
    Llama 3.1 70B Instruct visualization
    Meta
    FP16

    Llama 3.1 70B Instruct

    The Meta Llama 3.1 collection consists of high-performing, multilingual large language models optimized for dialogue and capable of handling text and code across 8 languages, available in 8B, 70B, and 405B parameter sizes, with a focus on safety, inclusivity, and societal benefit.

    $0.30 / $0.40
    16K Context
    JSON
    Llama 3.1 8B Instruct visualization
    Meta
    FP8
    FP16

    Llama 3.1 8B Instruct

    Meta Llama 3.1 is a collection of advanced, multilingual large language models designed for dialogues, available in 8B, 70B, and 405B sizes, that outperform many chat models on industry benchmarks and emphasize safe, responsible use in various applications.

    $0.025 / $0.025
    16K Context
    JSON
    Llama 3.2 1B Instruct visualization
    Meta
    FP16

    Llama 3.2 1B Instruct

    Llama 3.2 is a multilingual large language model collection from Meta, fine-tuned for dialogue and summarization tasks in multiple languages, designed for enhanced retrieval and conversational agents.

    $0.01 / $0.01
    16K Context
    JSON
    Llama 3.2 3B Instruct visualization
    Meta
    FP16

    Llama 3.2 3B Instruct

    Llama 3.2 is a multilingual large language model collection optimized for dialogue, retrieval, and summarization tasks with enhanced performance on industry benchmarks, employing supervised fine-tuning and reinforcement learning for safety and human-aligned responses.

    $0.02 / $0.02
    16K Context
    JSON
    Tool Calling
    Llama 3.3 70B Instruct visualization
    Meta
    FP16

    Llama 3.3 70B Instruct

    Meta's Llama 3.3 is a 70B parameter multilingual instruction-tuned language model designed for dialogue use, outperforming many open and closed-source models and incorporating safety features such as supervised fine-tuning and reinforcement learning with human feedback.

    $0.30 / $0.40
    125K Context
    JSON
    Tool Calling
    Mistral Nemo 12B Instruct visualization
    Mistral
    FP8

    Mistral Nemo 12B Instruct

    Mistral-NeMo-12B-Instruct is a 12-billion-parameter multilingual large language model designed for English-language chat applications, featuring impressive multilingual and code comprehension, with customization options via NVIDIA's NeMo Framework.

    $0.038 / $0.10
    16K Context
    JSON
    Tool Calling
    Osmosis Structure 0.6B visualization
    Osmosis
    FP32

    Osmosis Structure 0.6B

    Osmosis-Structure-0.6B is a small but capable language model optimized for generating structured outputs, particularly excelling in mathematical reasoning and problem-solving tasks with impressive performance enhancements through its structured training methodology.

    $0.10 / $0.50
    4K Context
    JSON
    Qwen 3 30B A3B visualization
    Qwen
    FP8

    Qwen 3 30B A3B

    Qwen3-30B-A3B-FP8 is a cutting-edge large language model that offers seamless switching between complex reasoning and general-purpose dialogue, excelling in reasoning, instruction-following, multilingual support, and agent integration, with a robust capacity for handling long contexts and supporting over 100 languages.

    $0.08 / $0.29
    16K Context
    JSON

    IMAGE-TO-TEXT

    Prices shown are per 1 million tokens

    Llama 3.2 11B Vision Instruct visualization
    Meta
    FP16

    START BUILDING TODAY

    15 minutes could save you 50% or more on compute.