EXPLORE MODELS Header Image

    EXPLORE MODELS

    Explore and experiment with today's leading models. Use our model documentation to setup your model of choice in minutes.

    TEXT-TO-TEXT

    Prices shown are per 1 million tokens

    Llama 3.1 8B Instruct visualization
    Meta
    FP8
    FP16

    Llama 3.1 8B Instruct

    Meta Llama 3.1 is a collection of advanced, multilingual large language models designed for dialogues, available in 8B, 70B, and 405B sizes, that outperform many chat models on industry benchmarks and emphasize safe, responsible use in various applications.

    $0.025 / $0.025
    16K Context
    JSON
    Llama 3.2 1B Instruct visualization
    Meta
    FP16

    Llama 3.2 1B Instruct

    Llama 3.2 is a multilingual large language model collection from Meta, fine-tuned for dialogue and summarization tasks in multiple languages, designed for enhanced retrieval and conversational agents.

    $0.01 / $0.01
    16K Context
    JSON
    Llama 3.2 3B Instruct visualization
    Meta
    FP16

    Llama 3.2 3B Instruct

    Llama 3.2 is a multilingual large language model collection optimized for dialogue, retrieval, and summarization tasks with enhanced performance on industry benchmarks, employing supervised fine-tuning and reinforcement learning for safety and human-aligned responses.

    $0.02 / $0.02
    16K Context
    JSON
    Tool Calling
    Mistral Nemo 12B Instruct visualization
    Mistral
    FP8

    Mistral Nemo 12B Instruct

    Mistral-NeMo-12B-Instruct is a 12-billion-parameter multilingual large language model designed for English-language chat applications, featuring impressive multilingual and code comprehension, with customization options via NVIDIA's NeMo Framework.

    $0.038 / $0.10
    16K Context
    JSON
    Tool Calling
    OpenAI gpt-oss 120B visualization
    OpenAI
    MXFP4

    OpenAI gpt-oss 120B

    The gpt-oss-120b model is OpenAI's powerful open-weight model designed for high-level reasoning and diverse developer applications, featuring customizable reasoning effort, permissive licensing, agentic capabilities, and the ability to be fully fine-tuned for specific use cases.

    $0.05 / $0.45
    128K Context
    JSON
    Tool Calling
    OpenAI gpt-oss 20B visualization
    OpenAI
    MXFP4

    OpenAI gpt-oss 20B

    The GPT-OSS-20B is an open-weight AI model by OpenAI, designed for versatile applications in reasoning, function calling, and agentic tasks, offering fine-tuning capabilities and configurable reasoning levels for specialized use, with a distribution under the Apache 2.0 license for both experimentation and commercial deployment.

    $0.05 / $0.15
    128K Context
    JSON
    Tool Calling
    Osmosis Structure 0.6B visualization
    Osmosis
    FP32

    Osmosis Structure 0.6B

    Osmosis-Structure-0.6B is a small but capable language model optimized for generating structured outputs, particularly excelling in mathematical reasoning and problem-solving tasks with impressive performance enhancements through its structured training methodology.

    $0.10 / $0.50
    4K Context
    JSON

    START BUILDING TODAY

    15 minutes could save you 50% or more on compute.