EXPLORE MODELS Header Image

    EXPLORE MODELS

    Explore and experiment with today's leading models. Use our model documentation to setup your model of choice in minutes.

    TEXT-TO-TEXT

    Prices shown are per 1 million tokens

    Llama 3.1 8B Instruct visualization
    Meta
    FP8
    FP16

    Llama 3.1 8B Instruct

    Meta Llama 3.1 is a collection of advanced, multilingual large language models designed for dialogues, available in 8B, 70B, and 405B sizes, that outperform many chat models on industry benchmarks and emphasize safe, responsible use in various applications.

    $0.025 / $0.025
    16K Context
    JSON
    Llama 3.2 1B Instruct visualization
    Meta
    FP16

    Llama 3.2 1B Instruct

    Llama 3.2 is a multilingual large language model collection from Meta, fine-tuned for dialogue and summarization tasks in multiple languages, designed for enhanced retrieval and conversational agents.

    $0.01 / $0.01
    16K Context
    JSON
    Llama 3.2 3B Instruct visualization
    Meta
    FP16

    Llama 3.2 3B Instruct

    Llama 3.2 is a multilingual large language model collection optimized for dialogue, retrieval, and summarization tasks with enhanced performance on industry benchmarks, employing supervised fine-tuning and reinforcement learning for safety and human-aligned responses.

    $0.02 / $0.02
    16K Context
    JSON
    Tool Calling
    Mistral Nemo 12B Instruct visualization
    Mistral
    FP8

    Mistral Nemo 12B Instruct

    Mistral-NeMo-12B-Instruct is a 12-billion-parameter multilingual large language model designed for English-language chat applications, featuring impressive multilingual and code comprehension, with customization options via NVIDIA's NeMo Framework.

    $0.038 / $0.10
    16K Context
    JSON
    Tool Calling
    Osmosis Structure 0.6B visualization
    Osmosis
    FP32

    Osmosis Structure 0.6B

    Osmosis-Structure-0.6B is a small but capable language model optimized for generating structured outputs, particularly excelling in mathematical reasoning and problem-solving tasks with impressive performance enhancements through its structured training methodology.

    $0.10 / $0.50
    4K Context
    JSON

    EMBEDDINGS

    Prices shown are per 1 million tokens

    START BUILDING TODAY

    15 minutes could save you 50% or more on compute.