PRICING

    The best prices on the market. 90% lower cost than other providers. All models are billed by usage.

    PRICING Hero Image
    ModelPricing
    Meta Llama 3.1 8B Instruct FP8
    Maker:Meta
    Price/Million:$0.025
    Meta Llama 3.1 8B Instruct FP16
    Maker:Meta
    Price/Million:$0.03
    Meta Llama 3.1 70B Instruct FP8
    Maker:Meta
    Price/Million:$0.30
    Meta Llama 3.1 70B Instruct FP16
    Maker:Meta
    Price/Million:$0.40
    Meta Llama 3.3 70B Instruct FP8
    Maker:Meta
    Price/Million:$0.30
    Meta Llama 3.3 70B Instruct FP16
    Maker:Meta
    Price/Million:$0.40
    Meta Llama 3.2 1B Instruct FP16
    Maker:Meta
    Price/Million:$0.01
    Meta Llama 3.2 3B Instruct FP16
    Maker:Meta
    Price/Million:$0.02
    Meta Llama 3.2 11B Instruct FP16
    Maker:Meta
    Price/Million:$0.055
    Mistral Nemo 12B Instruct FP8
    Maker:Mistral
    Price/Million:$0.10
    DeepSeek V3 FP8
    Maker:DeepSeek
    Price/Million:$2.00
    DeepSeek R1 FP8
    Maker:DeepSeek
    Price/Million:$5.00
    Qwen 2.5 72B Instruct FP8
    Maker:Qwen
    Price/Million:$0.35
    UnslopNemo 12B v4.1 BF16
    Maker:TheDrummer
    Price/Million:$0.50
    Rocinante 12B v1.1 BF16
    Maker:TheDrummer
    Price/Million:$0.80
    Mythomax L2 13B BF16
    Maker:Gryphe
    Price/Million:$0.19

    NEED A RESEARCH GRANT?

    Inference’s Grants program offers free compute resources to researchers and developers working on open-source AI projects.

    NEED ENTERPRISE PRICING?

    Inference is the best solution for large scale operations looking to source affordable inference compute. Leverage our network's capabilities and our team's expertise for your next initiative.

    START BUILDING TODAY

    15 minutes could save you 50% or more on compute.