News

    Introducing Catalyst: Train self-improving AI models

    Learn more
    Blog

    Latest Updates

    Stay informed about models we're releasing, upgrades to our API services and our thoughts on the industry.

    Schematron V2: Frontier HTML-to-JSON extraction at a fraction of the cost

    Schematron V2: Frontier HTML-to-JSON extraction at a fraction of the cost

    Schematron is a family of small, purpose-built models that transform messy HTML into clean, structured JSON, delivering frontier-level extraction quality at a fraction of the cost and latency of large general-purpose LLMs.

    Apr 16, 2026

    A

    Amar Singh

    Introducing Catalyst: Monitor, train, and deploy self-improving AI models

    Introducing Catalyst: Monitor, train, and deploy self-improving AI models

    Build self-improving AI-systems from production data

    Apr 14, 2026

    S

    Sam Hogan

    How Inference.net trains Specialized Language Models that cut AI costs by up to 50x

    How Inference.net trains Specialized Language Models that cut AI costs by up to 50x

    Learn how Inference.net trains Specialized Language Models using the NVIDIA NeMO framework to delivery frontier accuracy at up to 50x lower cost.

    Mar 11, 2026

    S

    Sam Hogan

    Specialized LLMs: The model you need doesn't exist yet

    Specialized LLMs: The model you need doesn't exist yet

    Specialized LLMs trained on your own user data can match frontier quality for a fraction of the cost

    Feb 5, 2026

    S

    Sam Hogan

    Project OSSAS: Custom LLMs to process 100 Million Research Papers

    Project OSSAS: Custom LLMs to process 100 Million Research Papers

    Project OSSAS is a large-scale open-science initiative to make the world’s scientific knowledge accessible through AI-generated summaries of research papers.

    Nov 11, 2025

    S

    Sam Hogan

    LOGIC: Trustless Inference through Log-Probability Verification

    LOGIC: Trustless Inference through Log-Probability Verification

    A practical method for verifying LLM inference requests in trustless environments.

    Nov 5, 2025

    A

    Amar Singh

    Hybrid-Attention models are the future for SLMs

    Hybrid-Attention models are the future for SLMs

    Hybrid attention delivers up to 3x cost reduction compared to traditional transformer models

    Nov 3, 2025

    A

    Amar Singh

    Announcing our $11.8M Series Seed

    Announcing our $11.8M Series Seed

    We raised $11.8 million in funding led by Multicoin Capital and a16z CSX to train and hosti custom language models that are faster, more affordable, and more accurate than what the Big Labs offer.

    Oct 14, 2025

    S

    Sam Hogan

    Schematron: An LLM trained for HTML -> JSON at scale

    Schematron: An LLM trained for HTML -> JSON at scale

    Schematron-8B and Schematron-3B deliver frontier-level extraction quality at 1-2% of the cost and 10x+ faster inference than large, general-purpose LLMs.

    Sep 9, 2025

    S

    Sam Hogan

    Introducing ClipTagger-12b: SoTA Video Understanding at 15x Lower Cost

    Introducing ClipTagger-12b: SoTA Video Understanding at 15x Lower Cost

    We're thrilled to announce the release of ClipTagger-12b, a groundbreaking open-source vision-language model that delivers GPT-4.1-level performance for video understanding at a fraction of the cost.

    Aug 14, 2025

    S

    Sam Hogan