Explore Models

Explore and experiment with today's leading models. Use our model documentation to setup your model of choice in minutes.

Workhorse Models

InferenceNet

BF16

Schematron 3B

Schematron-3B is a state-of-the-art large language model designed for reasoning and complex problem-solving tasks, with a focus on accuracy and efficiency in various domains, offering advanced capabilities for structured output generation and complex reasoning.

TRY IT

Schematron 8B

Schematron-8B is a state-of-the-art large language model designed for reasoning and complex problem-solving tasks, with a focus on accuracy and efficiency in various domains, offering advanced capabilities for structured output generation and complex reasoning.

TRY IT

ClipTagger 12B

ClipTagger-12b is a highly efficient, open-source 12-billion parameter vision-language model designed for scalable video understanding, providing frontier-quality performance through schema-consistent JSON outputs for video frames at a fraction of the cost of leading closed-source models.

TRY IT

Text-to-Text

Prices shown are per 1 million tokens

InferenceNet

BF16

Schematron 3B

Schematron-3B is a state-of-the-art large language model designed for reasoning and complex problem-solving tasks, with a focus on accuracy and efficiency in various domains, offering advanced capabilities for structured output generation and complex reasoning.

TRY IT

Schematron 8B

Schematron-8B is a state-of-the-art large language model designed for reasoning and complex problem-solving tasks, with a focus on accuracy and efficiency in various domains, offering advanced capabilities for structured output generation and complex reasoning.

$0.04 / $0.10

125K Context

JSON

TRY IT

Image-to-Text

Prices shown are per 1 million tokens

GrassData

FP8

ClipTagger 12B

ClipTagger-12b is a highly efficient, open-source 12-billion parameter vision-language model designed for scalable video understanding, providing frontier-quality performance through schema-consistent JSON outputs for video frames at a fraction of the cost of leading closed-source models.

TRY IT

Google Gemma 3

Gemma 3 is a versatile, lightweight, multimodal open-source model family by Google DeepMind, primed for text and image processing and text generation, supporting over 140 languages with a 128K context window, designed for easy deployment in resource-constrained environments.

TRY IT

Schematron

ClipTagger

View All Models

Explore Models

Workhorse Models

Schematron 3B

Schematron 8B

ClipTagger 12B

Text-to-Text

Schematron 3B

Schematron 8B

Image-to-Text

ClipTagger 12B

Google Gemma 3