API USAGE
API IDENTIFIER
inference-net/schematron-3b
import OpenAI from "openai";
const openai = new OpenAI({
baseURL: "https://api.inference.net/v1",
apiKey: process.env.INFERENCE_API_KEY,
});
const completion = await openai.chat.completions.create({
model: "inference-net/schematron-3b",
messages: [
{
role: "user",
content: "What is the meaning of life?"
}
],
stream: true,
});
for await (const chunk of completion) {
process.stdout.write(chunk.choices[0]?.delta.content as string);
}
PLAYGROUND
Total Cost = $0.00
Time To First Token
0ms
Tokens Per Second
0
Total Tokens
0
Total Cost = $0.00
Time To First Token
0ms
Tokens Per Second
0
Total Tokens
0
Type a message to get started
Tweak the overall style and tone of the conversation.
Control how creative you'd like the model to be when responding to you.
Set the maximum token length of generated text.
RELATED MODELS

ClipTagger 12B
ClipTagger-12b is a highly efficient, open-source 12-billion parameter vision-language model designed for scalable video understanding, providing frontier-quality performance through schema-consistent JSON outputs for video frames at a fraction of the cost of leading closed-source models.
TRY IT

Google Gemma 3
Gemma 3 is a versatile, lightweight, multimodal open-source model family by Google DeepMind, primed for text and image processing and text generation, supporting over 140 languages with a 128K context window, designed for easy deployment in resource-constrained environments.
TRY IT

Llama 3.1 8B Instruct
Meta Llama 3.1 is a collection of advanced, multilingual large language models designed for dialogues, available in 8B, 70B, and 405B sizes, that outperform many chat models on industry benchmarks and emphasize safe, responsible use in various applications.
TRY IT