Get Started
Examples
Concepts
Resources
Projects
Integrations
API Reference
copy markdown
Interfaze architecture allows you to programmatically run parts of the model or built-in tools without activating the full model making it significantly faster and cheaper.
| task Name | Description |
|---|---|
ocr | Optical character recognition on images and documents |
object_detection | Detect objects in images |
gui_detection | Detect GUI elements in images |
web_search | Web search |
scraper | Extract structured data from web pages |
speech_to_text | Speech to text transcription |
translate | Translation |
<task>task_name</task>any ot empty schema.Example of system prompt:
<task>web_search</task>OpenAI SDK
Vercel AI SDK
LangChain SDK
import { z } from "zod";
import { zodResponseFormat } from "openai/helpers/zod";
const response = await interfaze.chat.completions.create({
model: "interfaze-beta",
messages: [
{
role: "system",
content: "<task>speech_to_text</task>",
},
{
role: "user",
content: [
{ type: "text", text: "Transcribe the audio file https://r2public.jigsawstack.com/interfaze/examples/stt_long_audio_sample_3.mp3" },
],
},
],
response_format: zodResponseFormat(z.any(), "empty_schema"),
});
console.log(response.choices[0].message.content);Output
name of the task and the raw resultresult schema is different depending on the taskThe output is truncated for this example.
<task> tag is parsed from the system message (the first match). One task can only be run at a time.<task> tag, the schema takes priority — the run task execution is skipped and the full model is activated to generate a structured response following the schema.