Interfaze

logo

Beta

pricing

docs

blog

sign in

Get Started

Introduction

Examples

Vision

Concepts

Resources

Integrations

OCR (Optical Character Recognition)

copy markdown

Extract text and bounds with confidence scores from dense images and large documents including handwritten text, printed documents, screenshots, and other visual content.

  • Over 100+ languages with mixed language support. View all supported languages
  • Charts, tables, graphs, mathematical equations, latex, code blocks, and more
  • Multi-page documents and large PDFs
  • High density images and low quality images

OCR extraction of an image

OpenAI SDK

Vercel AI SDK

LangChain SDK

...

Bounding boxes mapped to the image

ID bounding boxes mapped

JSON output

object contains the extracted information defined in the schema. precontext contains the raw metadata such as bounding boxes and confidence scores.

...

OCR extraction of a PDF document

Document: https://arxiv.org/pdf/2602.04101

OpenAI SDK

Vercel AI SDK

LangChain SDK

...

JSON output

...

The output is truncated for this example.

Run OCR task with raw output

Running OCR as a tasks with <task>ocr</task> in the system message make it cheaper and faster with a fixed structured output that's pre-defined.

Learn more about running a task.

OpenAI SDK

Vercel AI SDK

LangChain SDK

...

JSON output

...