Get Started
Examples
Concepts
Resources
Integrations
copy markdown
Detect and identify objects from an input image. It locates bounding boxes, classify objects, and return structured metadata describing each detected entity.
OpenAI SDK
Vercel AI SDK
LangChain SDK
Bounding boxes mapped to the image

JSON output
object contains the extracted information defined in the schema. precontext contains the raw metadata such as bounding boxes and confidence scores.
Running object detection as a single task with <task>object_detection</task> in the system message makes it cheaper and faster with a fixed structured output that's pre-defined.
Learn more about running a task.
OpenAI SDK
Vercel AI SDK
LangChain SDK
JSON output