Viki Clip Models

Viki Clip Models by jnurik, a image-to-text model. Understand and compare features, benchmarks, and capabilities.

Comparison

Feature	Viki Clip Models	Interfaze
Input Modalities	image	image, text, audio, video, document
Native OCR	No	Yes
Long Document Processing	No	Yes
Language Support	unknown	162+
Native Speech-to-Text	No	Yes
Native Object Detection	No	Yes
Guardrail Controls	No	Yes
Context Input Size	unknown	1M
Tool Calling	No	Tool calling supported + built in browser, code execution and web search

Feature	Viki Clip Models	Interfaze
Scaling	Self-hosted/Provider-hosted with quantization	Unlimited

View model card on Hugging Face

🚀 Live App: WikiLens Space

Этот репозиторий содержит веса и FAISS-индексы для приложения WikiLens.