FormulaNet
FormulaNet by alephpi, a image-to-text model with OCR capabilities. Understand and compare OCR features, benchmarks, and capabilities.
Comparison
| Feature | FormulaNet | Interfaze |
|---|---|---|
| Input Modalities | image | image, text, audio, video, document |
| Native OCR | Yes | Yes |
| Long Document Processing | No | Yes |
| Language Support | unknown | 162+ |
| Native Speech-to-Text | No | Yes |
| Native Object Detection | No | Yes |
| Guardrail Controls | No | Yes |
| Context Input Size | unknown | 1M |
| Tool Calling | No | Tool calling supported + built in browser, code execution and web search |
OCR Capabilities
| Feature | FormulaNet | Interfaze |
|---|---|---|
| Text Bounding Boxes | No | Yes |
| Confidence Scores | No | Yes |
| Dense Image Processing | No | Yes |
| Low Quality Images | No | Yes |
| Handwritten Text | No | Yes |
| Charts, Tables & Equations | No | Yes |
Scaling
| Feature | FormulaNet | Interfaze |
|---|---|---|
| Scaling | Self-hosted/Provider-hosted with quantization | Unlimited |
View model card on Hugging Face
See more details in https://github.com/alephpi/Texo