
OnnxOCR
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
The Lens
OnnxOCR is a fast, multilingual OCR engine, the technology that turns images of text into actual text you can use. It is a rebuild of the popular PaddleOCR that strips out the heavy PaddlePaddle training framework and runs on ONNX Runtime instead, which makes it lean and quick. It reads Simplified and Traditional Chinese, English, Japanese, and more, and handles tables, document layout, and even license plates. Apache-2.0 and free.
Setup is a pip install with Python 3.8 or newer. You can run it locally with a test script, stand it up as a JSON API, or launch a browser UI, and Docker support is included. Because it dropped the training framework, it runs well on edge devices and on both ARM and x86, which is the whole point: OCR without dragging a deep learning stack along. Accuracy is reported to match PaddleOCR 3.0.
Developers who need to pull text from images or scanned documents and would rather not pay per-call cloud fees should look here. Solo and small teams get production-capable OCR for free, running on their own hardware. Larger teams processing high volumes save the most, since cloud OCR APIs bill per image. It is free at every scale.
The catch is that you own the deployment and the accuracy tuning. Cloud OCR services like Google Vision or AWS Textract hand you an API and a support line; here you manage the models and the edge cases yourself. For high volume or privacy-sensitive work, that trade is usually worth it.
Free vs Self-Hosted vs Paid
fully freeFree tier: Everything. Apache-2.0 licensed, the full multilingual OCR engine with table, layout, and license-plate recognition.
Self-hosted: A pip install with Python 3.8+. Run it as a local script, a JSON API, or a browser UI, with Docker support included. Because it dropped the PaddlePaddle training framework, it runs well on edge devices and both ARM and x86.
Paid: None. The relevant comparison is cloud OCR APIs like Google Vision and AWS Textract, which bill per image; running OnnxOCR yourself trades that per-call fee for your own hardware.
Completely free and open source under Apache-2.0. The savings come from running OCR yourself instead of paying per-image cloud fees.
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
License: Apache License 2.0
Use freely. Patent grant included.
Commercial use: ✓ Yes
About
- Owner
- jingsongliu (User)
- Stars
- 1,784
- Forks
- 195
Explore Further
More tools in the directory
openclaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
373.3k ★everything-claude-code
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
192.9k ★claw-code
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
192.0k ★