STACKQUADRANT

PaddlePaddle/PaddleOCR

RAG Libraries

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

8.6
GitHub Metrics
Stars
75.7k
Forks
10.2k
Open Issues
236
Watchers
532
Contributors
288
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Apr 16, 2026
Created
May 8, 2020
Latest Release
v3.4.1
Release Date
Apr 14, 2026
Synced: Apr 16, 2026
Quality Scores
Documentation Qualityw: 20%
8.4

Has docs site (https://www.paddleocr.com). Description: 176 chars. Stars signal: 75,709. Contributors: 288. Score: 8.4/10

Community Healthw: 20%
9.7

Stars: 75,709. Contributors: 288. Watchers: 532. Forks: 10,244. Issue ratio: 0.3%. Score: 9.7/10

Maintenance Velocityw: 15%
7.7

Last commit: 0d ago. Weekly commits: 0. Latest release: v3.4.1. Maturity bonus: 5.9y old. Score: 7.7/10

API Design & DXw: 20%
8.4

Stars/issues ratio: 321. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 75,709 stars. Score: 8.4/10

Production Readinessw: 15%
8.6

Battle-tested: 75,709 stars. Peer review: 288 contributors. Versioned: v3.4.1. Licensed: Apache-2.0. Age: 5.9 years. Maintenance: last commit 0d ago. Score: 8.6/10

Ecosystem Integrationw: 10%
8.9

Fork interest: 10,244. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 75,709 stars. Has web presence. Score: 8.9/10

Tags
ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdown
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration
PaddlePaddle/PaddleOCR — 8.6/10 — AI/LLM Repository Review — StackQuadrant