STACKQUADRANT

PaddlePaddle/PaddleOCR

RAG Libraries

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

8.6
GitHub Metrics
Stars
84.1k
Forks
10.9k
Open Issues
221
Watchers
551
Contributors
297
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Jun 26, 2026
Created
May 8, 2020
Latest Release
v3.7.0
Release Date
Jun 11, 2026
Synced: Jun 28, 2026
Quality Scores
Documentation Qualityw: 20%
8.4

Has docs site (https://www.paddleocr.com). Description: 176 chars. Stars signal: 84,122. Contributors: 297. Score: 8.4/10

Community Healthw: 20%
9.7

Stars: 84,122. Contributors: 297. Watchers: 551. Forks: 10,904. Issue ratio: 0.3%. Score: 9.7/10

Maintenance Velocityw: 15%
7.7

Last commit: 2d ago. Weekly commits: 0. Latest release: v3.7.0. Maturity bonus: 6.1y old. Score: 7.7/10

API Design & DXw: 20%
8.4

Stars/issues ratio: 381. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 84,122 stars. Score: 8.4/10

Production Readinessw: 15%
8.6

Battle-tested: 84,122 stars. Peer review: 297 contributors. Versioned: v3.7.0. Licensed: Apache-2.0. Age: 6.1 years. Maintenance: last commit 2d ago. Score: 8.6/10

Ecosystem Integrationw: 10%
8.9

Fork interest: 10,904. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 84,122 stars. Has web presence. Score: 8.9/10

Tags
ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdown
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration