tesseract-ocr
Portfolio concentration
95%
Top three share
Shows whether the organization is driven by one breakout repo or several visible projects.
Breadth
14 repos
Visible snapshot
2 repositories updated in the last 90 days.
Leading language
Unknown
Portfolio mix
Unknown (7), HTML (2), C++ (1)
Average size
6.3K
Stars per repository
Useful for distinguishing one flagship-heavy publisher from a repeatable portfolio.
95%
of the visible star count comes from this organization's top three repositories.
6.3K
stars per repository in this same snapshot.
Unknown
is the most common language here, with 2 repositories updated in the last 90 days.
Why this rank
This organization stands out because one flagship repo drives 84% of its visible star count.
Organization pages work best when you separate portfolio breadth from flagship concentration. In tesseract-ocr's case, the visible top three repositories account for about 95% of total stars in this snapshot, which helps explain whether the organization is known for one breakout project or for a broader repeatable portfolio.
The dominant language mix here is Unknown (7), HTML (2), C++ (1). That makes this page useful not just for popularity checks, but also for seeing what technical shape an organization's public ecosystem actually has.
Top Repositories
| # | Repository | Language | ⭐ Stars | 🍴 Forks | Updated |
|---|---|---|---|---|---|
| 1 | tesseract-ocr/tesseract Tesseract Open Source OCR Engine (main repository) | C++ | 74.3K | 10.6K | 1 months ago |
| 2 | tesseract-ocr/tessdata Trained models with fast variant of the "best" LSTM models + legacy models | 7.5K | 2.4K | 2 years ago | |
| 3 | tesseract-ocr/tessdoc Tesseract documentation | HTML | 2.4K | 437 | 1 months ago |
| 4 | tesseract-ocr/tessdata_best Best (most accurate) trained LSTM models. | 1.6K | 425 | 2 years ago | |
| 5 | tesseract-ocr/langdata Source training data for Tesseract for lots of languages | 868 | 877 | 1 years ago | |
| 6 | tesseract-ocr/tesstrain Train Tesseract LSTM with make | Python | 720 | 215 | 1 years ago |
| 7 | tesseract-ocr/tessdata_fast Fast integer versions of trained LSTM models | 601 | 164 | 1 years ago | |
| 8 | tesseract-ocr/docs Various documents related to Tesseract OCR | 267 | 125 | 4 years ago | |
| 9 | tesseract-ocr/langdata_lstm Data used for LSTM model training | 126 | 154 | 2 years ago | |
| 10 | tesseract-ocr/tesseract-ocr.github.io Tesseract documentation | Ruby | 75 | 63 | 4 years ago |
| 11 | tesseract-ocr/tessconfigs Tesseract Config files | Makefile | 36 | 22 | 4 years ago |
| 12 | tesseract-ocr/test Repository for tesseract testing | Shell | 35 | 31 | 1 years ago |
| 13 | tesseract-ocr/tessdata_contrib User contributed (non Google) OCR models for Tesseract | 31 | 24 | 1 years ago | |
| 14 | tesseract-ocr/tessapi Tesseract source code and API documentation | HTML | 13 | 11 | 4 years ago |
Next step after the organization read
Learn and methodology
How to read this organization snapshot
Total stars are useful as a discovery signal, but they do not tell you whether a team maintains every repository equally. Pair this page with release cadence, maintainer activity, and the flagship concentration shown above before making adoption decisions.
For broader background on GitStar's ranking logic and editorial guidance, see Methodology & Editorial Standards.