Erpnext_ocr

🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Alternatives To Erpnext_ocr
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Tesseract51,145514 days ago1February 27, 2018396apache-2.0C++
Tesseract Open Source OCR Engine (main repository)
Tesseract.js30,533221125a day ago50September 20, 202214apache-2.0JavaScript
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
Siyuan10,603
18 hours ago1July 07, 202251agpl-3.0TypeScript
A privacy-first, self-hosted, fully open-source personal knowledge management software, written in typescript and golang.
Ocrmypdf9,000674 days ago205July 04, 202298mpl-2.0Python
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Faceai6,666
3 years ago29mitPython
一款入门级的人脸、视频、文字检测以及识别的项目.
Tessdata5,117
16 days ago3May 19, 202047apache-2.0
Trained models with fast variant of the "best" LSTM models + legacy models
Pytesseract4,82371424224 days ago27February 19, 202227apache-2.0Python
A Python wrapper for Google Tesseract
Tesseract Ocr Ios4,038
1723 years ago7April 04, 2015112mitC
Tesseract OCR iOS is a Framework for iOS7+, compiled also for armv7s and arm64.
Tess Two3,642
a year ago15February 11, 2021apache-2.0C
Fork of Tesseract Tools for Android
Ocrad.js3,2272293 years ago1May 06, 201424gpl-3.0JavaScript
OCR in Javascript via Emscripten
Alternatives To Erpnext_ocr
Select To Compare


Alternative Project Comparisons
Readme

License: MIT Managed with Taiga.io Build Status Codacy Badge Coverage Status

ERPNext OCR

⚗️ Experimental Frappe OCR application with tesseract.

This project is a fork of ERPNext-OCR by John Vincent Fiel. Its aim is to fix and cleanup the original source code and add some new features.

Check out more on ERPNext Discuss.

📈 Changes

See CHANGELOG

🔖 Roadmap

See Taiga.io

🚧 Install

Pre-requisites: tesseract-python and imagemagick

Install tesseract-ocr, plus imagemagick and ghostscript (to work with pdf files) using this command on Debian:

sudo apt-get install tesseract-ocr imagemagick libmagickwand-dev ghostscript

Install Frappe application

bench get-app --branch develop erpnext_ocr https://github.com/Monogramm/erpnext_ocr
bench install-app erpnext_ocr

When installing Frappe app, the following python requirements will be installed:

  • python binding for tesseract, tesserocr

  • image processing library in python, pillow

  • HTTP library in python, requests

  • python binding for imagemagick, wand

🚀 Usage

File Being Read:

File Being Read

Sample Screenshot:

Sample Screenshot

Tesseract trained data

In order to use OCR with different languages, you need to install the appropriate trained data files. Check tesseract Wiki for details: https://github.com/tesseract-ocr/tesseract/wiki/Data-Files

Development

If you wish to develop or just test locally this application, you can use docker-compose up -d at the root of the this repository. You can then access your ERPNext OCR dev env at http://localhost:8080.

Known issues

✅ Run tests

bench run-tests --app erpnext_ocr

👤 Authors

Monogramm

John Vincent Fiel

🤝 Contributing

Contributions, issues and feature requests are welcome!
Feel free to check issues page. Check the contributing guide.

👍 Show your support

Give a ⭐️ if this project helped you!

📄 License

Copyright © 2019 Monogramm.
This project is MIT licensed.


This README was generated with ❤️ by readme-md-generator

Popular Tesseract Projects
Popular Ocr Projects
Popular Media Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Ocr
Imagemagick
Tesseract
Pdf Files
Tiff