Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python ocr
ocr
x
python
x
840 search results found
Paddleocr
⭐
36,076
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Easyocr
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Paddlehub
⭐
12,193
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
Ocrmypdf
⭐
11,136
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Latex Ocr
⭐
8,088
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Ddddocr
⭐
7,369
带带弟弟 通用验证码识别OCR pypi版
Faceai
⭐
6,666
一款入门级的人脸、视频、文字检测以及识别的项目.
Dango Translator
⭐
5,689
团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器
Parsr
⭐
5,423
Transforms PDF, Documents and Images into Enriched Structured Data
Paperless Ng
⭐
5,371
A supercharged version of paperless: scan, index and archive all your physical documents
Pytesseract
⭐
5,312
A Python wrapper for Google Tesseract
Chineseocr
⭐
4,953
yolo3+ocr
Donut
⭐
4,651
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Video Subtitle Extractor
⭐
4,267
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框 GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Layout Parser
⭐
4,107
A Unified Toolkit for Deep Learning Based Document Image Analysis
Pymupdf
⭐
3,908
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Mmocr
⭐
3,771
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Text Detection Ctpn
⭐
3,368
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Adelaidet
⭐
3,275
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Manga Image Translator
⭐
3,225
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Textrecognitiondatagenerator
⭐
2,901
A synthetic data generator for text recognition
Craft Pytorch
⭐
2,797
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Doctr
⭐
2,636
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Captcha_trainer
⭐
2,505
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Open Paperless
⭐
2,492
Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Papermerge
⭐
2,201
Open Source Document Management System for Digital Archives (Scanned Documents)
Chinese_ocr
⭐
2,037
CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras
Trwebocr
⭐
2,021
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作
Pdftabextract
⭐
1,994
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Chinese Ocr
⭐
1,985
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景
Deepdoctection
⭐
1,929
A Repo For Document AI
Tesserocr
⭐
1,853
A Python wrapper for the tesseract-ocr API
Ballonstranslator
⭐
1,730
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
Simplehtr
⭐
1,719
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Invoice2data
⭐
1,570
Extract structured data from PDF invoices
Rapidocr
⭐
1,551
A cross platform OCR Library based on PaddleOCR & OnnxRuntime & OpenVINO.
Textshot
⭐
1,518
Python tool for grabbing text via screenshot
Deep_ocr
⭐
1,452
make a better chinese character recognition OCR than tesseract
Topsup
⭐
1,332
答题辅助决策:头号英雄等答题类游戏
Normcap
⭐
1,291
OCR powered screen-capture tool to capture information instead of images
Pix2text
⭐
1,261
Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images. 80+ languages are supported.
Doc2text
⭐
1,221
Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.
Manga Ocr
⭐
1,172
Optical character recognition for Japanese text, with the main focus being Japanese manga
Tr
⭐
1,143
Free Offline OCR 离线的中文文本检测+识别SDK
Keras Ocr
⭐
1,114
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Faspell
⭐
1,069
2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)
Openseg.pytorch
⭐
1,052
The official Pytorch implementation of OCNet series and SegFix.
Text_renderer
⭐
1,039
Generate text images for training deep learning ocr model
Attention Ocr
⭐
973
Visual Attention based OCR
Attention Ocr
⭐
957
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Rpaframework
⭐
941
Collection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python
Text_select_captcha
⭐
897
实现文字点选、选字、选择、点触验证码识别,基于pytorch训练
Cps Ocr Engine
⭐
892
An awesome OCR engine developed by SYSU DeepDriving Lab
Dbnet.pytorch
⭐
772
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
Ocr_dataset
⭐
760
收集并整理有关OCR的数据集并统一标注格式,以便实验需要
Qiji Font
⭐
742
齊伋體 - typeface from Ming Dynasty woodblock printed books
Open Semantic Search
⭐
741
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
2019 Ccf Bdci Ocr Mczj Ocr Identificationidelement
⭐
706
2019CCF-BDCI大赛 最佳创新探索奖获得者 基于OCR身份证要素提取赛题冠军 天晨破晓团队 赛题源码
One Python
⭐
655
We don't need a lot of libraries. We just need the best ones. | Unofficial recommended first choice.
Princess Connection Farm
⭐
640
国服PCR公主连结 多开自动农场脚本 基于opencv+UIAutomator
Aster
⭐
628
Recognizing cropped text in natural images.
Captcha Break
⭐
626
captcha break based on opencv2, tesseract-ocr and some machine learning algorithm.
Tensorflow Ocr
⭐
616
🖺 OCR using tensorflow with attention
Idcardgenerator
⭐
605
身份证图片生成工具 generate an id card picture
Fots.pytorch
⭐
604
FOTS Pytorch Implementation
Kraken
⭐
600
OCR engine for all the languages
Paddle2onnx
⭐
584
ONNX Model Exporter for PaddlePaddle
Handwriting Ocr
⭐
560
OCR software for recognition of handwritten text
Paddleocr2pytorch
⭐
553
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/Paddle
Captcha_platform
⭐
548
[验证码识别-部署] This project is based on CNN+BLSTM+CTC to realize verificationtion. This projeccode identificat is only for deployment models.
Millionheroassistant
⭐
526
百万 / 冲顶 / 芝士 / UC / 万能 答题助手(知识图谱更加专业,自动推荐答案, Android手机自动屏幕适配,模拟器支持,多开)
Tesstrain
⭐
517
Train Tesseract LSTM with make
Cnstd
⭐
498
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
Octopii
⭐
486
An AI-powered Personal Identifiable Information (PII) scanner.
Simple Ocr Opencv
⭐
476
A simple python OCR engine using opencv
R2cnn_faster Rcnn_tensorflow
⭐
475
Rotational region detection based on Faster-RCNN.
Cnn_lstm_ctc_ocr
⭐
470
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
Lackey
⭐
467
Lackey - Graphical desktop automation with Python
Seglink
⭐
457
An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments
Stn Ocr
⭐
450
Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
Videocr
⭐
439
Extract hardcoded subtitles from videos using machine learning
Parseq
⭐
429
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Nomeroff Net
⭐
425
Nomeroff Net. Automatic numberplate recognition system.
Basecrack
⭐
422
Decode All Bases - Base Scheme Decoder
East
⭐
419
This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.
Pymupdf Utilities
⭐
414
Demos, examples and utilities using PyMuPDF
Psenet.pytorch
⭐
412
A pytorch re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network
Tensorflow_psenet
⭐
401
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:
Mayan Edms
⭐
398
Free Open Source Document Management System (mirror, no pull request or issues)
Attention Ocr Chinese Version
⭐
394
Attention OCR Based On Tensorflow
Ocr_densenet
⭐
393
第一届西安交通大学人工智能实践大赛(2018AI实践大赛--图片文字识别)第一名;仅采用densen
Vedastr
⭐
389
A scene text recognition toolbox based on PyTorch
Dewarpnet
⭐
389
Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)
Autocorrect
⭐
376
Spelling corrector in python
Idcardocr
⭐
375
离线环境下第二代居民身份证信息识别
Tarsier
⭐
372
Vision utilities for web interaction agents 👀
Card Ocr
⭐
364
身份证识别OCR
Synthtiger
⭐
358
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
Lstm_ctc_ocr
⭐
358
Use CTC + tensorflow to OCR
Ocr.pytorch
⭐
350
A pure pytorch implemented ocr project including text detection and recognition
Related Searches
Python Django (28,897)
Python Deep Learning (19,702)
Python Script (17,004)
Python Dataset (14,792)
Python Docker (14,113)
Python Machine Learning (14,099)
Python Tensorflow (13,736)
Python Algorithms (10,033)
Python Database (9,975)
Python Natural Language Processing (9,064)
1-100 of 840 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.