Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for document analysis
document-analysis
x
29 search results found
Pdfpig
⭐
1,361
Read and extract text and other content from PDFs in C# (port of PDFBox)
Awesome Document Understanding
⭐
783
A curated list of resources for Document Understanding (DU) topic
Curve Text Detector
⭐
532
This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.
Advancedliteratemachinery
⭐
464
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
Pick Pytorch
⭐
442
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
Pandora
⭐
223
Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results
Assemblyline
⭐
157
AssemblyLine 4: File triage and malware analysis
Robin
⭐
157
RObust document image BINarization
Lilt
⭐
139
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Amazon Textract Transformer Pipeline
⭐
76
Post-process Amazon Textract results with Hugging Face transformer models for document understanding
Docextractor
⭐
73
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
Pydoxtools
⭐
56
Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.
Local_adaptive_binarization
⭐
54
Local adaptive image binarization
Vibertgrid Pytorch
⭐
47
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Detectron2 Publaynet
⭐
24
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Adversebinet
⭐
19
Improving Document Binarization via Adversarial Noise-Texture Augmentation
Enhanced Document Understanding On Aws
⭐
14
Enhanced Document Understanding on AWS delivers an easy-to-use web application that ingests and analyzes documents, extracts content, identifies and redacts sensitive customer information, and creates search indexes from the analyzed data.
Indesign Cep
⭐
13
Adobe CEP extension for InDesign to use the Bookalope cloud services.
Docvisor
⭐
11
An open-source tool for visualisation of outputs of deep-learning models for document analysis tasks such as fully automatic, bounding box and OCR.
Kuzushiji_recognition
⭐
10
[Late Submission] Solution for Kuzushiji recognition (Kaggle competition)
Document_layout_analysis Monkai
⭐
9
DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.
Readmodules
⭐
9
CVL/READ Modules including Basic Layout Analysis and Writer Identification/Retrieval
Utrnet High Resolution Urdu Text Recognition
⭐
8
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
Nlapi Java
⭐
8
Java Client for the expert.ai Natural Language API
Bookalope
⭐
7
Everything related to Bookalope and its REST API.
Metalda
⭐
6
The code for MetaLDA in ICDM 2017
Readframework
⭐
5
The Core Framework for CVL/READ Modules
Dev
⭐
5
This is the repository for the backend, AI models and API development of AymurAI
Kws Sift
⭐
5
Python code to perform keyword spotting using SIFT features
1-29 of 29 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.