Airflow Pdf2embeddings

NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
Alternatives To Airflow Pdf2embeddings
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Pdf Corpora60
10 months agocc-by-4.0
An index of PDF-centric corpora
Science Result Extractor42
3 years ago4apache-2.0Java
Pdf Corpus15
7 years agomitPython
Python script to quickly create hand-crafted PDF files
Pdf2emb_nlp7
3 years ago2August 18, 2020mitPython
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query
Airflow Pdf2embeddings6
4 years ago10September 28, 2020mitPython
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
Alternatives To Airflow Pdf2embeddings
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Pdf Files Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Natural Language Processing
Corpus
Embeddings
Pdf Files