Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Pdf Corpora | 60 | 10 months ago | cc-by-4.0 | |||||||
An index of PDF-centric corpora | ||||||||||
Science Result Extractor | 42 | 3 years ago | 4 | apache-2.0 | Java | |||||
Pdf Corpus | 15 | 7 years ago | mit | Python | ||||||
Python script to quickly create hand-crafted PDF files | ||||||||||
Pdf2emb_nlp | 7 | 3 years ago | 2 | August 18, 2020 | mit | Python | ||||
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query | ||||||||||
Airflow Pdf2embeddings | 6 | 4 years ago | 10 | September 28, 2020 | mit | Python | ||||
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query. |