Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Sotawhat | 1,280 | 7 months ago | 18 | Python | ||||||
Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday. | ||||||||||
Findpapers | 164 | 3 months ago | 24 | June 22, 2021 | 3 | mit | Python | |||
Findpapers: A tool for helping researchers who are looking for related works | ||||||||||
Wcep Mds Dataset | 49 | a year ago | mit | Python | ||||||
Cookies That Give You Away | 44 | 5 years ago | OpenEdge ABL | |||||||
Code release for: Cookies that give you away: The surveillance implications of web tracking | ||||||||||
Dark Patterns | 30 | 5 years ago | gpl-3.0 | Jupyter Notebook | ||||||
Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites". | ||||||||||
Tiny Crawler | 21 | 4 months ago | Python | |||||||
download the links from libgen.io, arxiv | ||||||||||
Arxiv Crawler | 17 | 6 years ago | Python | |||||||
crawling arXiv paper and organize as a database | ||||||||||
Kairos | 17 | 13 years ago | 1 | apache-2.0 | Java | |||||
Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled with fields of metadata that correspond to individual papers. Using event date metadata extracted from the conference website, Kairos proactively harvests metadata about the individual papers soon after they are made public. We use a Maximum Entropy classifier to classify uniform resource locators (URLs) as scientific conference websites and use Conditional Random Fields (CRF) to extract individual paper metadata from such websites. The crawler is built on top of the popular open-source crawler Nutch. | ||||||||||
Papercrawler | 16 | 5 years ago | gpl-3.0 | Python | ||||||
Crawler used to crawl papers | ||||||||||
Paperwebcrawler | 15 | 10 months ago | apache-2.0 | Java | ||||||
IEEE XPLORE等文献网站的爬虫工具/Crawler for Paper Website like IEEE XPLORE |