Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Simplehtr | 1,621 | 2 days ago | 2 | mit | Python | |||||
Handwritten Text Recognition (HTR) system implemented with TensorFlow. | ||||||||||
Osint_collection | 1,441 | 2 months ago | 2 | |||||||
Maintained collection of OSINT related resources. (All Free & Actionable) | ||||||||||
Person_search | 709 | 5 months ago | 27 | other | Python | |||||
Joint Detection and Identification Feature Learning for Person Search | ||||||||||
Nasbench | 583 | a year ago | 23 | apache-2.0 | Python | |||||
NASBench: A Neural Architecture Search Dataset and Benchmark | ||||||||||
Showcase Recipe Search | 400 | 2 years ago | 2 | apache-2.0 | JavaScript | |||||
Instantly search 2M cooking recipes using Typesense Search (an open source alternative to Algolia / ElasticSearch) ⚡ 🥘 🔍 | ||||||||||
Jstarcraft Rns | 359 | 8 months ago | apache-2.0 | Java | ||||||
专注于解决推荐领域与搜索领域的两个核心问题:排序预测(Ranking)和评分预测(Rating). 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了70多种排序预测与评分预测算法,是最快最全的Java推荐与搜索引擎. | ||||||||||
Idt | 206 | 2 years ago | 3 | October 19, 2020 | 2 | mit | Python | |||
Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process. | ||||||||||
Visualsearch_mxnet | 191 | 4 years ago | 3 | Jupyter Notebook | ||||||
Visual Search using Apache MXNet and gluon | ||||||||||
Uci Ml Api | 189 | 2 years ago | 3 | mit | Python | |||||
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze) | ||||||||||
Semantic Search | 145 | 3 years ago | 6 | Python | ||||||
Semantic search for images and words using neural networks. |
Datasets accompanying the paper Quasar: Datasets for QA by Search and Reading.
Both Quasar-S and Quasar-T are available for download here. See the accompanying readme.txt
for a description of the included files. The release includes:
The background corpus for Quasar-S can be downloaded from here (via the SO Scrape link at the bottom). The background corpus for Quasar-T is the ClueWeb09 dataset which you can access here.
An implementation of the F1 and EM evaluation metrics for Quasar-T is included in metric.py
.
Below you can find links to the web interface used for evaluating human performance and collecting annotations. This includes an interactive quiz in which the questions are presented one by one, and a search engine for querying the background corpus which you can use in a separate window. Please note that any answers and annotations you enter will be recorded!
Quasar-S: Quiz | Search Engine
Quasar-T: Quiz | Search Engine
The search engine used here is Solr, which uses a Lucene backend similar to the search phase described in the paper. The corpus in each case is the set of short pseudodocuments (sentences) pooled across all queries for that corpus, with no query identifiers or tag filtering. This is just a bare-bones search engine guaranteed not to yield the exact source sentence for the question (as you'd get by just using Google).
If you use these datasets please cite the following:
@article{dhingra2017quasar,
title={Quasar: Datasets for Question Answering by Search and Reading},
author={Dhingra, Bhuwan and Mazaitis, Kathryn and Cohen, William W},
journal={arXiv preprint arXiv:1707.03904},
year={2017}
}