Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Perceptualsimilarity | 2,618 | 2 | 3 months ago | 5 | August 25, 2021 | 42 | bsd-2-clause | Python | ||
LPIPS metric. pip install lpips | ||||||||||
Fastdup | 794 | 10 hours ago | 16 | other | Python | |||||
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale. | ||||||||||
Facial Similarity With Siamese Networks In Pytorch | 620 | 3 years ago | 8 | mit | Jupyter Notebook | |||||
Implementing Siamese networks with a contrastive loss for similarity learning | ||||||||||
Dataset Sts | 542 | 5 years ago | 23 | Python | ||||||
Semantic Text Similarity Dataset Hub | ||||||||||
Siamese Lstm | 172 | 5 years ago | 5 | Python | ||||||
Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset. | ||||||||||
Datagene | 170 | a year ago | 3 | May 10, 2020 | Jupyter Notebook | |||||
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai) | ||||||||||
Mp Cnn Torch | 106 | 5 years ago | 1 | Lua | ||||||
Multi-Perspective Convolutional Neural Networks for modeling textual similarity (He et al., EMNLP 2015) | ||||||||||
Metaod | 85 | a year ago | 7 | September 28, 2020 | 1 | bsd-2-clause | ||||
Automating Outlier Detection via Meta-Learning (Code, API, and Contribution Instructions) | ||||||||||
Japanesewordsimilaritydataset | 84 | a year ago | Python | |||||||
Japanese Word Similarity Dataset | ||||||||||
Multi Camera Vehicle Tracking And Reidentification | 78 | 5 years ago | 3 | Python | ||||||
This repository contains our source code of Track 3 in the NVIDIA AI City Challenge Workshop at CVPR 2018. Please find the full source code of both Track 1 and Track 3 here -> |
COS960 is a Chinese word similarity dataset of 960 word pairs. Each pair of words is annotated by 15 native speakers with a similarity score which reflects true similarity. The 960 word pairs are further divided into 3 groups according to their Part Of Speech tags, including 480 pairs of nouns, 240 pairs of verbs and 240 pairs of adjectives.
To use COS960 to test your word embedding, use command
python correlation_calcu.py {VECTOR_FILE}
The data in the files is formulated as
[Word1] [Word2] [Average] [Annotator1] ... [Annotator15]
小心谨慎 谨慎小心 4.0 4 ... 4
If you use the dataset, please cite this:
@article{huang2019COS960,
Author = {Junjie Huang and Fanchao Qi and Chenghao Yang and Zhiyuan Liu and Maosong Sun},
Title = {{COS960: A Chinese Word Similarity Dataset of 960 Word Pairs}},
journal={arXiv preprint arXiv:1906.00247},
Year = {2019},
}