Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Tidytuesday | 6,090 | 12 hours ago | 208 | cc0-1.0 | HTML | |||||
Official repo for the #tidytuesday project | ||||||||||
Php Ml Examples | 525 | 5 years ago | 11 | mit | PHP | |||||
Examples use case of PHP-ML library. | ||||||||||
Wikitables | 279 | 3 | 1 | 2 years ago | 14 | August 26, 2021 | 4 | mit | Python | |
Import tables from any Wikipedia article as a dataset in Python | ||||||||||
Wikihow Dataset | 250 | 8 months ago | 6 | Python | ||||||
A Large Scale Text Summarization Dataset | ||||||||||
Fakenewscorpus | 184 | 4 years ago | 2 | apache-2.0 | ||||||
A dataset of millions of news articles scraped from a curated list of data sources. | ||||||||||
Minhash | 166 | 5 years ago | 4 | mit | Python | |||||
Example Python code for comparing documents using MinHash | ||||||||||
Robin | 157 | a year ago | 2 | mit | Python | |||||
RObust document image BINarization | ||||||||||
Curation Corpus | 77 | 3 years ago | cc-by-4.0 | Python | ||||||
Code for obtaining the Curation Corpus abstractive text summarisation dataset | ||||||||||
Summarization | 70 | 6 years ago | Python | |||||||
A sequence to sequence model for abstractive text summarization | ||||||||||
Cgnn | 60 | 4 years ago | 2 | Python | ||||||
Replication code for the article "Learning Functional Causal Models with Generative Neural Networks" |
TidyTuesday
is a weekly social data project. All are welcome to participate! Please remember to share the code used to generate your results!TidyTuesday
is organized by the R4DS Online Learning Community. Join our Slack for free online help with R and other data-related topics, or to participate in a data-related book club!Our over-arching goal for TidyTuesday is to make learning to work with data easier, by providing real-world datasets.
Our goal for 2023-2024 is to increase usage of #TidyTuesday within classrooms. We would like to be used in at least 10 courses by September 2024. If you are using TidyTuesday to teach data-related skills, please let us know!
To cite the TidyTuesday
repo/project in publications use:
R4DS Online Learning Community (2023). Tidy Tuesday: A weekly social data project. rfordatascience/tidytuesday.
A BibTeX entry for LaTeX users is
@misc{tidytuesday,
title = {Tidy Tuesday: A weekly social data project},
author = {R4DS Online Learning Community},
url = {https://github.com/rfordatascience/tidytuesday},
year = {2023}
}
Note: If you would like to cite the tidytuesdayR package, you should use citation("tidytuesdayR")
instead.
TidyTuesday
is built around open datasets that are found in the "wild" or submitted as Issues on our GitHub.
If you find a dataset that you think would be interesting, you can approach it through two ways:
readme.md
file using a recent readme.md
as a template. Make sure to give yourself credit!