Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Transformers | 102,685 | 64 | 911 | 9 hours ago | 91 | June 21, 2022 | 735 | apache-2.0 | Python | |
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. | ||||||||||
D2l Zh | 44,047 | 1 | 3 days ago | 45 | March 25, 2022 | 34 | apache-2.0 | Python | ||
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。 | ||||||||||
Made With Ml | 33,193 | a month ago | 5 | May 15, 2019 | 11 | mit | Jupyter Notebook | |||
Learn how to responsibly develop, deploy and maintain production machine learning applications. | ||||||||||
Spacy | 26,264 | 1,533 | 842 | a day ago | 196 | April 05, 2022 | 110 | mit | Python | |
💫 Industrial-strength Natural Language Processing (NLP) in Python | ||||||||||
Applied Ml | 24,242 | 9 days ago | 3 | mit | ||||||
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production. | ||||||||||
Nlp Progress | 21,649 | 4 days ago | 50 | mit | Python | |||||
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks. | ||||||||||
D2l En | 18,001 | 4 days ago | 99 | other | Python | |||||
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 universities from 60 countries including Stanford, MIT, Harvard, and Cambridge. | ||||||||||
Rasa | 16,439 | 32 | 28 | a day ago | 274 | July 06, 2022 | 122 | apache-2.0 | Python | |
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants | ||||||||||
Mindsdb | 16,375 | 3 | 1 | 13 hours ago | 42 | March 19, 2019 | 627 | gpl-3.0 | Python | |
MindsDB is a Server for Artificial Intelligence Logic. Enabling developers to ship AI powered projects to production in a fast and scalable way. | ||||||||||
Datasets | 16,357 | 9 | 208 | a day ago | 52 | June 15, 2022 | 615 | apache-2.0 | Python | |
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools |
Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
Project description is put into:
We use poetry
as an enhanced dependency resolver.
make poetry-download
poetry install --no-dev
To create datasets for the further classification, it is necessary to collect them. There are 2 available ways for it:
@msaidov
in order to have the access to the private Google Drive;poetry add "dvc[gdrive]"
Then, run dvc pull
. It will download preprocessed translation datasets
from the Google Drive.
To generate translations before artificial text detection pipeline,
install the detection
module from the cloned repo or PyPi (TODO):
pip install -e .
Then, run generate script:
python detection/data/generate.py --dataset_name='tatoeba' --size=20000 --device='cuda:0'
To run the artificial text detection classifier, execute the pipeline:
python detection/old.py