Artificial Text Detection

Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
Alternatives To Artificial Text Detection
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Transformers102,685649119 hours ago91June 21, 2022735apache-2.0Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
D2l Zh44,047
13 days ago45March 25, 202234apache-2.0Python
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。
Made With Ml33,193
a month ago5May 15, 201911mitJupyter Notebook
Learn how to responsibly develop, deploy and maintain production machine learning applications.
Spacy26,2641,533842a day ago196April 05, 2022110mitPython
💫 Industrial-strength Natural Language Processing (NLP) in Python
Applied Ml24,242
9 days ago3mit
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Nlp Progress21,649
4 days ago50mitPython
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
D2l En18,001
4 days ago99otherPython
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 universities from 60 countries including Stanford, MIT, Harvard, and Cambridge.
Rasa16,4393228a day ago274July 06, 2022122apache-2.0Python
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Mindsdb16,3753113 hours ago42March 19, 2019627gpl-3.0Python
MindsDB is a Server for Artificial Intelligence Logic. Enabling developers to ship AI powered projects to production in a fast and scalable way.
Datasets16,3579208a day ago52June 15, 2022615apache-2.0Python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Alternatives To Artificial Text Detection
Select To Compare


Alternative Project Comparisons
Readme

Build Status PyPi Versions Wheel codecov Code style: black Pre-commit DVC License

Artificial Text Detection

Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.

Contents

Project description is put into:

Installation steps:

We use poetry as an enhanced dependency resolver.

make poetry-download
poetry install --no-dev

Datasets for artificial text detection

To create datasets for the further classification, it is necessary to collect them. There are 2 available ways for it:

  • Via Data Version Control. Get in touch with @msaidov in order to have the access to the private Google Drive;
  • Via datasets generation. One dataset with a size of 20,000 samples was process with MT model on V100 GPU for 30 mins;

Data Version Control usage:

poetry add "dvc[gdrive]"

Then, run dvc pull. It will download preprocessed translation datasets from the Google Drive.

Datasets generation

To generate translations before artificial text detection pipeline, install the detection module from the cloned repo or PyPi (TODO):

pip install -e .

Then, run generate script:

python detection/data/generate.py --dataset_name='tatoeba' --size=20000 --device='cuda:0'

Simple run:

To run the artificial text detection classifier, execute the pipeline:

python detection/old.py
Popular Natural Language Processing Projects
Popular Machine Learning Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Machine Learning
Deep Learning
Pytorch
Natural Language Processing
Generative Model
Machine Translation
Attention Mechanism