|Project Name||Stars||Downloads||Repos Using This||Packages Using This||Most Recent Commit||Total Releases||Latest Release||Open Issues||License||Language|
|Tokenizers||6,601||131||a day ago||72||April 13, 2022||244||apache-2.0||Rust|
|💥 Fast State-of-the-Art Tokenizers optimized for Research and Production|
|Whatlang Rs||801||12||11||3 months ago||30||May 07, 2022||7||mit||Rust|
|Natural language detection library for Rust. Try demo online: https://whatlang.org/|
|Pypostal||615||8||3||9 months ago||14||March 25, 2022||36||mit||C|
|Python bindings to libpostal for fast international address parsing/normalization|
|Jieba Rs||486||5||11||3 months ago||39||October 03, 2022||10||mit||Rust|
|The Jieba Chinese Word Segmentation Implemented in Rust|
|Link Grammar||362||2 days ago||1||February 27, 2018||85||lgpl-2.1||C|
|The CMU Link Grammar natural language parser|
|Node Postal||176||7||2||a year ago||10||December 13, 2021||10||mit||C++|
|NodeJS bindings to libpostal for fast international address parsing/normalization|
|Vtext||110||2 years ago||4||June 14, 2020||14||apache-2.0||Rust|
|Simple NLP in Rust with Python bindings|
|Expostal||88||6 months ago||6||July 06, 2017||4||mit||Elixir|
|Elixir binding for Libpostal - a library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.|
|Open Nlp||88||20||8 years ago||7||May 28, 2014||2||other||Ruby|
|Ruby bindings to the OpenNLP Java toolkit.|
|Ruby Nlp||87||9 years ago||Ruby|
|Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer|
bindings by Cathal Garvey, Copyright 2017, released under MIT license.
This simply wraps the Whatlang-rs library by Sergey Potapov, exposing a single function to Python which returns a tuple of (language code, script code).
You will require:
pip install setuptools_rust
python3 setup.py install --user. The
--user part is essential, as
Rustup doesn't perform a system installation of Rust. Possibly, you could use
a system-managed Rust toolchain (e.g.
apt-get install cargo) to permit a
system-wide install of the library. Or, you could just use
python3 setup.py build
and then manually install the library system-wide.
[sudo] pip3 install https://github.com/cathalgarvey/whatlang-py/raw/master/dist/whatlang-0.0.1-cp35-cp35m-linux_x86_64.whl
[sudo] pip3 install *.whl
There is only one function in the library:
detect_language, which accepts a
single unicode string and returns a pair of strings or raises a
ValueError on failure.
The strings returned correspond to the three-character language code, and three-character
script code. These are enumerated in Whatlang's docs as the Enums
Script and are returned as strings in the same leading-uppercase format.