Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python multilingual
multilingual
x
python
x
208 search results found
Paddleocr
⭐
36,076
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Muse
⭐
2,844
A library for Multilingual Unsupervised or Supervised word Embeddings
Polyglot
⭐
2,212
Multilingual text (NLP) processing toolkit
Blade Build
⭐
2,019
Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protobuf...
Chatterbot Corpus
⭐
1,219
A multilingual dialog corpus
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Conceptnet Numberbatch
⭐
1,114
Bpemb
⭐
1,068
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
The Pile
⭐
1,048
Multilingual T5
⭐
962
Detoxify
⭐
774
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at
[email protected]
.
Multilingual_text_to_speech
⭐
740
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Uform
⭐
729
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Trankit
⭐
693
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Wiktextract
⭐
654
Wiktionary dump file parser and multilingual data extractor
Wordless
⭐
649
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Qbr
⭐
563
A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.
Xtreme
⭐
547
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
Deeptype
⭐
516
Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"
Jekyll
⭐
498
Jekyll-based static site for The Programming Historian
Autocorrect
⭐
376
Spelling corrector in python
Djangocms Blog
⭐
373
django CMS blog application - Support for multilingual posts, placeholders, social network meta tags and configurable apphooks
Github Typo Corpus
⭐
289
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
Vosk
⭐
287
VOSK Speech Recognition Toolkit
Text2text
⭐
268
Text2Text: Crosslingual NLP/G toolkit
Bitextor
⭐
260
Bitextor generates translation memories from multilingual websites
Multi_rake
⭐
249
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Alfred Searchio
⭐
215
Alfred workflow to auto-suggest search results from multiple search engines and languages.
Xl Sum
⭐
209
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Is Chatgpt A Good Translator
⭐
209
A preliminary evaluation of ChatGPT/GPT-4 for machine translation.
Mkdocs Static I18n
⭐
177
MkDocs i18n plugin using static translation markdown files
Tt2020
⭐
173
TT2020 is an advanced, open source, hyperrealistic, multilingual typewriter font for a new decade.
Wn
⭐
170
A modern, interlingual wordnet interface for Python
Laserembeddings
⭐
163
LASER multilingual sentence embeddings as a pip package
Spacy Universal Sentence Encoder
⭐
156
Google USE (Universal Sentence Encoder) for spaCy
Mimick
⭐
149
Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Dl4mt C2c
⭐
145
Xtreme Distil Transformers
⭐
144
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
Django Tagging Ng
⭐
139
Django tagging with multilingual, synonyms and hierarchy!
Nautilus Copy Path
⭐
137
Configurable extension for Nautilus to copy path, URI, name or content
Kerning Pairs
⭐
137
The ultimate list of kerning pairs for type designers
Ph Submissions
⭐
133
The repository and website hosting the peer review process for new Programming Historian lessons
Mldoc
⭐
132
A Corpus for Multilingual Document Classification in Eight Languages.
Vecalign
⭐
122
Improved Sentence Alignment in Linear Time and Space
Multilingual_ner
⭐
120
Applying BERT to named entity recognition in English and Russian.
Bert Qa
⭐
119
BERT for question answering starting with HotpotQA
Mtdata
⭐
115
A tool that locates, downloads, and extracts machine translation corpora
Nerda
⭐
98
Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks
Ml Mkqa
⭐
94
We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper for details, MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
Lima
⭐
92
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Crosslingual Nlp
⭐
83
This repo supports various cross-lingual transfer learning & multilingual NLP models.
Odenet
⭐
81
Open German WordNet
Django Linguo
⭐
80
Linguo is a Django application that provides the ability to have multilingual models.
Lang Reps
⭐
75
Code accompanying our EMNLP paper Learning Language Representations for Typology Prediction
Mhan
⭐
73
Multilingual hierarchical attention networks toolkit
Multilingual Latent Dirichlet Allocation Lda
⭐
73
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Multilingual Model Transfer
⭐
72
In this project we develop new deep learning models for bootstrapping language understanding models for languages with no labeled data using labeled data from other languages.
Mmner
⭐
69
Massively Multilingual Transfer for NER
Tupa
⭐
67
Transition-based UCCA Parser
Nllb Serve
⭐
66
Meta's "No Language Left Behind" models served as web app and REST API
Agentocr
⭐
65
一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.
Glot500
⭐
65
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages (ACL'23)
Anuvaad
⭐
65
State of the art open-source translation for Indic languages.
Multilangstructurekd
⭐
64
[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
Simplenlp
⭐
63
Lightweight, multilingual natural language processing
Profanity Filter
⭐
60
A Python library for detecting and filtering profanity
Kwx
⭐
57
BERT, LDA, and TFIDF based keyword extraction in Python
Rakun2
⭐
56
RaKUn 2.0 - A fast keyword detection algorithm
Mlma_hate_speech
⭐
54
Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"
Integreat Cms
⭐
53
Simplified content management back end for the Integreat App - a multilingual information platform for newcomers
Neon Tts Plugin Coqui
⭐
52
Coqui AI TTS plugin
Xpersona
⭐
51
XPersona: Evaluating Multilingual Personalized Chatbot
Hurtlex
⭐
50
A multilingual lexicon of words to hurt.
Bangla Tts
⭐
50
Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library
Llmebench
⭐
49
Benchmarking Large Language Models
Xorqa
⭐
48
This is the official repository for "XOR QA: Cross-lingual Open-Retrieval Question Answering".
Multi2oie
⭐
47
Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)
Django Multilingual Model
⭐
43
Django Simple Multilingual Support for Models.
Prism
⭐
42
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
Groundedtranslation
⭐
41
Multilingual image description
Hottosns Bert
⭐
41
hottoSNS-BERT: 大規模SNSコーパスによる文分散表現モデル
Multilingual Kd Pytorch
⭐
41
ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation
Few Shot Lm
⭐
40
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
Ewiser
⭐
40
A Word Sense Disambiguation system integrating implicit and explicit external knowledge.
Okapi
⭐
36
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Multilingual_nmt
⭐
36
Experiments on Multilingual NMT
B4msa
⭐
34
A Baseline for Multilingual Sentiment Analysis
Multilingual Usas
⭐
32
Lexicons for the Multilingual UCREL Semantic Analysis System
Voice Assistant Chatgpt
⭐
31
Voice Assistant based on Whisper ASR and ChatGPT API
Extractive_rc_by_runtime_mt
⭐
30
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
News Clustering
⭐
29
News clustering algorithm. Implementation of the "Multilingual Clustering of Streaming News" paper submitted to EMNLP 2018
Yawd Translations
⭐
28
A set of tools for developing multilingual websites with Django
Multilingual Markdown
⭐
27
CLI and Python API to handle i18n markdowns (available on Linux, macOS, and Windows)
Multicapclip
⭐
27
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Opus 100 Corpus
⭐
27
Multilingual_event_extraction
⭐
27
Python code to run ACE-style event extraction on English, Chinese, or Spanish texts
Tok Tok
⭐
26
A fast, simple, multilingual tokenizer
Python Flask With Javascript
⭐
26
This repository contains an example app to communicate between JavaScript and Python.
Korquad
⭐
25
KorQuAD (Korean Question Answering Dataset) submission guide using PyTorch pretrained BERT
Exquisite Corpus
⭐
25
Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.
Related Searches
Python Django (28,897)
Python Dataset (14,792)
Python Machine Learning (14,099)
Python Deep Learning (13,092)
Python Html (10,924)
Python Database (9,975)
Python Natural Language Processing (9,064)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
Python Neural (7,444)
1-100 of 208 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.