Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python natural language processing
natural-language-processing
x
python
x
1,656 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
D2l Zh
⭐
56,684
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Ailearning
⭐
38,107
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
Made With Ml
⭐
35,496
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Spacy
⭐
28,628
💫 Industrial-strength Natural Language Processing (NLP) in Python
D2l En
⭐
20,613
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Unilm
⭐
16,971
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Ciphey
⭐
16,681
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Chinese Llama Alpaca
⭐
15,877
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Gensim
⭐
15,180
Topic Modelling for Humans
Best Of Ml Python
⭐
14,990
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
500 Ai Machine Learning Deep Learning Computer Vision Nlp Projects With Code
⭐
14,248
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Docsgpt
⭐
13,745
GPT-powered chat for documentation, chat with your documents
Virgilio
⭐
13,515
Your new Mentor for Data Science E-Learning.
Nltk
⭐
12,699
NLTK Source
Haystack
⭐
12,474
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Paddlehub
⭐
12,193
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
Moss
⭐
11,453
An open-source tool-augmented conversational language model from Fudan University
Allennlp
⭐
11,300
An open-source NLP research library, built on PyTorch.
Paddlenlp
⭐
10,908
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Stanford Tensorflow Tutorials
⭐
10,303
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Doccano
⭐
8,980
Open source annotation tool for machine learning practitioners.
Textblob
⭐
8,906
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Chinese Bert Wwm
⭐
8,600
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
It_book
⭐
8,543
本项目收藏这些年来看过或者听过的一些不错的常用的上千本书籍,没准你想找的书就在这里呢,包含了互联网行
Pattern
⭐
8,519
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Qwen
⭐
8,482
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Petals
⭐
8,040
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Attention Is All You Need Pytorch
⭐
7,910
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Machine_learning_examples
⭐
7,861
A collection of machine learning examples and tutorials.
Text_classification
⭐
7,628
all kinds of text classification models and more with deep learning
Llmsurvey
⭐
7,255
The official GitHub page for the survey paper "A Survey of Large Language Models".
Gpt2 Chinese
⭐
7,249
Chinese version of GPT2 training code, using BERT tokenizer.
Autogluon
⭐
7,109
Fast and Accurate ML in 3 Lines of Code
Stanza
⭐
6,931
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Models
⭐
6,819
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Text Generation Inference
⭐
6,743
Large Language Model Text Generation Inference
Danswer
⭐
6,435
Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.
Mycroft Core
⭐
6,430
Mycroft Core, the Mycroft Artificial Intelligence platform.
Txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Chinese Llama Alpaca 2
⭐
5,810
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Xlnet
⭐
5,709
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Bert Pytorch
⭐
5,605
Google AI 2018 BERT pytorch implementation
Bertviz
⭐
5,547
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Modelscope
⭐
5,517
ModelScope: bring the notion of Model-as-a-Service to life.
Flashtext
⭐
5,463
Extract Keywords from sentence or Replace keywords in sentences.
Parsr
⭐
5,423
Transforms PDF, Documents and Images into Enriched Structured Data
Bertopic
⭐
5,170
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Topdeeplearning
⭐
4,901
A list of popular github projects related to deep learning
Donut
⭐
4,651
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Nlp_ability
⭐
4,337
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提
Machine_learning_complete
⭐
4,296
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Resume Matcher
⭐
4,224
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Courses
⭐
4,018
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Openprompt
⭐
4,006
An Open-Source Framework for Prompt-Learning.
Data Science
⭐
3,898
Collection of useful data science topics along with articles, videos, and code
Marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Snips Nlu
⭐
3,796
Snips Python library to extract meaning from text
Huatuo Llama Med Chinese
⭐
3,776
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
Lm Evaluation Harness
⭐
3,768
A framework for few-shot evaluation of language models.
Awesome Pretrained Chinese Nlp Models
⭐
3,738
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Textract
⭐
3,699
extract text from any document. no muss. no fuss.
Text2vec
⭐
3,610
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-
Dive Into Dl Tensorflow2.0
⭐
3,588
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为TensorFlow 2.0实现,项目已得到李沐老师的认可
Baichuan2
⭐
3,527
A series of large language models developed by Baichuan Intelligent Technology
Flaml
⭐
3,500
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Text
⭐
3,411
Models, data loaders and abstractions for language processing, powered by PyTorch
Paper Qa
⭐
3,407
LLM Chain for answering questions from documents with citations
God Level Data Science Ml Full Stack
⭐
3,384
A collection of scientific methods, processes, algorithms, and systems to build stories & models. Whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI
Llm Foundry
⭐
3,354
LLM training code for MosaicML foundation models
Sumy
⭐
3,343
Module for automatic summarization of text documents and HTML pages.
Course Nlp
⭐
3,271
A Code-First Introduction to NLP course
Chatbot
⭐
3,252
ChatGPT带火了聊天机器人,主流的趋势都调整到了GPT类模式,本项目也与时俱进,会在近期更新GP
Ml Workspace
⭐
3,197
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Catalyst
⭐
3,151
Accelerated deep learning R&D
Picogpt
⭐
3,084
An unnecessarily tiny implementation of GPT-2 in NumPy.
Pyhanlp
⭐
3,036
中文分词
Simcse
⭐
2,983
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Fastnlp
⭐
2,940
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Argos Translate
⭐
2,929
Open-source offline translation library written in Python
Autotrain Advanced
⭐
2,928
🤗 AutoTrain Advanced
Gpt2 Chitchat
⭐
2,870
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
Chinese Clip
⭐
2,816
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Torchscale
⭐
2,804
Foundation Architecture for (M)LLMs
Uer Py
⭐
2,802
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Mitie
⭐
2,778
MITIE: library and tools for information extraction
Cluedatasetsearch
⭐
2,778
搜索所有中文NLP数据集,附常用英文NLP数据集
Lingvo
⭐
2,776
Lingvo
Thinc
⭐
2,774
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
Texthero
⭐
2,773
Text preprocessing, representation and visualization from zero to hero.
Ml Road
⭐
2,742
Machine Learning Resources, Practice and Research
Deepsparse
⭐
2,729
Sparsity-aware deep learning inference runtime for CPUs
Jionlp
⭐
2,724
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Eli5
⭐
2,695
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Deepke
⭐
2,679
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Cv
⭐
2,637
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
Neuralcoref
⭐
2,607
✨Fast Coreference Resolution in spaCy with Neural Networks
Related Searches
Python Jupyter Notebook (21,947)
Python Machine Learning (20,195)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Algorithms (10,033)
Python Artificial Intelligence (8,580)
Python Raspberry Pi (8,403)
Python Pytorch (7,877)
Python Neural (7,444)
1-100 of 1,656 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.