Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Dl Nlp Readings | 847 | 2 years ago | n,ull | mit | TeX | |||||
My Reading Lists of Deep Learning and Natural Language Processing | ||||||||||
Skychat Chinese Chatbot Gpt3 | 543 | a year ago | 4 | mit | C# | |||||
SkyChat是一款基于中文GPT-3 api的聊天机器人项目。它可以像chatGPT一样,实现人机聊天、问答、中英文互译、对对联、写古诗等任务。| SkyChat is a Chatbot project based on Chinese GPT3 API. Like chatGPT, it can do human-machine chat, question and answer, and can also complete tasks such as Chinese-English or English-Chinese translation, content continuation, couplets, and Chinese ancient poems writing. | ||||||||||
Textrl | 513 | 9 months ago | 33 | August 06, 2023 | 3 | mit | Python | |||
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL) | ||||||||||
Coderl | 423 | 7 months ago | 33 | bsd-3-clause | Python | |||||
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22). | ||||||||||
Llm Rlhf Tuning | 225 | 7 months ago | 1 | Python | ||||||
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA) | ||||||||||
Implicit Language Q Learning | 153 | 9 months ago | mit | Python | ||||||
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning" | ||||||||||
Scienceworld | 151 | 3 months ago | 13 | apache-2.0 | Scala | |||||
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum. | ||||||||||
Grounding_llms_with_online_rl | 135 | 5 months ago | 3 | mit | Python | |||||
We perform functional grounding of LLMs' knowledge in BabyAI-Text | ||||||||||
Gdc | 99 | a year ago | other | Python | ||||||
Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation" | ||||||||||
Pretraining With Human Feedback | 97 | a year ago | 2 | mit | Python | |||||
Code accompanying the paper Pretraining Language Models with Human Preferences |