Grounding_llms_with_online_rl

We perform functional grounding of LLMs' knowledge in BabyAI-Text
Alternatives To Grounding_llms_with_online_rl
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Dl Nlp Readings847
2 years agon,ullmitTeX
My Reading Lists of Deep Learning and Natural Language Processing
Skychat Chinese Chatbot Gpt3543
a year ago4mitC#
SkyChat是一款基于中文GPT-3 api的聊天机器人项目。它可以像chatGPT一样,实现人机聊天、问答、中英文互译、对对联、写古诗等任务。| SkyChat is a Chatbot project based on Chinese GPT3 API. Like chatGPT, it can do human-machine chat, question and answer, and can also complete tasks such as Chinese-English or English-Chinese translation, content continuation, couplets, and Chinese ancient poems writing.
Textrl513
9 months ago33August 06, 20233mitPython
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Coderl423
8 months ago33bsd-3-clausePython
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Llm Rlhf Tuning225
7 months ago1Python
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Implicit Language Q Learning153
9 months agomitPython
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Scienceworld151
4 months ago13apache-2.0Scala
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
Grounding_llms_with_online_rl135
6 months ago3mitPython
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Gdc99
a year agootherPython
Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
Pretraining With Human Feedback97
a year ago2mitPython
Code accompanying the paper Pretraining Language Models with Human Preferences
Alternatives To Grounding_llms_with_online_rl
Select To Compare


Alternative Project Comparisons
Popular Reinforcement Learning Projects
Popular Language Model Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Reinforcement Learning
Language Model