Project Name	Stars	Most Recent Commit	Open Issues	License	Language
Llm Rlhf Tuning	225	9 months ago	1		Python
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Open Chatgpt	66	a year ago		apache-2.0	Python
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
Llama Trl	38	a year ago	2	apache-2.0	Python
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Chatglm Lora Rlhf Pytorch	21	a year ago	1	mit	Python
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
Vicuna Lora Rlhf Pytorch	17	a year ago	2	mit	Python
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Alpaca Lora Rlhf Pytorch	10	a year ago		mit	Python
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

Alternatives To Llama Trl

Select To Compare

Llm Rlhf Tuning ⭐ 225

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

most recent commit 9 months ago

Open Chatgpt ⭐ 66

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

most recent commit a year ago

Llama Trl ⭐ 38

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

most recent commit a year ago

Chatglm Lora Rlhf Pytorch ⭐ 21

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

most recent commit a year ago

Vicuna Lora Rlhf Pytorch ⭐ 17

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

most recent commit a year ago

Alpaca Lora Rlhf Pytorch ⭐ 10

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

most recent commit a year ago

Suggest An Alternative To llama-trl

Alternative Project Comparisons

Llama Trl vs Llm Rlhf Tuning

Llama Trl vs Open Chatgpt

Llama Trl vs Chatglm Lora Rlhf Pytorch

Llama Trl vs Vicuna Lora Rlhf Pytorch

Llama Trl vs Alpaca Lora Rlhf Pytorch

Popular Lora Projects

Chinese Llama Alpaca ⭐ 15,877

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

most recent commit 6 months ago

Stable Diffusion Webui Colab ⭐ 14,090

stable diffusion webui colab

most recent commit 8 months ago

Peft ⭐ 12,271

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

dependent packages 101total releases 11latest release December 06, 2023most recent commit 5 months ago

Llama Factory ⭐ 10,715

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

total releases 19latest release December 03, 2023most recent commit 5 months ago

Lora ⭐ 7,814

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

dependent packages 16total releases 3latest release August 27, 2023most recent commit 6 months ago

Popular Ppo Projects

Baselines ⭐ 14,949

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

dependent packages 2total releases 6latest release February 26, 2018most recent commit 7 months ago

Reinforcement Learning With Tensorflow ⭐ 8,174

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

most recent commit a year ago

Easy Rl ⭐ 7,643

强化学习中文教程（蘑菇书），在线阅读地址：https://datawhalechina.github

most recent commit 5 months ago

Tianshou ⭐ 7,125

An elegant PyTorch deep reinforcement learning library.

dependent packages 10total releases 33latest release August 22, 2023most recent commit 5 months ago

Deep Reinforcement Learning ⭐ 4,635

Repo for the Deep Reinforcement Learning Nanodegree program

most recent commit 7 months ago

Popular Networking Categories