Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Llm Rlhf Tuning | 225 | 9 months ago | 1 | Python | ||||||
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA) | ||||||||||
Open Chatgpt | 66 | a year ago | apache-2.0 | Python | ||||||
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT. | ||||||||||
Llama Trl | 38 | a year ago | 2 | apache-2.0 | Python | |||||
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA | ||||||||||
Chatglm Lora Rlhf Pytorch | 21 | a year ago | 1 | mit | Python | |||||
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM | ||||||||||
Vicuna Lora Rlhf Pytorch | 17 | a year ago | 2 | mit | Python | |||||
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna | ||||||||||
Alpaca Lora Rlhf Pytorch | 10 | a year ago | mit | Python | ||||||
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca |