Llama Trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Alternatives To Llama Trl
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Llm Rlhf Tuning225
9 months ago1Python
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Open Chatgpt66
a year agoapache-2.0Python
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
Llama Trl38
a year ago2apache-2.0Python
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Chatglm Lora Rlhf Pytorch21
a year ago1mitPython
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
Vicuna Lora Rlhf Pytorch17
a year ago2mitPython
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Alpaca Lora Rlhf Pytorch10
a year agomitPython
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
Alternatives To Llama Trl
Select To Compare


Alternative Project Comparisons
Popular Lora Projects
Popular Ppo Projects
Popular Networking Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Adapter
Lora
Ppo