Vicuna Lora Rlhf Pytorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Alternatives To Vicuna Lora Rlhf Pytorch
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Stable Diffusion Webui Colab14,090
7 months ago16unlicenseJupyter Notebook
stable diffusion webui colab
Peft12,2711014 months ago11December 06, 202365apache-2.0Python
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Lora7,814164 months ago3August 27, 202379mitPython
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Chatglm Efficient Tuning3,130
7 months ago6August 12, 2023apache-2.0Python
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Adapters2,35472 months ago18April 06, 202351apache-2.0Jupyter Notebook
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Alpaca Cot2,235
5 months ago30apache-2.0Jupyter Notebook
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
Chatglm_finetuning1,486
6 months ago38Python
chatglm 6b finetuning and alpaca finetuning
Onediff787
4 months ago27Python
OneDiff: An out-of-the-box acceleration library for diffusion models.
Lorax719
4 months ago45apache-2.0Python
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Llm Finetuning582
6 months ago1Jupyter Notebook
LLM Finetuning with peft
Alternatives To Vicuna Lora Rlhf Pytorch
Select To Compare


Alternative Project Comparisons
Popular Lora Projects
Popular Pytorch Projects
Popular Networking Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Pytorch
Lora
Ppo