Instructllama

Name: michaelnny/InstructLLaMA
Brand: michaelnny/InstructLLaMA
SKU: project/michaelnny/InstructLLaMA
Rating: 4.42 (12 reviews)

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.

Categories > Machine Learning > Ppo

Suggest Alternative

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.