Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for language model rlhf
language-model
x
rlhf
x
11 search results found
Open Assistant
⭐
36,197
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Llama Factory
⭐
10,715
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Chatglm Efficient Tuning
⭐
3,130
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Docta
⭐
2,472
A Doctor for your data
Textrl
⭐
519
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Pykoi
⭐
332
pykoi: Active learning in one unified interface
Llm Rlhf Tuning
⭐
225
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Pretraining With Human Feedback
⭐
97
Code accompanying the paper Pretraining Language Models with Human Preferences
Alpaca Rlhf
⭐
42
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
Okapi
⭐
36
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Beavertails
⭐
12
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Awesome Rlaif
⭐
5
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
Related Searches
Python Language Model (692)
Pytorch Language Model (164)
Machine Learning Language Model (127)
Llm Language Model (82)
Artificial Intelligence Language Model (63)
Dataset Language Model (63)
Gpt 3 Language Model (44)
Language Model Gpt 2 (41)
Language Model Gpt (37)
Python Rlhf (34)
1-11 of 11 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.