Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for large language models rlhf
large-language-models
x
rlhf
x
16 search results found
Llama Factory
⭐
10,715
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Llmsurvey
⭐
7,255
The official GitHub page for the survey paper "A Survey of Large Language Models".
Chinese Llama Alpaca 2
⭐
5,810
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Awesome Rlhf
⭐
2,376
A curated list of reinforcement learning with human feedback resources (continually updated)
Safe Rlhf
⭐
1,040
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Alpaca_eval
⭐
899
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Alignllmhumansurvey
⭐
368
Aligning Large Language Models with Human: A Survey
Medqa Chatglm
⭐
235
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答
Step_into_llm
⭐
211
MindSpore online courses: Step into LLM
Cornucopia Llama Fin Chinese
⭐
178
聚宝盆(Cornucopia): 基于中文金融知识的LLaMA微调模型;涉及SFT、RLHF、GPU训练部署等
Chain Of Hindsight
⭐
171
Chain-of-Hindsight, a simpler and more effective alternative to RLHF
Remax
⭐
61
Alpaca Rlhf
⭐
42
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
Okapi
⭐
36
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Prompt Oirl
⭐
14
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
Lm Research Hub
⭐
14
Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)
Related Searches
Python Large Language Models (393)
Llm Large Language Models (359)
Chatgpt Large Language Models (153)
Artificial Intelligence Large Language Models (123)
Machine Learning Large Language Models (121)
Gpt Large Language Models (118)
Large Language Models Llms (96)
Large Language Models Llama (85)
Langchain Large Language Models (46)
Large Language Models Llama2 (45)
1-16 of 16 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.