Awesome Open Source

Programming Languages

Search results for language model rlhf

language-model x

11 search results found

Open Assistant ⭐ 36,197

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Llama Factory ⭐ 10,715

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Chatglm Efficient Tuning ⭐ 3,130

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Docta ⭐ 2,472

A Doctor for your data

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

pykoi: Active learning in one unified interface

Llm Rlhf Tuning ⭐ 225

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Pretraining With Human Feedback ⭐ 97

Code accompanying the paper Pretraining Language Models with Human Preferences

Alpaca Rlhf ⭐ 42

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Beavertails ⭐ 12

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Awesome Rlaif ⭐ 5

A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)

Related Searches

Python Language Model (692)

Pytorch Language Model (164)

Machine Learning Language Model (127)

Llm Language Model (82)

Artificial Intelligence Language Model (63)

Dataset Language Model (63)

Gpt 3 Language Model (44)

Language Model Gpt 2 (41)

Language Model Gpt (37)

Python Rlhf (34)

1-11 of 11 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.