Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python dpo
dpo
x
python
x
4 search results found
Medicalgpt
⭐
2,127
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏
Halos
⭐
339
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
Notus
⭐
123
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Llm_dpo
⭐
6
dpo finetuning
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-4 of 4 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.