Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for reinforcement learning ai safety
ai-safety
x
reinforcement-learning
x
4 search results found
Safe Rlhf
⭐
1,040
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Thought Cloning
⭐
217
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
Pretraining With Human Feedback
⭐
97
Code accompanying the paper Pretraining Language Models with Human Preferences
La Mbda
⭐
16
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
Ais
⭐
5
Common repository for our readings and discussions
Related Searches
Python Reinforcement Learning (2,612)
Jupyter Notebook Reinforcement Learning (761)
Pytorch Reinforcement Learning (681)
Reinforcement Learning Rl (649)
Artificial Intelligence Reinforcement Learning (623)
Dataset Reinforcement Learning (51)
Python Ai Safety (17)
Survey Reinforcement Learning (14)
Reinforcement Learning Adversarial Attacks (14)
Reinforcement Learning Robustness (13)
1-4 of 4 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.