Awesome Open Source

Programming Languages

Search results for reinforcement learning ai safety

reinforcement-learning x

4 search results found

Safe Rlhf ⭐ 1,040

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Thought Cloning ⭐ 217

[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

Pretraining With Human Feedback ⭐ 97

Code accompanying the paper Pretraining Language Models with Human Preferences

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Common repository for our readings and discussions

Related Searches

Python Reinforcement Learning (2,612)

Jupyter Notebook Reinforcement Learning (761)

Pytorch Reinforcement Learning (681)

Reinforcement Learning Rl (649)

Artificial Intelligence Reinforcement Learning (623)

Dataset Reinforcement Learning (51)

Python Ai Safety (17)

Survey Reinforcement Learning (14)

Reinforcement Learning Adversarial Attacks (14)

Reinforcement Learning Robustness (13)

1-4 of 4 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.