Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for language model ai safety
ai-safety
x
language-model
x
2 search results found
Pretraining With Human Feedback
⭐
97
Code accompanying the paper Pretraining Language Models with Human Preferences
Toolemu
⭐
73
A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
Beavertails
⭐
12
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Promptinject
⭐
11
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks.
Related Searches
Python Language Model (540)
Jupyter Notebook Language Model (263)
Artificial Intelligence Language Model (113)
Dataset Language Model (63)
Chatbot Language Model (52)
Language Model Gpt (37)
Python Ai Safety (17)
Language Model Pre Training (15)
Language Model Llama (14)
Reinforcement Learning Language Model (12)
1-2 of 2 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.