Awesome Open Source

Programming Languages

Search results for language model ai safety

language-model x

2 search results found

Pretraining With Human Feedback ⭐ 97

Code accompanying the paper Pretraining Language Models with Human Preferences

A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use

Beavertails ⭐ 12

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Promptinject ⭐ 11

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks.

Related Searches

Python Language Model (540)

Jupyter Notebook Language Model (263)

Artificial Intelligence Language Model (113)

Dataset Language Model (63)

Chatbot Language Model (52)

Language Model Gpt (37)

Python Ai Safety (17)

Language Model Pre Training (15)

Language Model Llama (14)

Reinforcement Learning Language Model (12)

1-2 of 2 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.