Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for llm ai safety
ai-safety
x
llm
x
6 search results found
Safe Rlhf
⭐
1,040
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Tiger
⭐
337
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Awesome Ai Safety
⭐
64
A curated list of papers & technical articles on AI Quality & Safety 📚
Beavertails
⭐
12
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Llm Cooperation
⭐
9
Code and materials for the paper S. Phelps and Y. I. Russell, Investigating Emergent Goal-Like Behaviour in Large Language Models Using Experimental Economics, working paper, arXiv:2305.07970, May 2023
Universal Neurons
⭐
8
Universal Neurons in GPT2 Language Models
Related Searches
Python Llm (1,377)
Openai Llm (569)
Chatgpt Llm (533)
Artificial Intelligence Llm (445)
Natural Language Processing Llm (285)
Jupyter Notebook Llm (275)
Llm Llama (269)
Llm Gpt (260)
Typescript Llm (258)
Machine Learning Llm (214)
1-6 of 6 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.