Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python llm evaluation
llm-evaluation
x
python
x
10 search results found
Deepeval
⭐
1,070
The Evaluation Framework for LLMs
Agenta
⭐
651
The all-in-one LLMOps platform: prompt management, evaluation, human feedback, and deployment all in one place.
Continuous Eval
⭐
78
Evaluation for LLM / RAG pipelines, ready for CI/CD
Commongen Eval
⭐
74
Evaluating LLMs with CommonGen-Lite
Athina Evals
⭐
45
Python SDK for running evaluations on LLM generated responses
Just Eval
⭐
36
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Conner
⭐
24
The implementation for EMNLP 2023 paper ”Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators“
Dcr Consistency
⭐
16
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Leaf Playground
⭐
14
A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent action level.
Parea Sdk Py
⭐
13
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-10 of 10 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.