Awesome Open Source

Programming Languages

Search results for python llm evaluation

llm-evaluation x

10 search results found

Deepeval ⭐ 1,070

The Evaluation Framework for LLMs

The all-in-one LLMOps platform: prompt management, evaluation, human feedback, and deployment all in one place.

Continuous Eval ⭐ 78

Evaluation for LLM / RAG pipelines, ready for CI/CD

Commongen Eval ⭐ 74

Evaluating LLMs with CommonGen-Lite

Athina Evals ⭐ 45

Python SDK for running evaluations on LLM generated responses

Just Eval ⭐ 36

A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.

The implementation for EMNLP 2023 paper ”Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators“

Dcr Consistency ⭐ 16

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

Leaf Playground ⭐ 14

A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent action level.

Parea Sdk Py ⭐ 13

Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)

Related Searches

Python Django (28,897)

Python Machine Learning (20,195)

Python Flask (17,643)

Python Dataset (14,792)

Python Docker (14,113)

Python Tensorflow (13,736)

Python Command Line (13,351)

Python Deep Learning (13,092)

Python Jupyter Notebook (12,976)

Python Network (11,495)

1-10 of 10 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.