Quickstarts

LLM Tutorials

More examples

GitHub

Community

Evidently AI - Documentation

Docs

Metrics

Examples

F.A.Q.

Changelog

Old docs

Sign up

Website

Talk to us

Tutorials and guides

What is Evidently?

LLM Evaluation

Data and ML checks

How to capture LLM inputs and outputs and evaluate them.

Tracing

How to install the open-source Python library.

Installation

Evidently Cloud

How to self-host the open-source Evidently UI service.

Self-hosting

Core concepts and components of the Evidently Python library.

Introduction

Overview

Data definition

Descriptors

Report

Tests

Output formats

How to add metadata to the Report you upload.

Add tags and metadata

Set up an evaluation or monitoring Project.

Manage Projects

How to create, upload and manage Datasets.

Work with datasets

Synthetic data

Set up tracing

How to run evals and log them on the platform

Run evals via API

How to evaluate your data in a no-code interface.

No code evals

Reviewing the evaluation results on the Platform.

Explore view

Get a pre-built monitoring Dashboard using templates.

Dashboard tabs

Overview of the available monitoring Panels.

Dashboard panel types

How to design your Dashboard with custom Panels.

Add dashboard panels

How production AI quality monitoring works.

Batch monitoring

Running managed evaluations over traces on a platform.

Scheduled evals

Alerts

Available metrics, tests and how to customize them.

Evaluations

Reference page for all dataset-level evals.

All Metrics

Reference page for all row-level text and LLM evals.

All Descriptors

Text Evals

Data Drift

Data Summary

Overview of the Classification Quality Preset

Classification

Overview of the Regression Quality Preset

Regression

How to add a custom row-level text evaluator.

Custom Text Descriptor

How to run prompt-based evaluators for custom criteria.

Custom LLM Judge

How to use models from HuggingFace as evaluators.

Custom HuggingFace evaluator

How to change data drift detection methods and conditions.

Customize Data Drift

How to create a custom dataset or column-level Metric.

Custom Metric

Data drift

Open-source metrics for ranking and recommendations.

Ranking and RecSys metrics

LLM evaluations

How to run regression testing for LLM outputs. noindex: "true"

LLM regression testing

LLM as a judge

Frequently Asked Questions

Why Evidently?

Open-source vs. Cloud

How to migrate to the new Evidently version?

Migration Guide

What data is collected when you use Evidently open-source.

Intro

LLM Tutorials

Tutorials and guides

Quickstarts

LLM quickstart

ML quickstart

Tracing quickstart

LLM Tutorials

LLM judges

Regression testing

LLM evaluations

More examples

Intro

LLM Tutorials

​Quickstarts

LLM quickstart

ML quickstart

Tracing quickstart

​LLM Tutorials

LLM judges

Regression testing

LLM evaluations

​More examples

Quickstarts

LLM Tutorials

More examples