Introduced an Evals Orb to orchestrate LLM evaluations

Apr 30, 2024 New feature

The official CircleCI Evals Orb makes it easy to integrate LLM evaluations into a CI pipeline, and to review evaluation results without context switching. The output of evaluations run through the Evals Orb is stored in CircleCI, and is accessible as a job artifact and as a PR comment added automatically by CircleCI.

Currently, the Evals Orb exposes commands to run evaluations through two popular LLMOps tools: LangSmith and Braintrust. If your evals leverage a different tool, let us know at ai-feedback@circleci.com. You can also contribute directly to the official Orb, by opening a PR on the public repository.

More resources on evaluating LLM-enabled applications are available in our documentation.

Orbs

>Previous changes

Apr 30, 2024

Enhancement

Apr 30, 2024

Server 4.5.0

Enhancement

Server

Apr 26, 2024

Maintenance

Apr 26, 2024

Updated copy for email invitation

Maintenance

Overview

Features

Integrations

Role

Use case

Company size

Developers

Explore

Community

Benefits

Compare

Company

Overview

Features

Integrations

Role

Use case

Company size

Developers

Explore

Community

Benefits

Compare

Company

Explore

Introduced an Evals Orb to orchestrate LLM evaluations

>Previous changes

Server 4.5.0

Updated copy for email invitation