Baserun: Observability and evaluation platform for LLM apps.

Baserun

Observability and evaluation platform for LLM apps.

Summer 2023

Active

AIOps

Artificial Intelligence

Observability and evaluation platform for LLM apps.

LLMs are incredibly powerful, but latency, cost, and unpredictable outputs have made productionizing LLM features challenging. Baserun is a testing and observability platform that helps AI teams streamline their development cycle from identifying an issue to evaluating their solution, so that teams ship faster with confidence.

Active Founders

Effy Zhang

Founder

Building something new

Effy Zhang

Founder

Building something new

Company Launches

baserun.ai: Ship LLM features with confidence 💪💪💪

See original launch post

TL:DR: baserun.ai is a testing platform for LLM apps. From prompt playground to end-to-end tests, baserun helps you ship your LLM apps with confidence and speed.

The problem:

Productionizing LLM features is hard. 🥲

It's difficult to judge which combination of model, configurations, and prompts performs better.
It's challenging to debug complex workflows that mix chained prompts and other 3rd party API calls.
It's hard to understand the progression of app performance over time.

uploaded image

Our solution:

Gain insights into your LLM features within seconds

Install baserun SDK and immediately gain insights into your LLM features and agents during testing, and monitor their behavior in production.

Full visibility into your end-to-end tests & user journey

Visualize the precise sequence of calls, duration, and cost, along with the inputs and outputs at each stage, encompassing both custom functions and third-party API calls.

Intuitive and flexible UI for evaluating and debugging

Effortlessly compare test runs side by side, directly edit prompts, and rerun tests from the UI.

uploaded image

Collaborative workspace for teams

Review results, experiment and iterate on prompts, and build test datasets with your whole team. All prompts and test results are version-controlled.

uploaded image

Our Asks

Try baserun and give us some feedback! Book a 15-minute call with us. We’d love to hear about what you are building and help you get onboarded.
You can follow us on Twitter or LinkedIn, or email us at hello@baserun.ai