Raindrop: Sentry for AI Agents

Raindrop

Sentry for AI Agents

Winter 2024

Active

Artificial Intelligence

B2B

Monitoring

San Francisco

https://www.raindrop.ai

Sentry for AI Agents

Monitor your AI agents the right way. AI engineers use Raindrop to get alerts about silent failures in their AI agents. Raindrop sends you alerts when your AI misbehaves and links straight to the events, so you can dig into the conversations or traces, understand the root cause, and fix it, fast.

Active Founders

Zubin Koticha

CEO, Cofounder

Building Raindrop – Sentry for AI products. Prev. cofounder & CEO of Opyn (acq. Coinbase), the first and largest DeFi options platform, with $15 + billion volume. UC Berkeley

Zubin Koticha

CEO, Cofounder

Building Raindrop – Sentry for AI products. Prev. cofounder & CEO of Opyn (acq. Coinbase), the first and largest DeFi options platform, with $15 + billion volume. UC Berkeley

Alexis Gauba

Founder

Building Raindrop: Sentry for AI agents. Previously Co-Founder at Opyn (acquired by Coinbase), the first and largest DeFi options platform ($15b+ volume), inventing a new financial asset class known as the power perpetual (an option that has no expiry). Dropped out of UC Berkeley EECS

Alexis Gauba

Founder

Ben Hylak

Founder

building raindrop (sentry for ai products) I was previously on the Human Interface team at Apple for 4 years, building out visionOS. before that, dabbled with robotics + avionics

Ben Hylak

Founder

building raindrop (sentry for ai products) I was previously on the Human Interface team at Apple for 4 years, building out visionOS. before that, dabbled with robotics + avionics

Latest News

Want to Know Why Your AI Agent Failed? There’s AI for That.

Dec 01, 2025

Company Launches

Raindrop Deep Search

See original launch post

https://www.youtube.com/watch?v=pN82WxN-_G0

Today, we're excited to launch Raindrop Deep Search

It’s like Deep Research for your Production AI Data

Search for anything, and Raindrop automatically trains little models to accurately classify any topic or issue, across millions of events.

𝗧𝗵𝗲 𝗣𝗿𝗼𝗯𝗹𝗲𝗺

We’ve heard from thousands of AI engineers and they’re struggling to track issues with their agents.

Imagine a user reports a problem: your agent is saying it can’t search the web for documentation. You need to know if this is a one-off problem or a much bigger issue… but how? Keyword search, or even semantic search, doesn’t tell the full story.

𝗖𝗮𝗻’𝘁 𝘄𝗲 𝗷𝘂𝘀𝘁 𝘂𝘀𝗲 𝗧𝗿𝗮𝗱𝗶𝘁𝗶𝗼𝗻𝗮𝗹 𝗘𝘃𝗮𝗹𝘀?

Offline evals work well as unit tests. But since they’re running on preset data, you have no visibility into what’s actually happening in production.

Online evals just run these unit tests on a tiny sample of production data, leaving you blind to how widespread problems are.

𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗥𝗮𝗶𝗻𝗱𝗿𝗼𝗽 𝗗𝗲𝗲𝗽 𝗦𝗲𝗮𝗿𝗰𝗵

That’s why we built Deep Search. It’s like Deep Research for your production data.

How Deep Search works:

1. Describe the issue (eg. agent failing to search the web)

2. Deep Search finds examples out of millions of events

3. Refine search with feedback

4. Start tracking the issue

Deep Search runs across all of your production data to give you an accurate metric of issue frequency.

Traditional classification systems require humans to manually label thousands of data points. So to achieve this, Raindrop Deep Search introduces a new research breakthrough, bespoke few-shot classifiers, which only need a few examples.

It’s essentially bootstrapping weaker systems from stronger systems, ultimately training custom small models that analyze millions of events a day. You can think of it like creating materialized views for natural language.

Once you start tracking the issue you can use Raindrop to dive into traces and tool calls to find the root cause. And you can quickly confirm whether your fixes are effective by monitoring issue frequency and receiving real-time Slack alerts.

You can try out Deep Search at raindrop.ai.

We’re excited to hear what you think!

uploaded image