HomeCompaniesArga Labs
Arga Labs

The validation infrastructure for AI agents

Arga provides on-demand staging per PR, so anyone (human or agent) can test any change. Deploy a change into our sandbox and it runs against twins of any SaaS (Stripe, Google Drive, Slack, etc.), with no config required. These twins implement the same APIs, are fully compatible with any SDK, surface edge cases mocks or assertions miss, and you never hit rate limits. For AI agents, run red-teams with Arga: test tool calls, behaviours, and failure modes before anything reaches production.
Active Founders
Phillip Li
Phillip Li
Founder
Phillip is the Co-founder and CEO of Arga Labs. He previously built an internal dev tool at Amazon to automate complex engineering workflows. This resulted in 10+ recurring weeks/year of engineer hours saved across multiple teams, and prevented VP escalations due to bug issues. Before that, he was a fencer on the Canadian Junior National Team.
Akira Tong
Akira Tong
Founder
Co-founder, CTO @ Arga Labs Prev. SDE @ Stripe, quant @ GS Skipped high school, graduated @ 19 Pro player for Identity V, quit because I had to finish my finals :(
Company Launches
Arga Labs: on-demand, production-like staging for every PR
See original launch post

tl;dr: become a 100x tester with Arga by giving your agents staging environments scoped to a change. We spin up twins of your integrations (Slack, Stripe, etc.) and dependencies (DB, Redis), and deploy only the services that were changed. Everything else is routed to prod.

Launch Video: https://www.youtube.com/watch?v=OzEk5pYTvak

The problem: You’re a 1x tester

  • You don’t have staging :(
  • You have staging, but it’s different from prod, and things still break after testing
    • You mock integrations, which can be inaccurate
    • You use real accounts to test integrations, but you hit rate limits, can’t clear states between tests, and can’t test at scale
  • You have staging, but your agents can’t automatically test changes so you’re stuck at being a 1x tester despite being a 100x engineer
  • You have staging, but it’s difficult to merge concurrent changes without breaking something (for larger teams!)

Our Solution

We spin up a temporary staging of your app every time you open a PR. Only the services you change are redeployed; all other services are routed to prod. Your dependencies are deployed as in-memory sidecars, so nothing corrupts prod data. For example, you can configure the tests to read from the prod database and perform all write commands to the sidecar; you can also configure everything to be routed to the sidecar.

We also spin up twins of services your app uses: Stripe, Slack, GSuite, Hubspot, and more, so you don’t hit the real API. This lets you test more thoroughly: you can catch weird edge cases, simulate failure modes, and run tests in parallel without worrying about rate limits. All twins are fully compatible with any SDK, expose the same API endpoints, and support webhook events. You can seed any twin in natural language, or select from our pre-set scenarios.

uploaded image

Demo of Github, Slack, Stripe twins on the Arga app: https://www.youtube.com/watch?v=3ZPrIdGvPoo

You or your agent can prompt Arga via web, API, CLI, or MCP to automatically generate tests for your PR, run them, and stream the results and logs back. This allows them to know what was tested, what broke, and which calls/functions caused the failure. They can use this information to go back and fix the code, so you’ll have a fully-tested and passing PR before you look at it.

Validation report:

uploaded image

Claude Code using arga CLI to read logs & test results:

uploaded image

Our ask

Try us out! We’re live at www.argalabs.com and you can get access to limited features for free.
If you like our stuff, book a demo here (https://cal.com/phillip-li/30min).
If you know any teams that struggle with testing and use lots of external integrations, send them our way.

The Team

Akira and Phillip met each other in first-year calculus at UBC. At the time, Akira was 14 and Phillip was studying neuroscience to pursue research in human biology. Within a year of pivoting to CS, Phillip built a dev tool at Amazon that saved 10 weeks / year of engineering hours as an intern. While interning at Stripe, Akira realized that a high-fidelity “staging” was the key to unlocking dev productivity and was surprised that most companies don’t have proper staging that actually mirrored prod. Now they’re solving that problem.

uploaded image

YC Photos
Arga Labs
Founded:2025
Batch:Spring 2026
Team Size:3
Status:
Active
Location:San Francisco
Primary Partner:Pete Koomen