
Hey everyone, we’re Abdullah and Saad - cofounders of Zag 👋
TL;DR
Zag is a platform for building and running AI review agents. You describe agents in TypeScript or Swift as composable Tasks, and Zag runs them in real sandboxed environments on every pull request. Agents don't just read diffs. They build your project, run your tests, boot your app, and verify that it actually works.
We've opened early access to teams building for Apple platforms (iOS, iPadOS, macOS), running on dedicated Apple Silicon with Xcode, simulators, and the full native toolchain.
https://www.youtube.com/watch?v=MuIK1woknhc
The Problem
Coding agents are producing changes faster than humans can review them. Tools like Claude Code and Codex are accelerating how code gets written, but the review side hasn't kept up. Teams are merging changes they haven't fully validated, or creating bottlenecks where a handful of senior engineers review everything.
Static AI code review (inline comments on diffs) helps, but it requires more human-in-the-loop than teams can sustain. Reading AI-generated suggestions about AI-generated code doesn't scale. The kind of confidence that lets teams merge fast comes from actually building, testing, and running the code.
Our Solution
Zag lets teams build custom review agents that run in real environments on every PR.
We believe that as human engineering work moves up a layer of abstraction, a tech lead's primary job will be tuning better review agents. Not reading diffs line by line, but shaping the automated judgment that enforces their team's engineering culture at the scale of AI-authored code.
The primary unit of work is a Task: a composable, dependency-aware unit that describes an environment and the logic that runs inside it. Tasks can spin up Claude Code, Codex, or custom agents with access to real build tools, simulators, and full operating systems. Every Task runs in an ephemeral session on dedicated Apple Silicon that boots in seconds from pre-warmed snapshots.
Use built-in utilities for common checks (build verification, test execution, crash detection, App Store compliance), or write your own agents for the specific behaviors your team cares about. When applied to every PR, this helps teams merge changes as fast as they author them.
Why Apple Developers First
Apple developers need macOS to build and test their code. You can't meaningfully validate an iOS app without Xcode, a simulator, and Apple Silicon. Most CI providers treat macOS as an afterthought. Zag provisions dedicated M4 hardware with no queue at $0.015/minute, priced at AWS cost. If you already have GitHub Actions or Cirrus workflows, a one-line runner swap points them at Zag compute, even before you adopt the review agents.
Our Ask
The Team
Abdullah built ML infrastructure at Google powering core Search systems that directly drove $1B+ in revenue.
Saad spent three years training video models at Tesla Autopilot and later built training infrastructure with Ashish Vaswani (author of Attention Is All You Need).
Together, we’ve also collected $200k+ in bug bounties from security research.