Infrastructure Startups funded by Y Combinator (YC) in the San Francisco Bay Area that are currently hiring 2024

May 2024

Browse 24 of the top Infrastructure startups funded by Y Combinator. Headquartered in the San Francisco Bay Area, these are some of the hottest and fastest-growing startups. Their teams are well-funded and actively hiring.

We also have a Startup Directory where you can search through over 5,000 companies.

  • Apollo
    Apollo (s2011)Active • 200 employees • San Francisco, CA, USA
    Apollo GraphQL is the leader in open source and commercial GraphQL technologies. Apollo’s open-source GraphQL client, server, and gateway are downloaded more than 17M times per month and used in production by over 30% of the Fortune 500. Customers like Walmart, Expedia, Glassdoor, Audi, and PayPal, use the Apollo Graph Platform to unify their GraphQL efforts, collaborate on graph development, and safely iterate on their graphs. Based in San Francisco, Apollo is backed by Insight Partners, Andreessen Horowitz, Matrix Partners, Trinity Ventures, Y Combinator and individual investors.
    developer-tools
    open-source
    graphql
  • SingleStore
    SingleStore (w2011)Active • 150 employees • San Francisco, CA, USA
    MemSQL is The No-Limits DatabaseTM, powering modern applications and analytical systems with a cloud-native, massively scalable architecture for maximum ingest and query performance at the highest concurrency. MemSQL envisions a world where every business can make decisions in real-time and every experience is optimized through data. Global enterprises use the MemSQL distributed database to easily ingest, process, analyze, and act on data in order to thrive in today’s insight-driven economy. MemSQL is optimized to run on any public cloud or on-premises with commodity hardware. Visit www.memsql.com or follow us @memsql. Headquartered in San Francisco, CA with offices in Seattle, WA and Portland OR, MemSQL has raised over $100M from top investors including GV, Accel Partners, and Khosla Ventures, among others. MemSQL is trusted by customers including Uber, Akamai, Dell EMC, Samsung, Comcast, Kellogg, and more. If you want to work at a company that celebrates diversity, innovation, leadership, and creativity every day, check out our openings at https://www.memsql.com/careers/.
  • Etleap
    Etleap (w2013)Active • 11 employees • San Francisco, CA, USA
    Etleap is an ETL solution for creating perfect data pipelines from day one. Unlike other enterprise solutions, Etleap doesn’t require extensive engineering work to set up, maintain, and scale. It automates most ETL setup and maintenance work, and simplifies the rest into 10-minute tasks that analysts can own.
    data-engineering
  • Mezmo
    Mezmo (w2015)Active • 172 employees • San Jose, CA, USA
    Mezmo, formerly LogDNA, is an observability platform to manage and take action on your data. It ingests, processes, and routes log data to fuel enterprise-level application development and delivery, security, and compliance use cases. Mezmo was brought to life by three-time co-founders Chris Nguyen and Lee Liu and included in the Winter 2015 batch of Y Combinator. In 2018 the company partnered with tech giant, IBM, to become the sole logging provider for IBM Cloud. Mezmo is on a mission to empower people who build solutions that shape the world. We’re doing this by delivering a platform that enables enterprises to get more value from their observability data in real time, regardless of source, destination, use case, or scale. We’re not the only ones working on this problem but we have a few things the others don’t. We’re cloud-native and know how to make the most of modern technology like Kubernetes. We have scaled a solution from zero to petabyte scale in a short amount of time, while supporting thousands of active users across multiple environments. We are hungry for change and are surrounded by enterprises telling us they’re hungry, too. We have a kick-ass group of people who are thinking about the problem analytically and are excited to change the observability world for the better. Mezmo has helped some of the world’s most innovative companies transform how they manage their systems and applications. Still, we know that we can help them get more value from their observability data by providing more flexibility and control over how they use it. This will enable teams to spend less time switching between data silos so they can focus on shipping better, more resilient, and secure products. We have momentum on our side. Last year we saw triple digit revenue growth and added 800 new customers to our roster. Recent accolades include being named to YC’s Top Companies, CRN’s 10 Hottest DevOps Startups, and EMA’s Top 3 Observability Platforms.
    developer-tools
    devsecops
    saas
    kubernetes
    data-engineering
  • Bitmovin
    Bitmovin (s2015)Active • 145 employees • San Francisco, CA, USA
    Bitmovin has been a first mover in almost every significant development in online video, from building and deploying the world’s first (and fastest) commercial adaptive streaming (MPEG-DASH/HLS) HTML5 Player, to being the first to achieve 100x realtime encoding speeds in the cloud. Bitmovin provides HEVC as well as VP9 live streaming with 60FPS and 4K resolution, and built the first containerized video encoding solution with Docker and Kubernetes. Bitmovin products are completely in-house developed, easy and fast to integrate and highly customizable. In combination with our great support, documentation and SLAs, this is a true enterprise offering. To find out more about Bitmovin’s video infrastructure solutions, or about any individual products, contact sales@bitmovin.com, or visit our website: bitmovin.com
    developer-tools
    video
  • Mux
    Mux (w2016)Active • 140 employees • San Francisco, CA, USA
    Mux is an API-first video platform designed by experts to make world-class video streaming and analytics possible for every development team. Self-optimizing videos take the guesswork out of encodings, delivery, and renditions. Mux makes your video look beautiful on every device, every time. We offer two products, Mux Video and Mux Data. Mux Video is a simple API to advanced video streaming. Powered by data and designed by video experts, Mux Video makes beautiful video possible for every development team. Post a video or live stream with a single API call, and watch it within seconds. Mux Video handles the encoding, storage, and delivery. QoE and performance analytics provided by Mux Data is included by default for every Mux video stream.This real-time data is used to create the right renditions for every device and title, resulting in greater reliability and better viewer experiences. Mux Data is a Quality of Experience analytics platform that measures the performance metrics that actually matter to viewers: rebuffering, startup time, video quality, and errors. Our intuitive dashboard quickly gives users the insight to find issues and make improvements through features like A/B testing, industry-wide scores, alerts, and detailed information about every single video view. The product works out of the box on HTML5, iOS, Android, and major OTT platforms. All data is available via exports and APIs, making it easy to integrate Mux Data into other systems.
    developer-tools
    video
    api
  • Activeloop
    Activeloop (s2018)Active • 15 employees • Mountain View, CA, USA
    We provide a simple API for creating, storing, versioning, and collaborating on multi-modal AI datasets of any size. With Activeloop's open-core stack, you can rapidly transform and stream data while training models at scale. Deep Lake powers foundational model training by acting as a vector database with significant benefits, such as (1) the ability to use multi-modal datasets to fine-tune your own LLM models, (2) storing both the embeddings and the original data with automatic version control, so no embedding re-computation is needed (3) truly serverless service with no vendor lock-in. How cool is that? GitHub loves us - we're one of the fastest-growing libraries there, and we're used by little-known companies like Google, Waymo, and Intel. No big deal. Our founding team hails from places like Princeton, Stanford, Google, and Tesla, and we're backed by Y Combinator & other Silicon Valley heavyweights. Activeloop is hiring, and we want you! Check out our open roles on our YC page and join the fun. 10-min demo: https://activeloop.wistia.com/medias/aibvo0dst2 Whitepaper: https://www.deeplake.ai/whitepaper
    computational-storage
    deep-learning
    generative-ai
    computer-vision
    open-source
  • Wasmer
    Wasmer (s2019)Active • 11 employees • San Francisco, CA, USA
    We are working on the software that will power the next generation of Cloud Computing (Edge Computing) platforms using WebAssembly. You can see us as a mix between Docker (but 100x more performant in startup time, and with 100x more lightweight containers), npm (we released the first WebAssembly package manager: wapm.io) and the JVM (but more universal).
    developer-tools
    open-source
  • Freshpaint
    Freshpaint (s2019)Active • 45 employees • San Francisco, CA, USA
    Our customers use Freshpaint to make their websites and entire marketing stack HIPAA compliant. It's impossible to use modern analytics and marketing tools without sharing sensitive HIPAA-regulated data. Our platform does the hard thing of moving data from A to B. And then we do the really hard thing of controlling the flow of sensitive HIPAA-regulated data to various 3rd-party marketing tools. We're building the data infrastructure that safeguards patient privacy while also enabling marketing teams promote access to healthcare.
    developer-tools
    saas
    b2b
  • Replicate
    Replicate (w2020)Active • 27 employees • San Francisco, CA, USA
    artificial-intelligence
    developer-tools
    machine-learning
    community
    open-source
  • Cortex
    Cortex (w2020)Active • 50 employees • San Francisco, CA, USA
    ​​Cortex is an internal developer portal built to accelerate the path to engineering excellence. Companies like Docker, TripAdvisor, and Dropbox use Cortex to catalog, score, and assign action to improve service quality and velocity, so devs can get back to work that drives the business forward.
    developer-tools
    saas
    b2b
  • Aquarium Learning
    Aquarium Learning (s2020)Active • 12 employees • San Francisco, CA, USA
    ML models are only as good as the datasets they're trained on, and that means that most improvement to model performance comes from improvement to the quality and diversity of their datasets. Our tooling makes it easy for ML teams to find anomalies + failure patterns in their datasets and fix these problems by editing / adding the right data. So the next time you retrain your model, it just gets better.
    deep-learning
    developer-tools
    generative-ai
    machine-learning
    ai
  • Svix
    Svix (w2021)Active • 10 employees • New York, NY, USA
    Svix is the enterprise ready and open source webhooks service. Webhooks are a pain. Developers need to worry about deliverability, retries, monitoring and security. All of which are different for webhooks compared to the rest of the stack. We turn all of that into a simple API call. We are backed by Y Combinator, Andreessen Horowitz (a16z), Aleph, and founders and CTOs of companies such as Github, PagerDuty, Segment and Lookout.
    developer-tools
    open-source
    api
  • Rootly
    Rootly (s2021)Active • 52 employees • San Francisco, CA, USA
    At Rootly, we are a mission to be the go-to way companies respond when things go wrong, helping every organization be more reliable. We do this by building an industry leading incident management platform that allows companies around the world consistently and quickly resolve incidents. We are not simply transforming an industry, we are carving an entirely new +$B segment ourselves and need incredible talent to achieve this ambitious goal together. Customers love Rootly. Some of the fastest growing companies around the world such as NVIDIA, Figma, Canva, Tripadvisor, Squarespace and more rely on Rootly to power their critical incident management process. They obsess over our delightful enterprise-ready platform and unique partnership model.
    developer-tools
    saas
    b2b
  • Sieve
    Sieve (w2022)Active • 6 employees • San Francisco, CA, USA
    Sieve is a new kind of specialized cloud built specifically around video / audio understanding and generation use cases. We have a set of production-ready APIs available out of the box, and tooling that makes it easy to combine various models together in your apps — and share them with your team as an intuitive AI playground.
    artificial-intelligence
    video
  • FlowDeploy
    FlowDeploy (w2022)Active • 3 employees • Mountain View, CA, USA
    FlowDeploy helps bioinformaticians manage their data analysis pipelines. We provide everything they need to try, run, develop, and share their pipelines. That includes integrations with AWS, Snakemake, Nextflow, GitHub, Slack, SSO, and more, as well as a clean API and web app for launching and monitoring pipelines and managing their data. FlowDeploy is built for bioinformaticians: it doesn't restrict how pipelines are built and managed, as long as a bioinformatics workflow manager like Nextflow or Snakemake is used. But it does eliminate several footguns like idle spend and accidental data egress, and it reduces the potential for users accidentally sharing credentials. FlowDeploy runs the pipelines in either our managed cloud or the customer's cloud – eliminating the need to transfer data externally. Non-computational biologists can use FlowDeploy, too: features like pipelines templates decrease the complexity to launch a new pipeline, which reduces user error and decreases the need for advanced cloud training for non-computational users.
    developer-tools
    drug-discovery
    data-engineering
  • Hydra
    Hydra (w2022)Active • 6 employees • San Francisco, CA, USA
    Open source Snowflake alternative. Query billions of rows instantly on column-oriented Postgres. Hydra can be used as open source, managed cloud, or deployable in customer cloud infrastructure. Get parallelized analytics in minutes with no code changes
    developer-tools
    analytics
    open-source
    data-engineering
  • Eventual
    Eventual (w2022)Active • 5 employees • San Francisco, CA, USA
    Eventual is a data warehouse for ML/AI. We are building an integrated development experience for data scientists and engineers to build ML/AI applications which query and process multimodal data (images, video, audio and 3D scans). Daft (https://www.getdaft.io) is our open-sourced distributed data query engine. It has a Python Dataframe API and is built in Rust, providing both ease of use and performance for use-cases such as ML data engineering, training data ingestion and general data analytics. We are funded by investors such as Caffeinated Capital, Array.vc and top angels in the valley from Databricks, Meta and Lyft. Our team has deep expertise in high performance computing, big data technologies, cloud infrastructure and machine learning.
    computer-vision
  • Bitnami (w2013)Acquired • 75 employees • San Francisco, CA, USA
    Packaged Applications for Any Platform. Bitnami has automated the ability to package, deploy and maintain applications, lowering the barrier to adoption for anyone to deploy and maintain a full spectrum of server applications, development stacks and infrastructure applications in virtually any format. Simply click to deploy any one of 120+ ready-to-run applications from the Bitnami Application Catalog or use Bitnami Stacksmith to package and migrate your custom in-house applications to the cloud. For more information, visit www.Bitnami.com, or follow us on Twitter (@Bitnami) and Facebook.
    developer-tools
  • Armory
    Armory (w2017)Acquired • 90 employees • San Mateo, CA, USA
    Your software is your competitive advantage, and your customer experience is everything. Armory Continuous Deployment solutions empower you and your teams to confidently deploy every application, every time; safely and securely, so you can accelerate your time-to-market, increase stability, and decrease customer-impacting issues.
    developer-tools
    devsecops
  • Peer5
    Peer5 (w2017)Acquired • 11 employees • Palo Alto, CA, USA
    Peer5's P2P CDN improves content delivery for live and on demand video streams. As video streaming continues to become more mainstream and the streams themselves become HD and 4K, traditional CDNs struggle to deliver high quality viewing experiences. Many publishers try to keep up with peak demand by paying for more servers, many of which are almost never used and struggle with video playback. Not only is this expensive, but it's also inefficient, which is why at Peer5, we've created a P2P CDN to boost your video streaming capabilities. Our CDN works with your servers to ensure that as your viewership increases, your network gets stronger by offloading bandwidth to our peer network. On average, this has been shown to increase server capacity by 20x, offload 70% of bandwidth, increase viewing time by 30% and decrease instances of rebuffering by 40%. Our video streaming CDN uses emerging technologies, including HTML5 and WebRTC. They require no installation by end users and offer easy integration for content providers. We support many different streaming protocols, players and media servers.
    video
  • Satsuma
    Satsuma (s2021)Acquired • 5 employees • San Francisco, CA, USA
    Satsuma is a developer tool for building applications on top of real-time blockchain data. Our product lets developers take decoded data from multiple chains, customize it for their use cases, and access it through API endpoints. Blockchains serve as distributed databases for these products, holding their most important data. However, it’s difficult to access and query that data. We believe this friction is an enormous blocker for web3 developers and that better tooling will enable mass adoption for web3. We’re a founding team of engineers, having built data infrastructure and product as early employees at Airtable, Heap, and Y Combinator.
    developer-tools
    saas
    crypto-web3
    data-engineering