Data Engineering Startups funded by Y Combinator (YC) in New York 2024

May 2024

Browse 12 of the top Data Engineering startups funded by Y Combinator. Headquartered in New York, these are some of the hottest and fastest-growing startups.

We also have a Startup Directory where you can search through over 5,000 companies.

  • Narrator
    Narrator (s2019)Active • 8 employees • New York, NY, USA
    Narrator is an end-to-end platform built on top of the data standard, the Activity Schema and starting at $500/mo. Data analyst are able to build their definitions of their user journey, and use that journey to answer any question that comes up. From there, data can be visualized in a dashboard, used to build a story like analysis, exported and more. The biggest values of Narrator is Speed and Cost reduction. Small teams are able to move fast and answer questions in minutes allowing them to preform the work of very large data teams. All while Narrator is optimized to minimize compute cost of the warehouse.
    analytics
    big-data
    data-engineering
  • Gecko Robotics
    Gecko Robotics (w2016)Active • 230 employees • Austin, TX, USA
    The mission of Gecko Robotics is to improve the state of the world by helping the most important institutions ensure the availability, reliability and sustainability of critical infrastructure. Gecko's combination of wall-climbing robots, industry-leading sensors, and an AI-powered data platform give customers a unique window into the health of their physical assets allowing real-time decisions that prevent power outages, ensure military missions succeed, and help reduce energy costs.
    robotics
    energy
    big-data
    data-engineering
    ai
  • authzed
    authzed (w2021)Active • 17 employees • New York, NY, USA
    We build the tools companies need to provide performant and scalable authorization for their applications. We’re founded by 3 successful entrepreneurs with expertise in enterprise software, most recently as leaders at Red Hat. Jake and Joey met on the APIs team at Google in 2010. They went on to found Quay, where Jimmy joined as their first hire. Over the past decade, they’ve changed the landscape for building and deploying software.
    developer-tools
    saas
    security
    open-source
    data-engineering
  • Spruce Systems
    Spruce Systems (w2021)Active • 25 employees • New York, NY, USA
    Spruce lets users control their data across the web. We believe that the world is evolving toward one based on cryptography, networks, and digital economies that are user-controlled. Today, the dominant use case for user keys is the signing of blockchain transactions, but we think this barely scratches the surface of what is possible. Soon, the entirety of a user’s digital interactions will be based on their keypairs, and we’re unlocking this transition with our constellation of products. We are passionate about cultivating a thriving culture of diverse individuals who bring unique perspectives to our mission. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.
    crypto-web3
    identity
    open-source
    privacy
    data-engineering
  • Datafold
    Datafold (s2020)Active • 24 employees • New York, NY, USA
    Datafold exists to make working with data more enjoyable and productive. We are all about empowering data and analytics engineers. We find the most tedious, error-prone, and repetitive tasks and create tools to automate them. We make the world better by giving superpowers to data professionals who solve hard problems in various domains with data.
    saas
    analytics
    data-engineering
  • Dataland
    Dataland (s2020)Active • 2 employees • New York, NY, USA
    Dataland is the easiest way to deliver high-quality internal tools to your business users. It's secure, easy-to-use, and sets up in minutes. Dataland uses GenAI to enable business users to construct their own internal tools without blocking on engineering.
    saas
    b2b
    data-engineering
  • Avenue
    Avenue (w2021)Active • 8 employees • New York, NY, USA
    Avenue is a simple way for business teams to set up alerts from their database or data warehouse. Think Datadog / PagerDuty for operations teams. Operations teams create set-and-forget alerts on all their data, so they can be more proactive with their time (and monitor on more nuanced triggers than just what fits on their dashboard page). Avenue can improve response times to critical problems from several days to real-time by alerting directly on the data sources that customers already use.
    developer-tools
    saas
    data-engineering
  • Prequel
    Prequel (w2021)Active • 9 employees • New York, NY, USA
    Prequel makes it easy for companies to share data with their customers. It helps you export data directly to your customer's Snowflake, Redshift, BigQuery, Databricks, or other data warehouse on an ongoing basis.
    saas
    analytics
    data-engineering
  • violet
    violet (s2019)Active • 8 employees • New York, NY, USA
    be intentional with your time
    artificial-intelligence
    data-engineering
    ai-assistant
  • Operator Labs
    Operator Labs (w2020)Active • 6 employees • New York, NY, USA
    Toolkit for connecting AI agents to the decentralized web
    generative-ai
    crypto-web3
    data-engineering
  • Lariat Data
    Lariat Data (s2021)Active • 3 employees • New York, NY, USA
    Lariat is a Continuous Data Quality monitoring platform to discover data bugs before your consumers do. Ensure data products don’t break even as business logic, input data and infrastructure change. Use Lariat to define and then automatically extract, store and visualize data quality metrics on raw event-level data through to delivered data products.
    machine-learning
    big-data
    data-engineering
  • Yhat (w2015)Acquired • 17 employees • Brooklyn, NY, USA
    Yhat (YC W15, pronounced y-hat) was an end-to-end data science platform. Acquired by Alteryx (NYSE:AYX)
    artificial-intelligence
    machine-learning
    enterprise
    data-engineering