Data Engineering Startups funded by Y Combinator (YC) in New York 2026

April 2026

Browse 11 of the top Data Engineering startups funded by Y Combinator. Headquartered in New York, these are some of the hottest and fastest-growing startups. This doesn't include all companies originally founded in New York or by founders from there.

We also have a Startup Directory where you can search through over 5,000 companies.

  • sieve
    sieve
    Y Combinator LogoP2025
    Active • 2 employees • New York, NY, USA
    sieve solves data cleaning for hedge funds and investment firms by letting them get clean data in four lines of code. Currently, their data pipelines have conditions that raise for human review, which literally send an email to engineers with data that needs to be reviewed. We provide an API that integrates directly into their existing pipeline - instead of raising for human review, they can send all the same information to our API and get clean, high-quality data back. By using our AI agents built specifically for financial data collection, along with expert-in-the-loop review, we provide our clients with clean, validated data at a scale and level of quality that wasn't achievable before.
    investing
    data-engineering
    apis
  • Melder
    Melder
    Y Combinator LogoF2024
    Active • 2 employees • New York, NY, USA
    Melder is an Excel add-in that brings AI functions and document support into your spreadsheets. Upload files directly into cells, use smart formulas like =GEN, and build automations—all without leaving Excel. Core features: - File-to-Sheet: Drop PDFs directly into cells, then reference them in formulas. - AI-Powered Functions: Write formulas like =GEN() or =EXTRACT() to summarize, classify, and analyze content. - Chat Assistant: Use our AI assistant to help build your sheets or answer questions from your data, live in the workbook. Business users use Melder to: - Accelerate diligence by extracting insights from data rooms - Review contracts by identifying key terms and clauses instantly - Run market research by pulling information from competitor websites - Synthesize transcripts by generating summaries from interviews and calls Melder brings the power of structured spreadsheet logic to the messy, unstructured data world—no coding needed.
    artificial-intelligence
    generative-ai
    data-engineering
  • authzed
    authzed
    Y Combinator LogoW2021
    Active • 31 employees • New York, NY, USA
    We build the tools companies need to provide performant and scalable authorization for their applications. We’re founded by 3 successful entrepreneurs with expertise in enterprise software, most recently as leaders at Red Hat. Jake and Joey met on the APIs team at Google in 2010. They went on to found Quay, where Jimmy joined as their first hire. Over the past decade, they’ve changed the landscape for building and deploying software.
    developer-tools
    saas
    security
    open-source
    data-engineering
  • Prequel
    Prequel
    Y Combinator LogoW2021
    Active • 9 employees • New York, NY, USA
    Prequel makes it easy for companies to share data with their customers. It helps you export data directly to your customer's Snowflake, Redshift, BigQuery, Databricks, or other data warehouse on an ongoing basis.
    saas
    analytics
    data-engineering
  • Datafold
    Datafold
    Y Combinator LogoS2020
    Active • 30 employees • New York, NY, USA
    Datafold automates manual work in data engineering. We leverage agentic AI to automate both day-to-day tasks, such as testing and code reviews, and massive one-off projects, such as data platform code migrations. Companies from Perplexity to Disney use Datafold to unlock more value from their data by freeing up their data teams from manual work, accelerating developer velocity, and ensuring data quality.
    saas
    analytics
    data-engineering
    ai
  • Dataland
    Dataland
    Y Combinator LogoS2020
    Active • 2 employees • New York, NY, USA
    Dataland is the applied AI lab for complex operations and customer support. In 8 months, we've grown to multi-million dollar ARR as just two founders and are highly profitable. We’re partnering with some of the fastest growing startups and public companies in the world.
    b2b
    data-engineering
    data-visualization
    ai
  • Operator.io
    Operator.io
    Y Combinator LogoW2020
    Active • 6 employees • New York, NY, USA
    Operator helps you train autonomous agents until they actually work. Describe what you need, run variants, then deploy them.
    generative-ai
    data-engineering
  • Tarsal
    Tarsal
    Y Combinator LogoS2021
    Acquired • 10 employees • New York, NY, USA
    Tarsal is a data pipeline custom built for security teams. As security data grows 25% year over year, security teams desperately need access to best-in-class data infrastructure. Tarsal bridges the gap between the modern data stack and security teams, pioneering the modern security data stack.
    b2b
    cybersecurity
    big-data
    data-engineering
  • Bracket
    Bracket
    Y Combinator LogoW2022
    Acquired • 3 employees • New York, NY, USA
    Bracket is the two-way data pipeline between popular business tools and backend databases. When ops teams update data in Salesforce or Airtable, and engineers update data in the database, Bracket connects the two sources to reflect the same information.
    saas
    b2b
    data-engineering
  • Avenue
    Avenue
    Y Combinator LogoW2021
    Acquired • 8 employees • New York, NY, USA
    Avenue is a simple way for business teams to set up alerts from their database or data warehouse. Think Datadog / PagerDuty for operations teams. Operations teams create set-and-forget alerts on all their data, so they can be more proactive with their time (and monitor on more nuanced triggers than just what fits on their dashboard page). Avenue can improve response times to critical problems from several days to real-time by alerting directly on the data sources that customers already use.
    developer-tools
    saas
    data-engineering
  • Yhat
    Y Combinator LogoW2015
    Acquired • 17 employees • Brooklyn, NY, USA
    Yhat (YC W15, pronounced y-hat) was an end-to-end data science platform. Acquired by Alteryx (NYSE:AYX)
    artificial-intelligence
    machine-learning
    enterprise
    data-engineering