Datacurve

Frontier coding data for training and evaluating LLMs

Software Engineer - Full Stack

$120K - $180K / 0.50% - 2.00%
Location
San Francisco, CA, US
Job Type
Full-time
Experience
Any (new grads ok)
Connect directly with founders of the best YC-funded startups.
Apply to role ›
Serena Ge
Serena Ge
Founder

About the role

About Datacurve

Datacurve supplies frontier coding data to top AI labs and enterprises to train and evaluate the next generation of coding LLMs. Datacurve has scaled from $0 to multiple 7-figures ARR in 6 months with a team of 3 people and continues to skyrocket in growth.

Who you are

  • You’re a scrappy, hacker-type engineer who loves to ship and ships fast
  • Excited to take ownership and strong desire to create impact
  • Strong work ethic and works well under pressure

More on Datacurve’s mission

Abundant post-training data is one of the biggest bottlenecks to achieve autonomous SWEs and to break through plateaus in coding LLMs’ capabilities. We built a gamified coding platform attracts skilled engineers from all over the world to produce high quality data.

Our goal is to enable next generation coding LLMs from a foundation model level through quality data abundance. We will create a future where coding LLMs aren’t just productivity-boost devtools but capable to give anyone from any industry production power to build and engineer solutions.

Who we are

Datacurve is founded by Waterloo CS dropouts. Serena (CEO) interned at Cohere with the CTO and pioneered on early coding & synthetic data at Cohere. Charley (CTO) interned at Google before dropping out. The best way to describe what it’s like to work here is a long hackathon with friends. We are extremely ambitious. We don’t care, we will just do it. Join us.

Backed by Y Combinator, Afore Capital, Pioneer Fund, Amjad Masad (Replit), Oriol Vinyals (Gemini Technical Lead), Cohere, and Vercel.

About the interview

  1. Short 15-minute call to determine vibe fit
  2. (Opt.) Short 15-20 minute technical screen
  3. Take-home coding project
  4. Live coding interview (1-2 hours)
  5. Onsite

About Datacurve

Datacurve (YC W24) is a platform that produces high quality coding data for foundation model companies. Datacurve's gamified coding platform pays elite engineers to do fun problems in Leetcode style. Customers buy data from Datacurve to train better LLMs.

Datacurve
Founded:2024
Team Size:4
Location:San Francisco
Founders
Serena Ge
Serena Ge
Founder
Charley Lee
Charley Lee
Founder