Datafold

Deal with analytical data quality in the pull request

Senior Software Engineer

$175K - $230K
Location
US / Remote (US)
Job Type
Full-time
Experience
3+ years
Connect directly with founders of the best YC-funded startups.
Apply to role ›
Alex Morozov
Alex Morozov
Founder

About the role

About the role

As a Senior Software Engineer, you will be contributing to complex and innovative features in the Datafold application as well as help build the foundation of the Datafold platform. 

Datafold empowers its users – data engineers and analysts –  by abstracting them from the complexity of their data and automating the most tedious tasks. That requires us to solve various challenging problems under the hood.

Projects you will work on could include:

  • Developing API endpoints to implementing sophisticated features that may involve:

  • Evolving our cloud infrastructure in conjunction with the backend to make sure our product is maintainable, scalable, performant and reliable.

  • Logging into Linux systems to resolve problems, analyze performance issues.

You will partner closely with the CTO, Head of Engineering and product team to scope projects, and collaborate with other members of the engineering team to get projects shipped and solve our customers’ problems. Datafold’s frontend is written in React/TypeScript, our backend in Python, and we use PostgreSQL, ClickHouse, and Redis as datastores. Our product is hosted on AWS and Google Cloud.

About you

We are looking for engineers who have:

  • 4+ years of experience building web application backends and the infrastructure on which they run

  • Working knowledge of Python.

  • Working knowledge of either AWS or GCP.

  • Understanding of basic Devops & Linux skills

  • Worked with and knowledge of deployment frameworks like Terraform.

  • Working knowledge on Linux and docker administrative commands

  • Willingness and ability to learn the other technologies we are using.

  • Track record of taking ownership and delivering challenging projects

  • Ability to make technical and architectural decisions collaboratively

  • Affinity to high quality standards in code, architecture, and processes

  • Strategic approach to decision making with an ability to consider business context and needs of the users when making decisions to balance velocity, quality and impact.

  • Interest in the data domain and building tools that empower people.

Datafold is an equal opportunity employer and does not discriminate against any employee or applicant for employment based on race, color, religion, sex, national origin, age, disability, genetic information, sexual orientation, gender identity, marital status, military status, or any other protected characteristic. We are committed to providing equal employment opportunities to all individuals. We strive to create an inclusive and diverse work environment where all employees are valued and unique perspectives are respected and celebrated.

About Datafold

About Datafold

At Datafold, we build tools for data practitioners to automate the most error-prone and time-consuming parts of the data engineering workflow: testing data to guarantee its quality. While data quality (just like software quality) is a complex and multifaceted problem, we draw from decades of our team’s combined experience in the data domain to build opinionated tools our users love. Specifically, we believe that:

Data quality is a byproduct of a great data engineering workflow. That means, rather than building yet-another-app for data practitioners to switch to and from, we insert our tools in the existing workflows, for example, in CI/CD for deployment testing and IDEs for testing during development.

Data quality issues should be addressed before deploying the code. Most data quality issues are bugs in the code that processes data, and applying a proactive, shift-left approach is the most effective way to achieve high shopping velocity and data quality simultaneously. Read more

Lack of metadata (data about data) is the biggest gap in the data engineering workflow. We bring powerful tools such as data diffing and column-level lineage to every data engineer’s workflow to help them validate the code and underlying data and fully understand the dependencies in complex data pipelines.

Datafold is used by data teams at Patreon, Thumbtack, Substack, Angellist, among others, and raised $22M from YC, NEA & Amplify Partners.

Datafold
Founded:2020
Team Size:24
Location:New York
Founders
Gleb Mezhanskiy
Gleb Mezhanskiy
Founder
Alex Morozov
Alex Morozov
Founder