Pachyderm: Data Versioning, Data Pipelines, and Data Lineage

Python JupyterHub Developer at Pachyderm

San Francisco Bay Area OR Remote / Remote
Job Type
3+ years
Connect directly with founders of the best YC-funded startups.
Apply to role ›
Joe Doliner
Joe Doliner

About the role

About Pachyderm

At Pachyderm, we're building an open-source enterprise-grade data science platform that lets you deploy and manage multi-stage, language-agnostic data pipelines while maintaining complete reproducibility and provenance. Our system, developed with open source roots, shifts the paradigm of data science workflows by providing reproducibility, data provenance, and opportunity for true collaboration. Pachyderm utilizes modern technologies like Docker and Kubernetes to build an entirely new method of analyzing data. Offered both as an in-house solution as well as hosted service, Pachyderm brings together version-control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to use any language, framework, or tool they want. If you want to learn more about our grand vision, read what has become our "manifesto."

Pachyderm is a rapidly growing, early-stage company funded by the top VCs — Benchmark, Decibel, M12, and YCombinator. Like many modern companies, Pachyderm embraces a “Remote-first” approach to growing our team. It gives us a huge advantage in hiring top talent and diverse talent across the country while giving our team members the flexibility to work from anywhere.

You can check out our product on GitHub because it’s open-source and try our cloud service for free.

The Role

Pachyderm is looking for a Python & JupyterHub developer to join our team that is building a data science platform.

You'll be working closely with our design & product team to implement new features. Your main task will be integrating Pachyderm with JupyterHub. You will also be part of the team working on several exciting integrations with other tools and platforms. As an early member of the team, you will have a ton of ownership and impact on the product direction. You'll be required to collaborate closely with other engineers and product leaders.

While your primary focus will be building the product, you’ll also have direct exposure to users and enterprise customers. At Pachyderm, open-source user and customer feedback is a major driver of our product roadmap and we believe that everyone within the company should experience that first-hand.

The long and short of it is, if you're looking to make a big impact on a small team that works on open source software and delivers an enterprise-grade product, then this role is for you. You can check out our product on GitHub.

We offer significant equity, full benefits, and all the usual startup perks.


3+ years of experience as a Python Developer 2+ years of experience as a JavaScript/TypeScript Developer Experience designing interactive web applications using Python or JavaScript/TypeScript Experience with data analysis and data visualization in Python Experience with widgets and interactive visualizations in Jupyter Experience writing Jupyter Notebook and/or Jupyter Lab extensions Familiarity with Docker, Kubernetes and AWS Bonus: Experience deploying and customizing JupyterHub a plus Bonus: Knowledge of TypeScript and AngularJS a plus

Why you should join Pachyderm