Intelligent automation in Oncology Research and Care

Lead Research Scientist, NLP

$350K - $450K / 0.10% - 0.40%
San Francisco, CA, US / Remote (US)
Job Type
3+ years
Apply to Triomics and hundreds of other fast-growing YC startups with a single profile.
Apply to role ›

About the role

We are developing the technology to activate electronic health records (EHR) data for clinical trials from the ground up. Billions of dollars are being spent every year to parse the EMR data, interpret it and then enter it into some forms with discrete fields so that researchers can then analyze patient journeys. Triomics is pioneering a platform that will automate this process and we are looking for a natural language processing researcher to build this technology. If you want to solve decades of inefficiencies in clinical research with some of the latest tools at your disposal, this role is cut out for you.

Our mission involves constructing natural language processing solutions from the ground up to tackle the untapped reservoir of patient data currently locked within EMR notes. At Triomics, we're bullish on the potential of large language models as the solution to unlock this goldmine of information.

Our team is led by young, highly optimistic founders who are former MIT and Adobe researchers, bringing a wealth of complementary expertise to the table. Since our founding two years ago, we've secured over $14 million in venture capital from investors like Lightspeed, Nexus, Y Combinator, and General Catalyst.

Key Responsibilities:

  1. Framing the healthcare data abstraction problems as grounded natural language processing problems and analyzing literature to make an informed decision about the final choice.
  2. Conducting daring, cutting-edge research on large language models and even training them from scratch for the domain of healthcare
  3. Exploring their potential to be used as large-scale healthcare data mining solutions and comparing them to the existing problem-specific models.
  4. Work closely with the CTO/Founders to build these solutions from scratch, comparing them to human performance and publishing results in top-tier conferences and journals.
  5. Working in tandem with AI engineers for the implementation of the solutions to these problems and helping them train/optimize/validate models for all the identified NLP problems.
  6. Presenting the capabilities of everything we build in top conferences in the form of publications, research talks, and industry sessions.
  7. Collaborating with the oncology informatics team, product team, and business team to prioritize the solutions and help software developers take these technologies live in products.

We are seeking individuals who possess the following qualities:

  1. Consistent track record of publishing at high-impact conferences: Publications in conferences such as ICLR, NeurIPS, ICML, ACL, EMNLP
  2. First-principles thinking ability: The ability to approach problems and situations using fundamental principles and ideas rather than relying on conventions and precedents.
  3. High level of ownership: A proactive approach to responsibilities and a strong sense of accountability.
  4. Aptitude and analytical skills: An ability to quickly understand complex concepts and apply problem-solving skills to find solutions.
  5. Passion for healthcare and research: A genuine interest in and enthusiasm for healthcare and research, as well as a commitment to advancing the field.


  1. Opportunities for career advancement to Chief AI Scientist or leadership role within 12-18 months
  2. Performance-based increase in equity ownership
  3. 20% variable pay structure
  4. Comprehensive employee benefits including health insurance, 401(k) matching, and paid leave
  5. Highly competitive salary and stock options grant

About Triomics

Triomics is building the modern technology stack for clinical trial sites and investigators that unifies the workflows of clinical care and clinical research, moving the healthcare industry closer to the vision of Clinical Research as a Care Option. Our platform eliminates the operational inefficiencies in data collection, patient recruitment, and other laborious tasks involved in clinical research, thus enabling the generation of high-quality data and speeding up the time to market.

Team Size:50
Location:San Francisco
Sarim Khan
Sarim Khan
Sajjan Rajpurohit
Sajjan Rajpurohit
Hrituraj Singh
Hrituraj Singh