We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
Remote New

Data Scientist, NLP

Datavant
United States
Dec 09, 2025

Datavant is a data platform company and the world's leader in health data exchange. Our vision is that every healthcare decision is powered by the right data, at the right time, in the right format.

Our platform is powered by the largest, most diverse health data network in the U.S., enabling data to be secure, accessible and usable to inform better health decisions. Datavant is trusted by the world's leading life sciences companies, government agencies, and those who deliver and pay for care.

By joining Datavant today, you're stepping onto a high-performing, values-driven team. Together, we're rising to the challenge of tackling some of healthcare's most complex problems with technology-forward solutions. Datavanters bring a diversity of professional, educational and life experiences to realize our bold vision for healthcare.

We are looking for a motivated Data Scientist to help Datavant revolutionize the healthcare industry with AI. This is a critical role where the right candidate will have the ability to work on a wide range of problems in the healthcare industry with an unparalleled amount of data.

You'll join a team focused on deep medical document understanding, extracting meaning, intent, and structure from unstructured medical and administrative records. Our mission is to build intelligent systems that can reliably interpret complex, messy, and high-stakes healthcare documentation at scale.

This role is a unique blend of applied machine learning, NLP, and product thinking. You'll collaborate closely with cross-functional teams to:



  • Design and develop models to extract entities, detect intents, and understand document structure
  • Tackle challenges like long-context reasoning, layout-aware NLP, and ambiguous inputs
  • Evaluate model performance where ground truth is partial, uncertain, or evolving
  • Shape the roadmap and success metrics for replacing legacy document processing systems with smarter, scalable solutions


We operate in a high-trust, high-ownership environment where experimentation and shipping value quickly are key. If you're excited by building systems that make healthcare data more usable, accurate, and safe, please reach out.

Qualifications



  • 3+ years of experience with data science and machine learning in an industry setting, particularly in designing and building NLP models.
  • Proficiency with Python
  • Experience with the latest in language models (transformers, LLMs, etc.)
  • Proficiency with standard data analysis toolkits such as SQL, Numpy, Pandas, etc.
  • Proficiency with deep learning frameworks like PyTorch (preferred) or TensorFlow
  • Industry experience shepherding ML/AI projects from ideation to delivery
  • Demonstrated ability to influence company KPIs with AI
  • Demonstrated ability to navigate ambiguity


Bonus Experience



  • Experience with document layout analysis (using vision, NLP, or both).
  • Experience with Spark/PySpark
  • Experience with Databricks
  • Experience in the healthcare industry


Responsibilities



  • Play a key role in the success of our products by developing models for document understanding tasks.
  • Perform error analysis, data cleaning, and other related tasks to improve models.
  • Collaborate with your team by making recommendations for the development roadmap of a capability.
  • Work with other data scientists and engineers to optimize machine learning models and insert them into end-to-end pipelines.
  • Understand product use-cases and define key performance metrics for models according to business requirements.
  • Set up systems for long-term improvement of models and data quality (e.g. active learning, continuous learning systems, etc.).


After 3 Months, You Will...



  • Have a strong grasp of technologies upon which our platform is built.
  • Be fully integrated into ongoing model development efforts with your team.


After 1 Year, You Will...



  • Be independent in reading literature and doing research to develop models for new and existing products.
  • Have ownership over models internally, communicating with product managers, customer success managers, and engineers to make the model and the encompassing product succeed.
  • Be a subject matter expert on Datavant's models and a source from which other teams can seek information and recommendations.


#LI-BC1

We are committed to building a diverse team of Datavanters who are all responsible for stewarding a high-performance culture in which all Datavanters belong and thrive. We are proud to be an Equal Employment Opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.

At Datavant our total rewards strategy powers a high-growth, high-performance, health technology company that rewards our employees for transforming health care through creating industry-defining data logistics products and services.

The range posted is for a given job title, which can include multiple levels. Individual rates for the same job title may differ based on their level, responsibilities, skills, and experience for a specific job.

The estimated total cash compensation range for this role is:
$136,000 $170,000 USD

To ensure the safety of patients and staff, many of our clients require post-offer health screenings and proof and/or completion of various vaccinations such as the flu shot, Tdap, COVID-19, etc. Any requests to be exempted from these requirements will be reviewed by Datavant Human Resources and determined on a case-by-case basis. Depending on the state in which you will be working, exemptions may be available on the basis of disability, medical contraindications to the vaccine or any of its components, pregnancy or pregnancy-related medical conditions, and/or religion.

This job is not eligible for employment sponsorship.

Datavant is committed to a work environment free from job discrimination. We are proud to be an Equal Employment Opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.To learn more about our commitment, please review our EEO Commitment Statement here. Know Your Rights, explore the resources available through the EEOC for more information regarding your legal rights and protections. In addition, Datavant does not and will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay.

At the end of this application, you will find a set of voluntary demographic questions. If you choose to respond, your answers will be anonymous and will help us identify areas for improvement in our recruitment process. (We can only see aggregate responses, not individual ones. In fact, we aren't even able to see whether you've responded.) Responding is entirely optional and will not affect your application or hiring process in any way.

Datavant is committed to working with and providing reasonable accommodations to individuals with physical and mental disabilities. If you need an accommodation while seeking employment, please request ithere, by selecting the 'Interview Accommodation Request' category. You will need your requisition ID when submitting your request, you can find instructions for locating it here. Requests for reasonable accommodations will be reviewed on a case-by-case basis.

For more information about how we collect and use your data, please review our Privacy Policy.

Applied = 0

(web-df9ddb7dc-h6wrt)