Job Description

Position Summary:

As a Data Science Intern, you'll be at the forefront of uncovering deep insights from text data. You'll be utilizing advanced machine learning technologies, specifically natural language processing techniques, to extract meaningful information from large and complex data sets. You'll have the opportunity to work with cloud-based data pipelines, a variety of analytical tools, and visualizations to deliver actionable insights and solutions in the healthcare industry. This internship provides a key opportunity for you to develop valuable skills in the field of data science while making a meaningful impact on the healthcare industry.

General Duties/Responsibilities (May include but are not limited to):

  • Collaborate with key business leaders to understand their business problems and come up with analytical solutions.
  • Understanding the concepts and building algorithms in the healthcare domains
  • Working with large data sets and designing analytical approaches through natural language processing techniques
  • Applying coding skills and knowledge data structures to develop projects in partnership with other scientists and engineers in the team
  • Build end-to-end data science solutions which will improve healthcare outcomes and reduce the cost for our members.
  • Develop scalable and efficient machine learning and deep learning algorithms that can work in production systems.
  • Collaborate with the engineering team to build end-to-end cloud based machine learning production pipelines.

Minimum Requirements:

To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Minimum Experience:

  • 2+ years of relevant experience in predictive modeling and natural language processing (can include academic research projects)


  • Current PhD student in Computer Science, natural language processing, Mathematics, Statistics, or related field with a focus on Machine Learning and Deep Learning.


  • Solid data structures & algorithms background
  • Depth and breadth in state-of-the-art machine learning and deep learning technologies
  • Strong programming skills in Python or similar scripting language
  • Familiarity with natural language processing techniques, including text preprocessing, language modeling, sequence-to-sequence models, and word embeddings.
  • The ability to design, implement, and evaluate novel algorithms for solving natural language processing tasks such as text classification, sentiment analysis, language translation, and question answering.
  • Proficiency in open-source deep-learning and machine learning libraries & toolkits

