Digital Lead Computational Chemistry and AI

Novo Nordisk A/S
Apply Now

Job Description

The Multi-modal Representation Learning scientists in Machine Intelligence focus on the analysis of various biological data, such as transcriptomic and genomic data, protein sequence and structure, images, and more. We specialize in representation learning, multi-modal diffusion models, and decoder-only generative models to integrate different data modalities and enhance our understanding and predictions. We collaborate closely with our experimental colleagues in early research to improve and accelerate the design of new drug candidates. With a team of scientists located in multiple countries, we strive to deliver the latest advancements in multi-modal representation learning to benefit patient health and global medicine.

The Position

We are seeking an exceptional Multi-modal Representation Learning Scientist to join our team as a Senior Data Scientist and contribute to cutting-edge research in the analysis of biological data and integration of different data modalities. As an expert in representation learning, multi-modal diffusion models, and decoder-only generative models, you will bring your expertise in training deep learning models on diverse biological data and connecting them to other data modalities. In this role, you will work within drug development projects in early research, supporting the development of new drug candidates and the initiation of new drug pipeline projects. This requires close collaboration with other research scientists in multidisciplinary teams and solving ambitious scientific goals. The position also requires close collaboration with our data science and machine learning colleagues across global sites and contributing to the onboarding and development of new computational drug design methods.

Relationships

The Multi-modal Representation Learning Scientist reports to the head of Machine Intelligence. Internal partners include data scientists, specialists, engineers, software developers, technology scouts & partnership developers, system engineers, designers, anthropologists, medical doctors, IT professionals, and others across the US and other countries. External relationships include commercial and academic collaboration partners.

Essential Functions

  • Develop state-of-the-art deep learning models for the analysis of various biological data types, such as transcriptomic and genomic data, protein sequence and structure, and images.
  • Apply representation learning, multi-modal diffusion models, and decoder-only generative models to integrate different data modalities, perform in silico experiments, and enhance the analysis of biological data.
  • Collaborate with multidisciplinary teams to design and execute computational experiments that support drug development/screening projects in early research.
  • Stay current with the latest research and advancements in deep learning, representation learning, and multi-modal data integration.
  • Communicate findings and results to stakeholders through presentations, reports, and scientific publications.

Physical Requirements

Up to 10% overnight travel required.

Qualifications

  • Master’s degree, or PhD is preferred. Bachelor’s degree required.
  • A master’s degree with 3+ years’ relevant experience, or PhD with little to no years’ relevant experience, OR bachelor’s degree with 5+ years’ relevant experience can be considered.
  • Background in deep learning, representation learning, and multi-modal data integration techniques.
  • Experience with training deep learning models on diverse biological data, such as transcriptomic and genomic data, protein sequence and structure, and images.
  • Experience with representation learning and generative models.
  • Proficiency in Python and experience with the PyTorch deep learning libraries such as Torchvision, Lightning, Hugging Face, etc.
  • Experience with modern version control and continuous integration/testing systems, such as GitLab/GitHub.
  • Excellent problem-solving skills and the ability to work independently and in a team environment.
  • Strong written and verbal communication skills.

Preferred Qualifications

  • Experience with gene knockdown or drug perturbation experiments and their relation to multi-modal data integration.
  • Experience in integrating and analyzing multi-modal biological data, such as single-cell RNA sequencing or protein sequences, with representation learning and generative models.
  • Familiarity with high-content screening and/or high-throughput data generation methods in the context of multi-modal data analysis.
  • Experience in writing well-tested and documented code, adhering to best practices in software development, particularly in the context of machine learning and representation learning.
  • Experience working with cloud compute services such as AWS, Azure, or Nvidia DGX Cloud for large-scale deep learning model training and deployment.

To apply for this position, please submit your CV, cover letter, and a list of relevant publications. We look forward to reviewing your application and exploring the potential for you to contribute to our dynamic and innovative team.

We commit to an inclusive recruitment process and equality of opportunity for all our job applicants.

At Novo Nordisk we recognize that it is no longer good enough to aspire to be the best company in the world. We need to aspire to be the best company for the world and we know that this is only possible with talented employees with diverse perspectives, backgrounds and cultures. We are therefore committed to creating an inclusive culture that celebrates the diversity of our employees, the patients we serve and communities we operate in. Together, we’re life changing.

Novo Nordisk is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, gender identity, sexual orientation, national origin, disability, protected veteran status or any other characteristic protected by local, state or federal laws, rules or regulations.

This position is part of a job family. As a result, this position can be filled with varying job titles depending on experience. The ranges for the job titles are the following - 

  • Data Scientist I/II - 105,000 - 136,000
  • Sr. Data Scientist I/II - $125,000 - 160,000

Base compensation is determined based on a number of factors. In addition, this position is part of the Annual Performance Incentive Plan. The role may also be eligible for a long-term incentive bonus depending on level and other Company factors.

Company Info.

Novo Nordisk A/S

Novo Nordisk A/S is a Danish multinational pharmaceutical company headquartered in Bagsværd, Denmark, with production facilities in eight countries, and affiliates or offices in five countries.

  • Industry
    Pharmaceuticals,Healthcare
  • No. of Employees
    48,478
  • Location
    2880 Bagsværd, Denmark
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Novo Nordisk A/S is currently hiring Data Scientist Jobs in Seattle, WA, USA with average base salary of $125,000 - $160,000 / Year.

Similar Jobs View More