DATA ENGINEER, LLM

Genentech, Inc.
Apply Now

Job Description

The Engineering team within the Prescient Design group is looking for exceptional data engineers in order to design and build the data infrastructure foundation for our molecule design system. The data platform will play a key role in Prescient Design’s success. We are looking for someone with a background in software engineering, a passion for technical problem-solving, and a proven ability to build data infrastructure and pipelines for modeling.

The Data Engineer will work closely with ML researchers to enable end-to-end data machine learning workflows on various modeling efforts including supporting training protein language models, as well as interfacing with other teams within Genentech Research and Early Development (gRED).

THE ROLE

  • Enable cutting-edge research in machine learning and applications to drug discovery, design, and development through the collection, design, and management of data pipelines and infrastructure.
  • You will collaborate closely with cross-functional teams across both Prescient Design and gRED to solve complex problems in the life sciences, including understanding and analyzing algorithm issues.
  • You will interface with other teams at gRED in developing a common data architecture and model and in formalizing best practices.
  • You will be expected to help develop, manage, and scale data pipelines and infrastructure for analysis and modeling in production.
  • You will be expected to solve core engineering challenges including the design and implementation of stable data architecture and models.
  • You will be expected to serve as an expert and resource for multiple, diverse groups at Prescient Design and gRED.

QUALIFICATIONS

  • S., M.S., or Ph.D. in Computer Science, Statistics, Applied Mathematics, Computational Biology, Physics, related technical field, or equivalent practical experience.
  • At least one year relevant work experience.
  • Advanced programming skills in languages like C++, Python, Java, Scala, or SQL.

PREFERRED QUALIFICATIONS

  • Experience with cloud computing and infrastructure including Amazon Web Services (AWS) and distributed computing libraries like Spark, Hive, Impala, and Kafka.
  • Experience with data modeling and schema design, including databases and file systems for scientific data
  • Experience with containerization and orchestration tools like Docker, Singularity, Airflow, Luigi, and Kubernetes.
  • Experience developing and maintaining codebases and software libraries, following industry best practices.
  • Experience with CI/CD and automation tools like Terraform, CloudFormation, Jenkins, and Ansible.
  • Experience with tools and platforms for MLOps like Weights & Biases.
  • Intense curiosity about the biology of disease and eagerness to contribute to scientific and computational efforts.

gCS

Who We Are

Genentech, a member of the Roche group and founder of the biotechnology industry, is dedicated to pursuing groundbreaking science to discover and develop medicines for people with serious and life-threatening diseases. To solve the world's most complex health challenges, we ask bigger questions that challenge our industry and the boundaries of science to transform society. Our transformational discoveries include the first targeted antibody for cancer and the first medicine for primary progressive multiple sclerosis.

Genentech is an equal opportunity employer & prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity/expression, national origin/ancestry, age, disability, marital & veteran status. For more information about equal employment opportunity, visit our Genentech Careers page.

The expected salary range for this position based on the primary location of New York City is $125,000 - 232,100 of hiring range. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided.

Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.

Company Info.

Genentech, Inc.

Genentech, Inc. is a biotechnology company based in South San Francisco, California. It was founded in 1976 by venture capitalist Robert A. Swanson and biochemist Dr. Herbert W. Boyer. Genentech is considered one of the pioneers in the biotechnology industry and has been instrumental in the development of groundbreaking pharmaceutical products.

  • Industry
    Biotechnology Research
  • No. of Employees
    12,500
  • Location
    South San Francisco, CA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Genentech, Inc. is currently hiring Large Language Models Engineer Jobs in New York, NY, USA with average base salary of $125,000 - $232,100 / Year.

Similar Jobs View More