Insitro Data Scientist, Machine Learning Salary?

The average base salary for a Data Scientist, Machine Learning is $160,000 - $240,000 / Year at Insitro.

Data Scientist, Machine Learning Salary in South San Francisco, CA, USA

The average base salary for a Data Scientist, Machine Learning is $160,000 - $240,000 / Year in South San Francisco, CA, USA

Technical Skills Required to Become a Data Scientist, Machine Learning ?

To work as a Data Scientist, Machine Learning - You must have Degree in a relevant discipline in Degree in Computer Engineering Degree in Computer Science Degree in Data Science Degree in Machine Learning PhD Degree Degree in Mathematics Degree in Statistics

https://www.karkidi.com/upload-nct/company-logo/th1_insitro_8af3b.png

Looking for a Data Scientist, Machine Learning in South San Francisco, CA, USA?

Insitro is currently hiring Data Scientist, Machine Learning in South San Francisco, CA, USA and looking for candidates have skills and work experience of 2-4 year.

Does Insitro hire Data Scientist, Machine Learning now?

Insitro seeks to hire qualified Data Scientist, Machine Learning with at least 2-4 year experience.

Posted on:6 Dec 2022 BACK TO SEARCH

(Senior) Clinical ML Data Scientist

Insitro

Apply Now

Job Type
Full Time
Experience
2-4 year
Salary
$160,000 - $240,000 / Year
Location

South San Francisco, CA, USA
Job Function

Data Scientist, Machine Learning
Industry
Information Technology
Qualification

Degree in Computer Engineering
Degree in Computer Science
Degree in Data Science
Degree in Machine Learning
PhD Degree
Degree in Mathematics
Degree in Statistics

Key Skills

Java Programming, Python Programming, C++, C Programming, SQL, Cloud computing, Scala Programming, Machine learning techniques, Data science techniques, MATLAB Programming, PyTorch, TensorFlow, R Programming

Job Description

Key to insitro’s approach to rethinking drug development is leveraging disease models, genetics, and clinical datasets to link in vitro and cellular phenotypes with patient outcomes.

Multimodal clinical datasets are an essential component of modeling patient heterogeneity, disease progression, and phenotypic diversity. Our goal is to develop sophisticated models from patient clinical records, common lab biomarkers, and, when available, higher content multi-omic data to identify coherent patient segments to reveal novel genetic signals and opportunities for targeted therapies.

As a clinical machine learning data scientist, you will develop, productionize, and deploy cutting edge ML approaches to analyze and integrate large-scale multi-modal phenotypic datasets, including electronic health records, physiological monitoring, longitudinal clinical data, diverse biomarker data, and multi-omic modalities. You will work with clinical data from large human cohorts such as randomized clinical trials, electronic health records, national biobanks, and other sources. You will contribute to developing models understanding patient state and predicting outcomes and clinical endpoints for patient data. Via this collaborative effort, you will have the opportunity to contribute to developing models for understanding patient disease state and progression, predicting patient outcomes, and identifying therapeutic targets and developing drugs that have high efficacy and low toxicity.

In this role, your focus will be on developing end-to-end modeling capabilities for phenotypic clinical data. You will own the creation of scalable, reproducible pipelines that extract clinical data from diverse sources, normalize patient-level records into our standardized data schemas, develop novel feature extractors that exploit the underlying clinical dataset structure, and model architectures that generalize across heldout clinical trial sites and between datasets. The role will especially focus on developing multi-modal models, incorporating both longitudinal aspects of a patient's journey through various diagnostic ontologies, and progressively richer phenotypes as available in specific cohorts. You will work in collaboration with the software engineering team to ensure these pipelines are robust, reusable platform components that can be deployed on large-scale datasets in a portable way.

You will be joining a vibrant biotech startup that has long-term stability due to significant funding, yet is in a high growth phase. A lot can change in this early and exciting phase, providing many opportunities for significant impact. You will work closely with a very talented team, learn a broad range of skills, and help shape insitro’s culture, strategic direction, and outcomes. Join us, and help make a difference to patients! This role is preferably based in San Francisco Bay Area or Boston, but we are open to discussing other locations in the United States and the UK.

About You

Ph.D. in biomedical informatics, machine learning, computer science, or a related discipline, or equivalent practical experience (e.g., a Masters degree plus 2 years in relevant industry experience);
Demonstrated ability to use cutting edge statistical and machine learning methods for analyzing clinical data;
Extensive hands on experience working with several of the following areas: electronic health records; clinical trial data; disease progression modeling; multi-omic phenotypes; and biomedical or biophysical imaging modalities
Demonstrated ability to rigorously identify and deal with confounders and complexities in human clinical data;
Experience using modern deep learning frameworks (PyTorch, Jax, XGBoost, etc);
Proficiency in Python and working with large-scale clinical data;
Ability to communicate effectively and collaborate with people of diverse backgrounds and job functions;
Passion for making a difference in the world.

Nice to Have

Experience in probabilistic modeling and/or causal inference;
Experience working on decision making under uncertainty;
Experience working with EHR linked with genomic/molecular data;
Experience with genetic analyses (e.g., GWAS, rare variant analysis, etc.) and / or genomic data from different modalities (DNA sequencing, RNA-seq, proteomics, DNA accessibility assays, etc.);
Familiarity with cloud computing services (e.g., AWS or GCP) and workflow management tools or batch scheduling systems (e.g. SLURM);
Proficiency in Linux environment (including shell/Bash scripting), experience with database languages (e.g., SQL) and experience with version control practices and tools (e.g., Git)

Benefits at insitro

Excellent medical, dental, and vision coverage; insitro pays 100% of premiums for employees
Excellent mental health and well-being support
Open vacation policy
Access to free onsite baristas and cafe with daily lunch and breakfast
Access to free onsite fitness center
Commuter benefits
Paid parental leave
Competitive pay and 401(k) matching
Flexible work schedule (on site and remote)

Company Info.

Insitro

insitro is a data-driven drug discovery and development company using machine learning and data at scale to transform the way that drugs are discovered and developed for patients. insitro is developing predictive machine learning models to discover underlying biologic state based on human cohort data and in-house generated cellular data at scale. These predictive models can be brought to bear on key bottlenecks in pharmaceutical R&D.

Industry

Biotechnology Research
No. of Employees

207
Location

South San Francisco, CA, USA
Website

https://www.insitro.com
Jobs Posted

Get Similar Jobs In Your Inbox

Insitro is currently hiring Data Scientist, Machine Learning Jobs in South San Francisco, CA, USA with average base salary of $160,000 - $240,000 / Year.

Similar Jobs View More

Software Engineering Intern

Insitro

South San Francisco, CA, USA

0-2 year

Apache Hadoop,AWS,Computational Algorithms,Dask - Python library,Django,Flask,Git,Google Cloud Platform (GCP),Linux Operating system,Machine learning techniques,Microscopy,NoSQL,NumPy,Pandas,Proteomics,Python Programming,PyTorch,Scikit-learn,SciPy,SPARK Programming,SQL,Statistical modeling

Clinical Machine Learning (Senior) Director

Insitro

South San Francisco, CA, USA

2-4 year

Java Programming,Python Programming,C++,C Programming,SQL,Cloud computing,Scala Programming,Machine learning techniques,Data science techniques,MATLAB Programming,PyTorch,TensorFlow,R Programming

Senior / Lead Genetic Data Scientist, Statistical Geneticist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,C Programming,C++,Cloud computing,Google Cloud Platform (GCP),Python Programming,R Programming,RNA-seq Data processing,SQL,Statistical modeling

(Senior) Clinical Machine Learning Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,C++,Deep Learning,Google Cloud Platform (GCP),Java Programming,JAX framework,Keras software library,Natural Language Processing (NLP),OpenCV,Python Programming,PyTorch,R Programming,SQL,XGBoost

(Senior) ML Scientist - Advanced ML

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Google Cloud Platform (GCP),Java Programming,JAX framework,NeurIPS,Python Programming,PyTorch,SQL

Lead ML Applied Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Git,Google Cloud Platform (GCP),Java Programming,JAX framework,NeurIPS,Python Programming,PyTorch,SQL

(Senior) Director: Genomic Data Science & Computational Biology

Insitro

South San Francisco, CA, USA

4-6 year

AWS,Deep Learning,Google Cloud Platform (GCP),Java Programming,JAX framework,Machine learning techniques,Python Programming,PyTorch,SQL

Data Science & Machine Learning Intern: Core Imaging

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Dask - Python library,Git,Google Cloud Platform (GCP),Java Programming,Mercurial,NoSQL,NumPy,OpenCV,Pandas,Python Programming,Scikit-learn,SciPy,SQL

(Senior) Genetic Data Scientist, Statistical Geneticist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Bash scripting,C Programming,C++,Database,EBI GWAS Catalog,eQTL mapping,ExAC,gnomAD,Google Cloud Platform (GCP),Java Programming,Large scale data processing,Linux Operating system,PheWAS,Python Programming,SQL,UK Biobank

Lead/Senior ML Scientist for Molecular Omics

Insitro

South San Francisco, CA, USA

4-6 year

AWS,C++,Database,Deep Learning,Git,Google Cloud Platform (GCP),Machine learning techniques,Nextflow,Python Programming,R Programming,Snakemake,SQL

Lead/Senior ML Scientist for Computer Vision, Microscopy

Insitro

South San Francisco, CA, USA

4-6 year

AJAX,AWS,Deep Learning,Google Cloud Platform (GCP),Java Programming,Keras software library,Python Programming,PyTorch,SQL,TensorFlow

(Senior) Genetic Data Scientist, Statistical Geneticist

Insitro

South San Francisco, CA, USA

4-6 year

AWS,C Programming,C++,EBI GWAS Catalog,gnomAD,Google Cloud Platform (GCP),Java Programming,PheWAS,Python Programming,SQL,UK Biobank

Data Science & Machine Learning Intern: Clinical Machine Learning

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Dask - Python library,Git,Google Cloud Platform (GCP),Java Programming,Mercurial,NoSQL,NumPy,Pandas,Python Programming,SciPy,SQL

Data Science & Machine Learning Intern: Research Engineering

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Dask - Python library,Git,Google Cloud Platform (GCP),Java Programming,Mercurial,NoSQL,NumPy,OpenCV,Pandas,Python Programming,Scikit-learn,SciPy,SQL

Machine Learning (Senior) Manager / Director - Imaging

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Deep Learning,Google Cloud Platform (GCP),Java Programming,JAX framework,Python Programming,SQL

(Senior) Computer Vision/Clinical ML Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,C++,Deep Learning,Google Cloud Platform (GCP),Java Programming,JAX framework,Keras software library,OpenCV,Python Programming,PyTorch,SQL

Data Science & Machine Learning Intern: Advanced ML Technologies

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Bayesian networks,Causal inference,Dask - Python library,Gaussian Process,Generalization,Generative AI,Google Cloud Platform (GCP),JAX framework,Linux Operating system,Machine learning techniques,NoSQL,NumPy,Pandas,Probabilistic Graphical Models,Python Programming,PyTorch,SciPy,SQL,Statistical modeling

Head of Statistical Genetics

Insitro

South San Francisco, CA, USA

4-6 year

AWS,Database,EBI GWAS Catalog,ExAC,gnomAD,Google Cloud Platform (GCP),Java Programming,Python Programming,SQL,Tornado,UK Biobank

(Senior) Director: Genomic Data Science & Computational Biology

Insitro

South San Francisco, CA, USA

6-8 year

AJAX,AWS,Deep Learning,Google Cloud Platform (GCP),Java Programming,Python Programming,PyTorch,R Programming,SQL

Data Science & Machine Learning Intern: Omics

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Dask - Python library,Git,Google Cloud Platform (GCP),Java Programming,Mercurial,NoSQL,NumPy,OpenCV,Pandas,Python Programming,Scikit-learn,SciPy,SQL

(Senior) Computer Vision/Clinical ML Applied Scientist

Insitro

South San Francisco, CA, USA

2-4 year

Java Programming,Python Programming,C++,C Programming,SQL,Cloud computing,Scala Programming,Machine learning techniques,Data science techniques,MATLAB Programming,PyTorch,TensorFlow,R Programming

Data Science & Machine Learning Intern: Small Molecule Machine Learning

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Dask - Python library,Git,Google Cloud Platform (GCP),Java Programming,Mercurial,NoSQL,NumPy,OpenCV,Pandas,Python Programming,Scikit-learn,SciPy,SQL

Data Science & Machine Learning Intern: Statistical Genetics

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Dask - Python library,EBI GWAS Catalog,ExAC,Git,gnomAD,Google Cloud Platform (GCP),Java Programming,Large scale data processing,Mercurial,NoSQL,NumPy,OpenCV,Pandas,PheWAS,Python Programming,Scikit-learn,SciPy,SQL,UK Biobank

Senior / Lead ML Scientist - Advanced ML

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Cloud computing,Google Cloud Platform (GCP),JAX framework,Python Programming,PyTorch,SQL

(Senior) Director, Clinical Machine Learning

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Cloud computing,Google Cloud Platform (GCP),JAX framework,Python Programming,PyTorch,SQL

Senior / Lead Clinical Machine Learning Scientist

Insitro

South San Francisco, CA, USA

2-4 year

Cloud computing,JAX framework,Metabolomics,Natural Language Processing (NLP),Proteomics,Python Programming,PyTorch,RNA-seq Data processing,SQL

Senior / Lead Computer Vision/Clinical ML Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,C++,Cloud computing,Computer Vision (CV),CUDA/GPU programming,Google Cloud Platform (GCP),JAX framework,OpenCV,Python Programming,PyTorch,SQL

(Senior) Director, Imaging Machine Learning

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Cloud computing,Google Cloud Platform (GCP),JAX framework,Python Programming,PyTorch,SQL

Senior / Staff Research Engineer - Data Science and ML

Insitro

South San Francisco, CA, USA

6-8 year

CUDA/GPU programming,Data science techniques,Design,Drug Discovery,Large Language Models - LLMs,Machine learning techniques,Python Programming

(Senior) ML Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Deep Learning,Effective communication skills,Google Cloud Platform (GCP),JAX framework,Machine learning techniques,NeurIPS,Python Programming,PyTorch

Senior Manager / Director, Small Molecules Machine Learning

Insitro

South San Francisco, CA, USA

4-6 year

AWS,C Programming,C++,Design,Effective communication skills,Google Cloud Platform (GCP),Java Programming,Leadership Skill,Machine learning techniques,Python Programming,PyTorch,Scala Programming

Scientist II, Computational Chemistry (Chemoinformatics)

Insitro

South San Francisco, CA, USA

4-6 year

Data Analysis,Data Mining,Drug Discovery,Effective communication skills,Machine learning techniques,Science,Teamwork

Associate Scientist I/II, Automation Operations

Insitro

South San Francisco, CA, USA

2-4 year

Computer Vision (CV),Design,Machine learning techniques,Operations,Optimization,Science

Compbio Data Scientist

Insitro

South San Francisco, CA, USA

2-4 year

AWS,Azure,Effective communication skills,Python Programming

Machine Learning Scientist, Omics

Insitro

South San Francisco, CA, USA

0-2 year

AWS,Azure,Effective communication skills,Machine learning techniques,Python Programming

Mid / Senior Software Engineer

Insitro

Kraków, Poland

0-2 year

AWS,Azure,C Programming,C++,Effective communication skills,GoLang,Postgres,Python Programming,SPARK Programming,TypeScript

Senior / Staff Research Engineer - LLM Tools

Insitro

South San Francisco, CA, USA

6-8 year

Large Language Models - LLMs,Machine learning techniques,Python Programming

(Senior) Clinical ML Data Scientist

Job Type

Experience

Salary

Location

Job Function

Industry

Qualification

Key Skills

Job Description

Company Info.

Get Similar Jobs In Your Inbox

Insitro is currently hiring Data Scientist, Machine Learning Jobs in South San Francisco, CA, USA with average base salary of $160,000 - $240,000 / Year.

Similar Jobs View More