AWS, Biostatistics, Data Analysis, Data Analytics, Data pipelines, Effective communication skills, Github, Python Programming, R Programming
Position Focus:
This position will function as a lead data analytics and bioinformatics expert in the laboratory who will lead the processing, quality checks, all stages of data analyses, interpretation and visual representation of data that emerge from proteomics (mass spectrometry) AND transcriptomics (bulk and single cell RNAseq) studies performed in the laboratory. Bioinformatics analyses of these two types of data have distinct requirements and training, BOTH of which will be expected for this position.
The position will also interface with biologists in the team, to address their data needs, provide input in analytical plans and execute these plans in a timely manner. The position will require excellent multitasking abilities, and a hands-on and deep knowledge of programming languages not limited to R, Python, and SPSS.
The individual will be responsible for data management, data storage, cataloging of all data, data cleanup and will also ensure that datasets are appropriately disseminated to the public (via deposition in public repositories) after scientific publications.
The position will also participate in grant writing, manuscript preparation, scientific presentations and lab meetings. The individual needs to have a demonstrated ability to constantly improve and keep abreast with rapidly evolving fields of bioinformatics, systems biology and will therefore be expected to attend data science seminars, conferences and workshops.
The individual will also take mentoring roles focused on improving data analytical skills among all laboratory members, including senior members, technicians and trainees (pre- and post-doctoral).
Essential Duties
Required Education and Experience
Master’s Degree in statistics, bioinformatics, epidemiology or a related field and 5 years of related experience or an equivalent combination of education and experience.
Required Skill/Ability 1:
Familiarity with RNAseq pipelines (not limited to deseq2, EdgeR and Limma packages). At least completed 1 course on single cell RNAseq data analysis (with at least 8 contact-hours). Familiarity with proteomics pipelines (not limited to MAxQuant, Proteome Discoverer, MSFragpipe).
Required Skill/Ability 2:
Ability to independently troubleshoot analytic pipelines, and custom build pipelines to suit experimental needs. Ability to adopt existing code/pipelines from open-source forums (Ex.GitHub) and implement for in-house use.
Required Skill/Ability 3:
Demonstrated ability in Bioinformatics pipeline development, including RNAseq and proteomics analyses, including differential expression analysis, network analysis, gene set enrichment and pathway analysis. Quantitative academic focus and >2 years of data analysis/programming experience; or equivalent combination of education and experience.
Required Skill/Ability 4:
Extensive knowledge of and ability to apply standard software development principles, theories, concepts and techniques to data analysis. Strong programming skills in R, and/or Python. Demonstrated ability with data handing and processing using Amazon Cloud services (AWS). Managing large data sets.
Required Skill/Ability 5:
Excellent written and oral communication skills. Ability to effectively communicate with bioinformatics personnel/experts from other institutions to independently plan analytic pipelines. Ability to assemble high-quality figures for grants and publications. Ability to assemble slides and posters for talks/presentations
Preferred Education, Experience and Skills:
Master’s Degree in Biostatistics or Statistics AND two years of experience; or equivalent combination of education and experience, in the analyses of mass spectrometry proteomics and RNAseq data, including bulk RNAseq and single cell RNAseq.
Yale University is a private Ivy League research university in New Haven, Connecticut. Founded in 1701, Yale is the third-oldest institution of higher education in the United States and one of the nine colonial colleges chartered before the American Revolution.