Data Platform Architect

Invitae Corp.
Apply Now

Job Description

Invitae is dedicated to bringing comprehensive genetic information into mainstream medicine to improve healthcare for billions of people. Our team is driven to make a difference for the patients we serve. We are leading the transformation of the genetics industry, by making genetic testing affordable and accessible for everyone to guide health decisions across all stages of life.

Invitae needs engineers with diverse backgrounds to help us achieve our mission. We are a cross-functional team of scientific domain experts and dedicated, curious engineers. We build systems that take massive amounts of genomic data, combine it with the world's scientific literature, add to it years of rigorously curated results, and package it all neatly for our scientists to consume. It's a lot of information. As the data gets bigger, our systems need to get better and faster. That's where you come in.

The role focuses specifically on the Patient Data Network, which includes Ciitizen, a health technology platform that enables patients with cancer and rare neurologic disorders to collect, digitize, and share their health information. This includes a wide array of consumer and enterprise products used by a variety of customers including patients, providers, biopharma, advocacy, and HIEs. Specific examples of these applications include the Patient Portal, Clinical Trial Matching, Healthcare Services Marketplace, Patient Cohort Manager, and Prism Reporting.

We are looking for a reliable and motivated Data Platform Architect to join our Patient Data Network team who can drive the architecture behind our Data Platform Team and help in developing the data ingestion pipelines and data platform architecture that supports the analytical and reporting needs of internal stakeholders, data scientists, and our machine learning team, as well as externally facing products.

What you’ll do:

  • Understand our complex data ecosystem
  • Act as a data domain expert and drive platform data architecture 
  • Work with product managers and stakeholders to help define data platform roadmap
  • Design and implement reliable, scalable and efficient data infrastructure, data driven products and software solutions for external and internal customers
  • Provide expertise on the overall data engineering best practices, standards, architectural approaches and complex technical resolutions
  • Identify data platform areas that can be improved, propose better solutions, and drive implementation
  • Stay current on the latest data technology and how we can leverage them to improve data platform offering, efficiency, and scalability

What you bring:

  • 10+ years of relevant experience in data engineering
  • Extensive hands-on experience working with large datasets, pipelines, modern warehouse technologies, and data processing (real-time and batch) at large scale
  • Deep understanding of data and how it relates to architecture (Data ingestion, Realtime, Batch, SQL, NoSQL, Analytics,..)
  • Ability to model and design modern data structured in data lakes and data warehouses
  • Expert understanding of standard software design patterns and functional programing paradigms
  • Ability to think strategically and align design and architecture patterns to drive current needs as well as future growth
  • Strong technical communication skills
  • Self-starter attitude and ability to work towards a larger goal with minimal guidance
  • Advanced experience in data modeling, database technologies, SQL queries and performance tuning
  • Proficiency in Java, Python or Scala and a demonstrable ability to quickly learn
  • Focus on high quality code, including automated testing and coding best practices
  • Experience with messaging/queuing systems or stream processing systems
  • Experience in building distributed systems with infrastructure automation, monitoring and alerting

Additional Preferred but not Required Skills:

  • Experience with cloud data warehouse technology (e.g. Snowflake, Databricks)
  • Experience with data modeling/dimensional modeling
  • Experience with data transformation tools and BI tools
  • Experience using Kafka for implementing streaming application
  • Experience with workflow engines such as Netflix Conductor
  • Experience with data lineage/data governance tools like Atlan
  • Experience with CI/CD pipelines (e.g. GitHub Actions)
  • Experience with maintaining and administering Kubernetes clusters
  • Interest in working on related but separate projects in parallel

At Invitae, you’ll work alongside some of the world’s experts in genetics and healthcare at the forefront of genetic medicine. Our teams thrive in our dynamic organization, which has been designed to empower them to make the biggest impact they can for our patients. We give our employees the ability to explore interests and capabilities broadly within the organization. We prize freedom with accountability and offer significant flexibility. We also provide excellent benefits and competitive compensation in a fast-growing organization.

At Invitae, we’re changing healthcare to change lives. Join us.

Company Info.

Invitae Corp.

Invitae Corp. is a biotechnology company that was created as a subsidiary of Genomic Health in 2010 and then spun-off in 2012. In 2017, Invitae acquired Good Start Genetics and CombiMatrix. In 2020, Invitae announced the acquisition of ArcherDX for $1.4 billion.

  • Industry
    Biotechnology Research
  • No. of Employees
    2,100
  • Location
    San Francisco, CA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Invitae Corp. is currently hiring Data Platforms Architect Jobs in Austin, TX, USA with average base salary of $120,000 - $190,000 / Year.

Similar Jobs View More