Amazon Simple Storage Service (S3), AWS, AWS Aurora, Elasticsearch, Java Programming, Large data sets, Natural Language Processing (NLP), Python Programming, Text Processing
You are a Data Engineer with experience in processing terabytes of data. You have experience in creating and automating scalable, fault-tolerant and reproducible data pipelines using Amazon AWS technologies. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation.
What You’ll Do:
What You Know:
Must Haves:
Nice-to-Haves:
What do we love in team members?
Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.
Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment. By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy. Our customers are