Senior Machine Learning and Data Engineer

IBM
Apply Now

Job Description

Your Role and Responsibilities

We are looking for a highly skilled and experienced Senior Machine Learning and Data Engineer to join our team of AI experts. You will be responsible for designing, implementing and optimizing our data and machine learning pipelines across a wide variety of applications. The ideal candidate is an experienced data and ML pipeline builder who enjoys optimizing data and machine learning systems and building them from the ground up. The ML Data Engineer will support our software developers and data scientists in embedding AI into our products and will ensure that our data and ML delivery architecture is consistent across projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems and products. The right candidate will be excited by the prospect of optimizing and re-designing our data and ML architecture to support our next generation of products and data initiatives.

Key Responsibilities:

  • Design and develop robust, scalable, and high-performance data and machine learning pipelines, ensuring data integrity and reliability throughout the process.
  • Collaborate with cross-functional teams, including data scientists, product managers, and software engineers, to understand data and machine learning requirements and deliver optimal solutions.
  • Experience delivering and maintaining productionized end-to-end Machine Learning solutions, form data preparation, experimentation, model training, model serving, and model update
  • Design solutions that ensuring efficient data storage, retrieval, and processing capabilities using modern data lakehouse and data fabric systems.
  • Develop and implement data transformation and cleansing processes to ensure data quality and consistency.
  • Optimize and fine-tune data and machine learning pipelines for improved performance, scalability, and reliability.
  • Collaborate with the DevOps team to ensure data security, privacy, and compliance with relevant regulations and best practices.
  • Document technical specifications, system architecture, and data flows to facilitate knowledge sharing and maintain a well-documented system.
  • Stay updated with emerging technologies, industry trends, and best practices in data engineering, and proactively introduce new tools, frameworks, and techniques to enhance data processing capabilities.

Required Technical and Professional Expertise

  • 5+ years as a Data and Machine Learning Engineer or similar role with a focus on designing and developing end-to-end enterprise data and machine learning pipelines
  • Knowledge of data best practices and MLOps/Dev operations in SaaS and hybrid environments
  • 5+ years of Python, Scala or Java development experience
  • Ability in managing and communicating data and machine learning pipeline plans to internal clients
  • Experience building, running, and maintaining productionized Machine Learning models using a well-designed MLOps approach
  • Experience working with both batch and streaming data
  • Experience with machine learning frameworks like PyTorch, Tensorflow and scikit-learn
  • Experience with cloud-based platforms and services (e.g., IBM, AWS, Azure, Google Cloud).
  • Strong expertise in working with big data technologies and frameworks like Hadoop, Spark, or Kafka.
  • Strong understanding of data governance, data security, and data privacy best practices.
  • Excellent communication skills with the ability to effectively collaborate with technical and non-technical stakeholders.

Preferred Technical and Professional Expertise

  • Experience designing, building, and maintaining data processing and machine learning systems working in containerized environments (Docker, OpenShift, k8s)
  • Experience working with both batch and streaming data ingestion
  • Experience building reusable and reliable workflows and pipelines using workflow engines (Argo, Tekton, etc.)
  • Experience developing or leveraging automated platforms for model observability, model quality and drift and systems to automatically identify and correct issues.
  • Experience developing or leveraging automated platforms for data observability, data quality and drift and systems to automatically identify and correct data issues.

Company Info.

IBM

IBM is a leading cloud platform and cognitive solutions company. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 290,000 employees serving clients in 177 countries. IBM Research provides unparalleled insight into business, industry and society by leveraging advanced computing architectures and methodologies to solve some of the world’s most pressing challenges.

  • Industry
    Information Technology,Computer software,Computer hardware
  • No. of Employees
    292,500
  • Location
    New Orchard Road, Armonk, New York, NY 10504, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

IBM is currently hiring Data Engineer Jobs in Bangalore, Karnataka, India with average base salary of ₹90,000 - ₹250,000 / Month.

Similar Jobs View More