Job Description

We are looking for a Staff Machine Learning Practitioner to join our team and help our clients in their innovative journeys.

Essential functions

  • Design, develop, and deploy multimodal machine learning models that integrate NLP and visualization techniques
  • Work with large datasets to build and train models that can interpret and generate human language, as well as create interactive visualizations
  • Collaborate with cross-functional teams to identify opportunities for NLP and visualization applications
  • Stay up-to-date with industry trends and advancements in NLP, visualization, and related areas
  • Develop and maintain technical documentation for models, algorithms, and visualizations
  • Work closely with data scientists, engineers, and stakeholders to ensure seamless integration of models into larger systems

Qualifications

PhD in Machine Learning, Computer Science, Computer Engineering, or equivalent experience.

Experience with AWS Cloud Platforms

Proficiency in using vision-language and generative models for a variety of multimodal tasks, including controllable image synthesis and visual question answering.

Familiarity with various vision-language models (such as CLIP, BLIP, Stable Diffusion, ControlNet), and DL libraries (e.g. Huggingface) for applying these models to real-world problems.”

Ability to use vision-language models (e.g. BLIP) for a wider range of tasks such as image captioning and visual question-answering.

Familiarity with the Stable Diffusion ecosystem, including ControlNet, Dreambooth, LoRA, and DL libraries such as Huggingface.

Good to Have :

In-depth knowledge of the architecture and inner workings of vision-language models.

Capability to train custom models tailored to specific image generation/analysis tasks.

In-depth knowledge of vision-language models and training techniques.

Experience in fine-tuning vision-language models on custom datasets, and deploying them for use in real-time or batch scenarios.

In-depth knowledge of diffusion models.

Experience in fine-tuning diffusion models for a variety of use cases on custom datasets, and deploying them for use in real-time or batch scenarios.

Would be a plus

  • Ability to push the boundaries of vision-language models by creating custom model architectures for uncommon problems, including other modalities such as 3D and video.
  • Expertise in ethical considerations and responsible use of generative AI for images, including addressing potential biases and misuse.
  • Ability to develop custom models for uncommon problems, including working with other modalities such as 3D and video.
  • Ability to modify/adapt diffusion models for uncommon problems. Experience working with other modalities such as 3D and video.

We offer

  • Opportunity to work on bleeding-edge projects
  • Work with a highly motivated and dedicated team
  • Competitive salary
  • Flexible schedule
  • Benefits package - medical insurance, sports
  • Corporate social events
  • Professional development opportunities
  • Well-equipped office

Company Info.

Grid Dynamics Holdings, Inc.

Grid Dynamics is a leading provider of technology consulting, agile co-creation and scalable engineering and data science services for Fortune 500 corporations undergoing digital transformation. We work in close collaboration with our clients on digital transformation initiatives that span strategy consulting, early prototypes and enterprise-scale delivery of new digital platforms.

Get Similar Jobs In Your Inbox

Grid Dynamics Holdings, Inc. is currently hiring Staff Machine Learning Engineer Jobs in San Ramon, CA, USA with average base salary of $121,500 - $248,500 / Year.

Similar Jobs View More