Machine Learning Data Linguist, Amazon Comprehend

Amazon Web Services
Apply Now

Job Description

Amazon Web Services (AWS) is looking for a Machine Learning Data Linguist to join the AI Data team. This role focuses on text annotation and data collection. The position may be remote, with a preference for Santa Clara, Seattle, or New York City. We work in distributed teams, and strengthen our connections by traveling approximately five days per quarter for in person meetings.

You will join a growing team of professional linguists. You will apply your analytic skills to help develop the high-quality language data needed to train innovative models in Machine Learning. You will start by learning about the products on Comprehend. You will then dive deep to learn about our annotation processes. You will help the annotation team overcome challenges. You will lead tasks and collaborate across functional areas to drive performance improvements.

The successful candidate must have a background in linguistics or localization, experience with language analysis, and a passion for efficiency and accuracy.

Key job responsibilities

  • Help define requirements (e.g., tools, training, data collection protocols, etc.) for multiple projects at a given time
  • Build a thorough understanding of annotation conventions and mentor junior Data Linguists on applying these in annotation tasks
  • Annotate text data, identifying linguistic categories based on detailed annotation guidelines
  • Collect and organize text data from online sources
  • Collaborate in defining data quality and tracking metrics to ensure team works efficiently and data are high quality
  • Perform error trend analysis and create action plans to improve data quality
  • Provide feedback to Language Engineers and Scientists on tool improvements and annotation processes
  • Dive deep into challenges and implement solutions independently
  • Contribute to process improvements to reduce handling time and improve team output

About the team

The AI Data Team at AWS is responsible for delivering high-quality annotated data and a variety of language artifacts to ensure the best performance of different AWS machine-learning language services. These ML-based language services enable customers to readily add intelligence to their business operations and AI applications to drive positive outcomes.

BASIC QUALIFICATIONS

  • Bachelor's degree in a relevant field, such as Linguistics, Communications, a foreign language, or other language or data-related disciplines or at least 1 year experience in localization or NLP
  • Native or near-native English speaker
  • Applied experience with data annotation, linguistic annotation and other forms of data markup
  • Experience identifying linguistic ambiguity and annotation inaccuracies in data
  • Depth and breadth of knowledge in linguistic theory and/or applied linguistics

PREFERRED QUALIFICATIONS

  • Master's degree in a relevant field, such as Linguistics, Communications, a foreign language, or other language or data-related disciplines.
  • Native or advanced proficiency in German, French, Spanish or another foreign language
  • Familiarity with common text processing tools
  • Familiarity with json, yaml, xml or other forms of text markup
  • Ability to work in different operating systems (Windows, MacOS, or Linux)
  • Ability to navigate a Unix terminal and use common command line tools
  • Ability to strictly adhere to annotation guidelines, think abstractly about language, and identify basic parts of speech
  • Excellent communication and organizational skills
  • Ability to work collaboratively with other data associates on a team
  • Ability to deliver high quality results under tight deadlines
  • Comfortable working in a fast paced, collaborative work environment
  • Passion for language, linguistics, human language technology and AI

Company Info.

Amazon Web Services

Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon providing on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered pay-as-you-go basis. These cloud computing web services provide a variety of basic abstract technical infrastructure and distributed computing building blocks and tools. One of these services is Amazon Elastic Compute Cloud (EC2).

  • Industry
    Information Technology
  • No. of Employees
    79,196
  • Location
    410 Terry Ave N, Seattle, WA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Amazon Web Services is currently hiring Machine Learning Data Linguist Jobs in Vancouver, BC, Canada with average base salary of Can$95,000 - Can$170,000 / Year.

Similar Jobs View More