ML Data Linguist (FTC) - Hindi and English, AWS AI Data | Transcribe

Amazon Web Services
Apply Now

Job Description

Job summary

Amazon Web Services (AWS) is looking for a Machine Learning Data Linguist to join the AI Data team. This role focuses on speech and language data in Hindi and English, primarily in the areas of speech transcription, text annotation, and other general development of high quality language data deliverables. The successful candidate must have background in analyzing the target languages and a passion for efficiency and accuracy.

Inclusive Team Culture

Here at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

Work/Life Balance

Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

Mentorship & Career Growth

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future.

This is a Fixed Term Contractor role. Initial Term is 12 months with a possibility of extension once on-boarded.

Key job responsibilities

  • Build a thorough understanding of data collection and annotation guidelines and various annotation tools.
  • Transcribe and annotate natural language data accurately within deadlines, adhering to guidelines.
  • Dive deep into the data to perform qualitative error trend analysis.
  • Handle unique data collection and analysis requests for different NLP/NLU applications.
  • Collaborate with other ML Data Linguists to resolve data ambiguities and annotation disagreements.
  • Provide feedback to Language Engineers on annotation guidelines, tooling, and processes to drive improvements.

About the team

The AI Data Team at AWS is responsible for delivering high-quality annotated data and a variety of language artifacts to ensure the best performance of different AWS machine-learning language services. These ML-based language services enable customers to readily add intelligence to their business operations and AI applications to drive positive outcomes.

BASIC QUALIFICATIONS

  • Bachelor's degree in Linguistics, Speech Processing, Communication, Cognitive Science, or a related field, with background in phonetics, semantics, pragmatics, conversation analysis, and/or discourse analysis.
  • 1+ years of experience with natural language data labelling and other forms of data markup.
  • Experience transcribing language data orthographically and phonemically.
  • Practical knowledge in IPA, X-SAMPA or ARPABET.
  • Native or near-native proficiency in Hindi and English (US) (CEFR C1 or above).
  • Familiarity with command line interfaces and basic Unix commands.
  • Excellent communication, strong organizational skills with a keen eye for details.
  • Comfortable working in a fast-paced, highly collaborative, and dynamic work environment.
  • PREFERRED QUALIFICATIONS
  • Ability to quickly learn new guidelines, technical concepts, and softwares.
  • Flexibility to work in different operating systems (Windows, MacOS, or Linux) and collaborative productivity tools.
  • Working knowledge of a variety of file formats and mark up languages (e.g. JSON, XML, HTML).
  • Prior work experience at contact centers is a plus.
  • Willingness to support several projects at one time, and to accept reprioritization as necessary.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Company Info.

Amazon Web Services

Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon providing on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered pay-as-you-go basis. These cloud computing web services provide a variety of basic abstract technical infrastructure and distributed computing building blocks and tools. One of these services is Amazon Elastic Compute Cloud (EC2).

  • Industry
    Information Technology
  • No. of Employees
    79,196
  • Location
    410 Terry Ave N, Seattle, WA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Amazon Web Services is currently hiring Machine Learning Data Linguist Jobs in Vancouver, BC, Canada with average base salary of Can$95,000 - Can$170,000 / Year.

Similar Jobs View More