Aartificial intelligence, Annotation, Design, Large Language Models - LLMs, Machine learning techniques, MLOps tools
At Datadog, we are building an internal AI platform that empowers our teams to train, evaluate, and deploy models at scale. The Annotation & Evaluation team plays a foundational role in ensuring our models are reliable, safe, and production-ready. We design the infrastructure and tooling for dataset labeling, model benchmarking, trust & safety evaluation, and performance diagnostics across a range of ML and LLM applications.
From interactive labeling pipelines to automated evaluation environments, our systems provide the core feedback loop that allows engineers and scientists to measure, compare, and continuously improve AI models. We work at the intersection of applied ML, data engineering, and platform infrastructure.
We’re looking for a Senior Software Engineer to help us scale our evaluation systems, develop benchmarking tools, and drive trust & safety observability across Datadog's AI product offerings.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
Who You Are:
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.
Benefits and Growth:
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
Datadog is the essential monitoring platform for cloud applications. We bring together data from servers, containers, databases, and third-party services to make your stack entirely observable. These capabilities help DevOps teams avoid downtime, resolve performance issues, and ensure customers are getting the best user experience.