
Job Description
What You’ll Do:
- Act as a technical leader on the AI Platforms team, focused on building robust ML infrastructure and evaluation systems.
- Design and implement scalable, reproducible, and easy-to-use evaluation frameworks that will power the AI platform
- Build backend systems to power large-scale model experimentation leveraging Ray.io and Datadog in house tooling
- Develop tooling and infrastructure to support dataset versioning, model evaluation, and test set management across multiple use cases.
- Collaborate closely with AI researchers and application teams to enable rapid iteration and rigorous evaluation.
- Guide technical direction across multiple projects, ensuring scalability, reliability, and long-term maintainability.
- Mentor other engineers, contribute to design reviews, and help shape the culture of the AI Platforms team.
Who You Are:
- You have a BS/MS/PhD in Computer Science or a related field, or equivalent experience.
- 10+ years of relevant engineering experience, including backend systems and platform-level infrastructure.
- Deep experience building ML infrastructure or ML platforms that support training, evaluation, and deployment at scale.
- Strong understanding of machine learning principles and familiarity with model evaluation workflows and challenges.
- Proven ability to drive cross-functional initiatives and operate in high-ambiguity environments.
- Experience building and operating production-grade systems using modern cloud infrastructure (e.g., Kubernetes, GCP, AWS, etc.).
- You’re product-minded, collaborative, and thrive in fast-paced environments.
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.
Benefits and Growth:
- Get to build tools for software engineers, just like yourself. And use the tools we build to accelerate our development.
- Have a lot of influence on product direction and impact on the business .
- Work with skilled, knowledgeable, and kind teammates who are happy to teach and learn
- Competitive global benefits
- Continuous professional development
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan.
The reasonably estimated yearly salary for this role at Datadog is:
$234,000—$300,000 USD
Company Info.
Datadog, Inc.
Datadog is the essential monitoring platform for cloud applications. We bring together data from servers, containers, databases, and third-party services to make your stack entirely observable. These capabilities help DevOps teams avoid downtime, resolve performance issues, and ensure customers are getting the best user experience.
-
Industry
Information Technology
-
No. of Employees
3,400
-
Location
New York, NY, USA
-
Website
-
Jobs Posted
Get Similar Jobs In Your Inbox
Datadog, Inc. is currently hiring Staff Software Engineer Jobs in New York, NY, USA with average base salary of $234,300 - $300,000 / Year.