Filter By


Save Search

MORE FILTERS

LESS FILTERS

  • Job Types

28 Jobs in Site Reliability Engineer

___
Full Time

28 Dec 2024

Summary

Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners,...

Key Skills

Aartificial intelligence,AWS,Azure,Continuous Integration & Continuous Delivery - CI/CD,Data pipelines,DevOps,Github,Google Cloud Platform (GCP),Grafana,Helm,Kubernetes-K8s,Machine learning techniques,Prometheus,Terraform

Staff SRE Engineer

Vizio

Dallas, TX, USA; Denver, CO, USA

8-10 year

___
Full Time

20 Dec 2024

Summary

We are seeking a highly skilled and motivated Staff SRE Engineer. Join our team and play a pivotal role in ensuring the reliability and scalability of our critical database infrastructure while...

Key Skills

Aartificial intelligence,Amazon EMR,Apache Hadoop,AWS,Database,Databricks,JavaScript,Machine learning techniques,Python Programming

___
Full Time

3 Dec 2024

Summary

The Lead Support SRE will be responsible for the supporting and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National...

Key Skills

.NET,C++,Database,DevOps,Finance,GoLang,Grafana,JavaScript,Kibana,Linux Operating system,Python Programming,SDLC

Site Reliability Engineer - High Performance Computing / AI-ML

Twitter

Palo Alto, CA, USA; New York City, NY, USA; San Jose, CA, USA; Seattle, WA, USA; Austin, TX, USA

2-4 year

___
Full Time

19 Nov 2024

Summary

Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we’re on a mission to become the trusted global...

Key Skills

Aartificial intelligence,C++,GPUs,High performance computing- HPC,Kubernetes-K8s,Linux Operating system,Machine learning techniques,Puppet,Python Programming,Sass,Scala Programming

Site Reliability Engineer (KR)

Gauss Labs

Yeoksam-dong, Gangnam-gu, Seoul, South Korea

6-8 year

___
Full Time

29 Aug 2024

Summary

Gauss Labs is seeking a highly skilled Site Reliability Engineer to join our team. As an SRE at Gauss Labs, you will play a critical role in ensuring our industrial AI platform's reliability,...

Key Skills

Aartificial intelligence,AWS,Azure,Google Cloud Platform (GCP),Grafana,Machine learning techniques,Python Programming,Site Reliability Engineering (SRE),User support and Troubleshooting

___
Full Time

17 Aug 2024

Summary

Accountabilities:

  • Automate data tasks on GCP
  • Work with data domain owners, data scientists and other stakeholders to ensure that data is consumed effectively on...

Key Skills

Aartificial intelligence,Apache Hadoop,Apache Kafka,AWS,Azure,Big Data Technology,Continuous Integration & Continuous Delivery - CI/CD,Data science techniques,Git,Google Cloud Platform (GCP),Java Programming,Machine learning techniques,Python Programming,Site Reliability Engineering (SRE),SPARK Programming,SQL

___
Full Time

10 Jul 2024

Summary

At Peloton, we view Platform as a Product. An extraordinary platform unlocks speed of development and learning. It allows us to scale easily, enabling our engineers to enhance attention on new...

Key Skills

GoLang,Linux Operating system,Machine learning techniques,Python Programming,Site Reliability Engineering (SRE),Terraform

___
Full Time

18 Jun 2024

Summary

We are seeking a Site Reliability Engineer (SRE) to join our team in Singapore.

WHAT YOU’LL DO:

  • Keeping your assigned site or service up and running or getting it back up and...

Key Skills

Azure,Bash scripting,Database,Docker,Effective communication skills,Infrastructure as code,Kubernetes-K8s,Linux Operating system,Machine learning techniques,Operations,PostgreSQL,Python Programming,Site Reliability Engineering (SRE),Software Development,Unix Operating system

___
Full Time

14 Jun 2024

Summary

The Identity and Access Management (IAM) organization provides a secure, integrated, efficient, enterprise-wide IAM Services to enable efficient and effective protection of P&G information assets...

Key Skills

Aartificial intelligence,AWS,Azure,DevOps,Java Programming,Management,Oracle,SAP,Site Reliability Engineering (SRE)

___
Full Time

26 May 2024

Summary

Syndigo is a Master Data Management (MDM) visionary and a Product Information Management (PIM) leader. We are a team of passionate people who are rethinking the way MDM and PIM work. We recently...

Key Skills

Amazon Elastic Compute Cloud-EC2,Amazon Simple Storage Service (S3),Apache Hadoop,Apache Kafka,Azure,Bash scripting,Big Data Technology,Databricks,Docker,Effective communication skills,HBase,Kubernetes-K8s,PowerShell Programming,Python Programming,Ruby on Rails,SaaS,Site Reliability Engineering (SRE),SPARK Programming