Apache Hadoop, C Programming, C++, Data science techniques, Java Programming, Machine learning techniques, MapReduce, Python Programming, PyTorch, R Programming, Scala Programming, SQL, TensorFlow
At Databricks, we are obsessed with enabling data teams to solve the world's toughest problems, from security threat detection to cancer drug development. We do this by building and running the world's best data and AI infrastructure platform, so our customers can focus on the high-value challenges that are central to their missions.
Founded in 2013 by the original creators of Apache Spark, Databricks has grown from a tiny corner office in Berkeley, CA to a global organization with over 1500 employees. Thousands of organizations, from small to Fortune 100, trust Databricks with their mission-critical workloads, making us one of the fastest-growing SaaS companies in the world.
Our engineering teams build highly technical products that fulfill real, important needs in the world. We constantly push the boundaries of data and AI technology, while simultaneously operating with the resilience, security, and scale that is critical to making customers successful on our platform.
Do you dream about making infrastructure more efficient with data science? Databricks’s scale and range of initiatives present a tremendous opportunity for data science to tackle interesting, important challenges around optimizing our infrastructure. How do we better understand and forecast cloud computing costs and the input variables that drive costs? How can we better evaluate performance of our service? How do we better detect regressions hidden among billions of time series data points in our monitoring infrastructure? Can we detect server-side performance regressions in a high velocity engineering environment where there are strong trend and seasonality effects?
We’re looking for a leader to start a new Infrastructure Data Science group. The vision for the Infrastructure Data Science group is to use statistical and machine learning techniques to optimize the operations--and enable the continued growth of--Databricks’s infrastructure. It’s a group of “full stack” data scientists that partners with teams supporting all of Databricks’s product and infrastructure, focusing on long-term strategic initiatives that make Databricks infrastructure more efficient, reliable, and scalable.
The impact you will have:
What we look for:
Benefits
Databricks is an American enterprise software company founded by the creators of Apache Spark. Databricks develops a web-based platform for working with Spark, that provides automated cluster management and IPython-style notebooks. Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala.
United States
2-4 year
San Francisco, CA, USA
4-6 year
San Francisco, CA, USA
4-6 year
San Francisco, CA, USA
4-6 year
Mumbai, Maharashtra, India
6-8 year
United States
6-8 year
United States
6-8 year
United States
6-8 year
United States
6-8 year
Washington D.C., DC, USA
6-8 year
Los Angeles, CA, USA; Portland, OR, USA; Sacramento, CA, USA; San Diego, CA, USA; San Francisco, CA, USA; San Jose, CA, USA; Seattle, WA, USA
4-6 year
San Francisco, CA, USA
8-10 year
San Francisco, CA, USA
0-2 year
Bengaluru, Karnataka, India
2-4 year
Bengaluru, Karnataka, India
2-4 year
Amsterdam, Netherlands
0-2 year