Python Programming, Scala Programming, C++, Java Programming, Machine learning techniques, Data science techniques, SQL, Apache Hadoop, MapReduce, TensorFlow, PyTorch, R Programming
At Databricks, we are obsessed with enabling data teams to solve the world's toughest problems, from security threat detection to cancer drug development. We do this by building and running the world's best data and AI infrastructure platform, so our customers can focus on the high value challenges that are central to their own missions.
Founded in 2013 by the original creators of Apache Spark, Databricks has grown from a tiny corner office in Berkeley, California to a global organization with over 1000 employees. Thousands of organizations, from small to Fortune 100, trust Databricks with their mission-critical workloads, making us one of the fastest growing SaaS companies in the world.
Our engineering teams build highly technical products that fulfill real, important needs in the world. We constantly push the boundaries of data and AI technology, while simultaneously operating with the resilience, security and scale that is critical to making customers successful on our platform.
We develop and operate one of the largest scale software platforms. The fleet consists of millions of virtual machines, generating terabytes of logs and processing exabytes of data per day. At our scale, we regularly observe cloud hardware, network, and operating system faults, and our software must gracefully shield our customers from any of the above.
As a Data Scientist on the Data Team, you will help build a data-driven culture within Databricks by helping solve product and business challenges. The Data team also functions as a in-house, production customer that dogfoods Databricks and drives the future direction of the products.
If you are interested in machine learning infrastructure, please apply to the Software Engineer Backend job opening here.
The impact you will have:
What we look for:
Databricks is an American enterprise software company founded by the creators of Apache Spark. Databricks develops a web-based platform for working with Spark, that provides automated cluster management and IPython-style notebooks. Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala.
United States
2-4 year
San Francisco, CA, USA
4-6 year
San Francisco, CA, USA
4-6 year
San Francisco, CA, USA
4-6 year
Mumbai, Maharashtra, India
6-8 year
United States
6-8 year
United States
6-8 year
United States
6-8 year
United States
6-8 year
Washington D.C., DC, USA
6-8 year