How Data Science Roles Vary Across Top Tech Companies
Data Scientist- title and responsibilities can vary across companies
More than a decade ago, an article was published in Harvard Business Review (HBR) titled as “Data Scientist: The Sexiest Job of the 21st Century” by DJ Patil and Thomas Davenport. At the time, the title Data Scientist was defined as “a high-ranking professional with the training and curiosity to make discoveries in the world of big data.”
Fast forward today, the Data Scientist title has evolved in so many directions that it has become one of the most misused and confusing job titles in the data industry. The interview process for Data Science is the messiest of all time. One of the main reasons for not having a clear, concrete or streamlined interview process is the confusion around the job titles vs. responsibilities expected from the role.
In this article, I will try to unbox what the Data Scientist as a job title means in the top four tech companies- Amazon, Google, Microsoft and Meta (formerly known as Facebook). I will also highlight the top skills needed for those roles.
I have interviewed myself with all these companies, therefore I am writing this to help others.
Data Science Jobs in Google:
If you are going for the Data Scientist title, most of them are actually the re-branded titles that used to be called Quantitative Analysts in Google. Most of the data scientist roles are aligned to work with product teams, and it is more into the analytics side than the core ML (Machine Learning) side.
Google has another job title called ML Engineer, which is more aligned with ML focus, and they fall under SWE (Software Engineering). Most of their data science titles have the product team listed in the job title. For example- Data Scientist, Core Compute or Data Scientist, gTech etc.
Top skills asked are SQL, Python, Stats and Data Analysis skills. If you are more interested in the core ML work (building and deploying ML models), you want to consider ML Engineer titles instead.
Data Science Jobs in Amazon:
The data scientist titles in amazon are a little bit more well defined and structured. I want to give some credit to Amazon for that. For the analytics side, Amazon has other job titles like Data Analysts, Business Analysts and Business Intelligence Engineers. For the core ML, they have Applied Scientist titles. Therefore, the Data Scientist is the perfect balance between the above titles who will have analytics as well as an engineering mindset to build models. Top skills asked are Python, Coding (Data Structures & Algorithms, meeting the SWE level-1 bar), SQL, Data Visualization, Machine Learning concepts (more breadth than depth) and how to build the models. Deployment of models can be additional or preferred skills for data scientists, but for the Applied Scientists both build and deploy skills are required.
Data Science jobs in Meta:
The data scientist job in Meta is similar to Google, which means they are also embedded with product teams rather than engineering teams. Most of the DS (Data Science) roles in Meta are analytics heavy, working along with product teams to help with data analysis, defining, measuring and monitoring KPIs (Key Performance Indicators). Meta also has ML engineer roles which are focused more in the core ML.
Top skills asked for DS roles in Meta are SQL, Python, Stats, and Data Visualization. One additional skill they usually demand is experimental design (A/B testing). Therefore the DS interviews in Meta always include one analytical case study problem.
I created the diagram below to distinguish between the two major tracks for Data Science roles- one is Analytics side and the other is Core ML side. When you read the job description- if you see keywords are focusing on data analysis and SQL, statistical modeling, A/B testing, hypothesis testing, experimentation, visualization- then the DS role is analytics based. On the other hand, if you see the keywords are more focused on asking for python, big data tools like spark, machine learning frameworks like sklearn, deep learning frameworks like Pytorch, and Tensorflow, skills for doing ML in cloud, then the DS role is core-ML based. Therefore, regardless of the title, it is important to read and understand the job description before you make a decision to apply.
Data Science jobs in Microsoft:
One of the most confusing job titles when it comes to Data Scientist is from Microsoft. Being a part of Microsoft myself, I am not so proud on how they have defined the DS roles. Microsoft has the high-level job title to cover most of the data roles under the same title called ‘Data & Applied Scientist’. The good thing is their DS roles are mostly aligned with engineering, so the parity in pay is more comparable to SWE roles. You have to rely on job descriptions to read well and carefully to conclude if the role is more on the analytics side like Data Analyst, BI engineer, or is it more ML focused like ML engineer or Applied Scientist.
Top skills asked are Python, SQL, Stats, Big Data query, and Data Visualization. If the role is focused towards the research or deep learning side- they also ask DL (Deep Learning) frameworks like PyTorch, Keras , Tensorflow etc.
Lastly, I would like to wish you all the best if you are currently applying or interviewing for DS roles in any of these companies or others. If you’ve enjoyed reading this article, please subscribe for more contents. You can also follow me on linkedin.
Thank you for the amazing illustration of analytics vs. ML. And thank you for calling out the confusion in Microsoft DS roles. During my research for my mentees, it's the most confusing job posting ever with a blanket title covering numerous roles.