BEHIND THE

DATA SCIENTIST

MATÍAS ÁVILA

CATEGORY

Shapelets

CATEGORY

Shapelets

CATEGORY

Shapelets

 CATEGORY

  Interview

 DATE

   18 Apr

 TIME

   10 Minutes

BEHIND THE DATA SCIENTIST

By Shapelets – Introducing Matías Ávila

Welcome to this new series of interviews by Shapelets! 

The series allows us to explore different topics in the data science community. We will have experts on hand who can provide a comprehensive understanding of the role and experience of a data scientist.  

For this first interview, we are joined by Matías Ávila, who is a Data Scientist at IKEA and a Machine Learning Lecturer at the University of Navarra Big Data Science Master Degree.  

Today, we will discuss the different backgrounds of data professionals, the challenges they face, and the skills they use on a daily basis. 

How did you move from Economics to Data Science? 

Matías Ávila – When I was doing my undergrad at the University of Navarra, while studying Economics, I really enjoyed some lectures such as Econometrics, Time Series, … I was really interested in understanding how some features can explain or even predict behaviours and features values. So, during this time, one of my teachers, suggested me to do my senior undergrad thesis in a time when I had to deal with medical data. So I had to learn Python, and I had a great time gathering data, cleaning it, plotting, modelling… It was then when I discovered Data Science and when I decided to move to the States in order to pursue my career in AI and Big Data. So I started my master’s in Data Analytics at North Caroline State University. So, thanks to this professor, and this senior thesis project is why I moved from Economics to Data Science.    

What do you like the most about your job? 

M.A.  I always enjoy exploring uncharted patterns within data, that’s always fun. But something I really enjoy is that Data Science is a really eclectic domain. You derive ideas from abroad and huge range of sources. For example, you have different tools for survival analysis which is quite common in medical trials, models that are brought to the time series domain, algorithms from computer science, or even different metrics such as entropy, which is used in chemistry and physics. So I would say that something I really enjoy about Data Science is that is an area where you interact with very different domains 

What do you value the most in a data analysis tool or a data technology? 

M.A. Something I really value is open source technologies, because you have a huge community you can contact if you have any problem. Or just if you want to learn how other have solved the very same problem but in a different industry. It’s a great way to understand the tool you are using and get the most out of it 

From your professional experience, what do you find to be the main problems in the connection with the business area? 

M.A. So I would say that there are two main problems. The first would be dealing with expectations. Some people, especially managers, tend to think that Data Science is some kind of magic where you bring a bunch of nerds and blah blah blah they will make a ton of money and fix evrything with this algorithm. It doesn’t work like that. First, you have to be sure you have the needed data to solve this problem. Follow up the solution you’ve implemented. So there are many other things going on behind the scenes.   

 Another problem I’d say is that usually data scientists tend to lack soft skills. Data scientists have great backgrounds in computer science, mathematics, engineering,… but generally they struggle when communicating valuable insights or solutions.   

 So I’d say those are the main problems, dealing with expectations and the lack of soft skills of some data scientists when communicating some ideas to their managers.  

Could you please share what is your approach when solving any data analytics-based project?

M.A. There are different steps you should follow when doing and deploying a machine learning solution. You have to find a hypothesis, you have to find some metrics in order to track how good or bad is your solutions, you have to train the model, and before that you have to understand the data. But most people don’t know that. Something I really stress, or something that is really important and most people miss is that you have to get to know the business. Wonder do I know how the warehouse store the goods? Do I really understand what problems do the truck drivers face when dealing with a shipment? do I know how data is gathered? 

If you don’t know or you don’t understand the business, you won’t do your best when analyzing the data, when you are cleaning it, during some imputing values or even during modelling. So, I’d say that the very first stage that you should focus on is to really get to know the business and then follow the common steps of machine learning. 

What advice would you give to maybe your students or to other data scientists? 

M.A. Well, maybe you are from a business background, or maybe from a computers science degree, or you studied medicine. It doesn’t matter. Something you have to do is to take advantage of your knowledge and say, ok, now if I want to turn into a data scientist, what am I missing? If you have studied business, maybe you lack some statistical knowledge. Or even some programming languages.  

I strongly recommend you is to learn Python, R, SQL, Git…but you have to take advantage of the current skills that you have. Let’s say If you have studied computer science background, you already know how to program. You just have to focus on honing your skills in statistics and maybe in some business or soft skills.   

So what matters is how you use your skills. Let’s say you studied business so become a great data scientist in a bank. If I have studied medicine I’m gonna focus on becoming a great data scientist in the pharma industry. So that’s something I really recommend you, to take advantage of your background. Because that’s what makes you different, also what makes you stand out from the crowd.   

Another thing I would strongly recommend is that nowadays, cloud technologies such as Google, Azure or Amazon. That knowledge is quite important nowadays when working in a big company.  

What do you think are the key skills a data scientist need in 2022? 

M.A. I’d say you need three main skills. First, you need some knowledge or skills from computer science, algorithms, also databases. That would be the first one. Then you need some knowledge from statistics and mathematics which is essential in order to deal with some machine learning algorithms. And finally, but not least, you need some soft skills or business knowledge. The combination of these three sets of skills is hwat makes you a great data scientist 

 

Thank you Matías for taking the time to chat with us and for sharing your experience!  

This interview really helped us to deep into the professional background and day-to-day of a data scientist. It doesn’t matter where you come from or your bachelor degree, everything helps! As Matías advocates, take advantage of your expertise, your experience and push forward. Even if you don’t realise it yet, every field can help you become a greater data professional.  

Do you want to participate and share your experience? Contact us at marketing@shapelets.io or fill out this form, and tell us your story!  We want to help data scientists and data professionals build a strong personal brand and advance their career. Start now!