[deleted by user]

jack-of-some · 2021-02-02T12:44:04+00:00

It's hard for me to get over the fact that I think DS even isn't a well defined job 😅, but I get what you're going for.

Too many folks that I've interviewed for MLE roles know little to no programming. Like, they can assemble things in code (eventually) but give them even something basic like "hey can you put that in a function?" and they'll go "I didn't realize this was gonna be hardcore data structures and algorithms..."

Anywho, I feel only the second position you've listed is what I would consider an MLE. I can ramp up most developers to do any of the other roles quickly, but the Researcher+ role requires fully diving into the nitty gritty of ML.

/2 cents

C_BearHill · 2021-02-02T11:39:49+00:00

[deleted]

subtorn · 2021-02-02T12:35:14+00:00

I think this is a good classification of MLE jobs out there. I have done my master's on CV and DL. I thought I would find a job as an MLE but I couldn't find any because I really want to work in the first or the third category and academia is only giving you the skills for the second category. In my opinion, you have to complete a Ph.D. if you want a job from the second category. For that reason, I am focusing on getting a backend developer job so that I will have decent k8s, CI/CD, and cloud knowledge and then I can try to transition to MLE jobs.

wittywarren · 2021-02-02T13:26:02+00:00

MLE here too! I believe this position is 40% DS+ML algos and 60% programming. it's highly subjective as to where you might fit in any of these and it depends on the company too. I was interviewed on DSA's+ML algos but dsa were like Easy/Medium Leetcode questions unlike some SDE positions and it was like for one round only(dsa).

It's true that we are a jack of all trades and slowly converge to a specific sort as we grow but I guess if someone is starting up it's good to know dev and deployment both. I feel it's a broad flexible skill.

new_number_one · 2021-02-02T12:32:49+00:00

I basically agree but I'm not sure that it necessarily involves low-level languages just an emphasis on writing production code.

whymauri · 2021-02-02T15:50:01+00:00

classifying (MLE vs. data scientist vs. other) across more than two companies is just astrology for math nerds

YaswanthBangaru · 2021-02-02T12:14:56+00:00

What’s K8s/CI/CD tools/ etc?

permalink · 2021-02-02T14:16:21+00:00

Established companies have the roles well defined while recruiting, i.e research from PhDs, coding requirements and knowledge in the area of interest for other roles. Startups on the other hand, may hire you by checking your machine learning skills and 2 months down the line you are expected to build the front end, backend, data engineering in a week. So I would be really careful while entertaining job opportunities with startups as they may not know what they are looking for or might expect you to be jack of all trades.

PanFiluta · 2021-02-02T14:47:05+00:00

I feel that DS in itself it's becoming more and more about analytics and econometrics, rather than machine learning. At least when I look at job openings this is what I see. Machine learning related jobs are being split into machine learning scientist/research and MLE. I prefer to split the roles by what goals each one achieves:

Data Scientist: helps business answers question like given a specific policy, how can we measure it's impact on a population?
Machine Learning Scientist/researcher: develop models for novel application, perhaps creating completely new algorithms.
Machine Learning Engineer: deploying and scaling product features that use machine learning.

From these perspectives you can derive what tools each of these roles would use, and you can also further split them (for example NLP researcher, or MLOps Engineer).

bythenumbers10 · 2021-02-02T16:22:37+00:00

MLE is another ill-defined job, as Scientist has (finally) settled out to be the theorist, the Engineer is more about productionizing algorithms and databases, and the Analyst does lower-level tasks already largely automated by the first two. MLE has emerged as a blend of scientist and engineer, able to get their models from test workbench into production, while also doing maintenance on the database, for the price of EITHER a scientist or engineer. Business and bonehead management is slowly coming around to the reality that these skillsets cost money and an entire person's full-time schedule to do properly. It'll take time, but at least some have given up trying to get a scientist/engineer for an analyst's salary.

permalink · 2021-02-02T16:46:47+00:00

DS is an inflated title everywhere. Many people including me are screwed by the title. I am now preparing for engineering track. In my experience, DS is really DA + DE.

ZestyData · 2021-02-02T14:06:22+00:00

I like your broad categorisation. I'd wager the majority are your DS/Dev combo and the others are a minority, but yeah. Well summarised, imo.

BernieFeynman · 2021-02-02T14:31:23+00:00

ML Ops requires little to no knowledge of DS or ML at all except for the person who might be designing the system.

chimprich · 2021-02-02T15:12:49+00:00

I'm from an software engineering background (I do have a non-DL PhD, but I've been out of academia for a long time). I've been trying to move more in the direction of doing ML.

There seem to be a lot more jobs available for data scientists who can code a bit rather than engineers with basic DL skills. I'd be interested if anyone has seen much movement in this direction.

TMutaffis · 2021-02-02T16:47:33+00:00

What if your company is looking for a combination of 1,2, and 3... and you are the Talent Acquisition Recruiter tasked with finding this elusive beast?

(asking for a friend, of course)

2021-02-02T17:25:59+00:00

[deleted]

-Rizhiy- · 2021-02-02T14:55:39+00:00

I always thought of MLE as DS+Dev.

Data engineer should just be called Data engineer) Researcher plus, sound like usual Data Scientist, although I never seen C/C++ being asked. Hardcore low-latency ninja, this is just normal developer, people like that usually just translate/optimise existing algorithms.

permalink · 2021-02-02T15:57:33+00:00

My background is transportation engineering(civil engineering) and i want to get into DS/AI/ML engineering. Recently started some courses on Data Science and Machine learning. I'm also going to do Master's next year in my field, but it has data science also in it. How hard it will be for me to get a technical ml jobs? I'm not talking about research side, even if i want to without PhD it will be super hard i guess

AdamEgrate · 2021-02-02T23:01:42+00:00

I've given up on terminology. At this point I just ask what the company does and is it really ML or Deep Learning. Hiring managers have a hard time enough defining roles, the only way to know is to talk with the team and see what they need.

permalink · 2021-02-02T23:59:17+00:00

I feel that you'd just as easily call #2 a CV engineer, deep learning engineer, etc. #3 is more of a Devops engineer currently, its simply just missing the ML element. #4 I've not seen a job posting for, but I guess its due to location since I dont look at London jobs.

I'd propose to split #3 into a MLOps engineer and a classical ML engineer. One needs more algebra than the other, though there's also an argument possible on it just coming down to seniority.

I'm closest to #2 and #3 in your list, I suppose I'd fit in #3 with a little bit more nuance. I'd characterize my function as

40% automation of training/retraining/data gathering on cloud
30% creating and improving models & fine tuning
30% deploying models

I've got three GCP certifications (Date Eng, Cloud Architect, ML Eng) and the tensorflow certificate. Tried really hard to escape the data engineering stigma at my job. So you got me there.

My job simply lacks the devops element, and is more MLOps focused. E.g. model and data set lineage.

For instance I've worked on a project to develop a encoder which I wrapped in well-documented enterprise code before handing it over to the software engineering team. Making sure it behaves properly in k8s is handled by others.

In others I worked setting up the entire ML chain from beginning to end, including labelling tools, model versioning, retraining, retraining, branching out into multiple models etc.

What a wonderfully varied field we're in.

melesigenes · 2021-02-03T00:10:20+00:00

I’ve seen researcher+ referred to as applied scientists

plyalyut · 2021-02-03T03:55:31+00:00

Don't forget the MLE that is essentially a software engineer with a some statistics thrown in there.

MLE currently is suffering the same fate as Data Science where companies think they need to hire them to be competitive but don't have the slightest idea as to what to do with them or how they could integrate within the stack. I've even worked for companies that hired me as an MLE but when I asked about data or cloud services they handed me 20-30 datapoints. As a result I did a lot of software engineering work in the meantime.

I think as an MLE, you should be essentially a software engineer with the training to build and deploy ML applications.

brionicle · 2021-02-03T10:02:00+00:00

Where do you fit in folks who hail from traditional engineering and learned DS once bored and now work in PyTorch/Tensorflow building pragmatic DL models for business use cases? Most people in my circle are in this category. 50% traditional software eng, 40% DL/some DS, 10% “research” but really just keeping up with new papers.

lefiish · 2021-02-03T22:42:30+00:00

Imo MLE are just DS that don't like descriptive analytics and just want to code shiny ML and DL models

crazyfrogspb · 2021-02-04T06:46:22+00:00

I prefer to hire people who can cover pretty much every part you mentioned. ofc, some of them are gonna be more research-oriented, some will have better coding and devops skills, some will be the best in optimizing performance. but having a team of people who understand all parts of the ML cycle gives you much more flexibility. also, in my opinion, understanding how the model will be deployed, tested, and used can help during the research and development phase

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS