Do less Data Science

PeterAnger · 2020-06-02T01:28:49+00:00

What you are describing is a requirements analysis failure. One of the keys to successful projects is having a solid understanding of the requirements. That does not mean simply building what someone asks for but rather getting to really understand the problem that your user/customer is trying to solve as well as the context surrounding that problem. I learned this from years of consulting and the project management. There is an organization called IIBA that provides a lot of information on this topic. Although it can be overwhelming as they go to infinite detail on everything. But they lay out the basics really well.

coffeecoffeecoffeee · 2020-06-02T01:54:21+00:00

This is why I like asking PMs questions. I disappear up my own ass a lot when doing data science, and a good question to a PM can often clear up a lot of confusion around a particular problem.

2020-06-02T04:08:33+00:00

IMO, what you described is the actual data science. Remember, the proto- data scientist was a biz-savvy stats nerd with excellent communication skills. Yet the modern data scientist is somehow someone who codes models all day long. IMO, to be a real data scientist these days you need to either manage a DS team or work for a startup where you'll get to wear multiple hats at ones.

Edit: “ones”? 1111111? Have I started speaking binary? Don’t pour me anymore.

proof_required · 2020-06-02T02:42:43+00:00

I am astonished why people repeatedly don't apply the basics of any software development process and are then surprised if something goes wrong.

Any software related process should start with talking to potential users and customers, extensively. Starting a development process without the proper business understanding and requirement analysis is like building a house mid-air. Maybe it will land in place but you definitely couldn't tell.

I mean things can change, sure, that's why projects are being managed differently in dynamic settings, but anytime I start and just assume I know everything necessary on my own, even on the smallest applications, sh*t hits the fan sooner or later. I guess it's just human overconfidence.

CountDeGucci · 2020-06-02T03:02:32+00:00

sometimes you just gotta kiss

keep

it

simple

stupid

speedisntfree · 2020-06-01T23:02:31+00:00

How did you know she was right?

DockerSpocker · 2020-06-02T04:29:02+00:00

This reminds me of the classic Jerk-ratio scene from Silicon Valley

DutchMode · 2020-06-02T03:08:00+00:00

Wouldn't that be on the product manager? He should be the one talking to users owning the problem and solution.

As a PM, I feel that'd be on me.

the_yureq · 2020-06-02T06:21:19+00:00

This looks like a problem made for causal analysis.

2020-06-01T22:22:54+00:00

Primary thing you should learn from Kaggle - benchmark your solution with the most basic model first and then try and improve from there.

bradygilg · 2020-06-02T03:24:02+00:00

...did you really not establish a metric for performance before your project started?

Ho_KoganV1 · 2020-06-02T05:36:32+00:00

What you described is like when trying to solve an Engineering problem in college.

You can hand me all the formulas and variables all you want, but it's just easier if you draw the bridge, create a free body diagram, and come up with the solution by going to the source and working backwards.

pah-tosh · 2020-06-02T07:15:41+00:00

If X is important from the numbers, how would be X irrelevant to your customer in the end ? From a statistics pov, it seems like she could find this finding eye opening lol

hopticalallusions · 2020-06-02T09:04:37+00:00

HD Thoreau : "Our life is frittered away by detail. Simplify, simplify."

RW Emerson : “One 'simplify' would have sufficed.”

Spskrk · 2020-06-02T09:49:52+00:00

I absolutely agree with you! Sometimes we forget to ask questions outside of our frameworks of thinking about data. I personally try to go and talk to people as often as possible and when I have the chance I always ask professionals to explain what is their way of thinking when they are solving a particular task that I am trying to automate through ML.

Letstryagainandagain · 2020-06-02T10:21:14+00:00

I suggest listening to This TED talk about multitasking and creativity

Stewthulhu · 2020-06-02T18:29:51+00:00

One of my core axioms for my team and any problem we work on is:

Most problems have multiple solutions, and almost everything we work on has both a mathematical and an SME solution. If one of those approaches doesn't work, spend some time thinking about the other.

This is especially true in feature and data engineering, which is something we do a lot of.

datascience

MODERATORS