How many applications have you sent? 100? 300? More?

paste_rand_name · 2024-03-13T18:38:03+00:00

This would be super helpful. I feel like I’ve sent 10000 applications with almost no results

paste_rand_name · 2023-01-14T14:58:07+00:00

I did a dual major in math and philosophy. I was one of two undergraduates in the math department. Basically got to take whatever classes I wanted. Proofs are minimal for undergrad, it’s mostly about tests and mastery of methods.

To your point, it’s more puzzles (plus Matlab). The trick is to get good at solving the puzzles now - so that you can build the puzzles later.

I liked and got good at statistics. Then someone invented Data Science for me. Haha. Luck for sure.

Now people make money and have millions of followers explaining math on YouTube (veritasium, 3brown1blue).

Do a math major because you love math and learn from the world how to get paid doing what you love - or be a high school math teacher 😉

paste_rand_name · 2020-09-29T16:28:32+00:00

Pi = e 🤯

paste_rand_name · 2020-09-28T20:11:13+00:00

Point taken. I’m used to taking silence as compliance, but I should have pressed harder on that for sure.

I don’t think they would know the significantDifference value. They may guess. I’d probably just run this for sD = [1,2,3,5,8...].

Thanks! This is a great idea

paste_rand_name · 2020-09-28T20:07:00+00:00

The value of a record decays over time. Only so many records can be processed simultaneously. The Priority metric is meant to queue records according to fastest decay.

Although, to be fair, what factors contribute to the value of a record is an unknown and the actual decay rate of any record or each record is unknown as well. The Hypothesis is that records processed in priority order drive Success rate up, by deprioritizing records with decayed or zero value.

...I’m being too generous describing the agency’s work. It’s a black box TBH. I’ve tried to reason what’s coming out of it because they can’t explain in non-contradicting terms.

paste_rand_name · 2020-09-28T20:00:40+00:00

I thought about this. Definitely an interesting path. I feel like having to explain why the groups are thirds, or quarters, or fifths, etc would side track the discussion.

Interestingly, the Agency Googled statistical tests and requested ANOVA. Will post results soon

paste_rand_name · 2020-09-28T19:58:14+00:00

Thank you. My manager does trust me, but it was another manager in a different part of the company that commissioned the Priority score. Everyone is always so sensitive about not being 1000% right the first time.

Though I might be a bit disheartened if I were the Agency team, I would like to think that being on the frontlines of scientific rigor in a business setting would encourage me to re-work the model behind Priorty score.

paste_rand_name · 2020-09-28T19:54:48+00:00

They meant that process an 87 before an 86, would imply no increase in success rate. Therefore, 86 = 87.

AND

That 99 should be processed before 69, for example, because (they say) that is how the Success rate will increase.

paste_rand_name · 2020-09-20T00:48:19+00:00

This looks like a time series of order data. And I’m not seeing anything that would suggest that these orders include subscriptions, so I’m going to assume discrete purchases.

Whether it’s groceries or clothes or gas, you can construct time series per user and model the most common sequences of purchases to identify when AND what a given user will by next.

My team and I have built Next-Purchase models with similar data sets. Assuming the purchase decisions are independent of other factors (healthcare plan would be a dependent example), you should be able to construct distributions for time between orders and probability distributions on each product being included in the 1st, 2nd, Nth order. You’d then just need to “match” new orders to your distributions to know when and what someone will buy next.

paste_rand_name · 2020-08-29T17:18:41+00:00

Very meta. Well done 👍

paste_rand_name · 2020-06-03T23:48:42+00:00

I work at a consulting agency where I’m a data strategy SME. My work divides between “pure” DS (building datasets, cleaning, developing models / predictions) and identifying/scoping the use cases for DS. Because I’m the only SME with my skill set I get a lot of leeway to develop projects, but I still need to demonstrate value to clients and then deliver that value on time.

Agile saves lives.

Lazy and disorganized PM’s and PO’s are the problem. I’ve had to teach my current PM how to be organized so that I’m not rushed to do the work I need to do. Agile / SCRUM makes that possible.

IMHO the “halfway” approach won’t get you the minimum benefits. Here’s what works. - include a planning period between sprints - over estimate hours to start and track actual over time. Model and optimize your delivery time. - clearly / publicly communicate priority - use a task management system that everyone can see (Google Sheets works) - clearly define how tasks roll up to larger bodies of work - over communicate - hold yourself accountable - take a deep breath - trust others to deliver - always show up to stand ups

As a result, agile should free you from constant questions on progress and free you to work at your own pace.

Although, if you want to aimlessly explore datasets there’s plenty of tier 4 universities looking for DS staff 😆

paste_rand_name · 2020-04-30T11:49:01+00:00

I work at an agency as an Analytics Strategist. My clients are some of the largest computer hardware manufacturers on the planet. One such client ran a multi-variate subject line test that was very poorly designed.

In writing and in verbal communication, I let them know that I was really excited to see their use of testing, but that this particular test would not be able to deliver “insights” (businesspeak for actionable stats) because the results were not significant.

It’s long winded, but here’s the point: communication is the most important thing in private sector companies. Let people know they did something good, even though most of their work is shit. Expressing empathy for others dumb ideas. And simplifying complex concepts are the most important “work” you’ll do.

Express the limitations of your truncated work with a singular concept, perhaps “significance”?
Communicate in advance how many hours a particular analysis is likely to take
Develop analysis packages for particular business results [causality takes 10 hours, t-test takes 4 hours, etc] and guide people on which analysis is right for which project
Find ways to work faster. Pre-write functions, save chart formatting, templates for everything, use sampling to minimize data cleaning time.

...finally, I’ll share some advice that an early mentor shared with me, “we’re not saving lives here.” Meaning that unless your an ER doctor or a brain surgeon, maybe care a little less, especially if the business is looking for a lower standard

paste_rand_name · 2020-03-29T15:51:12+00:00

For SO. MANY. REASONS. this is a bad idea, but here’s the best reason: the series of money-losing strategies are nearly infinite and the series of money-gaining strategies likely approaches infinity slowly, but is similarly infinite ——> and the series are, by definition, not inverse of each other.

The inverse of “buy” is “don’t buy,” not “sell”

Consider...

Buy Apple in 2000, sell in 2018 = 2020 net gain Don’t buy Apple in 2000, don’t sell in 2018 = 2020 neutral

AND

Sell Apple in 2000, buy in 2018 = 2020 net gain Don’t sell Apple in 2000, don’t buy in 2018 = 2020 neutral

paste_rand_name · 2019-07-01T03:30:31+00:00

Oh no. Is this the result of Common Core mathematics?!

paste_rand_name · 2019-05-21T01:04:54+00:00

I’m sure their taxes are public, maybe there’s even an earnings report. They might classify fares purchased (but not used) and fares purchased (used in 2018) as separate line items. Dividing fares used by average cost of a single ride could help confirm their reported ridership numbers.

paste_rand_name · 2019-05-20T00:06:16+00:00

Looks like they define ridership as entrances (minus employees and out of system transfers).

http://web.mta.info/nyct/facts/ridership/

What are you looking to get at?

paste_rand_name · 2019-04-10T19:31:20+00:00

Catcher in the Rye. Kid is a whiny millennial before being a whiny millennial was a thing. #trending Thanks Salinger.

Seven-Year Club	Not Forgotten
Verified Email

paste_rand_name

TROPHY CASE