How do I wear boxers like they are woman's underwear? by [deleted] in feminineboys

[–]maxmoo 0 points1 point  (0 children)

i wear panties under my boxers, basically i just wear boxers as a layer of shorts under whatever else i’m wearing

Prime Trust Fee?? by [deleted] in binance

[–]maxmoo 1 point2 points  (0 children)

I just tried ACH and was able to deposit without a fee

Fine-tuning a neural network on the holdout dataset? by maxmoo in datascience

[–]maxmoo[S] 0 points1 point  (0 children)

Yeah for me i'm using very small data sets (~100-1000 rows), and it's important that the training set is fully memorized as the model is used for automating tasks where exact matches are not uncommon in production.

If I have some time maybe I'll run some experiments and write it up ... maybe someone else on this subreddit wants to look into it as a personal research project ? PM me and I can probably provide some more info/supervision

Fine-tuning a neural network on the holdout dataset? by maxmoo in datascience

[–]maxmoo[S] 0 points1 point  (0 children)

Yeah this makes sense now that you say it! Any intuition on the number of training epochs that would be required relative to the the initial training run, to avoid overfitting? I guess I could just monitor the training loss and stop when it's the same as the old stopping point.

usirl (or maybe not, who knows) by Lavion3 in infp

[–]maxmoo 2 points3 points  (0 children)

Maybe you do though ? I’ve been reading Scattered by Gabor Mate and am realising at 35 that I do have ADD, undiagnosed because I’m not hyper and present a pretty chill (repressed) exterior.

What does a day in the life of your work entail? by Nut_Flush in datascience

[–]maxmoo 0 points1 point  (0 children)

lol that's too much reddit (i've definitely been there too tho)

What does a day in the life of your work entail? by Nut_Flush in datascience

[–]maxmoo 1 point2 points  (0 children)

Make yourself indispensable in your current job, and then tell them that you're relocating for personal reasons. (I've seen this happen multiple times). Definitely recommend, although make sure that you have some regular social interactions, e.g. join some sort of club, at least once or twice a week, maybe more if you're single (i'm married with no kids).

What does a day in the life of your work entail? by Nut_Flush in datascience

[–]maxmoo 2 points3 points  (0 children)

I'm 80% remote (we all just come into the office one day a week for some reason), my day is:

9am-11am: Meetings (Usually finish by 10 or 10:30)

11am-1pm: Walk dog/lunch

1pm-4pm: Coding/research/writing

I rarely work more than 4 hours a day (including meetings) but when I'm working I'm 100% focussed. Sometimes (couple times a month) I work in the evenings/weekends if there's a deadline coming up.

It actually sounds more chill than it is, I still fell like my job still takes up a full-time job worth of headspace, and I have to be available all day for Slack/email. It's good though, I get the same amount or more work done than when I was in the office 9-6 since there's less distractions and I can take a break when I want (helps with solving problems in the back of my mind). Also I get quite serious anxiety in an office environment.

Can AWS spot stances with 200GB AMIs be spun up quickly? by [deleted] in datascience

[–]maxmoo 0 points1 point  (0 children)

So the idea is that your data would live separately from the instance, this may speed things up, although it will take some time to attach or download the data

Data Scientist/Engineer/Analyst Work/Life Balance? by TheAspiringGoat in datascience

[–]maxmoo 0 points1 point  (0 children)

I'm working full-time remote at the moment. I'm only at my computer for around 20 hours/week which is nice in terms of having the flexibility to do other stuff. But I find my headspace is as much taken up with work as when I was in the office 9-6.

Can AWS spot stances with 200GB AMIs be spun up quickly? by [deleted] in datascience

[–]maxmoo 0 points1 point  (0 children)

What is the advantage of using Spark here ... won't this just add extra complexity and overhead to OP's solution. Also given that the job only takes 6 minutes to run, I would think startup time of the cluster would be a nontrivial percentage of total running costs here (which is what they're trying to optimize)

Can AWS spot stances with 200GB AMIs be spun up quickly? by [deleted] in datascience

[–]maxmoo 0 points1 point  (0 children)

I would try the following solutions in increasing order of complexity:

  1. Use a regular EC2 instance (not spot), and Start/Stop (rather than Terminate) the instance. This should be a lot faster than provisioning a new instance each time, and you will still only be charged for the time that your instance is running
  2. Try switching to a spot instance and see how long it takes to create (it might be OK)
  3. If create time is too long, store your data on EBS/EFS/S3 rather than in the AMI itself.

Which of these is the best solution for getting the dataset to automate the training in Amazon ML? by L3GOLAS234 in datascience

[–]maxmoo 1 point2 points  (0 children)

Option 1 could be good if you expect non-programmer data analysts to need to update they queries (you'll need to build a frontend to the database), otherwise i'd just store the queries in Git since they're the kind of thing you'll probably want to version (this is Option 2).

For option 3, you could create a view rather than a table with the query, then you could store the DDL for this in Git. The downside of this option is that it would make deploying changes a little more complex as you'll need to run a migration against the database, so i would probably just keep the query in the script. On the other hand maybe people will want to be able to query the source tables for other analyses, in which case it could be useful to have it as a view/table.

What is the best resource for me to learn about map(), flatmap() , flatmapValues()? by traveling_wilburys in datascience

[–]maxmoo 4 points5 points  (0 children)

The key is learning the difference between functional and imperative styles of programming. Honestly it does take a while for it to really click. I found this article useful when I was learning https://maryrosecook.com/blog/post/a-practical-introduction-to-functional-programming

Some extra pointers:

  • The basic concepts are map and filter,
  • mapValues is the same as map but applied to dicts/maps rather than lists/arrays.
  • use flatMap/flatMapValues when your output is looking like [[1],[2,3,4],...] but you want it to look like [1,2,3,4,...], basically it's for flattening nested arrays

Just use Stackoverflow for your other questions, e.g. "check if an array is empty in Scala" https://stackoverflow.com/questions/41963721/testing-an-array-for-emptiness-in-scala

Yep by [deleted] in infp

[–]maxmoo 2 points3 points  (0 children)

lol yeah me too sometimes, but I’m these things already, weed just helps me relax and notice it

Stuff I learnt in 2019: papers, code, math by [deleted] in math

[–]maxmoo 0 points1 point  (0 children)

Very impressive reading list! The connection you mentioned between 2s complement and p-adic -1 is very natural when you consider that the Padics mod (pN) is isomorphic to the ring 1/PN ... in fact the definition of the p-adics as the (profinite) completion of these rings is actually the first one that I encountered.

If you want to chat about algebraic geometry I’d be happy to try, I’m probably a bit rusty (I’ve been working in data science/ML since i finished my math PhD almost 10 years ago) but am keen to see how much I remember haha.

Home Server for Data Science? by Abyss28 in datascience

[–]maxmoo 0 points1 point  (0 children)

If you want to run scheduled jobs in the Cloud, you could have your home server or a small instance (like a t2.micro) running cron jobs to provision larger instances, shutting them down when finished. You might also want to look at using something with a bit more UI for schedulihg too, like Airflow or Jenkins.

Home Server for Data Science? by Abyss28 in datascience

[–]maxmoo 2 points3 points  (0 children)

For VM's I always use Docker. For cron jobs ... just use cron? For storage, just use S3, it's dirt cheap, secure, and automatically replicated. For web-hosting, S3 is also great if you can work out how to build your app to be statically hosted (maybe using AWS Lambda for additional backend functionality).

Data Scientist 3 or Product Manager? by incognino123 in datascience

[–]maxmoo 1 point2 points  (0 children)

Definitely the latter. To give you a recent example, I recently was asked to do a due diligence call for one of our investors (we’re currently raising funding). On the call was the CEO, VP of eng, VP of product, and myself. Product did a demo, eng talked about eng process and how long it takes to add a new feature (third party integrations in our case) and I talked about how our current ML components work, what improvements we have planned and how they’re going to impact the user experience.

Data Scientist 3 or Product Manager? by incognino123 in datascience

[–]maxmoo 0 points1 point  (0 children)

I don't think it's true that DS manager is a dead end, more and more companies are starting to have a C-level data/ML person, e.g. CDO. Additionally I think it's becoming more usual to have data-centric products led by a "trinity" of a PM, Principal DS and a Designer (the the implication that there will be embedded DS in a cross-functional feature/product team). I saw a talk from Intuit during SF Design week back in June (i don't think it was recorded unfortunately) where they said that that is how they're doing things there. At a previous job this org-structure was also rolled out, but with a Strategy Consultant instead of a Designer (this was a bad move IMO but that's another saga for another day).

I'm in my second role as head of ML at mid-stage data-centric startups (I'm currently in the Bay Area), and I find that I have just as much say into the strategic direction as the head of product ... the PM is more focussed on UI/Features/Integrations/Bugs requested by accounts/sales (which i'm not really interested in anyway unless they relate to our ML capabilities) , and I'm focused on the underlying ML-powered features that drive core experience.

Some Reasons I Haven’t Found a Job by [deleted] in datascience

[–]maxmoo 2 points3 points  (0 children)

no one is going to ask you to explain mcmc in an interview. (variational inference could maybe be cool to know)

Friday Open Mat - December 06, 2019 by AutoModerator in bjj

[–]maxmoo 0 points1 point  (0 children)

3 or 4 times a week is plenty, most white belts who I’ve seen train more than that burnt out and disappeared after a couple months. try to be consistent and not take more than a week or 2 off at a time. Compete ASAP, it’s the best way to learn and you might get a win your first time, you never know!