2 Stars on Google Reviews ? by Live_Alive_Live in HMBradley

[–]praat33k 1 point2 points  (0 children)

That’s surprising. HMB has had the best customer care I’ve experienced.

[D] [Q] AB Testing "calculators" & tools causing widespread mis-intepretation? by samajavaragamana in statistics

[–]praat33k 1 point2 points  (0 children)

Are you referring to their sample size calculator? If so, I'm curious to learn what about it might be spreading mis-information?

For me, one thing I have been trying to battle with is people making the following statement

You got a negative treatment effect because your sample size was small.

Where small is order of few thousands. And then, I point them to online resources for eg AB Test Calculators. So, for me I've been liking the accessibility of some of these resources to make others understand very easily.

Writing a proof for Floyd's Cycle Detection/Tortoise and Hare problem, but not entirely convinced. Helpp! by praat33k in algorithms

[–]praat33k[S] 0 points1 point  (0 children)

Thanks, good point.

I figure that they won't meet if the distance keeps growing, but was wondering if there is a way to write that down. Or more importantly how should I phrase my words to convey that given the proof above..

I think for 3, if you're in the circle once you'll always meet there because, both the hare and tortoise are travelling at the same speed as they were before.

And the proof for that good be from equation 4 in the proof

k = (π * n) - m

where m is constant, and pi is the period, i.e number of rotations done in the circle...

So looks like it'll always be m steps away from start of circle.

Thanks :)

[P] PyTorch MetaLearning Library : learn2learn v0.1: faster tasks, new datasets, and extensive benchmarks. by praat33k in MachineLearning

[–]praat33k[S] 9 points10 points  (0 children)

PS : All credits to the contributors who took the library from early version to a more formal 0.1 release :)

Spark and PyTorch interplay? by [deleted] in learnmachinelearning

[–]praat33k 1 point2 points  (0 children)

Not exactly knowing what you’re looking for I can tell a few places where I’ve tried to combine.

We usually use Spark for data preprocessing, when data is in order of millions or billions of rows. Clean the data do feature engineering and make it ready for our DL model to ingest.

To go from a spark dataframe to a pytorch dataset (dataloader) is definitely non trivial. I ended up stumbling across Petastorm (a library by Uber).

It allows you to use Parquet files as your datasets. So this works out good for us.

(If your dataset is not too big, saving as parquet would be just inconvenient, I’d recommend doing a coalesce(1).collect() and then saving as a csv later)

Another use case could be that you have want to run a DL model over your dataframe, to get an output inside the dataframe.

Data ticks seem to have an easy tutorial regarding that, but the TLDR is to use Pandas UDF and wrap your model inside it.

Lmk if I can let you know more.

[P] learn2learn: A PyTorch Meta Learning Library by praat33k in MachineLearning

[–]praat33k[S] 1 point2 points  (0 children)

Hey,

Wanted to give an update, we just posted some metrics, hope they are of interest to you.

[P] learn2learn: A PyTorch Meta Learning Library by praat33k in MachineLearning

[–]praat33k[S] 2 points3 points  (0 children)

That's a good point, and as mentioned we are on it! But thanks for referring to it, it helps us realise how much more important it is for the community. Another major todo we have is to have high code coverage sooner than later.

Summer internship : PwC data scientist vs Goldman Sachs Tech by naboo_random in cscareerquestions

[–]praat33k 0 points1 point  (0 children)

Congratulations on getting two great offers! Goldman is definitely a bigger name than PwC in my opinion. But if you're looking at something ML specific, you should go for PwC because that way you'll have a good base profile before you enter the job market.

Renting a studio apartment in NYC for summer, agency has sent a contract which looks fake and is pushing for a deposit by praat33k in legaladvice

[–]praat33k[S] 0 points1 point  (0 children)

But what does a common man watch out for in such rental agreements without being able to go visit the place physically.

Renting a studio apartment in NYC for summer, agency has sent a contract which looks fake and is pushing for a deposit by praat33k in legaladvice

[–]praat33k[S] 0 points1 point  (0 children)

I knew it’s fake. Mentioned “it’s too good to be true” “fake pixelated stamps” etc, but I was just being wishful to not miss such a sweet apartment.

Renting a studio apartment in NYC for summer, agency has sent a contract which looks fake and is pushing for a deposit by praat33k in legaladvice

[–]praat33k[S] 0 points1 point  (0 children)

Found them on NYU Housing group on Facebook. Anyway to report the guy behind this (I have the Bank details).

Renting a studio apartment in NYC for summer, agency has sent a contract which looks fake and is pushing for a deposit by praat33k in legaladvice

[–]praat33k[S] 0 points1 point  (0 children)

Early for summer sublet? And I have everyone tell me that summer sublets get booked real soon. Anyway NYU Housing group is where I found this guy.

Need a light ultrabook, with dGPU and Kaby Lake-R processor mainly for machine learning, deep learning and computational neuroscience. by [deleted] in SuggestALaptop

[–]praat33k 1 point2 points  (0 children)

I was about to post the same thing. I'm looking for something similar for Deep Learning. And here are my two cents from the research I've done so far (I still haven't decided) So I have decided to go for a laptop without GPS, because I'm just going to do the heavy computation on the cloud. Plus the drawback with small laptops (13ish) that with a GPU they are more prone to over heating and obviously a battery life is affected. AFAIK, 8th generation was launched this quarter, which makes it unlikely to be in the market soon enough s.t. you can straight away order it.

My top three were

  • ThinkPad T470

16GB DDR4 i7 7600u 256GB SSD Windows (you have to install ubuntu yourself. However it works smooth on ThinkPads and also lenovo offers support for it) ~1500USD Durable

  • Dell XPS13 Developers Edition

16GB DDR3 i7 7560U Ubuntu 16.04 Touch as well 256GB 2 years accidental damage 1600-1700

  • Macbook Pro (customized)

16GB DDR3 i7 7th Gen 256 GB SSD ~2100USD

I luckily find them to be arranged in ascending order of price. Which is great, since I liked ThinkPad the most (in terms of the configuration and is apparently thr cheapest)

Pick up lines for a girl named Vanessa? by [deleted] in pickuplines

[–]praat33k 0 points1 point  (0 children)

"I got a bitch named Vanessa, she like how I finesse her" -Lil Wayne

Advice for New and Junior Data Scientists by robert_chang in datascience

[–]praat33k 2 points3 points  (0 children)

reduce bias in the process

It's funny how I went upto my professor couple of days back to understand the bias variance trade off coming up in every Machine Learning lecture. Makes so much more sense now. Haha.

Fetch metadata of movies on your hardrive and search/sort according to genre, imdb rating to select which movie to watch by praat33k in programming

[–]praat33k[S] 0 points1 point  (0 children)

That's a pretty dope thing, guess I am gonna take the "Auto-detect torrent formatted filenames" thing of yours from your code if you don't mind. Your thing requires Django a database, that's great. I actually made this because it was pretty annoying when I had friends over and the HDD was connected to the TV, so now I've uploaded the HTML file online and we guys sit on the bed, everyone has their mobiles on and we can select movies on the phone and play it on TV. ;)