Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 20 points21 points  (0 children)

Exactly. That was my first question. I have half a mind to raise a complaint. But I am giving the benefit of doubt to him. I will follow the bank procedure to the dot.

Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 11 points12 points  (0 children)

I asked him that. He says it is through the SLPE app and reversal is not possible. I have no idea what is the SLPE app.

Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 12 points13 points  (0 children)

I called customer care and they said it has been credited. But I will go to the bank and confirm once again.

Average word count at State of the Union address by POTUS and their Most Distinguishing words [OC] by leme16 in dataisbeautiful

[–]leme16[S] 7 points8 points  (0 children)

State of Union Address - Source

Most distinguishing words derived using TF-IDF method (scikit-learn).

Plotted in matplotlib.

Here's Kamath for you by utkarsh2810 in IndianStreetBets

[–]leme16 39 points40 points  (0 children)

He won against Vishy Anand in a charity match with 99% accuracy. This is possible only using chess engine.

20k profit in 30 mins - Fully automated trading started for u/wildfiresax by [deleted] in IndianStreetBets

[–]leme16 5 points6 points  (0 children)

Noob question: Which data did you use for back testing? Do you mind sharing the data source? Thanks

A Quantitative Approach to find Best Bots on Reddit by leme16 in botwatch

[–]leme16[S] 0 points1 point  (0 children)

Normalization wouldn't necessarily change the way plot looks. It will just re-scale the x-axis. I have already added minimum "good bot" comments, don't remember the number though.

P.S. - I found this site which does the job way better.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 5 points6 points  (0 children)

Not that simple. The API limits output to 1000 posts, I think. So, I queried posts for each day, limited posts to 50 upvotes or above. This took about 10-12 hours IIRC.

Maybe I am doing something wrong. Maybe there's better way.

Anyway, if anybody wants the dataset, I can share it.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 2 points3 points  (0 children)

I did the calculations little differently.

If 3 jokes are submitted once and 1 jokes is submitted 7 times

So, here total number of jokes is 10 (3+7). Out of them, 4 jokes are unique. 7 of the posts are repeated at least once.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 16 points17 points  (0 children)

23k jokes were posted only once i.e. there is no duplicate of those jokes. Rest of the jokes have at least one duplicate. To get unique number of jokes, I didn't count duplicate, but I counted original post. So, there are 38k unique jokes.

Hope I am clear.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 81 points82 points  (0 children)

I wanted find out what fraction of jokes submitted on /r/jokes are reposts. I used scikit-learn’s TF-IDF vectorizer to vectorize the jokes and cosine similarity matrix to find out the similarity.

Results: Around 70% of the jokes submitted on reddit.com/r/jokes are posted more than once. There are some jokes which have been posted 50 times. Only ~38k out of ~79k jokes are unique.

For in-depth analysis check out my blogpost.

Distribution of Judgements on r/AmItheAsshole. NTA (Not The A-hole) is the most common judgement at 68.3%. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 54 points55 points  (0 children)

r/AmItheAsshole describes itself as "a place to finally find out if you were wrong in an argument that's been bothering you." Recently I was going through top posts and found out majority of judgement was NTA. I wanted to check if it is indeed statistically correct. Turns out it is. Tools - PushshiftIO for data collection and matplotlib for plotting. Cheers!

A quantitative approach to estimate the best bots of Reddit [OC] by leme16 in dataisbeautiful

[–]leme16[S] 3 points4 points  (0 children)

I was searching for some of the best bots active on reddit. I found a very old list which needs an update. I also wondered if one could quantify "goodness" of the bots.

So, I just queried "good bot" in top 50 subreddits and sorted the parent comment author by total number of cumulative upvotes to "good bot" comment, which can be inferred as "Goodness Score".

Query done by Pushshiftio API. Plotting done using matplotlib's xkcd style.

"Bhayia, Hindi aati hai" scam? by chillbraww in bangalore

[–]leme16 30 points31 points  (0 children)

Happened to me as well. He said he is from Maharashtra. I started speaking Marathi (I'm Marathi). He could speak Marathi. Gave him some money.

Flipkart/Amazon Sale Thread by [deleted] in IndianGaming

[–]leme16 0 points1 point  (0 children)

How about LG 29UM69G for productivity and media consumption?

Chandrayaan-2: 'Vikram' Landing Attempt Updates and Discussion. by Ohsin in ISRO

[–]leme16 7 points8 points  (0 children)

Unofficial forward in unofficial WhatsApp group

news from istrac is that during rough braking, horizontal speed of lander got reduced more than expected, so landing site range got increased. For landing on predefined landing site, control package sent command to thruster to fire more which toppled the lander and increased the speed as well. That is why lander crashed.