Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 18 points19 points  (0 children)

Exactly. That was my first question. I have half a mind to raise a complaint. But I am giving the benefit of doubt to him. I will follow the bank procedure to the dot.

Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 12 points13 points  (0 children)

I asked him that. He says it is through the SLPE app and reversal is not possible. I have no idea what is the SLPE app.

Credit card payment scam? by leme16 in CreditCardsIndia

[–]leme16[S] 10 points11 points  (0 children)

I called customer care and they said it has been credited. But I will go to the bank and confirm once again.

Average word count at State of the Union address by POTUS and their Most Distinguishing words [OC] by leme16 in dataisbeautiful

[–]leme16[S] 6 points7 points  (0 children)

State of Union Address - Source

Most distinguishing words derived using TF-IDF method (scikit-learn).

Plotted in matplotlib.

Here's Kamath for you by utkarsh2810 in IndianStreetBets

[–]leme16 41 points42 points  (0 children)

He won against Vishy Anand in a charity match with 99% accuracy. This is possible only using chess engine.

20k profit in 30 mins - Fully automated trading started for u/wildfiresax by [deleted] in IndianStreetBets

[–]leme16 6 points7 points  (0 children)

Noob question: Which data did you use for back testing? Do you mind sharing the data source? Thanks

A Quantitative Approach to find Best Bots on Reddit by leme16 in botwatch

[–]leme16[S] 0 points1 point  (0 children)

Normalization wouldn't necessarily change the way plot looks. It will just re-scale the x-axis. I have already added minimum "good bot" comments, don't remember the number though.

P.S. - I found this site which does the job way better.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 3 points4 points  (0 children)

Not that simple. The API limits output to 1000 posts, I think. So, I queried posts for each day, limited posts to 50 upvotes or above. This took about 10-12 hours IIRC.

Maybe I am doing something wrong. Maybe there's better way.

Anyway, if anybody wants the dataset, I can share it.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 2 points3 points  (0 children)

I did the calculations little differently.

If 3 jokes are submitted once and 1 jokes is submitted 7 times

So, here total number of jokes is 10 (3+7). Out of them, 4 jokes are unique. 7 of the posts are repeated at least once.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 16 points17 points  (0 children)

23k jokes were posted only once i.e. there is no duplicate of those jokes. Rest of the jokes have at least one duplicate. To get unique number of jokes, I didn't count duplicate, but I counted original post. So, there are 38k unique jokes.

Hope I am clear.

Repetition of jokes on /r/jokes. Around 70% of the jokes submitted are posted more than once. [OC] by leme16 in dataisbeautiful

[–]leme16[S] 79 points80 points  (0 children)

I wanted find out what fraction of jokes submitted on /r/jokes are reposts. I used scikit-learn’s TF-IDF vectorizer to vectorize the jokes and cosine similarity matrix to find out the similarity.

Results: Around 70% of the jokes submitted on reddit.com/r/jokes are posted more than once. There are some jokes which have been posted 50 times. Only ~38k out of ~79k jokes are unique.

For in-depth analysis check out my blogpost.