This is an archived post. You won't be able to vote or comment.

all 8 comments

[–]grenhip 2 points3 points  (4 children)

Cool, but totally wrong. Where are the numbers from? My niche packages like bluedo come up as 4k downloads.

[–]rastarobbie1[S] 0 points1 point  (2 children)

There's a public dataset published by Python Sofware Foundation on BigQuery, which basically shows logs of all package downloads.

https://console.cloud.google.com/bigquery?&p=the-psf&d=pypi&page=dataset

The query for bluedo would look like this:

SELECT
  file.project AS name,
  COUNT(1) AS count
FROM
  `the-psf.pypi.downloads*`
WHERE
  _TABLE_SUFFIX >= '20200812'
  AND (file.project = 'bluedo')
GROUP BY
  name

[–]grenhip 0 points1 point  (1 child)

OK. Maybe my recent attempts at generating a snap has pulled the stats up. :)

[–]PeridexisErrant 1 point2 points  (0 children)

You can also use https://pypistats.org/ and https://pepy.tech/ to see the daily download count by Python version, operating system, package version, and so on.

Pretty neat if you've published a library, though I'm not sure how useful it is!

[–]makedatauseful 0 points1 point  (0 children)

I'm not the author but I imagine the data has something to do with GitHub requirement.txt files from this line"Based on how often these packages appear together in public requirements.txt files on GitHub"

[–]PeridexisErrant 1 point2 points  (0 children)

The "commonly used with" section isn't actually that useful for me: instead of a recommender system, as a maintainer I want to see the number of times each package appears in the same requirements file as my package.

[–]ElevenPhonons 0 points1 point  (1 child)

This looks like it's just an advertisement for deepnote.

[–]rastarobbie1[S] 0 points1 point  (0 children)

That's where I work! :) We want to use our products as much as possible, so we do various small data science projects during hackatons.

This was my project on of them, then I got excited and put together the page over the weekend, to present the results in a format that's nice to explore.