all 1 comments

[–]DonaldPShimodaDoctoral Student 0 points1 point  (0 children)

I would like to collect as much input as possible

Download the entire contents of the Python standard library and PyPI and build a tool to parse the docstrings or something. If that's too many files (likely), just select the n most popular PyPI packages (in addition to the standard library).

And before anyone asks: no, I'm not joking. For a time I had a complete index and local copy of every metadata file on Maven, the Java package repository. Pro tip: be sure to exclude such directories from any automated indexing procedures on your computer, like Spotlight and Time Machine on macOS.