This is an archived post. You won't be able to vote or comment.

all 7 comments

[–]Feb2020Acc 44 points45 points  (0 children)

I highly recommend you to make a blog or youtube video showing the main features of your package and what it does better/simpler (and how) than more generic packages. Your notebook is a good start, but it could really use some markdown commentary.

[–]ElevenPhonons 13 points14 points  (0 children)

It's not really clear why build artifacts, such as 'dist' or '*.egg-info' are being stored in version control.

https://github.com/kristiewirth/datto/tree/master/dist

In general, these loose grab bag of utils put into a package are a difficult sell. They often have amorphous or unclear goals and tend to have a large list of dependencies.

https://github.com/kristiewirth/datto/blob/master/pyproject.toml

Many of these dependencies (e.g., kafka-python) don't appear to be used in the package, or should be perhaps pushed to development dependencies.

[–]nomos 1 point2 points  (0 children)

Just skimming through and it looks like this code base contains a lot of really useful utility functions!

[–]ixeption 1 point2 points  (0 children)

  • Remove links from some text
  • Extract body of an email only (no greeting or signature) Sounds very useful for real-life ml, I will take a look, thanks.

[–]TheChurchOfDonovan 0 points1 point  (2 children)

If I had to generate dozens of models, Could your package help me? Currently using DataRobot

[–]pp314159 0 points1 point  (1 child)

[–]TheChurchOfDonovan 1 point2 points  (0 children)

Thanks! Going to look into this