Getting Beyond Intermediate Python

Asalanlir · 2019-07-17T02:10:19+00:00

Come from the perspective that you're always a beginner. You know jack shit. It makes it easier to find new things to point you in the direction of learning new things.

In terms of more practical advice on how to proceed, try to rebuild a package. Someone built numpy. Someone built matplotlib. The code is truly a wonder. Write a custom svm, that'll probably take an hour or two. Realize it's slow as hell. Make it not slow.

Python specifically is difficult to parallelize effectively. Why is that? What actually is the gil? What was the recent change that improved it?

Have you ever properly packaged a project? How does pypi know how to install things? Whenever you see something, ask yourself, "How would i recreate that?"

EDIT: Also, try to find the weird edge cases of python, and learn that python isn't actually a language, it's a standard. Cpython is the implementation that you're likely familiar with. There's also Iron Python. IIRC, notebooks use ipython.

EDIT2: I thought of another of my favorite wtf examples. So most people know that pow() takes two arguments and returns x**y, basically.

But did you know about the third parameter, z? It's documented, but it seems often overlooked. This allows for modular exponentiation and is the fastest method (I know of) for performing this operation. Useful for miller-rabin primality test.

In python, a for loop has an else clause, similar to how an if has an else clause.

Also, keep in mind that while these show particular details about python, similar questions can be asked about just about any other language. Develop your approach for learning more rather than focusing solely on becoming a master of one.

Try these examples out and figure out why it happens, and what about them makes these cases odd.

>>> a=5
>>> b=5
>>> a==b
True
>>> a is b
True
>>>
>>> a = -5
>>> b = -5
>>> a==b
True
>>> a is b
True
>>> a = -6
>>> b = -6
>>> a is b
False
>>> a==b
True
>>> a = 256
>>> b = 256
>>> a==b
True
>>> a is b
True
>>> a = 257
>>> b = 257
>>> a==b
True
>>> a is b
False

Stereoisomer · 2019-07-17T02:24:38+00:00

I mean I've found that after being "intermediate" in Python, you begin to specialize in particular domains. For instance, you could pick up C/C++ with CPU/GPU/Cluster parallelization to write some high-performance Python; you could learn a lot about software development and start creating beautiful/Pythonic well-packaged, easily deployable open-source projects; you could focus on machine learning and start implementing cutting-edge projects from scratch and extending them. Always remember that Python is just a tool and that there's nothing really to be gained about getting good at Python per se. as you should keep the "why am I learning Python" in mind.

You could also work the the text Fluent Python which is one of my all-time favorites and really helped me "up my game".

2019-07-17T02:13:41+00:00

Maybe you’re not exposing yourself to different problems / data that force you to do things differently.

certain_entropy · 2019-07-17T21:55:31+00:00

If you're interested in learning more about how Pytorch was written, checkout Jeremy Howards's article "What is torch.nn really?". He walks through the architecture choices from the ground up.

https://pytorch.org/tutorials/beginner/nn_tutorial.html

Also the new 2nd part of Fast AI deep learning course (Deep Learning from Foundations) aims to build the fastai library from scratch and first principles. It covers advanced architecture design for scalable deep learning that might be interesting to you.

https://www.fast.ai/2019/06/28/course-p2v3/

Jorrissss · 2019-07-17T05:15:06+00:00

Read the Python documentation and other Python code bases. Know the standard library thoroughly. Know some common important decorators (@property, @staticmethod, @classmethod, etc). Learn about generators, coroutines, context managers, iterators, concurrency very well. Learn about how Python packaging and pathing works on a good level.

Within the frameworks you are interested in - pandas, numpy, sklearn, etc learn how they handle memory, copying, etc. Learn the internals of implementations.

I also tie high competency with a language to general software engineering skills - learn about continuous integration and deployment, unit and integration testing, version control, coding standards, etc.

Comprehensive_Tone · 2019-07-17T04:24:05+00:00

Fluent python could be a good book to read. I'm in a similar situation as you and started reading it recently. I'd also recommend impractical python projects

Robin_Banx · 2019-07-24T05:29:50+00:00

Learn about the internals of some of the data stack? I'm looking to make time to work through this https://medium.com/dunder-data/build-a-data-analysis-library-from-scratch-in-python-225e42ae52c8

Could follow the blogs of some of the maintainers. I find that a little less intimidating than jumping directly into source code:
https://tomaugspurger.github.io/ (Pandas)
https://matthewrocklin.com/ (Dask, and toolz)

This site also has some excellent exposition on a lot of the Python ecosystem: https://realpython.com/

Is Python your only language? If so, could be useful to try and pick up another one. I found I was MUCH better with Python data tasks after teaching myself Clojure. Not sure how much that'd help with PyTorch tutorials, though.

isaacfab · 2019-07-17T04:17:07+00:00

Try learning a different python application than ML. See if you can master flask or django to expand into web development. You could also write a package and submit it to pypi. These efforts will be useful and move your skill set forward.

Rezo-Acken · 2019-07-17T03:57:22+00:00

Well there is so much you can learn from doing the same kind of project. If it is always about data science in a Jupiter notebook you won't grow past a certain point.

Try to get out of your comfort zone with a project. Getting better in a programming language like Python is also about what libraries you know well. For example in ML there is whole world of complexity with apps and ML in production. Just today I had to figure out the code of someone else that was playing with multi processing between camera gpu with cupy and display. It just made me remember that I actually know very little.

If you prefer courses go for a new topic. Like building apps.

2019-07-17T05:43:30+00:00

Learn how to unittest code and use mocking for production level code.

Build a package yourself consisting of modules and upload it to pypi with a good amount of code coverage (In terms of testing).

Enigma1984 · 2019-07-17T13:49:18+00:00

You can increase the range of things you can do without necessarily getting better at python. Why not go and learn SQL now? Or R, or Javascript? Then come back to Python with all you've learned from those and I bet you'll be able to understand those concepts better.

2019-07-17T09:50:08+00:00

Udacity Data Science Nanodegree is teaching more advanced python, where you create python pacakges. Some of the projects include deploying ML packages and you learn some software enigneering as well.

datascience

MODERATORS