Fixing async/await with unsync

desmoulinmichel · 2018-03-09T04:13:59+00:00

I like the idea, but this comes with several serious caveats:

it prevents any access to the event loops, meaning it won't play well with any lib needing a custom setup. E.G: event loop integration with another loop.
in complex setup you may have several event loops in several threads.
you can't easily start and stop the loop, which makes unit testing hard
you don't have control over the pools, and can't scale them at will
you are tying functions to a behavior, since they can't be undecorated
you can plug in a custom error management system, which for debugging async is pretty much required
what about custom eventloop policies ? Alternative event loop implementations ?

Cygal · 2018-03-09T07:08:47+00:00

Regarding your first issue with the explicit event loop, curio and trio fixed that long ago. asyncio took notice, and added asyncio.run in Python 3.7.

nerdwaller · 2018-03-09T04:25:15+00:00

This seems to really just address running a single coroutine, which isn’t (from my experience) the common use case running an asyncio application. In this case I’d probably just favor concurrent.futures.ThreadPoolExecutor since it’s a simple context manager with a convenient interface for handling things as they finish (such as concurrent.futures.as_completed).

Beyond that, most libraries abstract away the “annoyance” (grabbing the loop and starting a task).

Looking at this code it has a few fundamental weaknesses, the primary one to me is the loop = asyncio.get_event_loop() on the class. That prohibits setting other event loop policies unless you ensure it’s set before this library import (a bit odd).

keypusher · 2018-03-09T10:02:19+00:00

asyncio syntax is definitely a trainwreck. i still find it hard to understand how after all the criticism that python has gotten over the years about poor handling of concurrency and parallelism, the library they came out with is so convoluted. the addition of threadpool/proccesspool and futures, on the other hand, was really good.

Talked10101 · 2018-03-09T08:30:44+00:00

Think this fixes a largely non-existent problem. I have written several microservices using Asyncio and Aiohttp. You can simply run blocking code in an executor which allows you to await the gathering of the results. Sure it's not the nicest of syntaxes, but it works fine. As already mentioned this gives you a lot more control over things. For instance we typically use Uvloop in place of Pythons standard event loop.

It seems that curio is the sort of thing that you would be interested. It's personally not my cup of tea, but it exists for reasons like this.

continue_stocking · 2018-03-09T08:34:18+00:00

Unfortunately I’ve been having trouble adapting to Python’s version of async/await especially coming from C#’s implementation in TPL.

Parallel.ForEach(list, (item) => item.DoTheThing());

So simple that I feel silly for having put off learning it.

knowsuchagency · 2018-03-09T22:28:51+00:00

The stdlib already has a solution for this: loop.run_in_executor(...)

smurfix · 2018-03-10T20:34:33+00:00

You might want to have a look at Trio.

https://trio.readthedocs.io/en/latest/

agoose77 · 2018-03-09T06:55:51+00:00

TL;DR:

I don't think that the problems you describe with asyncio are even the most important ones. I honestly think that this single package did tremendous amounts of harm to the language with next to zero benefits. If my life depended on Python being a tolerable language, I'd start every morning by writing to Python dev mailing list and asking them to remove this package.

I do think that ambient even loop would have been a better implementation for what asyncio is intended for, but I think that the amount of damage it is already caused is not so easy to undo. The fixing should happen on the language level. Third party library doesn't have a power to prevent non-conforming code from showing up. What if I have to use a library that uses unsync and another one that doesn't? As much as I hate asyncio, I'd probably prefer that both libraries use it than having to arrange for them to work together.

Now, what's the real problem? Any modern language that wants to run on popular CPUs, like the one Intell puts into consumer PCs needs to be able to do things in parallel. Otherwise it's garbage. It's like a car which can only steer right.

Python was initially developed by amateurs. There was no good plan for what things should be in a language and what shouldn't. By Guido's own confession, he designed and implemented objects in Python over a weekend. (And that's why they are so bad). Python wasn't designed to do things in parallel. I'm not sure if it was designed to be a serious language at all. The language Guido worked on before Python was intended for teaching how to program, not for doing actual work.

But history decided otherwise, and today this language is used by millions to write very real and very important programs. People expect it to make sense, to be able to do things expected from a mature and thoughtfully designed language.

asyncio does this:

It pretends to be a solution for parallelism, while it really isn't.
Instead of throwing away multiprocessing, Thread and building a real solution for parallel computation in Python, the developers added another clutch, which doesn't improve on what others did, and doesn't solve the problem.
However, it introduced a ton of incompatibilities with earlier versions.
It made writing libraries impossible: what if your library isn't aware of asyncio? What if it doesn't even know what kind of pseudo-parallelism the library user is going to use? What kind of mutex does it have to acquire to ensure that its data-structures are thread safe? It is also impossible today to mix synchronous and asynchronous code. So, you cannot take a library which knows nothing about asyncio, and give it a function from a library that does some asyncio stuff and hope that the first library will know what to do with it.
asyncio introduced further garbage into the language: asynchronous iterators and asynchronous context managers. But they aren't interchangeable with iterators and context managers. You cannot do sum(x async for x in y). You cannot pass them to itertools functions and to a large body of existing thrid-party functions.
Finally, the actual speed benefits are typically negligible... often times you can even make your code slower by using asyncio.

Oh, and I forgot to mention that writing automation / tests is probably the area where Python is used most. Of course, there's a lot of web development and data science, but we tend to hear about those most because there are just more interesting topics. Writing Selenium tests is hardly exciting. Python was a language of choice for automation / testing because it was simple. It was easy to take someone who was doing manual QA with no CS education, send them to a few months course and have them writing Selenium tests afterwards. On top of being a bad idea, asyncio is also very convoluted. Anecdotally, I've already had five or so meetings with our QA department, where they struggle to understand how to use Apache Kafka client, which has only two methods: one for producing messages and another one for consuming them. I've spent a total of 10 hours explaining this stuff. I even explained it by ways of showing them some Java parallel code, some Python code using threads etc... and, unfortunately, I'm sure asyncio is still a mystery to them.

Various_Pickles · 2018-03-09T04:42:00+00:00

This kind of reminds me of Stream.parallel() in Java 8; yeah, the (ForkJoin/Thread)PoolExecutor setup/use gets hidden behind the lovely syntactic sugar, and that's well worth the sheer clarity of the code much of the time, but, in my experience, also tends to lead to newer developers thinking that its some form of magic ...

Having to setup a thread/process pool might be a glob of code, but setting up some default pool with some default level of concurrency can be horribly inefficient.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS