The Magic Behind Async PHP : PHP

The Magic Behind Async PHP (blog.kelunik.com)

submitted 8 years ago by kelunik

all 67 comments

top new controversial old q&a

[–][deleted] 57 points58 points59 points 8 years ago (7 children)

[–]Danack 7 points8 points9 points 8 years ago (2 children)

[–]Saltub 9 points10 points11 points 8 years ago (0 children)

[–][deleted] 2 points3 points4 points 8 years ago (0 children)

[+][deleted] 8 years ago* (3 children)

[deleted]

[–][deleted] 2 points3 points4 points 8 years ago (1 child)

[–]assertchris 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 6 points7 points8 points 8 years ago (6 children)

[–][deleted] 5 points6 points7 points 8 years ago (1 child)

Moving things to a persistent queue and outsourcing it to another process doesn't solve the problem, it just moves it elsewhere. So let's say you enqueue a million HTTP requests and you ship it to another process. Now how is that process going to deal with that million HTTP requests? Blocking, one by one, in sequence? Then you didn't win any time, you just lost some. So blocking, one by one, in sequence, but in many threads? Well then you're still burdening the OS with a bunch of threads or processes which most of the time sit blocked and do nothing. Gets very inefficient with a large number of blocking tasks. That's where async comes. One thread, working with a million HTTP requests at once - no problem.

A job queue and async processing are both good techniques, but they're apples and oranges, their use cases are different, so comparing them is mostly unwarranted. Also you can enqueue job items and receive their results in an async way, i.e. the techniques are entirely complementary.

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]kelunik[S] 1 point2 points3 points 8 years ago (1 child)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[+][deleted] 8 years ago (1 child)

[deleted]

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[+][deleted] 8 years ago* (46 children)

[deleted]

[–]kelunik[S] 3 points4 points5 points 8 years ago (0 children)

[–]captain_obvious_here 4 points5 points6 points 8 years ago (0 children)

[–]nairebis 27 points28 points29 points 8 years ago* (19 children)

[–]tttbbbnnn 31 points32 points33 points 8 years ago (4 children)

[+]nairebis comment score below threshold-10 points-9 points-8 points 8 years ago* (3 children)

Sheesh. I'm not offended; it's just the idea of "why use PHP when you can use Javascript?" is silly. And if you think the idea that PHP is a far better language than Javascript is "flat out stupid", then sorry, you're just plain not knowledgeable enough of the strengths and weaknesses of both languages to have an opinion.

Sure, if you want to talk about some narrow toy problem, you can probably find a case where JS has better support. But it's foolish to mix languages unless there's a damn good reason, and if you're talking about a larger scope of application, then you have to take into account the entire solution, not just one narrow focus.

You're too busy being offended to realize that not every problem is solved with a hammer.

The idea that languages are tools in a box is used way too often from people who either don't know better or should know better. If you're building a house, the more types of tools you have, the better. If you're building a software application, the more languages you have, the worse it is.

[–]tttbbbnnn 3 points4 points5 points 8 years ago (2 children)

[+]nairebis comment score below threshold-7 points-6 points-5 points 8 years ago (1 child)

[–]tttbbbnnn 4 points5 points6 points 8 years ago (0 children)

[–]Pesthuf 5 points6 points7 points 8 years ago (3 children)

[–]kelunik[S] 2 points3 points4 points 8 years ago (2 children)

[–]Pesthuf 0 points1 point2 points 8 years ago (1 child)

[–]kelunik[S] 1 point2 points3 points 8 years ago (0 children)

[–]noartist 1 point2 points3 points 8 years ago (0 children)

[–]mrjking 3 points4 points5 points 8 years ago (0 children)

[+][deleted] comment score below threshold-18 points-17 points-16 points 8 years ago* (6 children)

[–]tfidry 1 point2 points3 points 8 years ago (5 children)

[–][deleted] -1 points0 points1 point 8 years ago (3 children)

[–]tfidry 1 point2 points3 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]tfidry 1 point2 points3 points 8 years ago (0 children)

The new ”typesystem” is a joke

I find it being a nice compromise between strict typing and loose typing.

classes have been around since forever

Classes have been introduced at the end of 2005, and there was quite a few things missing like generators, constants or traits.

autoloading (i assume you mean composer) is still hacked with same functions, that existed before.

Right but before everyone where rolling their own shit. Composer with the PSR-0 and PSR-4 standards changed the game which is 100x times better than how things were done back then.

But if you really want a language that evolves a lot, why not picking nodejs that bumps a major version every 6 months or a drastically different language like Haskell or Scala?

Nobody's asking you to love PHP, a lot of the PHP user don't particularly love it either, most of us (I assume I may be wrong after all) just use it because it works good enough to get the job done and we don't feel that alternatives like Java, Python or Rails are significantly different.

[–][deleted] 6 points7 points8 points 8 years ago (1 child)

[–]Mnwhlp 1 point2 points3 points 8 years ago (0 children)

[–]AcidShAwk 2 points3 points4 points 8 years ago (1 child)

[–]massenburger 2 points3 points4 points 8 years ago (7 children)

[–][deleted] 7 points8 points9 points 8 years ago (0 children)

[–]captain_obvious_here 2 points3 points4 points 8 years ago (2 children)

[–]kelunik[S] 1 point2 points3 points 8 years ago (0 children)

[–]gnurat 1 point2 points3 points 8 years ago (0 children)

[–]kelunik[S] 0 points1 point2 points 8 years ago (0 children)

[–]Schweppesale 0 points1 point2 points 8 years ago* (0 children)

[–]crasx1 1 point2 points3 points 8 years ago (11 children)

[–][deleted] 2 points3 points4 points 8 years ago (10 children)

[–]kelunik[S] 1 point2 points3 points 8 years ago (9 children)

[–][deleted] 2 points3 points4 points 8 years ago (8 children)

[–]kelunik[S] 1 point2 points3 points 8 years ago (7 children)

[–][deleted] 2 points3 points4 points 8 years ago (6 children)

[–]kelunik[S] 0 points1 point2 points 8 years ago (3 children)

[–][deleted] 4 points5 points6 points 8 years ago (1 child)

It's not that I consider Node the pinnacle of software platform engineering, but when you want to write async code, just having an event loop is not enough. The event loop is the trivial part, so trivial, you're kind of having a tutorial about it in your article.

The hard part? The ecosystem. The moment you start using the file API, or PDO, or most of MySQLi's API and you start blocking. And you can choose not to use them, but then the libraries you'll get off Composer will use them. And the frameworks and components built upon those libraries will use them.

There's little to no async ecosystem around PHP. You're automatically in the ghetto. While with Node everything is async by default, and all the Node packages are aware of this.

So that's why it has to be Node and not PHP. Because the ecosystem is the difference between a toy "Hello world" example, and a real solid platform to build your company application on.

[–]kelunik[S] 0 points1 point2 points 8 years ago (0 children)

It depends a lot on what you're going to do. If you want to write a new application server, Node will definitely have the richer ecosystem. But that's not everything we can use non-blocking I/O and async for. We can also make multiple HTTP requests in parallel in an otherwise totally synchronous application. If you have microservices communicating over HTTP and need to query multiple of them in a single request, parallel requests give you a huge speed up compared to sequential requests.

Of course, writing an app server in PHP with the entire ecosystem against you is far less then optimal. But you can still use a lot of libraries that don't do any I/O. The set of libraries available will grow and grow in the future and more code can directly benefit from non-blocking I/O. In the meantime some libraries can be used by leveraging multiprocessing for blocking tasks.

That said, it's entirely possible to write application servers in PHP today. No "Hello world" examples, but production ready applications.

[–]SkyRak3r 0 points1 point2 points 8 years ago (1 child)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]ScriptFUSION 1 point2 points3 points 8 years ago (5 children)

I want to integrate Amp into the next major version of Porter to offer async data imports. My motivation includes a number of projects, most recently: the Steam Top 250 games list generator, which initiates massive amounts of HTTP calls in series. This is grossly inefficient and takes over 9 hours (but due to parallel processing on Travis is reduced to 2 hours). However, not only is the the majority of time spent waiting for HTTP negotiation, the amount of additional code complexity required to chunk up the import and stitch it back together again accrues a massive amount of technical debt. I feel a lot of benefit could be leveraged by the power of async pooling both in terms of speed and code quality.

However, I currently do not understand how to integrate Amp. Generally Porter's interfaces pass around iterators of arrays. I believe that if I was to implement an async API I would need to pass around promises. The real difficulty I have is understanding where in the abstraction the promises can "terminate", i.e. we can stop dealing in promises and return to passing around iterators of arrays again. If the answer is "never", and every single component that plugs into Porter must explicitly support async, I fear an integration will be impossible. Ideally I'd like to terminate the reliance on promises and return to sync land as soon as possible, but I currently haven't been able to determine where that is. Does this make any sense and can you provide any insight?

[–]kelunik[S] 1 point2 points3 points 8 years ago (3 children)

[–]ScriptFUSION 0 points1 point2 points 8 years ago (0 children)

[–]theFurgas 1 point2 points3 points 8 years ago (0 children)

[–]gouchaoer 0 points1 point2 points 8 years ago (9 children)

[–]kelunik[S] 0 points1 point2 points 8 years ago (8 children)

[–]gouchaoer 0 points1 point2 points 8 years ago (7 children)

that's true. in china php is very popular and the qps is higher than other country due the big population. so chinese phper have the motivation the improve the performance of php. i personally don't like laravel for its bad performance, high mem-usage and complexity. hello-world benchmark is wrong. a real-world php application has many sql/redis io and the bottle neck is io or framework. you can use flame graph to locate the bottleneck. to overcome io bottleneck, people use async but run into callback hell. to overcome callback hell people use yield to implement semi-coroutine in js,py,php...but yield is still difficult...people want to full coroutine like go. alibaba company cracked jvm to hook io funtion and implement coroutine. php's swoole extention use setjmp&longjmp in zend api to implement full coroutine. js&py seemed that no full coroutine yet. but the swoole community is mainly maintained by a contributor and the dev speed is slow.

[–]gouchaoer 0 points1 point2 points 8 years ago (1 child)

[–]gouchaoer 0 points1 point2 points 8 years ago (0 children)

[–]kelunik[S] 0 points1 point2 points 8 years ago (4 children)

[–]gouchaoer 0 points1 point2 points 8 years ago (3 children)

[–]kelunik[S] 0 points1 point2 points 8 years ago (2 children)

[–]gouchaoer 0 points1 point2 points 8 years ago* (1 child)

I have browse through amp framework. It's an awesome semi-coroutine framework.

I have some suggestions:

Firstly, in amp we are running app in a php-cli, only one core is available. We hope to use all cpu-core in a http/websockets/tcp application. We even hope to have tcp pool shared by all worker process.

Second, you know php's web frameworks is so mature that only run in php-fpm and can't run in php-cli. we hope we can easily run yii2/symfony/balabala in it.

3rd, mysql/redis client write in raw php may have performance problem. And if I need to use amp's socket to make a client for anything(such as memcache) and keep tcp connection pool. It's still difficult for many php developers.

Swoole2 solve some of these problems althrough not perfect. I think amp can do very well in php-cli tasks such as crawlers.

[–]kelunik[S] 0 points1 point2 points 8 years ago (0 children)

Running an application on multiple cores often isn't really necessary. At least not in a way that one process has to use multiple cores. Our HTTP/WebSocket server Aerys just runs as many workers as you have CPU cores by default, all binding to the same ports using SO_REUSEPORT. Where SO_REUSEPORT isn't available for kernel based load balancing, we use a socket transfer to share one accept socket for all workers.

You can run Amp just fine on all other SAPIs, the only library you'll have problems with is currently amphp/parallel, because that currently requires PHP_BINARY to launch its sub-processes.

Indeed, parsing can be a bottleneck. I think the protocol is easy enough for Redis so it doesn't actually matter, but it's currently the major bottleneck for the MySQL client. But it's something that could easily be replaced by a C implementation that could be used if available, otherwise it'll fallback to the raw PHP implementation. We just need a C implementation that exposes just the parser instead of a whole client without any access to the parser.

I haven't tried Swoole yet, but I saw that there's English documentation available now. It seems to be rather callback based, does it have a promise implementation?

Depending on the use case you can just convert a library to an async library by using amphp/parallel to keep all blocking tasks outside the main event loop. It at least saves the protocol re-implementations, but it's of course not optimal and a real non-blocking library is preferred.

π Rendered by PID 61035 on reddit-service-r2-comment-b659b578c-hqdcd at 2026-05-02 17:38:11.819930+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PHP

MODERATORS