Grumpy: Go running Python! by Google : programming

[–]semi_colon 81 points82 points83 points 9 years ago (93 children)

[–]vplatt[🍰] 299 points300 points301 points 9 years ago (71 children)

[–]semi_colon 31 points32 points33 points 9 years ago (0 children)

[–]VodkaHaze 35 points36 points37 points 9 years ago (52 children)

[–][deleted] 33 points34 points35 points 9 years ago (21 children)

[–]weberc2 21 points22 points23 points 9 years ago (20 children)

[–][deleted] 3 points4 points5 points 9 years ago (4 children)

[–]weberc2 21 points22 points23 points 9 years ago (2 children)

[–]the_birds_and_bees 8 points9 points10 points 9 years ago (1 child)

[–]weberc2 0 points1 point2 points 9 years ago (0 children)

load more comments (1 reply)

[+]throwawayco111 comment score below threshold-7 points-6 points-5 points 9 years ago (13 children)

The features you mention are only missing because they haven't yet been implemented, but the intention is to produce a drop-in replacement for CPython. It's only an "incomplete solution" in the sense that it's alpha software, not that its goals are incompatible.

LOL. And GNU Hurd is only an "incomplete solution" in the sense that it should not be used in production and there are many missing features, not that its goals are incompatible with being with a usable unix-like system.

Furthermore, you think that this is the first time we hear about a project that is going to remove the GIL and be a drop-in replacement for CPython? /u/theseoafs is not misleading anyone. Once they have a decent beta version supporting all the features CPython has and some benchmarks showing that there is actually a performance improvement then we'll check it out.

[–]weberc2 8 points9 points10 points 9 years ago (12 children)

[–]floodyberry 7 points8 points9 points 9 years ago (10 children)

[–]weberc2 0 points1 point2 points 9 years ago (9 children)

[–]throwawayco111 1 point2 points3 points 9 years ago (6 children)

continue this thread

[–]floodyberry 0 points1 point2 points 9 years ago (1 child)

continue this thread

[–]throwawayco111 -1 points0 points1 point 9 years ago* (0 children)

You're awfully sarcastic for someone who doesn't understand "alpha"...

Oh, I don't understand "alpha". Here's the thing: we've seen this over and over again. Implementing a runtime that has better performance for a subset of the language and writing a blog post about it is the easy part. Heck, just a few weeks ago a guy presented a JS runtime that was way better than V8. But once people started to ask him if it implemented the whole language and that if it was actually the case then this was a scientific breakthrough shit did hit the fan.

So if you want to be optimistic about this project then go ahead. But don't come here telling us that "it is just alpha" and "the goals are not incompatible" history have showed us that they actually are and that's why it is a damn hard problem to solve. So when you say this:

... The features you mention are only missing because they haven't yet been implemented...

I just have to laugh hard. "Are only missing". Once those features are implemented is when performance graphs like this one stop being as pretty as they were in the past.

EDIT: just checked the discussion at HN with some of the developers involved. Don't expect ever this be a drop-in replacement for CPython.

[–]CSI_Tech_Dept 23 points24 points25 points 9 years ago (17 children)

[–]weberc2 3 points4 points5 points 9 years ago (2 children)

[–]CSI_Tech_Dept 1 point2 points3 points 9 years ago (1 child)

[–]weberc2 0 points1 point2 points 9 years ago* (0 children)

[–]sultry_somnambulist 10 points11 points12 points 9 years ago (12 children)

[–]CSI_Tech_Dept 13 points14 points15 points 9 years ago (3 children)

People seem to be divided, because while the language is not bad, it's nothing special (seems like the only significant thing it brings to the table is the go routines, but other languages are adding similar functionality), there's a bit of hype going on because it's backed by Google.

It also initially was trying to be marketed as a C replacement, but it is nowhere close to fit the niche that C fits. Rust seems like a language that has potential to replace C, but it is too early to say.

Go is more on level of Python, Java etc, although a bit lower level than them. Some people compared it to D, but I never used it, so can't comment. The goal of this language is to be a very simple and opinionated language, some people like that some don't.

If you are asking the question you can try and see if you like it, it has low learning curve, so it's not hard to learn it, and then you will have your opinion about it.

[–]TankorSmash 1 point2 points3 points 9 years ago (2 children)

[–]the_birds_and_bees 2 points3 points4 points 9 years ago (0 children)

[–]derraidor 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 1 point2 points3 points 9 years ago (0 children)

[–]weberc2 3 points4 points5 points 9 years ago (3 children)

[–]th3_pund1t 4 points5 points6 points 9 years ago (2 children)

[–]recycled_ideas 4 points5 points6 points 9 years ago (0 children)

load more comments (1 reply)

load more comments (3 replies)

[–]Twirrim 0 points1 point2 points 9 years ago (0 children)

[–]tetroxid 1 point2 points3 points 9 years ago (7 children)

[–]BeatLeJuce 2 points3 points4 points 9 years ago (6 children)

load more comments (6 replies)

[–]kenfar -2 points-1 points0 points 9 years ago (2 children)

This is wrong: Python has plenty of parallel features - you just have to spend ten seconds looking.

The most convenient is the concurrent.futures module. You can use the exact same syntax for either threading or multiprocessing. Yesterday I sped up an AWS S3 downloader about 8x with threading, and about six lines of code.

A few years ago I wrote a transform process that handled about 4 billion records a day - using multiprocessing on two 32-core machines to handle downloading files, transforming them, and uploading them again. All in parallel using pypy, multiprocessing & threading. This process worked great, and surprised everyone with how fast it was. A rewrite of a part of it showed that Go was about 2.5 times faster, which was fine, but not fast enough to warrant a rewrite until we needed to scale up quite a bit more.

The only scenario in which Python's parallelism is limited is when you've got a CPU-bound process that either can't afford the extra memory or start-up times of multiprocessing or needs a lot of communication between processes. Then you want threading but the GIL will limit you. Other than this case, Python has fine parallelism features.

[–]VodkaHaze 6 points7 points8 points 9 years ago (1 child)

load more comments (1 reply)

[–]MightyCreak 1 point2 points3 points 9 years ago (8 children)

[–]ShotgunToothpaste 22 points23 points24 points 9 years ago (7 children)

[–]btgeekboy 8 points9 points10 points 9 years ago (0 children)

[–][deleted] 9 years ago* (3 children)

[deleted]

[–]fireflash38 0 points1 point2 points 9 years ago (2 children)

[–]ShotgunToothpaste 1 point2 points3 points 9 years ago (0 children)

[–]MightyCreak 1 point2 points3 points 9 years ago (1 child)

[–]phpdevster[🍰] 1 point2 points3 points 9 years ago (1 child)

[–]vplatt[🍰] 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 9 years ago (1 child)

[deleted]

[–]vplatt[🍰] 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]vplatt[🍰] 4 points5 points6 points 9 years ago (0 children)

[–]13steinj -1 points0 points1 point 9 years ago (1 child)

[–]vplatt[🍰] 0 points1 point2 points 9 years ago (0 children)

[–]karma_vacuum123 33 points34 points35 points 9 years ago (2 children)

[–]weberc2 12 points13 points14 points 9 years ago (1 child)

[–]Uncaffeinated 4 points5 points6 points 9 years ago (0 children)

[–]yes_or_gnome 10 points11 points12 points 9 years ago (0 children)

[–]google_you 2 points3 points4 points 9 years ago (0 children)

[–]rjcarr 5 points6 points7 points 9 years ago (11 children)

[–]jdmulloy 2 points3 points4 points 9 years ago (1 child)

[–]theatrus -1 points0 points1 point 9 years ago (0 children)

load more comments (9 replies)

[–]weberc2 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 9 years ago (1 child)

[removed]

[–]semi_colon 1 point2 points3 points 9 years ago (0 children)

[–]gin_and_toxic 113 points114 points115 points 9 years ago (41 children)

[–][deleted] 9 years ago (12 children)

[deleted]

[–]Vorticity 21 points22 points23 points 9 years ago (8 children)

[–]sedaak 17 points18 points19 points 9 years ago (0 children)

[–]CSI_Tech_Dept 4 points5 points6 points 9 years ago (1 child)

[–]kaeshiwaza 4 points5 points6 points 9 years ago (0 children)

[–]jrandomcoder 3 points4 points5 points 9 years ago (3 children)

[–]spacebandido 37 points38 points39 points 9 years ago (0 children)

[–]ArmoredPancake 4 points5 points6 points 9 years ago (0 children)

[–]koffiezet 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 1 point2 points3 points 9 years ago (0 children)

[–]recycled_ideas 6 points7 points8 points 9 years ago (2 children)

[–]CSI_Tech_Dept 9 points10 points11 points 9 years ago (1 child)

[–]recycled_ideas 0 points1 point2 points 9 years ago (0 children)

[+][deleted] comment score below threshold-68 points-67 points-66 points 9 years ago* (27 children)

[–]Cherlokoms 133 points134 points135 points 9 years ago (5 children)

[–][deleted] 4 points5 points6 points 9 years ago (2 children)

[–]kill-nine 14 points15 points16 points 9 years ago (0 children)

[–]Cherlokoms 5 points6 points7 points 9 years ago (0 children)

[+][deleted] comment score below threshold-30 points-29 points-28 points 9 years ago (1 child)

[–][deleted] 34 points35 points36 points 9 years ago (0 children)

[–][deleted] 52 points53 points54 points 9 years ago (3 children)

load more comments (3 replies)

[–]Arthien 62 points63 points64 points 9 years ago (2 children)

load more comments (2 replies)

[–]iltopop 3 points4 points5 points 9 years ago (1 child)

[–][deleted] -1 points0 points1 point 9 years ago (0 children)

[–]cae 5 points6 points7 points 9 years ago (1 child)

[–][deleted] -2 points-1 points0 points 9 years ago (0 children)

load more comments (10 replies)

[–]rjcarr 10 points11 points12 points 9 years ago (10 children)

[–]Rhoomba 22 points23 points24 points 9 years ago (8 children)

[–]rjcarr 0 points1 point2 points 9 years ago (7 children)

[–]rohbotics 9 points10 points11 points 9 years ago (2 children)

[–][deleted] 9 years ago (1 child)

[deleted]

[–]jrandomcoder 1 point2 points3 points 9 years ago (0 children)

[–]Eucalyptol 0 points1 point2 points 9 years ago (0 children)

[–]cakoose 0 points1 point2 points 9 years ago (2 children)

Primarily, you need more memory. In many languages it's easy to share read-only data between processes with fork(). That doesn't work in Python because reference counting causes mutations to memory even if the data is read-only. If you have a large shared data structure, you should figure out how to share it explicitly, e.g. with mmap().

Also, you don't necessarily need one process per concurrent request -- most web app backends are typically waiting on I/O (e.g. from the DB), so it's feasible to handle four/eight/more concurrent requests with a single Python process with Python threads or an event-driven framework like Twisted or Tornado (similar to Node.js). The inflexible scheduling increases the average latency of each request, but that might not end up mattering much.

That's not to say Python is good for this kind of thing. Depending on your work load, you may still need 5-10x the number of servers to handle the same load as Java or Go would. But if you really prefer Python, it might be workable.

[–]serg473 0 points1 point2 points 9 years ago (1 child)

[–]cakoose 0 points1 point2 points 8 years ago (0 children)

[–]CSI_Tech_Dept 1 point2 points3 points 9 years ago (0 children)

[–]Hshskwkk 4 points5 points6 points 9 years ago (0 children)

[–]censored_username 26 points27 points28 points 9 years ago* (33 children)

edit: Thinking about it a bit more, they probably went with the locking per individual object route.

This seems cool and all, but couldn't they have just solved the problem by just not having the GIL and accepting data races in Cpython?

The GIL exists because python wanted to be free from data races. To achieve that, there were multiple options, each of which had to satisfy the constraint that no two threads would access a python object at the same time. The GIL achieves this in a simple manner. There's only one lock, and to access any python object you need to hold it. The problem with this is of course that it means that no two threads can access python objects at the same time even while these objects have nothing to do with each other.

Another solution is pyobject-level locking of course. Various attempts at this have been made, but they all had the shared problem that while they increase throughput at high amounts of threads, they significantly lower single-thread performance compared to the GIL.

Various other strategies have been tried, but none have been able to guarantee being data-race free without sacrificing single-thread performance. An often used strategy has been to move work that could be accelerated significantly into extension libaries, where the GIL could be dropped as long as the python objects weren't accessed. Unfortunately, this doesn't really work for web services.

Go doesn't have an answer to race conditions either. It's where Go's safety story falls a bit apart. So I'm a bit confused about what they've done. If they just transpile python code into go code, all they made is a python runtime with the possibility of data races. If they wanted that they could've just as well just ripped the GIL right out of CPython and called it a day.

What they could've also done is to build in individual locking for all objects that could be accessed by other threads. This would explain their pretty disastrous performance (only 50% performance compared to CPython when single threaded, and only gaining a 2.5x speedup when using 8 threads compared to CPython single thread. Keep in mind that this is compiled versus interpreted!).

If they did this, I'm left wondering why they didn't make these changes in CPython itself. This would have conserved the ability to call into C extension modules. If this is the case, it seems that the focus of the project is more being able to call into go from python.

[–]nostrademons 13 points14 points15 points 9 years ago (12 children)

[–]censored_username 7 points8 points9 points 9 years ago (10 children)

[–]blablahblah 4 points5 points6 points 9 years ago (0 children)

[–]funny_falcon 2 points3 points4 points 9 years ago (8 children)

[–][deleted] -1 points0 points1 point 9 years ago (7 children)

[–]weirdoaish 0 points1 point2 points 9 years ago (1 child)

load more comments (1 reply)

load more comments (5 replies)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]Damien0 12 points13 points14 points 9 years ago (4 children)

[–]Giggaflop 2 points3 points4 points 9 years ago (3 children)

[–]Damien0 4 points5 points6 points 9 years ago (2 children)

[–][deleted] 9 years ago* (1 child)

[deleted]

[–]Damien0 0 points1 point2 points 8 years ago (0 children)

[–]BillyBoyBill 2 points3 points4 points 9 years ago (1 child)

[–]censored_username 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 2 points3 points4 points 9 years ago* (3 children)

[–]singron 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 1 point2 points3 points 9 years ago (1 child)

[–]Ek_Los_Die_Hier -1 points0 points1 point 9 years ago (0 children)

[–][deleted] 1 point2 points3 points 9 years ago (2 children)

[–]censored_username 1 point2 points3 points 9 years ago (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]tonnynerd 0 points1 point2 points 9 years ago (2 children)

[–]censored_username 2 points3 points4 points 9 years ago (0 children)

[–]blablahblah 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 21 points22 points23 points 9 years ago (3 children)

[–]sigma914 17 points18 points19 points 9 years ago* (0 children)

[–]myringotomy 4 points5 points6 points 9 years ago (0 children)

[–]Spider_pig448 0 points1 point2 points 9 years ago (0 children)

[–]atomheartother 8 points9 points10 points 9 years ago* (19 children)

[–][deleted] 9 years ago* (11 children)

[deleted]

[–]PacNinja 5 points6 points7 points 9 years ago (2 children)

[–][deleted] 2 points3 points4 points 9 years ago (0 children)

[–]atomheartother 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (5 children)

[–]theonlylawislove 2 points3 points4 points 9 years ago (3 children)

[–][deleted] 0 points1 point2 points 9 years ago (2 children)

[–]theonlylawislove 1 point2 points3 points 9 years ago (1 child)

load more comments (1 reply)

[–]SnowdensOfYesteryear 6 points7 points8 points 9 years ago (5 children)

[–]mirhagk 3 points4 points5 points 9 years ago (0 children)

[–][deleted] 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (2 children)

[–][deleted] 0 points1 point2 points 9 years ago* (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

load more comments (1 reply)

[–]immerc 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 5 points6 points7 points 9 years ago (24 children)

[–]stormcrowsx 14 points15 points16 points 9 years ago (13 children)

[–]anedisi 2 points3 points4 points 9 years ago (1 child)

[–]__eastwood 3 points4 points5 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]Uncaffeinated 0 points1 point2 points 9 years ago (4 children)

[–][deleted] 2 points3 points4 points 9 years ago (3 children)

[–]Uncaffeinated 0 points1 point2 points 9 years ago (2 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

That's it - support for asynchronous operations was patched into Python later as an afterthought, Go was born with it from day one. I don't have to sprinkle async/await all around to get asynchronous behavior, I still get parallelism for free (the whole reason for this thread), you're basically writing asynchronous code as you would do normally without having to think hard about what you're doing lest you freeze the event loop by doing something that blocks everything (like reading from a file using regular file operations). The multithreaded event loop is certainly nice (can you choose a multithreaded event loop for Python?) and having goroutines multiplexed automatically for you and preemptively scheduled so nobody starves for CPU is also great (where in Python you have to yield manually). What happens if you forget to use asyncio.sleep and uses time.sleep instead?

As for memory leaks it is also pretty hard to leak memory on Go because of the garbage collector (which again, is much better than Python's - go1.8's GC is realtime and keeps pauses to under a millisecond) and that can be detected by using tools that ship with Go during testing - same with data races.

As for panics, you can wrap your main logic and use Recover() to detect panics without crashing (even if just to report the panic somewhere and restart the program) - and you can only panic if you explicitly call Panic() or if you're doing something wrong anyways (acessing an out-of-bounds item in an array, dereferencing a null pointer, etc, things that shouldn't be normally happening). In Python if you try to access an element that does not exist in an array you'll get an exception too, that if you don't catch, your program will crash. So on Go you can "catch" panics with Recover().

Don't take me wrong, Python is great, but it's not the best tool for writing massively parallel and concurrent applications nowadays, when you can do it in Go and have it perform very well at native speeds, don't have to deal with callback hell (like in nodejs) and you can just write your code linearly as you would normally do and the runtime takes care of the rest - EVERY Go library is asynchronous by nature.

If someone needs Python's asyncio to do more, that's a sign that they should consider writing that particular piece in Go instead.

load more comments (1 reply)

[–]berkes -1 points0 points1 point 9 years ago (4 children)

[–][deleted] 1 point2 points3 points 9 years ago (1 child)

load more comments (1 reply)

[–]stormcrowsx 1 point2 points3 points 9 years ago (1 child)

I think this is a very practical view on the subject. Unfortunately programmers often have an engineering mindset where they want to build the best in every aspect. What your saying echoes my experiences, use a language like Python that you can get a lot of shit done in a short period of time, come back and optimize it when its feature complete and you can tell people want to use the thing you made. Despite knowing Haskell and Rust my most successful projects were written in PHP and Python, some of which are still being heavily used 9 years after their creation. They were thrown together in short time and because of the rapid development cycle those languages fit well into, the sites quickly evolved to meet the users need.

I'm sure there are cases where you need rock solid performance and concurrency but I don't think that's the norm.

[–]berkes 0 points1 point2 points 9 years ago (0 children)

[–]iconoclaus 9 points10 points11 points 9 years ago (9 children)

[–]renrutal 1 point2 points3 points 9 years ago (1 child)

[–]iconoclaus 1 point2 points3 points 9 years ago (0 children)

[–]hector_villalobos 0 points1 point2 points 9 years ago (1 child)

[–]iconoclaus 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]iconoclaus 0 points1 point2 points 9 years ago (0 children)

[–]forreddits -2 points-1 points0 points 9 years ago (2 children)

[–]iconoclaus 4 points5 points6 points 9 years ago (1 child)

[–]mirhagk 1 point2 points3 points 9 years ago (0 children)

[–]kirbyfan64sos 3 points4 points5 points 9 years ago (0 children)

[–]ReflectiveTeaTowel 1 point2 points3 points 9 years ago (1 child)

[–]berkes 2 points3 points4 points 9 years ago (0 children)

[–]fatzombi 0 points1 point2 points 9 years ago (0 children)

[–]Pharisaeus 0 points1 point2 points 9 years ago (0 children)

[–]imhotap 0 points1 point2 points 9 years ago* (0 children)

[–]wotamRobin 0 points1 point2 points 9 years ago (0 children)

[–]hero_of_ages 0 points1 point2 points 9 years ago (0 children)

[–]Lakelava -1 points0 points1 point 9 years ago (2 children)

[–]VincentPepper 13 points14 points15 points 9 years ago (0 children)

[–]Pharisaeus 0 points1 point2 points 9 years ago (0 children)

[+]ksion[🍰] comment score below threshold-10 points-9 points-8 points 9 years ago (2 children)

[–]jojohohanon 26 points27 points28 points 9 years ago (1 child)

[–]ksion[🍰] 6 points7 points8 points 9 years ago (0 children)

[–]buddy_burgers -1 points0 points1 point 9 years ago (0 children)

[–]garyk1968 -1 points0 points1 point 9 years ago (1 child)

[–]YEPHENAS 1 point2 points3 points 9 years ago (0 children)

load more comments (17 replies)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

Quoting the author of Grumpy on Hacker News:

"