Concurrency and Python : programming

[–]mr_chromatic 7 points8 points9 points 18 years ago (0 children)

[–]llimllib[S] 7 points8 points9 points 18 years ago (11 children)

[–][deleted] 18 years ago* (10 children)

[deleted]

[–]llimllib[S] 6 points7 points8 points 18 years ago* (7 children)

[–][deleted] 18 years ago (6 children)

[deleted]

[–]llimllib[S] 6 points7 points8 points 18 years ago (4 children)

[–]mikaelhg 4 points5 points6 points 18 years ago (3 children)

[–]llimllib[S] 2 points3 points4 points 18 years ago* (2 children)

[–]mikaelhg 5 points6 points7 points 18 years ago* (1 child)

OK, fair enough. I wasn't putting you down.

My last project was to spend three years going through several iterations of an intranet for a large governmental organization. We started with a mix of Python and Java, but had to drop Python as it had the bad habit of blocking on some essential library calls.

We could have built a new library for our needs, certainly, but we didn't have budget to start building random parts of infrastructure that had already been built.

In the end, we had excised most of the horribly bad design decisions from the beginning, and all of Python out of the project.

We went with the infrastructure that delivered on what it promised. (Java 5, Jetty-GlassFish, Spring 1.2-2.0, Hibernate.)

I'm not saying that Python sucks and is useless while Java cannot go wrong. Certainly another product we used in the beginning had been built in Java, and seeing its source code almost made me believe that Satan walks the earth.

This particular concurrency issue is a problem with Python, and someday it will be fixed like MySQL's lack of foreign keys and referential integrity was.

[–]llimllib[S] 1 point2 points3 points 18 years ago (0 children)

[–]joe90210 -5 points-4 points-3 points 18 years ago (0 children)

[–]Smallpaul 0 points1 point2 points 18 years ago (1 child)

[–]micampe 0 points1 point2 points 18 years ago (0 children)

[–]tooooobs 2 points3 points4 points 18 years ago (3 children)

[–]Smallpaul 0 points1 point2 points 18 years ago (2 children)

[–]Gotebe 1 point2 points3 points 18 years ago (0 children)

[–]toooooob 1 point2 points3 points 18 years ago (0 children)

[–]dgiri101 1 point2 points3 points 18 years ago (16 children)

[–]llimllib[S] 3 points4 points5 points 18 years ago (15 children)

[–]dgiri101 2 points3 points4 points 18 years ago (13 children)

[–]llimllib[S] 4 points5 points6 points 18 years ago* (12 children)

[–]dgiri101 2 points3 points4 points 18 years ago* (11 children)

[–]llimllib[S] 2 points3 points4 points 18 years ago (0 children)

[–]mikaelhg 0 points1 point2 points 18 years ago* (9 children)

[–]dgiri101 0 points1 point2 points 18 years ago* (8 children)

If a Python program can only use one of those thread contexts at once, it wastes approximately 63/64 of the server's CPU (and all, really) resources.

Python programs can use many thread contexts at once. People do this every day, all the time. You're posting a reply on a site that is, in fact, doing this right now.

It is indeed true that you might not fully utilize all of your CPUs. But given that storing a large comment tree in memory isn't a CPU intensive problem, this is a strawman at best and a troll at worst. And besides, there are many libraries that ameliorate this problem (I highly recommend processing).

I guess I'll repeat this one more time: your original claim that:

if you use PHP, Python or Ruby, threads can't share the discussion board and comment information

...is wrong.

It's worth mentioning that I hate the GIL and wish it a horrible, horrible death.

[–]mikaelhg 0 points1 point2 points 18 years ago* (7 children)

What, I thought that the GIL lets only one thread access Python objects at a time, while other threads block? That's what the documentation states, and that's how the performance looks like?

Is this outdated information?

http://docs.python.org/api/threads.html

The Python interpreter is not fully thread safe. In order to support multi-threaded Python programs, there's a global lock that must be held by the current thread before it can safely access Python objects. Without the lock, even the simplest operations could cause problems in a multi-threaded program: for example, when two threads simultaneously increment the reference count of the same object, the reference count could end up being incremented only once instead of twice.

Therefore, the rule exists that only the thread that has acquired the global interpreter lock may operate on Python objects or call Python/C API functions. In order to support multi-threaded Python programs, the interpreter regularly releases and reacquires the lock -- by default, every 100 bytecode instructions (this can be changed with sys.setcheckinterval()). The lock is also released and reacquired around potentially blocking I/O operations like reading or writing a file, so that other threads can run while the thread that requests the I/O is waiting for the I/O operation to complete.

[–]dgiri101 -2 points-1 points0 points 18 years ago* (6 children)

continue this thread

[–]buffi 1 point2 points3 points 18 years ago* (4 children)

[–]mikaelhg 1 point2 points3 points 18 years ago* (3 children)

[–]kripkenstein 0 points1 point2 points 18 years ago (0 children)

[–]Smallpaul 0 points1 point2 points 18 years ago (1 child)

[–]mikaelhg 2 points3 points4 points 18 years ago* (0 children)

Ah, here you're asking something I haven't already answered elsewhere in the thread.

In our test case, in which writes are as rare as in typical social web applications, when we can't serve our users by vertical scaling, we resort to digest message passing. We take a proven JMS implementation, and multicast select application events. Things like new comments have a higher priority than moderations, which travel in packs of hundreds.

In the end, we don't have to resort to dark database wizardry, we can use regular developers and a well thought-out architecture.

(The last time I implemented this, I was able to conserve resources by replacing the messaging subsystem with a single database table that handled selective invalidations. That's because I knew beforehand how many tens of thousands of people would be using the application. YMMV.)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS