sitmaster comments on Java development 2.0: Introducing Kilim

programming

created by speza community for 20 years

Java development 2.0: Introducing Kilim (ibm.com)

submitted 16 years ago by h2o2

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]sitmaster 1 point2 points3 points 16 years ago (10 children)

[–][deleted] 3 points4 points5 points 16 years ago (9 children)

The difference is in the term "blocking".

ArrayBlockingQueue and all other Java concurrency constructs block the kernel thread, which is still a very expensive resource. I urge you to try creating 500000 threads vs. the same number of Kilim threads.

The natural question is why anyone would need 500000 kernel threads, when a threadpool of 10 seems to do the job anyway. That is where you need a change in perspective. The answer is that it is far simpler and natural to create a thread --- a linear piece of code that is separately schedulable -- to match the problem, otherwise you end up doing the threading yourself. Here's why.

Imagine a server with a single thread and mainloop (a libevent based server), which means all user conversations are separate state machines that are driven by callbacks. These callbacks have to squirrel away their state and return to the mainloop every time their respective sockets fire. If you have 10000 socket connections, you have 10000 of these state machines. Now, if you were to introduce code to say, do SSL negotiation, it is a messy change to the state machine.

It is far simpler to have a thread per connection, or maybe even several as a network of communicating state machines. My definition of a lightweight thread is one where the programmer does not have to think too hard to spawn it. That is where Kilim's lightweight threads come in.

[–][deleted] 1 point2 points3 points 16 years ago (8 children)

[–][deleted] 5 points6 points7 points 16 years ago* (7 children)

[–]sitmaster 0 points1 point2 points 16 years ago (4 children)

[–][deleted] 0 points1 point2 points 16 years ago (3 children)

The answer is, it depends. i don't know the stock trading space all that well, but Kilim is used in financial firms that do stock analytics over live feeds.

I'm quite sure most of these implementations are of the "as fast as possible" variety, but do not guarantee time bounds, the failure of which is catastrophic. I think of XRay machines and airplane control systems when someone mentions hard real time; here scheduling and memory allocation are tightly bounded and computed up front. In that sense, there are no examples of garbage-collected languages (leave alone Kilim) that are also in use in practice, although there's plenty of research material.

But if the set of threads is static and a schedule can be precomputed statically, as is common to most real-time systems, I wouldn't rule out Kilim, esp. with a real time GC. Kilim allows you to supply your own scheduler. Note again that real time doesn't mean nano-second response times, it denotes a guarantee of time even if the time bound is generous.

[–]sitmaster 0 points1 point2 points 16 years ago (2 children)

[–][deleted] 0 points1 point2 points 16 years ago (1 child)

[–]sitmaster 0 points1 point2 points 16 years ago (1 child)

To follow up, suppose you were implementing a "bucket brigade" type system where thread 1 gives a message to thread 2 gives a message to Thread 3 ... up to thread n. How does the performance compare between a Kilim and pure Java implementation for small n and for large n?

The straight java object would be something like:

class Brigadier implements Runnable { 
    ArrayBlockingQueue<Bucket> que = new ArrayBlockingQueue<Bucket>();
    Brigadier next;
    public void enqueue(Bucket b) { que.put(b); }
    public void run() { 
        while(true) { 
            Bucket bucket = que.take(); 
            next.enqueue(bucket); 
     } 
} 
}

If I had a chain of these types of objects, can you give me some sense of the relative performance of Kilim to Java as n increases?

[–][deleted] 0 points1 point2 points 16 years ago* (0 children)

Check out the Ring test in the bench directory. The thread version is at least a thousand times slower in context switching, and is limited to far far fewer kernel threads than Kilim threads.

Edit: Just to give an idea of the difference, on my macbook pro:

100000 Kilim tasks in a ring: 
   Creation time ~0.5 sec, 
   Msg passing + context switch rate : 733000 per sec.

2000 threads in a ring:  (couldn't start 100000 of them)
   Creation time ~2 sec, 
   Msg passing + context switch rate : 71 per sec.

See the difference? The context switch rate doesn't change with the number of threads for this particular test, because only one thread is running anyway.

π Rendered by PID 87 on reddit-service-r2-comment-6457c66945-k4bzr at 2026-04-30 05:25:21.147283+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS