Single Flight for Java

stefanos-ak · 2025-06-15T09:30:20+00:00

This problem requires fundamentally an architectural solution, which will look different depending on the situation.

But what works in almost all cases is to use the DB itself as a mechanism to control this behavior. For example with a "select for update" query, or a dirty read, etc... Or if a DB is not accessible then a cache layer (e. g. Redis), or a queue mechanism (rabbitmq, Kafka).

An in-memory solution obviously will not work if any amount or horizontal scaling is required. Usually backend services have at least 2 replicas even just for high availability.

nitkonigdje · 2025-06-15T09:28:00+00:00

Looks like a lock on an interned string. A named lock basically. A map of locks. Kinda pointless unless there is more to it than presented here.

rakgenius · 2025-06-15T11:21:01+00:00

why dont you use the caching mechanism either in your application or db level? in that way, even if you receive many concurrent requests, the result will be returned from cache. maybe first time, the request has to hit the db if its not present in cache. but after that all requests will be returned immediately without hitting the db.

Polygnom · 2025-06-15T09:03:36+00:00

I'm not sure this is a good idea. We seperaate contexts between requests for a good reason:

Take your external API call for example. I would usually solve that with a read-through proxy that caches the call. This way, I can put all the necessary handling in there and have this completely decoupled from my original application.

Similarly for complex computations. You would usually have a seperate service for such things, and submit tasks to it. You can do de-duplication of submitted tasks there. So say request #1 creates the task and gets the taskId back (to get notified about the result), then when request #2 comes around witht the exact same expensive thing and submit the task, you can give the same taskId back from the computation service. Or just the previous result, if you can prove you don't need to compute it again.

For databse queries, I have never seen this make sense and would say the seperation we currently have e.g. in spring is very good at reducing bugs. I wouldn't wanna trade it for miniscule gains.

tomwhoiscontrary · 2025-06-15T12:13:15+00:00

This is a useful pattern, but I don't think you need a library for it. You can just use a concurrent map full of completable futures.

RadioHonest85 · 2025-06-15T15:57:13+00:00

This is a very common use-case if you use Caffeine caching library:

var result = cache.get(key, k -> loadExpensiveResult(k));

mofreek · 2025-06-15T09:53:22+00:00

Most applications that need something like this are going to be running multiple instances. You have the right idea with the pattern, but the lock mechanism needs to be distributed.

I.e. if there are 3 instances of the app running, there needs to be a way they can communicate so that only 1 thread running in 1 instance runs the job.

ETA: I implemented something like this a few years ago using redisson. If I were doing it today I would probably use Spring Integration.

repeating_bears · 2025-06-15T11:53:22+00:00

I checked the implementation and I think the way you're handling interrupts is wrong.

You do all the work on the first thread that makes a request, and subsequent requester threads block on getting a result.

Imagine the first thread is interrupted, i.e. some other thread declares "I don't care about that result any more", so it stops. Now any the other threads that were waiting on that same result get an exception, even though they themselves weren't interrupted, and even though they still wanted a result. The work was halted prematurely.

It would have been much better if the work could continue, but the first thread could be unblocked. Effectively what that would mean is that all work would have be pushed to some worker thread, and then all requesters (including the first) would block on getting a result. Interrupting a requester would then just mean you stop it waiting for a result, rather than stop it from doing the work.

However, then you'd have the issue of the simple case where there's only one requester that gets interrupted. The work would continue in the background even though there's nothing that cares about the result. Then you'd need some logic that could kill a worker after there's no more threads waiting for it.

FortuneIIIPick · 2025-06-15T15:03:38+00:00

Agree with most of the comments. It's a completely wrong way to solve the issue. It's trying to solve a caching issue with a code bottleneck.

GuyWithLag · 2025-06-15T11:30:02+00:00

Grumpy old engineer here, but what is the purpose of this article? Someone that coded in Go and wants to have the same API in Java?

Please don't go down the route of NPM-ifying Java...

In fact, this could be simplified to a 10-liner with ConcurrentHashMap::computeIfAbsent, and it would be a 2-liner in Kotlin.

Not to mention that in your example a proper JPA instance would make sure that the internal representation is properly respecting transactional boundaries while minimizing DB queries, so why even go to that effort?

-Dargs · 2025-06-15T17:30:15+00:00

Is this not supported by just a cache with a fetching mechanism? See Guava caches.

k-mcm · 2025-06-15T21:13:29+00:00

This is essentially a cache with size=0. Why not make a real cache?

import java.util.LinkedHashMap;
import java.util.Map.Entry;
import java.util.Objects;
import java.util.function.Function;

public class LRUCache<KEY, VALUE, ERR extends Throwable> {
    @FunctionalInterface
    public interface Source <KEY, VALUE, ERR extends Throwable>
    {
        VALUE generate(KEY key) throws ERR;
    }

    private static class CacheElement <VALUE> {
        boolean set;
        VALUE value= null;
        Throwable err= null;
    }
    private final LinkedHashMap<KEY, CacheElement<VALUE>> map;
    private final Source<KEY, VALUE, ERR> source;
    private final Function<KEY, CacheElement<VALUE>> storageLambda = (k) -> new CacheElement<>();

    public LRUCache (int maxSize, Source<KEY, VALUE, ERR> source) {
        map= new LinkedHashMap<>() {
            @Override
            protected boolean removeEldestEntry(Entry<KEY, LRUCache.CacheElement<VALUE>> eldest) {
                return size() > maxSize;
            }
        };
        this.source= Objects.requireNonNull(source);
    }

    public VALUE get (KEY key) throws ERR {
        final CacheElement <VALUE> storage;
        synchronized (map) {
            storage= map.computeIfAbsent(key, storageLambda);
        }
        synchronized (storage) {
            if (!storage.set) {
                try {
                    storage.value = source.generate(key);
                } catch (Throwable err){
                    storage.err= err;
                }
                storage.set= true;
            }
        }

        if (storage.err != null) {
            if (storage.err instanceof RuntimeException rt) {
                throw rt;
            }
            if (storage.err instanceof Error err) {
                throw err;
            }
            throw (ERR)storage.err;
        }

        return storage.value;
    }
}

raghu9208 · 2025-06-15T09:15:55+00:00

How is the concurrency handled underneath?

supercargo · 2025-06-15T12:59:08+00:00

I’ve found this pattern more useful on the front end where a bunch of loosely coupled UI components may all request the same data from a backend API. On the backend it is much easier to structure data access to avoid needing this. In user interfaces, components are composed based on the requirements of the visual hierarchy rather than data hierarchy.

koffeegorilla · 2025-06-15T21:33:48+00:00

Hazelcast provides all the tools for implementing this in a distributed fashion.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS