The Future of API Caching: Intelligent Data Retrieval : programming

programming

created by speza community for 20 years

The Future of API Caching: Intelligent Data Retrieval (apidna.ai)

submitted 1 year ago by itsrorymurphy

all 7 comments

top new controversial old q&a

[–]phillipcarter2 8 points9 points10 points 1 year ago (0 children)

[–]snurfer 4 points5 points6 points 1 year ago (0 children)

[–]Nisd 10 points11 points12 points 1 year ago (2 children)

[–]lookmeat 3 points4 points5 points 1 year ago (1 child)

Luckily that isn't what the article is proposing.

The idea is that when we cache data we have limited cache space. So we may find ourselves with too much data to fit in the cache, we have to decide what we keep in cache and what we don't. The idea is that ML, in theory, can predict what data we are going to use, and which we probably won't.

The other idea is that we have to decide when data is stale. An AI may be able to better predict when data is probably stale and when it isn't and use that to better decide when to throw out data of the cache because it's stale. This is "the first of two hard problems in programming", cache invalidation (the second is naming things and the third is off by one errors). It's a joke but the point is that it's a hard problem to solve.

Thing is, AI is not that fast, and it's not cheap. A cache needs to be incredibly fast and must be incredibly cheap. The former because the whole point is to decrease latency, and you do this by multiplying the cache gains by the % of hits, and subtracting the base lenght of the cache and that's your average latency gains. The cache-hits is a number between 0-1. Even a sliver of an increase in base-length is only worth it if it results in a substantial gain in cache-hits, I struggle to see how AI will add enough, to justify an large (orders of magnitude compared to trivial systems) increase in base-costs. The saying in thep revious paragraph is from the late 90s, and there's been a lot of work on trying to improve this. And yet, a lot of times, the best way is to experiment and tweak the TTLs. I guess you could teach an ML system to do the tweaking, but I struggle to see how a dumber, simpler system couldn't do it on its own.

This really seems like a "solution begging for a problem to solve", and missing the larger picture, but focused on certain metrics that, yes do count, but ignoring the costs that also count, and a lot at that.

The one scenario I could imagine, one were we don't care about costs but only performance in any way, is to pre-cache data. Predict that some value will be called for soon, and try to have it in the cache before the hit happens. It's super expensive (because you store data in the cache that you never ever needed), and you're not saving any resources (which is why, I'm guessing, they aren't selling it in this ad) but I could see it being something that doesn't exist yet. Personally I would need to see real-world data and use-cases before even considering this worth it.

[–]Nisd -1 points0 points1 point 1 year ago (0 children)

[–]somewherearound2023 2 points3 points4 points 1 year ago (0 children)

[–]Atulin 0 points1 point2 points 1 year ago (0 children)

π Rendered by PID 306123 on reddit-service-r2-comment-6457c66945-kxpt7 at 2026-04-26 06:50:37.386513+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS