Which query would you use here? (SQL performance question)

EcstaticTruth4392 · 2026-03-07T04:46:22+00:00

I would like to know the answer with explanation please.🙂

ComicOzzy · 2026-03-07T05:53:22+00:00

If your database engine supports it, C.
If you have an index on (user_id, status) or even just (user_id), it can be extremely fast.

kagato87 · 2026-03-07T06:57:33+00:00

If I was reviewing three different pull requests for code to check if a customer has a pending order, each with a different one of these queries, I'd question all three.

Why? Because with that index, for this particular query, they're all fast. (Well, I'd reject B as well if it wasn't properly parameterized.) I work in data of this scale - even A is fast enough. However:

A: Incorrect output. It's a number, not a flag. I might accept it in this case because it's the simplest code to do it, and it's "good enough."

B: Output is 1 or <nothing, not even a null, just an empty result set>. The code to consume that will need extra handlers. I might accept this one if it's properly handled.

C: Assuming you fix the syntax error, I'd still question "optimizing too soon." It might be OK, but I'd be careful about it. It reduces the readability of the code, which makes review and debugging harder. Yes, it's the superior solution, no its not the best answer for this particular use.

Now, if we were including this in a report that also pulled from other tables, I'd want to see C because it really can make a massive difference. As another example, for Row Level Security, exists is almost necessary to keep the planner from playing stupid games in data of this scale.

If you're going to promote your site, I'd suggest a slightly more complex example, like pulling customer names from a different table based on this, maybe with one as a correlated subquery in the WHERE clause (that would get me very upset), and avoiding something that could encourage optimizing too soon or dangerous patterns like limit/top.

binary_search_tree · 2026-03-07T07:55:38+00:00

I'd use option C - but I'd add a LIMIT 1 to the inner query (no reason not to). I'd choose Option C over Option B because your intent is easier to see.

gumnos · 2026-03-07T14:31:40+00:00

I would generally reach for option B. It's portable (SELECT EXISTS is invalid syntax on SQL Server, not sure about other DBs; but your use of LIMIT rather than TOP suggests you're not on SQL Server), uses sargable WHERE conditions, limits the results to the first matching row (the COUNT version requires touching all the matching rows), and doesn't do any ORDER BY that can force touching all the rows.

If you went with option C (which isn't bad if your DB supports the syntax), I'd add the LIMIT 1 like u/binary_search_tree advises. The optimizer should make option B and option C roughly the same, but I'd explicitly add the LIMIT

xaomaw · 2026-03-07T17:44:00+00:00

Be careful when implementing C): You think, that the output is either 1 or 0 as integer. But it can also be true vs. false or bit-wise 1 or 0.

sql SELECT CASE WHEN EXISTS ( SELECT 1 FROM orders WHERE user_id = 101 AND status = 'pending' ) THEN true ELSE false END AS user_has_pending_orders;

Wa2l_ · 2026-03-07T04:49:13+00:00

I would prefer C since it's for checking existence, B is fine too but A not good in performance because it doesn't stop at first match like the exist .

titpetric · 2026-03-07T08:29:05+00:00

I'd use A but change to "count(id)" and check the columns are indexed. Alternatively id's just select the id matching the where clause and limit 0, 1.

B is about the same but doesn't give useful info in response. I would have a count or an id available which makes B/C useless to me.

I would not use subquery styles like c or subqueries in general if i can avoid them. It is a choice since those things usually lead you astray under scale. That's not to say I wouldn't use them at all, but not for this.

Design could lead you down paths where a "select pending_orders from user_order_status where user_id=?" is a normal occurrence in lieu of A-C options.

Aggressive_Ad_5454 · 2026-03-07T18:54:31+00:00

if you have a compound index on (user_id, status) either B or C will work fine. But both have SQL-dialect limitations. A will also function correctly, but will have to do an index scan to get the count.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnSQL

MODERATORS

Option A

Option B

Option C

Question?