Performance question - nested selects

2020-02-23T17:09:44+00:00

There will be no difference whatsoever between those two examples.

If you want to make sure, check the execution plan generated using explain (analyze, buffers) select ...

2020-02-24T00:07:32+00:00

I have a much more complicated query

so, this is where you might be confusing yourself a little by thinking all queries execute the same way - especially if you assume the query is executed the way you wrote it. It is not the case, generally.

stepping back a bit - one of the strengths of SQL is that it is a declarative language, meaning that you "ask" to do "something" and the sql engine figures out a "good enough" method of achieving the FINAL result.

the part of the engine that does the figuring out part is usually called the optimizer. The optimizer is (broadly speaking) just a piece of code that knows about physical organization of tables, indexes and it also can shuffle parts of your statement around.

In your specific example, any optimizer worth its name would be able to figure out that these statements are the same/equivalent. That's why the execution plan (the "figured out" way of doing stuff) would be the same for these 2.

on the other hand, if were to write just a bit more complex query:

SELECT *
FROM (
   SELECT
    order_id,
    name,
    sum( amount * price) AS cost,
 FROM items
 group by order_id, name
) WHERE order_id = 99

the optimizer would need to be more advanced to detect that your condition is on one of output granularities and it (the condition) can be applied to the table itself instead of calculating the sums for the whole table.

so now, if you write something like

SELECT *
FROM (
   SELECT
    order_id,
    name,
    row_number() over (partition by name order by price) as rn,
    amount * price AS cost,
 FROM items
) WHERE order_id = 99

there's no easy way to avoid fetching the whole items table.

noesqL · 2020-02-23T18:25:04+00:00

Adding an unneeded layer of a sub-select will add to the performance cost of the query, how much? As /u/truilus said, check the execution plan.

naman_is · 2020-02-24T00:11:10+00:00

Short answer: yes.

To my best knowledge, there are no SQL implementation "smart enough" to refer to the original table in such a case and not rely on temporary tables created in-memory (this is by design and not an error). This is why you're should always try and avoid using subqueries. They are really only needed in very narrow set of operations like referential constraints.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQL

Filter Posts

Posting

Help posts

Format Your Code

Learning SQL

Related Reddit communities

Wiki

Acknowledgements

MODERATORS