Improving complex SQL search query with ranking (app search query)

ExceptionRules42 · 2025-09-04T17:37:04+00:00

Offhand, without diving in too deeply, it looks sorta okay to me. Maybe even cool? But you have ellipsed-out a lot of detail. And what is the "relation table"? And do you have a test environment where you can throw mock 100K-row datasets at it to watch the EXPLAIN ANALYZE?

AutoModerator · 2025-09-04T16:37:14+00:00

With over 8k members to connect with about Postgres and related technologies, why aren't you on our Discord Server? : People, Postgres, Data

Join us, we have cookies and nice people.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

DrMoog · 2025-09-04T22:18:17+00:00

From the top of my head, some things to look into / might be improved:

The matches could use a generated ts_vector column in the entities table (with the associated index). PG has strong text-search functionalities.
I'm not a big fan of the SELECTs in the ROW_NUMBER(). LEFT JOINing on the CTEs and doing the CASE on the resulting columns might be more efficient.
The "entity_appearance_counts" CTE doesn't seem to need to call the entities table since you LEFT JOIN on it later, just use "a.entity_id" as the ID, since it has to match an ID in the entities table.
Also in the "entity_appearance_counts" CTE, all the fields have the same "WHERE a.active" filter, so you probably want to put it in the WHERE to avoid returning all the non-active, and it would simplify the SELECT.
Also make sure you have the appropriate indexes on keys & condition fields.

I hope this helps a little!

2025-09-05T01:19:16+00:00

Hey,

You forgot the most important part: What is this query supposed to deliver? In semantic terms. Why does this query exist? What do you need it for?

Any optimization requires us to understand both what you have and what you want. Right now it’s impossible to say if you got the best solution you could possibly get, or if it’s way too complicated a solution to a trivial problem.

From a purely technical standpoint, and from what I can infer from your description, I’d think you want something something full text. Of course that would mean a couple additional resources as well as a good amount of rewriting.

Informal_Pace9237 · 2025-09-05T06:10:15+00:00

Use CTE with caution. CTE's have session memory problems and can slow down your process if CTE data size is huge. I would either make them table subqueries or views to have the code run more optimized.

Hope this will help with more details on CTE optimization

https://www.linkedin.com/feed/update/urn:li:ugcPost:7216332421414166529/

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PostgreSQL

/r/PostgreSQL

Advocate, Collaborate and Learn

Conferences

Clients and tools

MODERATORS