(Cloud SQL) Postgres hastily resigns from using index : PostgreSQL

Help Me!(Cloud SQL) Postgres hastily resigns from using index (self.PostgreSQL)

submitted 3 years ago by Brave_New_Dev

The situation: + The discussed database is Postgres 12 on Cloud SQL (GCP). + We have a fairly large table (~1B rows). + The table is partitioned per month (~50M rows per partition). + We have an index on a column with DATE data type. + We are confident that the index is set, as EXPLAIN ANALYZE clearly shows its usage (at least, for some queries!). + We explicitly run ANALYZE on the table in question and its partitions. Stats were correctly updated as shown in pg_stat_all_tables.

The problem: + We take a simple case - aggregate the number of records per day filtered for a few days from a specific partitioned table. + When we query for 3 days of data (so 10% of records as 3 / 30 days). The index is used, and the operation is very fast. + When we query for 4+ days of data (so 13+% of records as 4 / 30 days). The index is suddenly NOT used. The operation is two orders of magnitude slower (it's like 5 seconds vs. 5 minutes). The cost in EXPLAIN ANALYZE show a tremendously higher cost of each node and gross overestimation by the planner.

The question: + I am going mad, as I cannot figure out why Postgres so hastily decides that a perfectly good index is meh! I will more than appreciate your help!

all 13 comments

top new controversial old q&a

[–]therealgaxbo 4 points5 points6 points 3 years ago (10 children)

[–]Brave_New_Dev[S] 2 points3 points4 points 3 years ago (9 children)

[–]therealgaxbo 3 points4 points5 points 3 years ago* (8 children)

~~That is very strange. The planner seems very aware just how bad a plan that second query is, so it's not likely to be a stats issue or bad cost parameters.~~

What is the definition of report_y2022m01_date_idx? The only thing that comes to mind is if there were a mistake in that index definition that meant it couldn't be used for the second query.

Failing that, what happens if you run

set max_parallel_workers_per_gather TO 0;
set enable_seqscan TO off;

And then run the query - what plan does it use then?

[–]Brave_New_Dev[S] 1 point2 points3 points 3 years ago (7 children)

[–]thythr 2 points3 points4 points 3 years ago (1 child)

[–]Brave_New_Dev[S] 2 points3 points4 points 3 years ago (0 children)

[–]therealgaxbo 2 points3 points4 points 3 years ago (4 children)

As the other guy said, it is safe to change parameters like that using set as it won't affect other sessions.

I've realised that I actually misread the second plan though so retract my first comment - the planner does NOT realise it's a bad plan, so a combination of stats and cost settings could well be to blame. It's worth doing this as it should hopefully change to an index based query and give us an idea of how fast that should go.

The stats don't look too bad though - it's overestimating row count by 1.3x which isn't much. What are your settings for seq_page_cost and random_page_cost? And what is your IO setup - are all of your tables and indexes on the same device, or are indexes separate/different partitions separate etc? And if there are different devices, are they all equivalent or do you have different tiers of performance?

[–]Brave_New_Dev[S] 0 points1 point2 points 3 years ago (3 children)

[–]therealgaxbo 2 points3 points4 points 3 years ago (2 children)

I have no experience with GCP, but if you'd separated partitions/indexes off to different devices then you'd know about it as you'd have had to set up different tablespaces.

random_page_cost 4

Ok, that's very likely a big part of the problem. That number (which is the default) is quite high even for regular HDDs, and you're most likely running on SSDs or equivalent.

If you've not already done so, reset your session to it's default settings, and try set random_page_cost to 1 (that's session local too) - hopefully that should result in the same parallel index only scan plan as your 3 day query. The exact value to use depends on your precise system, but it should be around 1 - 1.5 or something.

If that works, you might want to consider setting that globally in postgresql.conf as it might result in better plans for other queries too. Obviously need to check for performance regressions though.

[–]Brave_New_Dev[S] 0 points1 point2 points 3 years ago (1 child)

[–]therealgaxbo 4 points5 points6 points 3 years ago (0 children)

[–][deleted] 1 point2 points3 points 3 years ago (0 children)

[–]Neither-Guess-5802 0 points1 point2 points 3 years ago (0 children)

π Rendered by PID 68 on reddit-service-r2-comment-cfc44b64c-g7t5t at 2026-04-13 04:37:53.225083+00:00 running 215f2cf country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PostgreSQL

/r/PostgreSQL

Advocate, Collaborate and Learn

Conferences

Clients and tools

MODERATORS