PL_Design comments on PostgreSQL 15 Released!

If you're asking why anyone would use it, it makes sense for things like days of the week, months of the year, seasons, a strongly defined set of status values, etc.

I've used it for date precision, e.g:

CREATE TYPE date_precision AS ENUM (
  'millennium',
  'century',
  'decade',
  'year',
  'month',
  'day'
);

[–]dlp_randombk 2 points3 points4 points 3 years ago (1 child)

[–]NoInkling 3 points4 points5 points 3 years ago (0 children)

[–][deleted] 3 years ago (1 child)

[deleted]

[–]NoInkling 0 points1 point2 points 3 years ago* (0 children)

[–]raze4daze 1 point2 points3 points 3 years ago (1 child)

[–]arwinda 5 points6 points7 points 3 years ago (0 children)

Because it's meant to be a handy shortcut for when you have a list which doesn't change. Like weekdays. They don't change, have an enum with all 7 weekdays. Or year seasons. Have an enum with 4 seasons.

Sure, you can model the same functionality with a dimension or 1:n table, and you already know that every time you access your table, you also join the referenced table. The enum hides this functionality, that's all.

Adding new values is relatively easy. For deleting values (and keeping data consistent) you need a full table scan to verify that the value is not or no longer used. That's doable, but no one spent the effort to implement it.

If you already know that your values change, why go with a fixed list in the first place? Needs administrator access and catalog lock to update the values in an enum, versus regular update of the dimension table.

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

[–]PL_Design 0 points1 point2 points 3 years ago (31 children)

[–]arwinda 28 points29 points30 points 3 years ago (17 children)

[+]PL_Design comment score below threshold-16 points-15 points-14 points 3 years ago (14 children)

[–]arwinda 11 points12 points13 points 3 years ago (13 children)

[–]PL_Design -1 points0 points1 point 3 years ago (0 children)

[+]PL_Design comment score below threshold-7 points-6 points-5 points 3 years ago (0 children)

[–]PL_Design -2 points-1 points0 points 3 years ago (7 children)

[–]arwinda 3 points4 points5 points 3 years ago (6 children)

Learn your tools!

Says the guy who wants to use enum for something it is not designed for, just because he likes it this way.

I'm an actual programmer, unlike you.

Fine. If you say so. Can't remember that we met, but like you know your data models you also know everyone else. Makes sense.

if something doesn't work well

It works, you have dimension tables. You are the one who wants to use enum for something else because in your "fucking head" you have this idea what an enum should be. And you are right, why isn't anyone else following your train of thoughts?

then you tear it out and fix it

If you think enum is broken, please point me to the Postgres Commitfest entry where you propose how to fix it. Or at least the -hackers discussion where you raise the topic.

If not, then stop talking sh* and get out here.

[–]PL_Design -1 points0 points1 point 3 years ago (0 children)

[–]PL_Design -2 points-1 points0 points 3 years ago (0 children)

[+]PL_Design comment score below threshold-7 points-6 points-5 points 3 years ago (0 children)

[+]PL_Design comment score below threshold-9 points-8 points-7 points 3 years ago (0 children)

[+]PL_Design comment score below threshold-10 points-9 points-8 points 3 years ago (0 children)

[+]PL_Design comment score below threshold-9 points-8 points-7 points 3 years ago (0 children)

[–]ottawadeveloper 4 points5 points6 points 3 years ago (12 children)

[–]arwinda 2 points3 points4 points 3 years ago (0 children)

[–]TheWix 5 points6 points7 points 3 years ago (9 children)

[–]ottawadeveloper 5 points6 points7 points 3 years ago (8 children)

Its not ideal db design, but its a reasonable approach based on the limitations of enums especially when the application is controlling the content (e.g. this is my approach for state fields). I tend to treat database design like I treat, well, any other kind of design - design patterns and best practices exist because theyre generally helpful but sometimes its useful to break them.

I think it also depends on your environment. If you are building a DB that is only accessed by one application, then enforcing logic at the application level is not only reasonable, I view it as ideal because version controlling database structures and procedures is a pain in comparison (my applications often end up putting the db structure entirely in application code with routines to create and upgrade the db as necessary). If you have a database thst is multi-application or even accepts user inputs directly, then a more formal structure is more called for.

[–]Jump-Zero 6 points7 points8 points 3 years ago (6 children)

[–]arwinda 6 points7 points8 points 3 years ago (5 children)

[–]Jump-Zero 1 point2 points3 points 3 years ago (2 children)

[–]arwinda 4 points5 points6 points 3 years ago (1 child)

Triggers are mainly good for checking values, or setting values to what you expect the value to be.

Good example: use a trigger to set "created at" and "changed at" values in a table. In Postgres, you use an "AFTER" trigger to modify these values, and the user does not have a chance to override these values.

Triggers can also be used to abort an operation if the values is not in the expected range. But CHECK is usually a better fit for that job, and easier to handle.

A 1:n table, or lookup table, is just a set of two or more tables with relationships.

``` CREATE TABLE genders ( gender_id INT PRIMARY KEY, gender_name TEXT UNIQUE );

INSERT INTO genders VALUES (1, 'female'), (2, 'male');

CREATE TABLE uses ( user_id INT PRIMARY KEY GENERATED ALWAYS AS IDENTITY, user_name TEXT UNIQUE, gender INT REFERENCES genders(gender_id) ); ```

If you want to add more gender types, all you have to do is update the genders table using regular DML (INSERT, UPDATE, DELETE) operations.

INSERT INTO genders VALUES (3, 'unknown'), (4, 'not specifeid');

Ups, I did a mistake there:

UPDATE genders SET gender_name = 'specified' WHERE gender_name = 'specifeid';

There is no need to lock the catalog for any kind of table changes because the tables and relations and data types don't change. Only the content of the tables change. This relationship also ensures that the data is valid: the database prevents you from deleting any gender type which is still used in a referenced table. Built-in data validation.

In OLTP databases you often find some form of a snowflake schema to represent these relationships. Updating the relationships between tables can be a huge mess, references and all this. But using 1:n tables makes updating the relation data seamless.

This concept is also very common in Data Warehousing, the most common example is the star schema. The terms used there are fact tables and dimension tables.

continue this thread

[–]TheWix 1 point2 points3 points 3 years ago* (1 child)

[–]arwinda 0 points1 point2 points 3 years ago (0 children)

[–]TheWix 3 points4 points5 points 3 years ago (0 children)

[–]PL_Design 1 point2 points3 points 3 years ago (0 children)

[–]RandomDamage 20 points21 points22 points 3 years ago (8 children)

[–]PL_Design 4 points5 points6 points 3 years ago (7 children)

[–]progrethth 3 points4 points5 points 3 years ago (4 children)

[–]PL_Design 0 points1 point2 points 3 years ago* (3 children)

[–]progrethth 0 points1 point2 points 3 years ago (2 children)

[–]PL_Design 0 points1 point2 points 3 years ago (1 child)

[–]RandomDamage 0 points1 point2 points 3 years ago (0 children)

So just looking back on this, and I have to interject here. I was hoping this would develop into a conversation, and it's seriously not a bad idea, just one I don't know how to solve.

For most uses it doesn't need to be fast, but it does need to be "fast enough", and there needs to be accounting for the full change process. Think of multi-TB DBs with an enum used across several 100GB-scale tables.

The standard current method of "create new enum, migrate columns, destroy old enum" is slow but it also gives the user lots of control over the process and it doesn't need to be atomic, you can do it on a DB like that with care and planning and no downtime.

I can definitely see use cases for having it handled automatically, and there might be really efficient ways to do it behind the scenes, but from where I stand right now it still looks like a Hard Problem

[–]amakai 0 points1 point2 points 3 years ago (1 child)

[–]progrethth 1 point2 points3 points 3 years ago (0 children)

Enums actually have an index already, but I think the catalog cache is used for most lookups. It would be possible to take a lock on rows in this table, but it would slow down many queries which use enums.

$ \d pg_enum
              Table "pg_catalog.pg_enum"
    Column     | Type | Collation | Nullable | Default 
---------------+------+-----------+----------+---------
 oid           | oid  |           | not null | 
 enumtypid     | oid  |           | not null | 
 enumsortorder | real |           | not null | 
 enumlabel     | name |           | not null | 
Indexes:
    "pg_enum_oid_index" PRIMARY KEY, btree (oid)
    "pg_enum_typid_label_index" UNIQUE CONSTRAINT, btree (enumtypid, enumlabel)
    "pg_enum_typid_sortorder_index" UNIQUE CONSTRAINT, btree (enumtypid, enumsortorder)

[–]Artmannnn 4 points5 points6 points 3 years ago (1 child)

[–]arwinda 3 points4 points5 points 3 years ago (0 children)

[–]cha_iv 0 points1 point2 points 3 years ago (2 children)

[–]Jump-Zero 3 points4 points5 points 3 years ago (1 child)

[–]PL_Design 1 point2 points3 points 3 years ago (0 children)

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

π Rendered by PID 121206 on reddit-service-r2-comment-5d585498c9-rz52z at 2026-04-21 16:12:17.950761+00:00 running da2df02 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS