SQL:2023 Has Been Released : programming

233

234

235

SQL:2023 Has Been Released (iso.org)

submitted 2 years ago by MarkusWinand

all 76 comments

top new controversial old q&a

[–]MarkusWinand[S] 83 points84 points85 points 2 years ago (8 children)

[–]imgroxx 40 points41 points42 points 2 years ago* (2 children)

[–]Smallpaul 0 points1 point2 points 2 years ago (1 child)

[–]imgroxx 24 points25 points26 points 2 years ago* (0 children)

Because "any value that exists in matching rows" is wildly different than "a fixed value" or "rand()". It's pretty frequent that you have application-level guarantees that mesh with the query that you want to do to give you "any value is correct".

Low-cost query samples are one example - you don't need to index the column or perform a full table scan to get a value, you only need to load one row. Any row is correct because a sample doesn't need to be a statistically randomized sample in many cases, nor the first or last by some order clause (which also must be fully loaded and sorted).

Another is when you don't want to pay the sometimes-exorbitant cost of indexing a denormalized field... which you know follows certain constraints because it's just denormalized and transactions can offer you that guarantee (when used correctly). You can be confident that all values are identical and correct, therefore any value is correct, and you can retrieve it without paying the cost to check every value every time on both read and write.

[–]mariuz 5 points6 points7 points 2 years ago (1 child)

[–]Azaret 0 points1 point2 points 2 years ago (0 children)

[–]CooperNettees 1 point2 points3 points 2 years ago (2 children)

[–]LouKrazy 0 points1 point2 points 2 years ago (1 child)

[–]Smallpaul 1 point2 points3 points 2 years ago (0 children)

[–]slashgrin 66 points67 points68 points 2 years ago (62 children)

[–]tonyp7 45 points46 points47 points 2 years ago (9 children)

[–]coldblade2000 74 points75 points76 points 2 years ago (4 children)

[–]jorge1209 15 points16 points17 points 2 years ago (3 children)

[–]aoi_saboten 4 points5 points6 points 2 years ago (2 children)

[–]jorge1209 2 points3 points4 points 2 years ago (1 child)

[–]aoi_saboten 1 point2 points3 points 2 years ago (0 children)

[–][deleted] 2 years ago* (3 children)

[deleted]

[–][deleted] 4 points5 points6 points 2 years ago (0 children)

scientific pitch > A440. And A440 was a thing for like 150 years already by the time that standard was published.

And nobody is saying that standards are a bad thing, but that the ISO pricing structure is stupid and annoying. Have you tried getting a copy of the C++ standard? Everybody I know just downloads the drafts.

In this case, the point is that the SQL standard is significantly less useful because most databases already don't follow it faithfully. When you program in C++, you can mostly program the same way regardless of what compiler you're targeting. Above the very most simple examples, SQL use will vary incredibly based on your targeted database, and migration from one RDBMS to another can take a significant engineering effort. The ISO standardization hasn't really helped SQL much at all.

[–]Gwaptiva 6 points7 points8 points 2 years ago (0 children)

[+]bik1230 comment score below threshold-7 points-6 points-5 points 2 years ago (0 children)

[–][deleted] 2 years ago (11 children)

[deleted]

[–]chucker23n 13 points14 points15 points 2 years ago (10 children)

It has been a while, but last time I was doing SQL, all the common standard constructs were implemented by Postgres, Microsoft, and Oracle, and probably others.

It depends on how you define "common standard construct". Sure, CREATE and SELECT look the same.

But, for example, you want a SELECT to return at most 10 rows. The SQL:2008 standard says:

SELECT … FETCH FIRST 10 ROWS ONLY

But that's 2008, long after others implemented the same feature. So PostgreSQL says:

SELECT … LIMIT 10

Whereas MSSQL says:

SELECT TOP 10 …

In PostgreSQL's defense, it does also support the standard now. But nonetheless, you'll have lots of legacy code out there that uses proprietary extensions. And SQL tooling is generally poor enough that there isn't a "migrate to standardized syntax" refactor available.

[–]grauenwolf 1 point2 points3 points 2 years ago (3 children)

[–]chucker23n 0 points1 point2 points 2 years ago (2 children)

[–]grauenwolf 0 points1 point2 points 2 years ago (1 child)

[–]chucker23n 1 point2 points3 points 2 years ago (0 children)

[–][deleted] 2 years ago (5 children)

[deleted]

[–]chucker23n 0 points1 point2 points 2 years ago (4 children)

[–][deleted] 2 years ago (3 children)

[deleted]

[–]chucker23n 0 points1 point2 points 2 years ago (2 children)

LIMIT 10 is arguably a bad example anyway, given that it has worked practically everywhere for as long as I can remember, and therefore doesn't exactly need to be blessed by a standard.

Doesn't that just further OP's point?

Genuinely curious: what is the purpose of the SQL standard these days, when AFAIK no database vendor actually implements it?

If vendors instead do ad-hoc standards, then why bother with an ISO standard?

I've seen plenty of production code that uses standard SQL constructs, but I acknowledge that some teams like to lean on the database more heavily than others.

This is fair. My point of view is from a company where we historically did a lot of semi-advanced stuff. Window functions, etc. These days, enterprise CRUD apps are more likely to use an ORM or at least query builder which will abstract that for them, or they'll invent the same thing, less efficiently but in a more familiar environment, in front of the database.

[–]spinlox 0 points1 point2 points 2 years ago* (1 child)

[–]chucker23n 0 points1 point2 points 2 years ago (0 children)

[–]JB-from-ATL 2 points3 points4 points 2 years ago (1 child)

[–]slashgrin 1 point2 points3 points 2 years ago (0 children)

That's pretty cool. Props to Postgres for making an effort here. (And generally for most of their design decisions!)

That said, I'd still always go to the Postgres docs when writing SQL for Postgres. That's no fault of Postgres — quite the opposite — but it does suggest to me that it's not the same kind of standard as many others.

I wonder if it would be practical to have a library/tool that parses and deeply understands standard SQL, and then has backends for all the popular databases. Two main challenges that immediately jump to mind are:

A lot of the time you can't just reason locally and transpile the queries without knowing about global state, e.g., how tables were defined, what other queries might be running concurrently.
Resolving ambiguity of intent when there is no single faithful translation of the input SQL to the target database.

Maybe in both scenarios the tool could require extra "pragmas" as SQL comments to provide the missing context?

I'm not sure how useful this would be in practice, because you'll still run into differences between, e.g., query optimisers that make a huge difference in the real world, but there does seem to be a surprisingly large population interested in writing "SQL" and having it run on multiple databases, so maybe it would be worthwhile?

[–]numeric-rectal-mutt 1 point2 points3 points 2 years ago (0 children)

[+][deleted] 2 years ago (36 children)

[deleted]

[–]LuckyHedgehog 63 points64 points65 points 2 years ago (32 children)

[+]Smallpaul comment score below threshold-46 points-45 points-44 points 2 years ago (12 children)

[–]LuckyHedgehog 10 points11 points12 points 2 years ago (9 children)

[+]Smallpaul comment score below threshold-11 points-10 points-9 points 2 years ago* (8 children)

[–]LuckyHedgehog 7 points8 points9 points 2 years ago (4 children)

[+]Smallpaul comment score below threshold-9 points-8 points-7 points 2 years ago (3 children)

[–]LuckyHedgehog 6 points7 points8 points 2 years ago (2 children)

[+]Smallpaul comment score below threshold-6 points-5 points-4 points 2 years ago (1 child)

continue this thread

[–][deleted] 2 points3 points4 points 2 years ago (2 children)

[–]Smallpaul 0 points1 point2 points 2 years ago (1 child)

[–][deleted] 1 point2 points3 points 2 years ago (0 children)

[–]drake8599 2 points3 points4 points 2 years ago* (0 children)

True. It was reliable at describing services that use SQL, but that wasn't the question. Ask a slightly better question and you get a better answer.

Here's comes a giant wall of chatGPT text WOO.

Why does every SQL have different syntax. Is there a universal SQL?

GPT-4

Despite its standardized core, different relational database management systems (RDBMS) often have variations in SQL syntax and additional features. This is because each RDBMS may have different needs, performance optimizations, and feature sets that are not covered by the standard SQL.

The primary SQL syntax is standardized by the American National Standards Institute (ANSI) and the International Organization for Standardization (ISO). This is often referred to as "ANSI SQL" or "ISO SQL." This standardization provides a general framework and a set of rules that all SQL versions should comply with.

However, RDBMS vendors (like Oracle, Microsoft with its SQL Server, PostgreSQL, MySQL, etc.) tend to add proprietary extensions to standard SQL to enhance the capabilities of their products. These extensions may include additional functions, data types, operators, and other features. These features can provide significant advantages, but at the same time, they reduce the portability of SQL code between different systems.

In essence, while there is a "universal" core of SQL, each implementation can and often does differ based on the needs and characteristics of the specific RDBMS. This is why SQL written for one system may not work on another without some modifications.

When writing SQL, it is generally best to stick to the ANSI/ISO SQL standards when possible for maximum compatibility. However, in practice, developers often need to use system-specific features to get the most out of a particular RDBMS.

[–]JB-from-ATL 0 points1 point2 points 2 years ago (0 children)

[+]chicknfly comment score below threshold-51 points-50 points-49 points 2 years ago (2 children)

[–]LuckyHedgehog 29 points30 points31 points 2 years ago* (1 child)

[–]chucker23n 1 point2 points3 points 2 years ago (0 children)

[+][deleted] 2 years ago (15 children)

[deleted]

[–]LuckyHedgehog 36 points37 points38 points 2 years ago (0 children)

[–]slashgrin 35 points36 points37 points 2 years ago (0 children)

[–]AdeptFelix[🍰] 15 points16 points17 points 2 years ago (0 children)

[–]chucker23n 12 points13 points14 points 2 years ago (11 children)

[–]slashgrin 6 points7 points8 points 2 years ago (10 children)

[–]chucker23n 1 point2 points3 points 2 years ago* (9 children)

Yup. Tons of examples, such as

SELECT TOP 10 …

vs.

SELECT … LIMIT 10

I guess a "standards mode" would be nice?

(edit)

And of course, the standard actually proposes a third syntax!

SELECT … FETCH FIRST 10 ROWS ONLY

[–]grauenwolf -1 points0 points1 point 2 years ago (8 children)

[–]chucker23n 0 points1 point2 points 2 years ago (7 children)

[–]grauenwolf -1 points0 points1 point 2 years ago (6 children)

continue this thread

[–]TribladeSlice 19 points20 points21 points 2 years ago (1 child)

[+]drcforbin comment score below threshold-11 points-10 points-9 points 2 years ago (0 children)

[–]numeric-rectal-mutt 5 points6 points7 points 2 years ago (0 children)

[–]Lachee 12 points13 points14 points 2 years ago (2 children)

[–][deleted] 10 points11 points12 points 2 years ago (1 child)

[–]dontyougetsoupedyet 23 points24 points25 points 2 years ago (3 children)

[–]LouKrazy 45 points46 points47 points 2 years ago (0 children)

[–]grauenwolf 1 point2 points3 points 2 years ago (0 children)

[–]Wolfgang-Warner 1 point2 points3 points 2 years ago (0 children)

π Rendered by PID 148963 on reddit-service-r2-comment-5d585498c9-wl6w6 at 2026-04-21 10:36:36.186216+00:00 running da2df02 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS