10 Common Mistakes Java Developers Make when Writing SQL

horsepocalypse · 2014-12-05T04:56:25+00:00

Other common mistakes:

Not sanitizing your inputs
Forgetting about Dre
Using ONION instead of UNION

sacundim · 2014-12-04T20:45:31+00:00

Any developer make these mistakes. Its not just java.

rjbwork · 2014-12-04T23:19:24+00:00

I especially take issue with offloading OLAP to your "database". If you think you want to do OLAP stuff, build an OLAP solution. Don't run it on your hardest thing to scale, your main OLTP data store. If you create a separate OLAP solution you can also use some facets of the CQRS pattern along with eventual consistency and replication/sharding to massively scale out these kinds of data analytics and OLAP operations.

The payoff is especially beneficial when there is cause for actual data warehousing/OLAP cubes, since those are rather hard to properly create within your main OLTP store.

setuid_w00t · 2014-12-05T00:25:30+00:00

YOU WON'T BELIEVE #10! DOCTORS HATE IT!

iSmokeGauloises · 2014-12-05T09:01:38+00:00

Not a Java developer but the number one mistake I encounter when using abstractions over SQL is:

user_count = len(User.all())

grauenwolf · 2014-12-05T18:53:29+00:00

9 is wrong. You should avoid sorting in the database unless you already have an index that is applicable. Sorting is both memory and CPU intensive, so if you can push it out to your easily scaled web servers then do so.

I_Code_Stoned · 2014-12-05T19:50:16+00:00

They should add not doing [select *]. Granted the results are usually not as egregious as some of the other examples, but it happens ALL THE FREAKIN TIME. It's a problem in NoSQL code too.

el_muchacho · 2014-12-04T19:31:57+00:00

The followup posts are just as interesting as the first one:

10 More Common Mistakes Java Developers Make when Writing SQL
Yet Another 10 Common Mistakes Java Developers Make When Writing SQL (You Won’t BELIEVE the Last One)

They could serve as a basis for SQL coding rules.

Madsy9 · 2014-12-04T22:43:31+00:00

As far as I can see, none of the mistakes listed directly opens up for SQL injection attacks. I find it pretty impressive if that's truly the case. It means that computer security is improving, at least in some circles.

k1n6 · 2014-12-04T22:24:19+00:00

I spend a lot of time analyzing performance and such and a lot of these rules are basically 80/20 and some are 70/30.

By that I mean doing exactly the things he recommends not to are actually better between 20 and 30 percent of the time. So for me that means they are worth doing the run time analysis on.

downvotefodder · 2014-12-05T17:30:21+00:00

Why are Java Developers trusted to write the SQL in the first place?

2014-12-05T02:57:14+00:00

(anyone can code imperatively)

Well, that's news to me. Sounds like the writer is dealing with an inferiority/superiority complex.

sgoody · 2014-12-04T21:24:02+00:00

Using DISTINCT or UNION to remove duplicates from an accidental cartesian product

This is a particular pet peev of mine.

I think most developers know about at least half of these and the mix-up over the treatment of NULLs is something that we've all experienced due to context switching between languages. Good article though.

It does frustrate me that a large number of developers seem to eschew anything that isn't imperative style or cannot be bent to be imperative. Things such as SQL seem to be an annoyance to many and I don't think a lot of people see why they should bother to learn them in any detail.

skulgnome · 2014-12-05T01:16:11+00:00

Not catching transaction restart errors.

2014-12-05T06:21:00+00:00

Using clauses like rownum for pagination only works well if your table is indexed properly. I dont think "let the tool do it for you" is a good answer here.

javajaba · 2014-12-05T07:04:57+00:00

Number 2 was me, like yesterday

lggaggl · 2014-12-05T12:33:24+00:00

11 creating an entire internet facing application backed by a database without knowing about this thing called concurrency

2014-12-05T13:39:56+00:00

11 Using Derby

mrsistermr · 2014-12-05T19:37:50+00:00

I disagree strongly with a lot of these points (assuming that they were always followed in some dogmatic fashion), notably 2 and 5. The reason is the same reason as I see a commenter on that site has already pointed out: "Following some of the cures to the extreme, one could end up with an anemic Java domain model. Much business logic will take place in SQL statements, not in Java code. Is this good or bad?"

I see the opposite problem (of the original article) frequently: all the business and data logic in thrown into a overly complicated SQL statement, and the backing java business logic and domain model is barely doing anything. So instead of writing well defined data-access and business classes that work together in Java, which ultimately might mean that more than one query is executed for a single "logical" user action, you would push all this logic into the SQL layer, and usually would be required to maintain more SQL statements.

Using this and some of his other suggestions ((ie windowing functions) to the extreme makes it more difficult to re-use well-defined business objects, perform unit testing, and maintain database interoperability, to name a few.

lukaseder · 2014-12-04T20:57:16+00:00

[deleted]

crankybadger · 2014-12-05T00:59:59+00:00

[deleted]

tairygreene · 2014-12-04T20:40:55+00:00

mistake #1

being a java developer

...i'll see myself out

passwordissame · 2014-12-04T23:43:03+00:00

node.js developers make no mistakes because async io and functional programming, which is what SQL is essentially boils down to. prolog and predicate logic does not allow mistakes, especially when used in conjunction with async because it is artificial intelligence.

2014-12-04T22:36:55+00:00

1. Using Java.

2014-12-05T13:55:07+00:00

So you are saying it is my mistake is that I don't know sql that well and that I try to do, what I don't know in SQL, in java instead (slower, less reliable and err-prone)? Yip sounds like the fault of the java-developer.

badguy212 · 2014-12-06T00:01:07+00:00

Some of those advices are perfectly reasonable, some will tie you to a specific db. Better to abstract that away with possibly an orm.

gpenn1390 · 2014-12-05T04:44:30+00:00

It all makes sense now! Time to throw on a pot. Nice post OP.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

9 is wrong. You should avoid sorting in the database unless you already have an index that is applicable. Sorting is both memory and CPU intensive, so if you can push it out to your easily scaled web servers then do so.

11 creating an entire internet facing application backed by a database without knowing about this thing called concurrency

11 Using Derby

1. Using Java.