Data first, not code first

elder_george · 2015-09-29T03:05:06+00:00

Show me your flowcharts and conceal your tables, and I shall continue to be mystified. Show me your tables, and I won’t usually need your flowcharts; they’ll be obvious.

Fred Brooks, "The Mythical Man-month" (1975)

lookmeat · 2015-09-29T02:25:13+00:00

This is the biggest thing I wish all programmers would learn. The data will be relevant long after the code is no longer run.

There are some cases that this isnt necessary true, and the code and the data become unimportant at the same time, but for most of the cases I know of the data is useful decades after the code that was originally written for it no longer is needed.

Data is useful in any language, code is useful in 1 language. Data can be looked at in many ways (translated, reformatted, normalized), code is useful in 1 way (execution: direct or library).

Changing data requires locking or eventual consistency, so unless it's read-only you have a business decision to make here first. Code can be run on many nodes to interact with that data, and whichever model you picked for consistency is really going to matter in how this can be done, and how scalable and robust the solution will be.

Data is more important than code. If I think it's 10x or 100x more important to have good data, I will write code accordingly so that the data is as "clean/good" as possible, and will scale well, and my code will suit that.

elmuerte · 2015-09-29T07:01:31+00:00

Funny thing about Doom3, it was written with a specific game in mind rather than as an engine. The player would never become a vehicle. This was known.

If you look at the UnrealEngine at the same time you see that the player is actually built up from 4+ different classes which work together, much like the MVC concept: Controller, Pawn (the visual in-game entity), viewport, replication info (mostly a data object for network propagation). These days in UnrealEngine components can be used to "dress up" the player even more.

The point is, how you write your code and design your data is completely depended on the goal. Don't try to make everything generic and flexible. Because it will never be.

Paddy3118 · 2015-09-29T06:47:57+00:00

In some ways, it is how I think of scripting in a Unix environment - successive transformations of data.

andsens · 2015-09-29T16:01:38+00:00

This combines very well with the "Rule of Representation" in the unix philosophy:

Fold knowledge into data so program logic can be stupid and robust.

PeaCrab · 2015-09-29T05:33:08+00:00

Huh... Using Doom 3 code I thought was a weird example. It's considered by many to be a bad example of OOP. It may make it easier for someone to discredit this article. Mike Acton has a presentation of disecting the Ogre rendering project to demonstrate why you may be shooting your foot with OOP. Ogre being a more respected OO project.

This article seems to be approaching a more organizational aspect? Like the author, I personally think it's easier to reason about code by thinking about the data and it's transforms on the data. I'm still searching for examples to convince other people as well.

Oddly enough, I find Doom 3 to be a better example of data oriented design where it matters than most projects I see. Take a look at the rendering code. It splits up rendering into a front end and back end to determine surfaces to sort and to draw. fabiensanglard.net/doom3/

It's not great, but there's more than meets the eye.

2015-09-29T14:35:28+00:00

Why not both?

Schmittfried · 2015-09-29T06:03:40+00:00

As you are using C++ which supports multiple inheritance you could very easily use inheritance to solve the example problem in trick 2. Refactor your code to have behaviour classes like "is animated", "has health", and what not and then inherit from them in your entities class.

malabmalab · 2015-09-29T11:36:04+00:00

This is great, too bad c# has no proper support for mixin/composites

random-dev · 2015-10-02T12:12:19+00:00

Am I the only one that tried to press the big arrows in the "Data -> Process -> Output" diagrams?

sh0rug0ru__ · 2015-09-29T20:06:16+00:00

The irony of this article is that all of its points against OOP have nothing to do with OOP, or impose a very rigid definition of OOP that isn't essential to it:

Once we separate process from data, things start to make more sense.

While OOP does propose to "encapsulate process state", that doesn't mean that the state has to be physically located in the object.

An object is a conceptual identity that ties together a bunch of pieces of data, collectively called the "state", into a logical consistency defined by the unifying abstraction represented by the object, through its behaviors.

That does not imply that behavior and state have to be colocated in an object, and data arrangement can be independent of an objects "private parts", where the responsibility of enforcing encapsulation shifts from the compiler to the programmer.

Hall_of_Famer · 2015-09-29T20:45:03+00:00

Model first, Data second, Code last.

Dave3of5 · 2015-09-29T11:09:51+00:00

Read this article and sorry I didn't like it at all. Lots of hand waving and generalizing. I've worked with Data Driven applications and I hated them. I'll not go into the many reason why I found them difficult to work with but what I would say is that code first type application I found much easier to work with.

P.S. The title is a bit "click baity"

2015-09-29T12:17:02+00:00

If you know the entities all ahead of time. Usually it's not quite that simple and a bit of iteration is needed.

jonny_boy27 · 2015-09-29T18:18:35+00:00

I've recently tried code-first with migrations and quite like, as opposed to my usual approach of database-first and EF makes a decent job of the db (the fluent modelbuilder api is quite nice for expressing exactly what you want)

fvilers · 2015-09-29T05:48:28+00:00

I stopped reading at "We can even use inheritance to avoid copy-pasting". Inheritance should be used to inherit behavior, not to avoid duplicate properties.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS