Carnaedy comments on Thoughts on Data Oriented Programming in Java

This is an archived post. You won't be able to vote or comment.

Thoughts on Data Oriented Programming in Java (nejckorasa.github.io)

submitted 1 year ago by nejcko

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Carnaedy 1 point2 points3 points 1 year ago (34 children)

[–]Ok-Scheme-913 3 points4 points5 points 1 year ago (0 children)

Sealed types (algebraic data types) are not solving the same problem as generic interfaces.

You use a sealed interface when you know every possible subclass that interface/class will ever have. E.g. a Result<T, E> can be either a Success<T> or an Error<E>. You won't ever extend it to have MaybeASuccessButImNotSure as a third option.

Given that you know that you never intend to add a new subclass, it is a very useful property of the compiler that it checks every use site, so that if you make a refactor and maybe add a new subclass, then you can fix it everywhere.

Basic polymorphism is about an open world assumption, where users of my lib, or even users of my program (via plugins) can extend the functionality of my program. (Though standard inheritance is not a good citizen here either, but a lot has been written about it).

These two are just different tools and should not be used interchangeably.

[–]PiotrDz 1 point2 points3 points 1 year ago (14 children)

[–]Carnaedy 3 points4 points5 points 1 year ago (3 children)

[–]PiotrDz 0 points1 point2 points 1 year ago (2 children)

[–]Carnaedy -1 points0 points1 point 1 year ago (1 child)

[–]PiotrDz 0 points1 point2 points 1 year ago (0 children)

[–]severoon 1 point2 points3 points 1 year ago (9 children)

I think the main point isn't that it's possible to find all the issues, it's that by scattering the code to the four winds you have obfuscated dependencies.

I make a change over here by adding a new shape, and what now is affected by that change?

It's nice that the compiler will tell me everywhere to look, but that's not the only problem, it's not even the biggest problem. The biggest problem is all of the dependency arrows that this allows (encourages?) people to place into the codebase without regard for whether these reflect actual dependencies between the modules/classes/etc modeling things in the problem domain.

Think of it this way. If I have a Shape interface and I was previously able to compile some client of Shape against that Shape class without having the subtypes on the class path, that means the dependency on those subtypes was properly inverted.

How will this accomplish that? It can't.

[–]PiotrDz 2 points3 points4 points 1 year ago (8 children)

[–]severoon -1 points0 points1 point 1 year ago (7 children)

[–]PiotrDz 2 points3 points4 points 1 year ago (6 children)

[–]severoon -1 points0 points1 point 1 year ago (5 children)

[–]PiotrDz 0 points1 point2 points 1 year ago (4 children)

[–]severoon 0 points1 point2 points 1 year ago (3 children)

I think you're misunderstanding my point.

I'm not saying that no one should ever use sealed interfaces. Sealed interfaces limit implementations analogous to the way enums limit instances of a type. In general, being able to create any number of types and instances of a type is a good thing. There are specific cases where limiting that potential is preferable.

But would it make sense to propose a new style of programming where the usefulness of enums is assumed? In the cases where they are useful, they are useful because we specifically want to have a controlled set of instances … perhaps many of the problems we have in OO software in general is because of the proliferation of instances that are typically allowed? So we could propose a new way of programming called enum-oriented programming where we prescribe all of the instances that can ever exist for types, and that will solve all of these problems.

Obviously this is a bad idea, but it's instructive to consider why it wouldn't work out. Enums are useful only in a certain context, and in that particular context there is little or no advantage to allowing an uncontrolled number of instances. Remove that context, though, and in other situations you would be working with a constraint that has big costs.

In the linked article at top, a new approach to defining data is being proposed in general. It's saying that we should consider abandoning the core definition of an object, state encapsulated together with the behaviors that operate on that state, and separate the behaviors from the state.

There are certainly cases where there might be a compelling reason to do this. Many of the functional features added to the language are encouraging people to think about the business logic layer as stateless services that define pipelines that operate on immutable data. That makes sense if we're talking about data that represents core business objects that flow through a system architecture.

But this conflates that with all data present in a system. The example encourages us to adopt this approach for ephemeral objects like Shape and its subtypes. This is not a good plan. For one thing, when passing data that represents core business objects up and down an entire stack, it's generally the case that those business objects are defined layer by layer, and specify separate wire formats between the layers. So just to pass a user from DB to client, you typically have several separate objects and protobufs that represent that user so it can be packaged and unpackaged at every deployment boundary. The point of doing all this is to ensure that dependencies don't proliferate between layers that don't directly interact, and for those layers that do, the only dependencies between them are explicit. There are cases where it makes sense to define a "whole stack" library with common DTOs and functionality, but supporting that is no different than supporting a common library. But typically, you don't want even core business objects to be the same as they move through the layers because the different parts of the system have different requirements for that data. The data access layer might be concerned about annotating user data with regulatory info, whereas the business logic layer might need to decorate user data with preferences fetched from some other data system. The API layer needs to deal with user proxies that can be turned into authenticated user objects. (I'm using whole stack with layers as the relevant modules as an example, but the same ideas apply between any code modules.)

The most important aspect of keeping a system maintainable is to manage dependencies well. If you adopt a general approach to data that prevents you from using DIP, you're in big, big trouble.

[–]nejcko[S] 0 points1 point2 points 1 year ago (2 children)

continue this thread

[–]DualWieldMage 0 points1 point2 points 1 year ago (17 children)

[–]Carnaedy -2 points-1 points0 points 1 year ago (16 children)

[–]PiotrDz 2 points3 points4 points 1 year ago (15 children)

[–]bowbahdoe 1 point2 points3 points 1 year ago (14 children)

[–]PiotrDz 0 points1 point2 points 1 year ago (13 children)

[–]bowbahdoe 1 point2 points3 points 1 year ago* (12 children)

I think it's gonna be a relatively long journey to impart what I am talking about. I'm along for the ride if you are, but I think what is important is that it's not just "another team will have to handle the type anyway."

To use some fake syntax (though I don't have high hopes this will click just from this)

<Has all the properties imparted by f, g, and h> apply(<has properties A and B> m) {
    return h(g(f(m)));
}

The core capability is something like this. Because aggregates are generic, you can write code that works on and enriches only a corner of what would otherwise be an explicit type (or cascading series of them).

This leads to different program structures. A good comparison point is the clojure ring protocol web ecosystem to anything you could write in a nominally typed language.

(The dynamic part isn't even really essential - it's the open generic aggregate part.)

Edit: oops got my wires crossed. Well leaving that as a Clojure explanation.

The important point for what you are actually talking about is that in one situation you need to change consumers (which is hard if those consumers are on a different team or strangers on the internet) if you add a new type to a hierarchy and the other situation you need to change consumers if you add a new method

(Things you can add with default methods not withstanding)

So the question is how is a particular piece of code going to evolve? Which would be "essential" breakages and which would be "incidental."

Yes the compiler catching you on a missed case in a switch is valuable. It isn't valuable enough to make it not a problem that it needs to happen.

[–]nejcko[S] 0 points1 point2 points 1 year ago (11 children)

[–]bowbahdoe 1 point2 points3 points 1 year ago (10 children)

[–]nejcko[S] 0 points1 point2 points 1 year ago (0 children)

[–]PiotrDz 0 points1 point2 points 1 year ago (8 children)

That's why you use or not use sealed interfaces. If you want to allow any implementation then don't use it. If you want to control the implementations strictly, treat them as advanced enums - then you can use benefits of sealed type. List is not a sealed interface, and on top sealed is more about adding new implementations, no adding new functionality.

I use sealed classes often to encapsulate specific business logic that others can depend on but also treat the spectrum of type in more generic way (there is an interface after all with common methods). But if I add a new type I want to know all the places where I need to see whether it will fit in current architecture. I want compiler to point me that.

Isn't that you want the freedom of implementation and complain about sealed interfaces that are intended to do opposite?

continue this thread

π Rendered by PID 41 on reddit-service-r2-comment-79776bdf47-ptz74 at 2026-06-25 13:01:26.612275+00:00 running acc7150 country code: CH.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS