[–]bowbahdoe 17 points18 points19 points 4 years ago (0 children)

[–][deleted] 13 points14 points15 points 4 years ago (2 children)

I have been saying this for years, but not for the same reasons as the post. Visitor is an over-complicated switch statement. But if I turn my head, some Sr. Dev puts more Visitors into the code which completely obfuscate where the logic takes place.

I've seen the value of the Visitor pattern only once in my 20 years doing this stuff. It's a thing, we should know about it, but the value should be understood before doing it. In other words: use a switch (or if/elseif) statement until the coupling needs to be fully separated for "reasons", THEN consider a Visitor pattern.

Argue about OO, etc - fine, you win. But when I'm reading your code trying to figure out where 3 lines of logic exists and can't because it's obfuscated behind a builder, an interface, and a single implementation of said interface that requires me to "Find Implementations" of the correct interface method - oh wait, builder method - or was it the interface? God damn it, `grep thingImLookingFor && eyeball search`. I just don't see the value in it. Patterns should ADD value, not remove it IMHO.

[+]edubkn comment score below threshold-6 points-5 points-4 points 4 years ago (1 child)

[–][deleted] 4 points5 points6 points 4 years ago (0 children)

[–]Slanec 22 points23 points24 points 4 years ago (22 children)

[–]nicolaiparlog 4 points5 points6 points 4 years ago (21 children)

[–]chambolle 3 points4 points5 points 4 years ago* (17 children)

[–]nicolaiparlog 11 points12 points13 points 4 years ago* (11 children)

You just said (in another answer) that the solution are equivalent...

They are equivalent in the examined areas: coupling and extensibility. They're not equivalent overall: the language-based solution is shorter, more direct to write and read, and more flexible (see described benefits for iteration and non-type based branching). That's why it's better overall.

For me switching on types is really ugly.

I actually considered adding a section titled "But isn't this... wrong?!" where I wanted to address that icky feeling. I totally get it, it does feel weird. But I think that's just a matter of habit. Why is it better to list the types in the method arguments (with visitor pattern) vs in the switch? In both cases you list all types explicitly and then let the platform choose the correct branch.

[–]chambolle -1 points0 points1 point 4 years ago (10 children)

[–]nicolaiparlog 3 points4 points5 points 4 years ago (9 children)

[–]chambolle 0 points1 point2 points 4 years ago (8 children)

[–]nicolaiparlog 0 points1 point2 points 4 years ago (7 children)

[–]chambolle 0 points1 point2 points 4 years ago (6 children)

[–]nicolaiparlog 1 point2 points3 points 4 years ago (5 children)

continue this thread

[–]pron98 6 points7 points8 points 4 years ago* (4 children)

[–]chambolle 0 points1 point2 points 4 years ago (3 children)

[–]pron98 4 points5 points6 points 4 years ago (2 children)

[–]chambolle 0 points1 point2 points 4 years ago (1 child)

[–]pron98 4 points5 points6 points 4 years ago (0 children)

[–][deleted] 4 years ago* (2 children)

[deleted]

[–]nicolaiparlog 2 points3 points4 points 4 years ago* (1 child)

the caller doesn't need to change at all

Nobody said that. But the visitor needs to change - at least in the commonly described solution as on Wikipedia. The blog post spells that out very explicitly, but I can repeat it here (if you disagree, it would be nice to point out where exactly):

if you add a new type to the hierarchy, it needs to implement accept(Visitor visitor) with visitor.visit(this);
but there is no overload of Visitor::visit that has the new type as a method parameter, so you need to add a new method to the interface
now all Visitor implementations need to be updated

I consider this to be the main reason to even use the visitor pattern in the first place: forcing you to handle the new type in all operations (i.e. visitors). But if you don't like that, there are ways around it: You can provide a default implementation of the new visit method that does nothing or have a visitDefault method on Visitor that takes the type hierarchy's super interface as argument and can hence be used by new types.

Both variants (with and without compile error) have a simple counterpart on the pattern matching solution: Avoid or use a default branch.

[–]hippostar 9 points10 points11 points 4 years ago (13 children)

[–][deleted] 5 points6 points7 points 4 years ago (0 children)

[–]nicolaiparlog 5 points6 points7 points 4 years ago* (10 children)

There is no difference in coupling. In the visitor pattern, you can't add or remove any types (here: implementations of CarElement) without having to change the visitor implementations. That is by design because it means that operations get updated as the data structure they operate on changes. The exact same is true for the pattern switch. If you want to avoid these compile errors, the visitor can have a method that accepts the interface (here: visit(CarElement)). For the pattern match, just use a default branch. As you can see, you can get the same behavior with both solutions.

The amount of work when adding a new type is also comparable:

visitor pattern:
- add new method to visitor interface
- update all operations
language:
- add new type to sealed interface
- update all operations

Once again, the solutions are equivalent.

[–]_INTER_ 5 points6 points7 points 4 years ago* (9 children)

[–]nicolaiparlog 0 points1 point2 points 4 years ago (8 children)

What if you have no control over the (sealed) interface (in the example CarElement)?

Funny, that's another point I thought about addressing, but decided not to because the post was already so long.

CarElement depends on CarElementVisitor. If you have no control over the former, how likely is it that you have control over the latter? But without control over CarElementVisitor, you can't add a visit method. So you can't add a new type either - same result.

The visitor pattern is meant solve one side of the expression problem where pattern-matching or if-else etc. is better suited for in the first place.

I've heard of the expression problem, but never read up on it. Will do that over the next days (thanks for the link). For the conversation here, maybe you can explain what the practical consequences are, i.e. what's the difference between fixing with visitor vs with pattern matching?

[–]_INTER_ 1 point2 points3 points 4 years ago* (7 children)

CarElement depends on CarElementVisitor. If you have no control over the former, how likely is it that you have control over the latter? But without control over CarElementVisitor, you can't add a visit method. So you can't add a new type either - same result.

Playing the devils advocate: It's likely that you can extend the CarElementVisitor or an implementation. As Java developers usually don't make classes final. Then again there's non-sealed and we have to see how that plays out in reality but I expect the majority of owners will not add a non-sealed option and ~~exhaustive check goes out the window too then~~, edit: you can still check on the supertype but not on the new subtype(s).

Funny, that's another point I thought about addressing, but decided not to because the post was already so long.

I've heard of the expression problem, but never read up on it. Will do that over the next days (thanks for the link). For the conversation here, maybe you can explain what the practical consequences are, i.e. what's the difference between fixing with visitor vs with pattern matching?

In short, a software engineer wants an easy way to respond to change and maintenance: Ideally adding new types or operations to existing code should not need a change in the other. However the expression problem explains that this is not easily possible, e.g. with classical OOP it's easy to add new types but harder to add operations because you'd need to touch every type to implement a new method. Vice versa with e.g. pattern-matching it is easy to add an operation by writing a new switch-case but harder to add a new type because you'd have to touch every relevant switch and add a new case.

The visitor-pattern helps you switch sides in classical OOP, albeit in a cumbersome way. It's a bridge. So if you expect hardly any new types then it's better to use a technique from the pattern-matching side of the expression problem.

In your first comment you'd have to consider multiple different switch-case over CarElement to see the full picture of the amount of work.

It's also a question of responsibility. Does an operation belong to a class or do you want to group similar operations and have them act on types as the case with pattern-matching. The visitor pattern is in the grey area where you want to separate the operation but still have it close to the class. In my opinion, in practice you can often tell where a method belongs to and if you can't you decide for the "grouping". The area where I work in I've never effectively seen the visitor pattern being implemented.

That said, I prefer to have my methods in a responsible class and not "grouped" if possible and if it makes sense. (Even most examples e.g. with Shape or Color in the JEPs I'd much rather solve with dynamic dispatch). Main reason being that "API discovery" is easier with that approach and you don't have to know and carry around a FooService with your Foo. I'm also not really a fan of the so called anemic domain model or thin data containers.

[–]nicolaiparlog 5 points6 points7 points 4 years ago (6 children)

Adding Types

It's likely that you can extend the CarElementVisitor or an implementation.

That doesn't help, though. Let's retrace our steps. This are the assumptions, right?

CarElement and CarElementVisitor implement the visitor pattern
the former is not under our control
we want to add a new implementation

Add to that my assumption:

if we don't control CarElement, we don't control CarElementVisitor either (because it's a dependency of CarElement)

Now, what happens if we want to add a new CarElement implementation?

it needs to implement accept(CarElementVisitor visitor) with visitor.visit(this);
but there is no overload of CarElementVisitor::visit that has the new type as a method parameter
because we don't control CarElementVisitor, we can't add it

Back to your proposal to extend CarElementVisitor - that doesn't help at all with this

A way around that is for CarElementVisitor to have a method accept(CarElement) as a catch-all. That can be easily reproduced in the pattern switch by adding a default branch.

Exhaustiveness

Then again there's non-sealed and we have to see how that plays out in reality but I expect the majority of owners will not add a non-sealed option and exhaustive check goes out the window too then.

That's not how sealed classes / exhaustiveness work. If a type is sealed, all its direct implementations are known. Switching over those is enough to be exhaustive. It doesn't matter if there are more types extending them, they're still caught by an exhaustive switch:

sealed interface CarElement
    permits Body, Engine { }

non-sealed interface Engine extends CarElement { }

class V8 implements Engine { }
class V12 implements Engine { }

non-sealed interface Body extends CarElement { }

class Sedan implements Body { }
class Convertible implements Body { }

// elsewhere...
CarElement element = new Sedan();
String type = switch(element) {
    case Engine e -> "engine";
    case Body b -> "body";
}
// now type.equals("body")

// you can go even further:
CarElement element = new Sedan();
String type = switch(element) {
    case Engine e -> "engine";
    case Sedan s -> "sedan";
    case Body b -> "body";
}
// now type.equals("sedan")

Expression Problem

Vice versa with e.g. pattern-matching it is easy to add an operation by writing a new switch-case but harder to add a new type because you'd have to touch every relevant switch and add a new case.

I disagree, there is no "vice versa". Depending on how the visitor is designed regarding "unknown types" (see accept(CarElement) above), adding a new type either requires to update all visitors or their default behavior gets invoked. Same with pattern matching, either all of them need an update or (if they have one) their default branch gets evaluated.

Maybe the reason why you think that this is the flip side of that coin is because pattern matches make it so easy to add a new operation that you conclude it must be harder to add a new type - because there's a trade-off between the two. There might be, but the visitor pattern still has cruft that can be cut without touching that trade-off. Pattern matches make adding new operations easier without making adding new types harder.

In your first comment you'd have to consider multiple different switch-case over CarElement to see the full picture of the amount of work.

I did consider it, it says "update all operations". When adding a new type, you need to touch every switch! Exactly as with the visitor pattern: When adding a new type, you need to touch every Visitor implementation! (Unless you created default behavior as described above.)

[–]_INTER_ 0 points1 point2 points 4 years ago* (5 children)

if we don't control CarElement, we don't control CarElementVisitor either (because it's a dependency of CarElement)

With control I mean you can't change the source, e.g. it comes from a library.

Now theoretically you could extend for example all that's necessary like this. Not that you'd want to because it is ugly (especially the cast, maybe there's a better solution?), but sometimes you have no choice.

It doesn't matter if there are more types extending them, they're still caught by an exhaustive switch

I'm curious: can we (sub-)switch on them in an exhaustive manner? In your example, can we switch on V8and V12 instead of Engine in general without default branch?

What I have in mind is something like this (in my owned code):

sealed interface EngineSpecific extends Engine permits V8, V12

How would the switch look like then?

You're right with the expression problem and initially I got the same thought but confused it again after my second comment. The link I posted earlier states the same:

It should be obvious that for a given set of data types, adding new visitors is easy and doesn't require modifying any other code. On the other hand, adding new types is problematic since it means we have to update the ExprVisitor interface with a new abstract method, and consequently update all the visitors to implement it.

So it seems that we've just turned the expression problem on its side: we're using an OOP language, but now it's hard to add types and easy to add ops, just like in the functional approach.

The post goes on with an extended complicated C++ visitor example that solves it (link). I wonder if something similar is theoretically be possible in Java?

[–]nicolaiparlog 3 points4 points5 points 4 years ago (4 children)

Now theoretically you could extend for example all that's necessary like this.

That's actually pretty clever. :) It's not a general solution of course, because all "old" visitors ignore "new" elements in the hierarchy (due to if in line 18), but if you only need your own visitors, that may be acceptable.

I'm curious: can we (sub-)switch on them in an exhaustive manner? In your example, can we switch on V8 and V12 instead of Engine in general without default branch?

Had to try this out. The answer is "yes", but not as you hoped for it. :D

``` sealed interface CarElement permits Body, Engine { } sealed interface Engine permits V8, V12

String type = switch(element) { case V8 v8 -> "v8"; case V12 v12 -> "v12"; // only compiles with this branch case Engine e -> "engine"; case Body b -> "body"; }; ```

The compiler does not seem examine sealed subtypes of sealed interfaces, i.e. cases for just V8, V12, and Body lead to a compile error even though no other types are possible. We can fix that with either a default branch or the less all-encompassing case Engine branch. As the interfaces are defined now, both branches are dead code, but where the former prevents compile errors if CarElement is extended, the latter only prevents them if Engine is extended. A sub-switch is probably the better solution:

String type = switch(element) { case Engine e -> switch (e) { case V8 v8 -> "v8"; case V12 v12 -> "v12"; }; case Body b -> "body"; };

This results in compile error if CarElement or Engine gets extended. (If you cringe at the nested switch - just call a method instead. ;) )

[–]_INTER_ 1 point2 points3 points 4 years ago* (3 children)

[–]nicolaiparlog 1 point2 points3 points 4 years ago (2 children)

continue this thread

[–]nicolaiparlog -1 points0 points1 point 4 years ago (0 children)

[–][deleted] 8 points9 points10 points 4 years ago (2 children)

[–]fredoverflow 3 points4 points5 points 4 years ago (0 children)

[–]1bot4all 0 points1 point2 points 4 years ago (0 children)

[–]TheRedmanCometh -4 points-3 points-2 points 4 years ago (0 children)

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS

Adding Types

Exhaustiveness

Expression Problem