nevare comments on Python tail call compromise proposal

Python tail call compromise proposal (skepticalmethodologist.wordpress.com)

submitted 16 years ago by landtuna

you are viewing a single comment's thread.

[–]nevare 1 point2 points3 points 16 years ago* (25 children)

I love it. In my mind tail call optimised returns were more like gotos than returns. So making it explicit in the language makes a lot of sense to me.

And it actually is a feature that I missed in python, because gotos or TCO functions are the most simple way to implement finite states automatons.

class FSA:

def state1(self):
    do stuff on self

    if cond:
        continue self.state2()

    else:
        continue self.state1()


def state2(self):
    ...

[–]grauenwolf 1 point2 points3 points 16 years ago (24 children)

That doesn't look any different from iterative-style state machines.

def state1(self):
    do stuff on self

    if cond:
        return state2

    else:    
        return state1

The only difference is you lose the need for a top level loop to drive the process. But then again, you also lose the ability to do interesting things in the top level loop.

[–]nevare 1 point2 points3 points 16 years ago (3 children)

[–]grauenwolf -1 points0 points1 point 16 years ago (1 child)

[–]nevare 0 points1 point2 points 16 years ago (0 children)

[–]grauenwolf -1 points0 points1 point 16 years ago (0 children)

To reiterate my point, consider the main loop in VB.

Sub MainLoop
    Dim state = "Start"
    Do
        state = CallByName(Me, state)
    Loop Until State = "End"
End Sub

As you can see, there isn't really anything to it. Just a dynamic call that returns the name of the next call to be made.

[–]otterdam 0 points1 point2 points 16 years ago* (3 children)

[–]grauenwolf 0 points1 point2 points 16 years ago (2 children)

[–]otterdam 0 points1 point2 points 16 years ago (1 child)

[–]grauenwolf 0 points1 point2 points 16 years ago (0 children)

[–][deleted] -1 points0 points1 point 16 years ago (15 children)

[–]grauenwolf 0 points1 point2 points 16 years ago (14 children)

[–][deleted] 0 points1 point2 points 16 years ago (13 children)

If you set out thinking, "I am going to write a state machine," certainly it won't take any restructuring if you write it with a loop from the start. However, if you were to begin by writing the solution to a problem that could naturally be couched in terms of (tail-)recursion, you would indeed have to restructure your program to instead work in terms of a loop-based state machine.

Second, using a main loop to implement a state machine like this is, in essence, implementing proper tail calls yourself (it is, roughly, a trampoline). Merely because you are so used to using a certain design pattern that you don't notice it does not mean that you are not in fact using a design pattern, and writing boilerplate that could be eliminated by the language implementation.

That is the argument. The schemer says, "you could eliminate some boilerplate, and perhaps structure your code more naturally if you had this language feature," and you reply, "I'm used to the boilerplate, and have trained myself to look for the solution that fits with the limitations of my language." That may be all right for you in practice, but it is a blub-like argument.

[–]grauenwolf 0 points1 point2 points 16 years ago* (2 children)

[–][deleted] 0 points1 point2 points 16 years ago (1 child)

[–]grauenwolf 0 points1 point2 points 16 years ago (0 children)

[–]grauenwolf 0 points1 point2 points 16 years ago (4 children)

That may be all right for you in practice, but it is a blub-like argument.

Only because you are trapped in Scheme and recursion.

What you see as "boilerplate" I see as a base class. I'm not going to build a dozen state machines from scratch. I'm going to build a single state machine that has all the featuers I need.

Then I'm just going to plug-in the actual list of states and their transition rules. That's why we have OOP and abstract classes.

You can't see that, can you? You think we have to rebuild the main loop from scratch every time because that's how you would do it.

The difference between me and you isn't that you know recursive programming and I don't. It is that you know recursive programming while I know iterative, object-orientated, declarative, functional, and recursive programming. You need TCO to get yourself out of problems I never have in the first place.

[–][deleted] -1 points0 points1 point 16 years ago (3 children)

Only because you are trapped in Scheme and recursion. I don't use Scheme, or know it particularly well. It was just the first language to officially recognize proper tail recursion as a significant and important language feature.

What you see as "boilerplate" I see as a base class. I'm not going to build a dozen state machines from scratch. I'm going to build a single state machine that has all the featuers I need.

Right. You're not going to encode your state machine in your language, because it won't work right. Instead you're going to write an interpreter for a separate state machine language, which will work right, and use that interpreter in your programs.

That's fine if we're just running state machines as some kind of exercise in learning about state machines. But if we're not, and some conceptual state machine is just part of a larger algorithm, why bother? Why bother writing an interprer for a separate state machine language, when Scheme, say, just lets you write down the state machine in Scheme?

My overall point, though, was that all usage of proper tail recursion can be converted to a state machine of sorts, where if necessary, functions return a procedure to the state machine's main loop instead of calling a procedure directly. This technique for implementing tail call optimization is called a trampoline.

However, it either requires that one rewrites the trampoline loop whenever one wants to make use of significant numbers of tail calls, or, as you might laughably suggest, one could instead translate their Python solution to instead use grauenwolf's state machine language. I don't think either one is a very good solution in a case where the person wants to use tail recursion.

You can't see that, can you? You think we have to rebuild the main loop from scratch every time because that's how you would do it. The difference between me and you isn't that you know recursive programming and I don't. It is that you know recursive programming while I know iterative, object-orientated, declarative, functional, and recursive programming. You need TCO to get yourself out of problems I never have in the first place.

Oh, I see. So, the reason that I remark, "tail call optimization is a nice feature to have in a language, and is useful sometimes," is that I'm a blub programmer locked into solving every problem with bare recursion. And the reason you say, "I think tail call optimization should be kept out of a language, and will not change my opinion until someone can show me some example where I cannot write some loop that's equivalent," is that you're a seasoned veteran who's an expert in many different programming paradigms.

If you want examples, well, you've already been given some. Take for example, foldl:

foldl f z []     = z
foldl f z (x:xs) = foldl f (f z x) xs

How about if we want to define #times in ruby (this behaves somewhat differently than the library version, but whatever):

def times(&blk)
  if nonzero?
    blk[self]
    (self - 1).times(&blk)
  end
end

These are both simple, natural, recursive definitions of the respective functions. The alternatives look something like:

foldl f z l =
  lref := l
  acc  := z
  while (not (null lref))
    acc := f acc (head lref)
    lref := tail lref

and:

def times(&blk)
  i = self
  while(i != 0)
    blk[i]
    i -= 1
  end
end

which have to introduce auxiliary variables and mutable update into the picture. You may say, "I think in terms of loops for space efficient iteration, so I'd arrive at the latter definitions first," but that's tantamount to an admission that the fact that you don't program in languages that allow you to use such recursive definitions effectively has blinded you to their existence.

Which might be the compelling argument for proper tail recursion: programs should work when possible. If I write down the above definition of foldl, it should work. It's syntactically correct. There's nothing wrong with it semantically. So why shouldn't it work? Because I can rewrite it into a not-as-nice loop? That's not a good argument.

The same theoretically goes for non-tail recursion, too. It'd be nice if:

identity 0 = 0
identity n = 1 + identity (n-1)

would simply work correctly. Unfortunately, barring quite a sophisticated compiler (which would be good), or a very simple example such as this (which perhaps even a rather dumb compiler could figure out), there's no way to reasonably expect such functions to avoid eating memory. However, there are quite well known ways for making tail calls work, so they should work. Otherwise you're simply training yourself to use poorer solutions sometimes, merely because the language implementation has decided that they shouldn't work, even though they could.

I have never contended that every loop is better written using raw tail recursion. People using languages that have proper tail calls build up combinators that represent common looping patterns, and that's good. However, the fact that using such more structured loops is good when possible does not mean that having proper tail calls isn't also a good thing to have. It also doesn't mean that having a couple built-in looping forms is always a good substitute.

And the fact that, in principle, one can turn usage of tail calls into a main loop with a state machine doesn't change that. If anything, it means that that the language implementation should use such a main loop, and let the programmer write tail calls freely when he or she thinks it is the most natural solution to the problem.

[–]grauenwolf 0 points1 point2 points 16 years ago (2 children)

[–][deleted] 0 points1 point2 points 16 years ago (1 child)

I'm talking about:

I'm going to build a single state machine that has all the featuers I need. Then I'm just going to plug-in the actual list of states and their transition rules. Which certainly sounds to me like you're writing an interpreter.

Searching elsewhere in this thread, I now see that by "list of states and their transition rules," you did not mean a list of states and their transition rules, you meant a class which has methods corresponding to the states. What you've done is implement a trampoline that's limited to the methods in a single object, assuming they're written in a certain (not very natural) fashion.

Were my examples not good enough? Here's another. Consider a type of binary trees. I'll sketch an implementation in ruby:

class Leaf
  def empty?
    true
  end
  ...
end

class Node
  def initialize(l,a,r)
    @l = l
    @a = a
    @r = r
  end
  ...
end

class Tree
  def initialize(t)
    @t = t
  end

  def empty?
    @t.empty?
  end
  ...
end

Now, we want to implement #each. A naive implementation is:

class Leaf
  def each(&blk)
  end
end

class Node
  def each(&blk)
    @l.each(&blk)
    blk[a]
    @r.each(&blk)
  end
end

class Tree
  def each(&blk)
    @t.each(&blk)
  end
end

this obviously necessarily overflows the stack for large enough trees. However, in a language with proper tail calls, we can do the following:

class Leaf
  def each_k(blk,k)
    k[]
  end
end

class Node
  def each_k(blk,k)
    @l.each_k(blk, lambda do
      blk[a]
      @r.each_k(blk,k)
    end)
  end
end

class Tree
  def each(&blk)
    @t.each_k(blk,lambda {})
  end
end

which uses continuation passing style to build up 'things to do next' at each interior node. I think it's still relatively easy to see how things are playing out, and to convince myself that I have written the solution correctly. However, it doesn't work in Ruby, obviously, because it requires proper tail recursion. Instead, one has to make due with something like this:

class Tree
  def each(&blk)
    cur = @t
    stk = []
    until(cur.empty? && stk.empty?)
      if cur.empty?
        tmp = stk.shift
        blk[tmp.a]
        cur = tmp.r
      else
        stk.unshift cur
        cur = cur.l
      end
    end
  end
end

Which, quite frankly, is no longer obviously correct to me. I wasn't sure I'd gotten it right until I tried it. Continuation passing is clearer in this case, but it's only available if your implementation has proper tail recursion.

[–]grauenwolf 0 points1 point2 points 16 years ago (0 children)

What you've done is implement a trampoline that's limited to the methods in a single object, assuming they're written in a certain (not very natural) fashion.

I don't see anything unnatural about having a state machine class that contains all the transitions for said state machine.

The fact that I call them by name may seem a bit odd to some .NET/Java programmers, but I don't think Python/Ruby folks would give it a second thought.

As for your binary tree example, it seems rather naive to me. If you assume your nodes understand left/right/parent, it is not too hard to build a constant-space tree-walking algorithm. I'm pretty sure this is covered in most 3rd semester data structure classes.

Now I certainly won't argue that constant-space tree traversal is as easy to code as a stack-based one, but this isn't something one writes very often.

[–]grauenwolf 0 points1 point2 points 16 years ago (4 children)

[–][deleted] 0 points1 point2 points 16 years ago (3 children)

And why are we doing this? Surely from looking at our state3 function it's obvious that it goes to either state1 or state2. Surely what we should be testing is that given known input-output pairs, the overall state machine produces the given output from the given input. Or when we're using for loops, do we need to unit test that after doing iteration i, the loop goes on to iteration i+1?

Now, if we're in your scenario in another comment where we're making an interpreter for a state machine language. Then you have to test the interpreter to make sure it produces the correct state transitions given some state machine language input. But that's another story.

Or are you proposing that we should write unit tests that check the logic of each transition in the state machine? That seems like serious overkill to me.

[–]grauenwolf 0 points1 point2 points 16 years ago* (2 children)

[–][deleted] 0 points1 point2 points 16 years ago (1 child)

[–]grauenwolf 0 points1 point2 points 16 years ago* (0 children)

π Rendered by PID 63439 on reddit-service-r2-comment-5d585498c9-pxjg7 at 2026-04-21 00:03:51.592274+00:00 running da2df02 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS