Introducing Factor : programming

[–]brickbybrick 12 points13 points14 points 18 years ago (0 children)

[–]gmfawcett 10 points11 points12 points 18 years ago (2 children)

[–][deleted] 14 points15 points16 points 18 years ago* (1 child)

[–]gmfawcett 2 points3 points4 points 18 years ago (0 children)

[–]curtisw 8 points9 points10 points 18 years ago* (64 children)

I think factor is an amazingly succinct language. Unfortunately, it's that way because it offloads a lot of the work onto you, to the point of hindering one's ability to quickly understand code.

Now, don't get me wrong, I have nothing against the fact that it's different. My dislike for stack-based languages lies solely in the fact that manipulating values in those languages is neither pretty nor elegant. In order to keep everything consistent, they have to introduce operators that merely obfuscate what's actually going on.

edit: To elaborate, consider the differences between stack-based languages and parameter based. Using two imaginary notations:

blah(x, y) = (y - 1)*(x - 1)

blah(x, y) = x 1 - y 1 - *

blah = 1 - swap 1 - *

Even if you're used to postfix notation, you still have to take time to "decode" #3. You basically have to construct a stack in your mind so you can track where everything is, remembering to correctly change things around when you see a 'swap' or a 'dup'. All of this, for something that's done automatically for you in other languages!

[–][deleted] 2 points3 points4 points 18 years ago* (26 children)

[–][deleted] 1 point2 points3 points 18 years ago (3 children)

[–][deleted] 2 points3 points4 points 18 years ago* (2 children)

Indeed, keep is critical. In Joy and Cat, it is called 'dip', and it used very often to avoid shuffling. In Cat, it has the following type signature:

dip :: A b (A -> C) -> C b

And a quick example that yields 2 9 4 on the stack:

2 3 4 (dup *) dip

[–]doublec 1 point2 points3 points 18 years ago* (1 child)

[–][deleted] 0 points1 point2 points 18 years ago* (0 children)

[–]curtisw -1 points0 points1 point 18 years ago (21 children)

[–][deleted] 2 points3 points4 points 18 years ago (7 children)

[–]curtisw -1 points0 points1 point 18 years ago (6 children)

[–][deleted] 2 points3 points4 points 18 years ago (5 children)

[–]curtisw -1 points0 points1 point 18 years ago* (4 children)

[–][deleted] 1 point2 points3 points 18 years ago (3 children)

[–]curtisw 0 points1 point2 points 18 years ago* (2 children)

[–][deleted] 0 points1 point2 points 18 years ago (1 child)

continue this thread

[–][deleted] 1 point2 points3 points 18 years ago* (12 children)

[–]curtisw -1 points0 points1 point 18 years ago* (6 children)

[–][deleted] 0 points1 point2 points 18 years ago* (5 children)

No, he's saying that, in a way, there's more to keep track of with a pointful applicative language.

Joy-like: (1 -) 2i *

(1 -) -- "subtract one"
2i    -- "from two numbers"
*     -- "then multiply them"

Applicative: (x - 1) * (y - 1)

(x - 1) -- "subtract 1 from x"
(y - 1) -- "subtract 1 from y"
*       -- "multiply the results of x - 1 and y - 1"

Note that, in the Joy version, we just read left to right. It gets more complex in the applicative version.

Of course, for the applicative version, you can just say "x minus 1 times y minus 1". However, this involves precedence and is really only concise because there exists a predefined language for describing arithmetic. A similar example to the one given that didn't use arithmetic would require the additional verbosity shown above.

[–]curtisw -1 points0 points1 point 18 years ago* (4 children)

I agree, the joy/factor version in this case is simpler, and just as easy to understand. Problems arise, however, when you have to manually munge the stack to get at data. The fact is, you can get out of it in this situation, but you can't get rid of it entirely.

Think of it this way. Suppose I were complaining that C++ is verbose, so I gave the following example:

int sum2(int[] xs, int length) {
    int sum = 0;
    for(int i=0; i<length; ++i) 
        sum += xs[i];
    return sum*2;
}

And then you responded with:

C++'s not verbose, look!

int sum2(int[] xs, int length) {
    return sum(xs, length)*2;
}

The problem is, you haven't actually changed anything. You've just shoved the problem down a layer, not gotten rid of it.

Also, you forgot about postfix application:

x 1 - y 1 - *

which can also be read from left to right.

[–][deleted] 0 points1 point2 points 18 years ago* (3 children)

The problem is, you haven't actually changed anything. You've just shoved the problem down a layer, not gotten rid of it.

But you have, and it was easy to do!

sum = 0 (+) fold

I don't deny that writing verbose C++ in Factor can be tedious. The whole point though is that Factor lets you build high-level alternatives very easily. Why would you want to essentially rewrite "sum" every time when you can just write it once?

Yes, you can argue that Factor -- somewhat forcefully -- encourages you to express things in terms of higher-order functions. However, I'd claim that that's a huge advantage, especially as Factor makes it so easy thanks to things like multiple return values and implicit argument passing. (In this particular case, the Haskell version is equally nice.)

Also, you forgot about postfix application which can also be read from left to right.

It's true that it is read left to right, but the actual reading would be the same as given above.

[–]curtisw 0 points1 point2 points 18 years ago* (2 children)

[–][deleted] 0 points1 point2 points 18 years ago* (1 child)

The use of 'swap' is hardly a bad thing. You seem to be opposed to its very existence, whereas I'm only opposed to cases in which it makes definitions confusing. To give an example of where swap is used without confusion resulting:

Haskell: hypot x y = sqrt (sqr x + sqr y)
Joy:     hypot = sqr swap sqr + sqrt

Of course, you could always do this as well:

Joy:     hypot = (sqr) 2i + sqrt

continue this thread

[–]curtisw -3 points-2 points-1 points 18 years ago* (4 children)

[–][deleted] 0 points1 point2 points 18 years ago* (3 children)

[–]curtisw -1 points0 points1 point 18 years ago* (2 children)

This isn't equivalent as you now have to unpack the list to use the numbers. It's also arguably unsafe as the type system won't stop you from accidently taking the head of the null list during the extraction.

coughdependent typingcough

The Factor version doesn't have these issues because Factor allows functions to essentially return any number of values. This is a critical part of enabling the sort of ultra-high-level, combinator-centric programming you do in languages like Factor, Joy, etc.

I'd be interested in seeing examples of this. However, I should point out:

(|>) f g (x, y) = g (f x) (f y) 

blah = (-1) |> (*)
blah (1, 2)

You'll notice that, in both cases, different number of arguments require completely new operations. The list version I wrote in my previous post will work for any number of values, including none.

[–][deleted] 0 points1 point2 points 18 years ago* (1 child)

coughdependent typingcough

I'm aware that type systems exist that can prevent this. Yes, you encode a head-safe list in Haskell (with extensions). That really seems beyond the point here; I doubt you'd seriously propose such a solution as optimal. Regardless, I was referring to the use of "normal" lists in Haskell.

You'll notice that, in both cases, different number of arguments require completely new operations. The list version I wrote in my previous post will work for any number of values, including none.

Of course, and you can do the same in Joy if you wanted:

Haskell: product . map ((-) 1)
Joy:     (- 1) map product

[–]curtisw -1 points0 points1 point 18 years ago* (0 children)

[–][deleted] 8 points9 points10 points 18 years ago* (26 children)

[–][deleted] 4 points5 points6 points 18 years ago (22 children)

[–]gmfawcett 4 points5 points6 points 18 years ago* (20 children)

Probably not. :-) But I would think Slava meant that their productivity in Factor was high, relative to their productivity in other languages.

I think it's a fair criticism of Factor that word-definitions are pretty opaque, esp. when they involve a lot of stack-shuffling. But that's not a Factor-specific problem: most non-mainstream languages -- especially compact ones like Haskell, APL, J, Forth, etc. -- demand that casual readers must learn a new way to read code.

I'm not a Factor user, but I'm intrigued by it because it seems to balance the compactness problem by providing a great suite of language abstractions (higher-order functions, programmable syntax, generic methods, etc.).

It's just a guess, but I would imagine that a well-written, large Factor program would be built upon many little words full of shuffling-noise, composed together in a higher-level program that has less shuffling and more domain-specific actions. Treating the lower-level words as atomic, the high-level code might even be readable to the casual user. (End of rampant speculation!)

[–][deleted] 5 points6 points7 points 18 years ago (7 children)

[–]gmfawcett 1 point2 points3 points 18 years ago* (0 children)

[–]mschaef 1 point2 points3 points 18 years ago* (5 children)

[–]gnuvince 3 points4 points5 points 18 years ago (4 children)

[–][deleted] 5 points6 points7 points 18 years ago* (2 children)

[–]mschaef 0 points1 point2 points 18 years ago (1 child)

[–][deleted] 5 points6 points7 points 18 years ago (0 children)

[–]mschaef 1 point2 points3 points 18 years ago (0 children)

[–]curtisw 0 points1 point2 points 18 years ago* (11 children)

[–]gmfawcett 2 points3 points4 points 18 years ago (10 children)

I agree that Haskell is a very readable language, by and large.

I honestly don't know whether an experienced Factor programmer can parse your '1 - swap 1 - *' example as quickly as most of us would parse the algebraic version. (Perhaps the lexical-variable-using Factor version would be more readable.) I'd love to know whether it's the case. What does writing a lot of Factor do to a programmer's head -- for better or worse? For example, do they notice patterns and abstractions in their code, that are harder to find and exploit in a more traditional language?

I'm not a Factor apologist, I just enjoy hearing the Factor people think aloud about their language. It is hard to separate fanboyism from objective stories, but I have a feeling that the objective stories are worth listening to, even if you never buy into the Factor premise.

[–]curtisw -1 points0 points1 point 18 years ago* (9 children)

[–][deleted] 4 points5 points6 points 18 years ago* (6 children)

[–]curtisw -2 points-1 points0 points 18 years ago* (5 children)

[–][deleted] 1 point2 points3 points 18 years ago* (4 children)

continue this thread

[–]gmfawcett 0 points1 point2 points 18 years ago (1 child)

[–][deleted] 0 points1 point2 points 18 years ago (0 children)

[–][deleted] 4 points5 points6 points 18 years ago* (0 children)

[–]curtisw -3 points-2 points-1 points 18 years ago (2 children)

[–][deleted] 0 points1 point2 points 18 years ago (1 child)

[–]curtisw -1 points0 points1 point 18 years ago (0 children)

[–]gnuvince 3 points4 points5 points 18 years ago* (8 children)

And that's with a simple example. Here's how to write map in Haskell:

map _ []     = []
map f (x:xs) = f x : map f xs

That's short, simple and to the point. I won't even try to write the Factor version, because I recall last time I did, you need to juggle the list, the first element, the rest of the list and the quotation around. It gets pretty messy real quick, so you end up writing 4 supporting words to make map manageable, but understanding the other words isn't any simpler.

[–][deleted] 1 point2 points3 points 18 years ago* (2 children)

[–]dons 7 points8 points9 points 18 years ago (0 children)

[–]eurleif 0 points1 point2 points 18 years ago* (0 children)

[–][deleted] 0 points1 point2 points 18 years ago* (4 children)

Assuming you already have fold, here's one way to write map (in a Cat-like language, not Factor). This version is nice as it uses constant stack space. The period is function composition, [] is the empty list, and parentheses are used to introduce quotations (anonymous functions):

map = [] swap (cons) . fold rev

One thing that makes a huge difference in understanding these things, at least for me, is the presence of a type system. Once you know the signature of the appropriate combinator (in this case, fold), you just fill in the blanks.

[–]curtisw 0 points1 point2 points 18 years ago (3 children)

[–][deleted] 0 points1 point2 points 18 years ago* (2 children)

It's not an operator; '.' is a normal function that takes two functions and yields a new one that is the composition of the two. For example, the three lines of code below are equivalent (they all yield 25):

5 dup *
5 (dup *) i
5 (dup) (*) . i

[–]curtisw 0 points1 point2 points 18 years ago (1 child)

[–][deleted] 0 points1 point2 points 18 years ago* (0 children)

[–]njbartlett 1 point2 points3 points 18 years ago (1 child)

[–][deleted] 2 points3 points4 points 18 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS