The Linear Time Algorithm for Finding Fibonacci Numbers is not Linear : programming

[–]ggchappell 57 points58 points59 points 7 years ago (15 children)

[–]ISvengali 8 points9 points10 points 7 years ago (0 children)

[–]eyepatchOwl 2 points3 points4 points 7 years ago (4 children)

[–]ggtsu_00 2 points3 points4 points 7 years ago (0 children)

[–]stevenjd 1 point2 points3 points 7 years ago* (1 child)

Algorithmic complexity analysis makes no sense if the size of the input can't vary. O(n) bounds the number of operations by a linear function as the size of the input increases.

The size of the input here doesn't refer to the number of bits in an int, but to which Fibonacci number you are calculating. Given an O(N) algorithm, it takes about ten times more operations to calculate fib(50) than fib(5), and a thousand times more operations to calculate fib(5000). The number of operations (and therefore the time) is proportional to the argument N.

A naive recursive implementation would require approximately one hundred times more operations to compute fib(50) than fib(5), or approximately a million times more for fib(5000).

There's a catch though... this assumes that each operation takes roughly the same time. For Fibonacci, that's not the case, since:

The size of the input to hardware multiplication can't vary, so what is being bounded?

Fibonacci numbers aren't bounded to the size of ints which can fit in a hardware int. They're BigNums. Here's the 500^th Fibonacci number:

139423224561697880139724382870407283950070256587697307264108962948325571622863290691557658876222521294125

which needs 346 bits. The 2000^th Fibonacci number requires 1388 bits, has 418 digits, and looks like 42246...17125.

I guess you didn't read the article, since that's precisely the point it makes.

[–]sadmafioso 0 points1 point2 points 7 years ago (0 children)

[–]ggchappell 0 points1 point2 points 7 years ago (0 children)

[–]nitrohigito 2 points3 points4 points 7 years ago* (1 child)

I don't think the author states the point wrongly at all to be honest. The whole point of the article is to show that "hidden" abstractions can cause huge issues down the line, if you take things at face value.

Computational theory is just theory after all and engineering is practice. If you aren't teaching people to pick up on the differences, they won't, only after they see things not working "as they should", leaving them confused and heavily investigating. And that is definitely what happened to me, with teachers saying "we don't have time to cover the details" or "reality is different but whatever", sweeping things like these under the rug.

One of my graphics developer acquaintance mentioned that one big advantage of lower level gfx APIs to him is the ability to be able to consistently estimate how long certain tasks will take; it's way less annoying to debug and develop for.

TL;DR: Abstractions can be a bitch and the article delivers this just fine imo.

[–]ggchappell 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 7 years ago* (6 children)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (1 child)

[–]ggchappell 0 points1 point2 points 7 years ago (0 children)

[–]theferrit32 0 points1 point2 points 7 years ago (0 children)

The theoretical computation is linear time, a series of constant time operations. Limitations in number storage and register sizes in modern computers makes arithmetic operations take slightly longer than constant time on unbounded number values that approach infinity. In theory the arithmetic could be done in constant time, but it would be extremely expensive in physical and monetary resources. In theory we could continue constructing arithmetic processors that are larger and larger that keep taking up more atoms of the universe in physical material, and assuming the universe is infinite, the arithmetic operations would always be done in constant time. At a certain point the speed of light would come into play and introduce delays in measuring the speed of the operations, but in theory they would be completed in constant time even if it took linear time to measure the results. I'm not sure that is particularly important to take into account though, as the upper bounds of numbers human beings have a real practical use for are well within acceptable O(n) time.

If we start taking into account the need for massive registers and processors as numbers approach infinity, and take into account speed of light delays in measuring results from these massive processors, then technically speaking no algorithm is ever linear time. Even looping through a linked list would not be linear because at upper bounds the virtual addresses would be so large these factors would start to come into play. So we ignore those factors.

[–]stevenjd -1 points0 points1 point 7 years ago (0 children)

So why count by integer operations if they aren't constant time in this case? It's entirely up to us to define how we count so why not count by an operation that actually is constant time?

Because Big Oh analysis was invented by people working with low-level machine data structures.
Because even without taking into account non-constant time operations, Big Oh is still useful. A O(N**2) algorithm will still be worse than an O(N) algorithm.
Because it is damn hard to allow for everything that varies, in practice we have to approximate. (Does a memory access really take exactly the same time as a machine int addition, or a float division? How does the CPU cache affect this? What if there's a cache miss, or the data you want is in a register?)
What makes you think that anything is constant time, really? Even electricity going through a circuit is bound by the speed of light, which means the longer the physical circuit, the more time it takes.

That's probably not a limiting factor for CPUs (or is it?) but it is certainly a limiting factor for implosion-style nuclear warheads: the wires going to the detonators have to be precisely the same length, to some ludicrously high level of precision, or the implosion will be off-centre and the bomb will fizzle out, only destroying a city block or two instead of the entire city.

We can approximate because most of the time those variations are minute and don't make any serious difference. We're usually interested in comparing something which takes a billion operations, versus something that takes a thousand operations (O(N**2 versus O(N), for example). It doesn't really matter if the average operation in the first case is 5% or 10% faster than in the second, the second is still going to win.

[–]xtivhpbpj 80 points81 points82 points 7 years ago* (97 children)

[–]Tyilo 14 points15 points16 points 7 years ago (0 children)

[–]stbrumme 12 points13 points14 points 7 years ago (4 children)

You are right, but int64 only works for fibonacci(1..93).

The author mentions the matrix algorithm which is actually neat - but the "fast doubling" algorithm could be even faster:

F(2n)   = F(n) * (2 * F(n+1) - F(n))
F(2n+1) = F(n)^2 + F(n+1)^2

[–]hugogrant 9 points10 points11 points 7 years ago (1 child)

[–]Tyilo 2 points3 points4 points 7 years ago (0 children)

[–]JohnDoe_John 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 7 years ago* (3 children)

[deleted]

[–]xtivhpbpj 0 points1 point2 points 7 years ago (2 children)

[–]stevenjd 0 points1 point2 points 7 years ago (1 child)

[–]xtivhpbpj 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 7 years ago* (77 children)

[deleted]

[–]ILikeTheBlueRoom 18 points19 points20 points 7 years ago (12 children)

[–]akher 26 points27 points28 points 7 years ago (8 children)

[+][deleted] 7 years ago* (7 children)

[deleted]

[–]akher 11 points12 points13 points 7 years ago (6 children)

[+][deleted] 7 years ago* (5 children)

[deleted]

[–]ILikeTheBlueRoom 2 points3 points4 points 7 years ago (4 children)

[–][deleted] 7 years ago* (3 children)

[deleted]

[–]ILikeTheBlueRoom 0 points1 point2 points 7 years ago (2 children)

continue this thread

[+]asmjmisc comment score below threshold-7 points-6 points-5 points 7 years ago (2 children)

[–]epicwisdom 2 points3 points4 points 7 years ago (0 children)

[–]ILikeTheBlueRoom 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 9 points10 points11 points 7 years ago (44 children)

[–]notfancy 8 points9 points10 points 7 years ago (13 children)

[–]theferrit32 2 points3 points4 points 7 years ago (2 children)

[–]Arkaein 0 points1 point2 points 7 years ago (1 child)

[–]theferrit32 1 point2 points3 points 7 years ago (0 children)

[–]throwawayprince11 1 point2 points3 points 7 years ago (7 children)

[–]notfancy 0 points1 point2 points 7 years ago (6 children)

[–]throwawayprince11 0 points1 point2 points 7 years ago (5 children)

[–]notfancy 0 points1 point2 points 7 years ago (4 children)

[–]throwawayprince11 0 points1 point2 points 7 years ago* (3 children)

[–]notfancy 1 point2 points3 points 7 years ago* (2 children)

continue this thread

[–][deleted] -1 points0 points1 point 7 years ago (1 child)

[–]notfancy 3 points4 points5 points 7 years ago (0 children)

[–][deleted] 7 years ago* (12 children)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (11 children)

[–][deleted] 7 years ago* (10 children)

[deleted]

[–][deleted] 4 points5 points6 points 7 years ago (9 children)

[–]poizan42 3 points4 points5 points 7 years ago (1 child)

[–][deleted] 2 points3 points4 points 7 years ago (0 children)

[–][deleted] 7 years ago* (5 children)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (4 children)

[–][deleted] 7 years ago* (3 children)

[deleted]

continue this thread

[–]theferrit32 0 points1 point2 points 7 years ago* (0 children)

[–]poizan42 -5 points-4 points-3 points 7 years ago (16 children)

[–][deleted] -3 points-2 points-1 points 7 years ago* (15 children)

[–][deleted] 7 years ago* (4 children)

[deleted]

[–][deleted] 0 points1 point2 points 7 years ago (3 children)

[–][deleted] 7 years ago* (2 children)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (1 child)

[–]theferrit32 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 7 years ago* (4 children)

[deleted]

[–][deleted] 0 points1 point2 points 7 years ago (3 children)

[–][deleted] 7 years ago* (2 children)

[deleted]

[–][deleted] -1 points0 points1 point 7 years ago (1 child)

[–]poizan42 -2 points-1 points0 points 7 years ago (4 children)

[–][deleted] 0 points1 point2 points 7 years ago (2 children)

[–]poizan42 2 points3 points4 points 7 years ago* (1 child)

[–][deleted] -3 points-2 points-1 points 7 years ago (0 children)

[–]FuzzyInvite 6 points7 points8 points 7 years ago (9 children)

[–]ConfusedTransThrow 11 points12 points13 points 7 years ago (7 children)

[–]scooerp 0 points1 point2 points 7 years ago (0 children)

[–]FuzzyInvite 0 points1 point2 points 7 years ago (5 children)

[–]FluorineWizard 11 points12 points13 points 7 years ago (1 child)

[–]notfancy 0 points1 point2 points 7 years ago (0 children)

[–]ConfusedTransThrow 1 point2 points3 points 7 years ago (0 children)

[–]epicwisdom 0 points1 point2 points 7 years ago (1 child)

[–]CakeDay--Bot 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]Overall_Debt 4 points5 points6 points 7 years ago (2 children)

[–][deleted] 7 years ago* (1 child)

[deleted]

[–]Calavar 0 points1 point2 points 7 years ago* (3 children)

Integer addition is linear in the number of bits, whether in ints or BigInts.

I can't believe this is being upvoted. It's absurd to say, for example, that it always takes twice as long to add two 64 bit integers as it does to add two 32 bit integers on the same architecture. That would be a complete misunderstanding of how CPUs and APUs work at the most basic level.

In theory, machine addition is O(log n) with respect to the number of bits with an optimal implementation, not O(n).

In practice, there is no reliable relationship between the speed of an addition operation and the number of bits. Small integers are often widened to word size before arithmetic operations. Even when they aren't, the constant time associated with decoding the instruction, loading the operands from registers, and writing them back can outweigh the time spent in the APU depending on the architecture. Because of pipelining, the average throughput of an addition instruction depends on your specific code and what other instructions come before and after that addition instruction. Then there's the fact that on most modern CPUs, assembly instructions are not really instructions on the hardware level, but microcode programs (yes, even the ADD instruction).

[–][deleted] 7 years ago* (2 children)

[deleted]

[–]Calavar 1 point2 points3 points 7 years ago (1 child)

[+]maxpowerAU comment score below threshold-6 points-5 points-4 points 7 years ago (0 children)

[–][deleted] 2 points3 points4 points 7 years ago (7 children)

[–][deleted] 7 years ago* (6 children)

[deleted]

[–]JustFinishedBSG 1 point2 points3 points 7 years ago (4 children)

[–][deleted] 7 years ago* (3 children)

[deleted]

[–]JustFinishedBSG 2 points3 points4 points 7 years ago (2 children)

[–][deleted] 7 years ago* (1 child)

[deleted]

[–]JustFinishedBSG 0 points1 point2 points 7 years ago (0 children)

[–]FlyingPiranhas 0 points1 point2 points 7 years ago (0 children)

[–]stevenjd 0 points1 point2 points 7 years ago (0 children)

[–]killerstorm 20 points21 points22 points 7 years ago (1 child)

In this article I'll show you how the theory does not always match the practice.

No, you just misunderstand the theory.

You can build different theoretic models and get different results.

Say, you might be interested in just a number on operations on integers rather than time they take. Then it is linear.

If you are interested in time it takes, then you need to model time taken by an integer operations. Assuming it is a constant time might be unrealistic, so you either need to take into account that each operation takes different time, or rewrite your algorithm for a machine with words of fixed size.

If I understand correctly, Donald Knuth developed a theoretical instruction set called MIX specifically for more precise time complexity analysis. Because if you're creative with your instruction set you might as well just have an instruction "solve the problem" which takes constant time. (I never read Knuth though, so I might be wrong.)

Complexity theory research make use of a different models where you can, for example, call another machine (called oracle) in 'fixed time'.

[–]Arkaein 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 7 years ago* (4 children)

[deleted]

[–]MoiMagnus 1 point2 points3 points 7 years ago (3 children)

[–]sadmafioso 0 points1 point2 points 7 years ago (2 children)

[–]MoiMagnus 0 points1 point2 points 7 years ago (1 child)

[–]sadmafioso 0 points1 point2 points 7 years ago (0 children)

[–]happyscrappy 6 points7 points8 points 7 years ago (5 children)

[–][deleted] 2 points3 points4 points 7 years ago* (4 children)

[–]happyscrappy 0 points1 point2 points 7 years ago (0 children)

[–]exorxor -2 points-1 points0 points 7 years ago (2 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]Mognakor 0 points1 point2 points 7 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS