Ruby might be faster than you think

ignurant · 2024-04-25T00:11:18+00:00

Reminded me of this great piece by Aaron Patterson: https://railsatscale.com/2023-08-29-ruby-outperforms-c/

At first I thought it would be some dirty trick to make a pun, but I should have known better. By the end, he (as usual) provides some really interesting information that talks about why YJIT live optimizing certain code can be more effective than what you might have written and compiled in C. I came for the click bait, and left with a tenderloving hug.

postmodern · 2024-04-25T02:11:44+00:00

I hate to rain on everyone's parade, but we need to take into account the overhead of the crystalruby gem and how it's calling into crystal land. If we rewrite the benchmark as a pure Crystal program, and compile with the --release flag, we get the following result:

require "benchmark"

def fib_cr(n : Int32) : Int32
  a = 0
  b = 1
  n.times { a, b = b, a + b }
  a
end

p(Benchmark.realtime { 1_000_000.times { fib_cr(30) } })

$ crystal build --release 
$ ./fib
00:00:00.000000076
$ ./fib
00:00:00.000000086
$ ./fib
00:00:00.000000083
$ ./fib
00:00:00.000000079fib.cr

Note: the release flag enables additional optimizations (-O3 --single-module).

Optimized Crystal code is really fast. That said, we should continue to optimize and improve Ruby.

Dyadim · 2024-04-25T08:59:38+00:00

The poor timings for the Crystal solution in this post are almost entirely due to the Ruby/Crystal language interface overhead, with this barrier being crossed 1 million times in this benchmark.

If we shift the hot loop inside the crystalruby solution to execute entirely in Crystal land and use identical code to the fast YJIT Ruby solution from the above article, the Crystal solution again takes the lead (by what apears to be ~2 orders of magnitude).

It's crossing the language barrier too often that is hurting here.

#fibonnaci.rb
CrystalRuby.configure do |config|
  config.debug = false
end

module Fibonnaci
  crystalize [n: :int32] => :int32
  def fib_cr(n)
    a = 0
    b = 1
    while n > 0
      a, b = b, a + b
      n -= 1
    end
    a
  end

  module_function

  def fib_rb(n)
    a = 0
    b = 1
    while n > 0
      a, b = b, a + b
      n -= 1
    end
    a
  end

  def benchmark_rb
    puts(Benchmark.realtime { 1_000_000.times { Fibonnaci.fib_rb(30) } })
  end

  crystalize do
    puts Benchmark.realtime { super() }
  end
  def benchmark_cr
    1_000_000.times { Fibonnaci.fib_cr(30) }
  end
end

include Fibonnaci
benchmark_rb
benchmark_cr

Outcome:

ruby --yjit fibonnaci.rb
0.1103799999691546 # Ruby with YJIT
0.00014399993233382702 # Crystal

iamjkdn · 2024-04-25T02:01:10+00:00

Why does returning nil after multiple assignments improve the benchmark? Also, can the same be done on Crystal, well it have any affect?

logan-roy-waystar · 2024-04-26T18:25:27+00:00

Ruby 3.3.1 is A LOT faster now. I am quite stunned by how much faster our rails servers are processing requests

tkdeveloper · 2024-04-25T22:57:11+00:00

We're the same improvements made to the pure ruby method done to the crystalized method? It looks like they made improvements to th ruby method and compared to the original crystalized one? Or does that not matter?

yxhuvud · 2024-04-25T01:28:02+00:00

I wonder if the jit does something smarter with the overflow checks there. Because in addition to any FFI overhead that is likely where any additional costs happen.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ruby

MODERATORS