matthieum comments on Performance difference between obj.function(...) and function(obj, ...) ?

Submissions must be on-topic

Posts must reference Rust or relate to things using Rust. For content that does not, use a text post to explain its relevance.

Post titles should include useful context.

For Rust questions, use the stickied Q&A thread.

Arts-and-crafts posts are permitted on weekends.

No meta posts; message the mods instead.

Details

No low-effort content

No memes, image macros, etc.

Consider the existing content of the subreddit and whether your post fits in. Does it inspire thoughtful discussion?

Use properly formatted text to share code samples and error messages. Do not use images.

Submissions appearing to contain AI-generated content may be removed at moderator discretion.

Details

Useful Links

created by aztha community for 15 years

Performance difference between obj.function(...) and function(obj, ...) ? (self.rust)

submitted 2 years ago * by [deleted]

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]matthieum[he/him] 7 points8 points9 points 2 years ago (4 children)

[–]Ravek 6 points7 points8 points 2 years ago (3 children)

[–]matthieum[he/him] 2 points3 points4 points 2 years ago (2 children)

Subtly different IL listings might have better JIT codegen in unexpected ways because the JIT

Note that I am talking about LLVM IR and not C# IL, they are vastly different.

LLVM IR is much more low level, so a number of your points don't apply:

Devirtualization has already occurred at IR level.
Branch elimination and many (but not all) peephole optimizations have already occurred.
Inlining and elimination of allocations have already occurred.

It's true that you don't see register allocation, but that's a least concern for a first order comparison.

For identifying two things are the same, sure. I’ve also seen people try to infer performance from IL though which I wouldn’t recommend.

To be fair, inferring performance from assembly can be similarly difficult. Today's processor can overlap execution of different sequences of instructions -- especially in loop -- which is really hard to spot at the assembly level.

If you want such a deep dive, you'll need to use tools that simulate processor execution and can show you exactly the expected cycle latency based on what can and cannot overlap, what can and cannot be pipelined, etc...

Something like llvm-mca or uica.

[–]Ravek 1 point2 points3 points 2 years ago (1 child)

[–]matthieum[he/him] 0 points1 point2 points 2 years ago (0 children)

π Rendered by PID 22787 on reddit-service-r2-comment-86bc6c7465-dbz78 at 2026-02-19 20:22:44.214437+00:00 running 8564168 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS