"Clean" Code, Horrible Performance in Rust

chrs@programming.dev · 6 months ago

"Clean" Code, Horrible Performance in Rust

gedhrel@lemmy.world · 6 months ago

Casey’s video is interesting, but his example is framed as moving from 35 cycles/object to 24 cycles/object being a 1.5x speedup.

Another way to look at this is, it’s a 12-cycle speedup per object.

If you’re writing a shader or a physics sim this is a massive difference.

If you’re building typical business software, it isn’t; that 10,000-line monster method does crop up, and it’s a maintenance disaster.

I think extracting “clean code principles lead to a 50% cost increase” is a message that needs taking with a degree of context.

Alex@programming.dev · 6 months ago

Did author knows about difference between static and dynamic dispatch? 🤦🏻‍♂️

Turun@feddit.de · 6 months ago

It would be interesting to see if an iterator instead of a manual for loop would increase the performance of the base case.

My guess is not, because the compiler should know they are equivalent, but would be interesting to check anyway.

zweieuro@lemmy.world · 6 months ago

Correct me if I am wrong but isn’t “loop unrolling/unwinding” something that the c++ and rust compilers do? Why does the loop here not get unwound?

Giooschi@lemmy.world · 6 months ago

Loop unrolling is not really the speedup, autovectorization is. Loop unrolling does often help with autovectorization, but is not enough, especially with floating point numbers. In fact the accumulation operation you’re doing needs to be associative, and floating point numbers addition is not associative (i.e. (x + y) + z is not always equal to (x + (y + z)). Hence autovectorizing the code would change the semantics and the compiler is not allowed to do that.

"Clean" Code, Horrible Performance in Rust

"Clean" Code, Horrible Performance in Rust

https://chrs.dev/blog/clean-code-rust/