codegen: add a pass for late conversion of known modify ops to call atomicrmw #57010

vtjnash · 2025-01-10T01:02:22Z

The ExpandAtomicModify can recognize our pseudo-intrinsic julia.atomicmodify and convert it into some of known atomicrmw expressions, or simplify it with more inlining, as applicable. Ideally we could get this pass upstreamed, since there's nothing specific to julia about this pass, and LLVM's IR cannot express this quite correctly without making it a new intrinsic.

This ensures that now our @atomic modify is as fast as Threads.Atomic!

julia> @code_llvm Threads.atomic_add!(r, 10)
; Function Signature: atomic_add!(Base.Threads.Atomic{Int64}, Int64)
;  @ atomics.jl:307 within `atomic_add!`
define i64 @"julia_atomic_add!_2680"(ptr noundef nonnull align 8 dereferenceable(8) %"x::Atomic", i64 signext %"v::Int64") #0 {
top:
; ┌ @ Base_compiler.jl:94 within `modifyproperty!`
   %0 = atomicrmw add ptr %"x::Atomic", i64 %"v::Int64" acq_rel, align 8
; └
; ┌ @ Base_compiler.jl:54 within `getproperty`
   ret i64 %0
; └
}

base/atomics.jl

gbaraldi · 2025-01-10T02:28:50Z

Why is there such a large aotcompile.cpp diff? Looks unrelated

src/aotcompile.cpp

vtjnash · 2025-01-13T21:48:47Z

Why is there such a large aotcompile.cpp diff? Looks unrelated

It is mostly support code for this, since if you don't have the IR emitted for the target, then it has to use a loop&call, which isn't want people want to see, so we need to make sure to emit the target code too into all of the correct compile units. We could land some of it separately, but it wouldn't do anything (be mostly untested) until this landed

…tomicrmw The ExpandAtomicModify can recognize our pseudo-intrinsic julia.atomicmodify and convert it into some of known atomicrmw expressions, or simplify it with more inlining, as applicable. This ensures that now our `@atomic` modify is as fast as `Threads.Atomic` for the cases we implement now.

oscardssmith reviewed Jan 10, 2025

View reviewed changes

base/atomics.jl Show resolved Hide resolved

oscardssmith added performance Must go faster multithreading Base.Threads and related functionality atomics labels Jan 10, 2025

gbaraldi reviewed Jan 10, 2025

View reviewed changes

src/aotcompile.cpp Show resolved Hide resolved

vtjnash force-pushed the jn/atomic-modify-opt branch from 8cd489b to 666985c Compare January 17, 2025 14:31

vtjnash added 2 commits January 17, 2025 14:32

remove deprecated Threads.Atomics

c98b43f

vtjnash force-pushed the jn/atomic-modify-opt branch from 666985c to c98b43f Compare January 17, 2025 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codegen: add a pass for late conversion of known modify ops to call atomicrmw #57010

codegen: add a pass for late conversion of known modify ops to call atomicrmw #57010

vtjnash commented Jan 10, 2025

gbaraldi commented Jan 10, 2025

vtjnash commented Jan 13, 2025

codegen: add a pass for late conversion of known modify ops to call atomicrmw #57010

Are you sure you want to change the base?

codegen: add a pass for late conversion of known modify ops to call atomicrmw #57010

Conversation

vtjnash commented Jan 10, 2025

gbaraldi commented Jan 10, 2025

vtjnash commented Jan 13, 2025