[xla:runtime] Use structs to define gpu runtime custom calls
LLVM seems to have troubles with inlining custom call handlers defined by function pointers. When we use struct, then the custom call body is typically fully inlined into the CustomCallHandler template instantiation and generates better code. PiperOrigin-RevId: 481300474
Showing
想要评论请 注册 或 登录