• C
    modify complex template for elementwise ops (#33071) · dbc08d69
    chentianyu03 提交于
    * modify complex template for elementwise ops
    
    * modify mul, div grad struct
    
    * add complex template for CudaShuffleDownSync CudaShuffleXorSync funcs and fix the bug when delete cuda<9000
    
    * fix shuffle func args bug
    
    * fix shuffle func args bug
    
    * fix shuffle func args bug
    dbc08d69
cuda_device_function.h 8.3 KB