- 15 6月, 2022 3 次提交
-
-
由 Guoxia Wang 提交于
-
由 zhouweiwei2014 提交于
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul * fix CI * fix CI * fix comment * fix comment
-
由 Yiqun Liu 提交于
Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to support large tensor. (#43506) * Change some data type from int to int64_t in GetGpuLaunchConfig1D to support large tensor. * Use int64_t in ElementwiseKernel as index type to support large tensor.
-
- 13 6月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix stride_slice bug * fix bug
-
- 10 6月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 07 6月, 2022 3 次提交
-
-
由 shixingbo 提交于
-
由 Wilber 提交于
-
由 niuliling123 提交于
-
- 06 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 31 5月, 2022 1 次提交
-
-
由 xiongkun 提交于
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 * make EInsumOP support bf16 * add unittest for BF16 * add condition for test_BF16 * fix bugs * fix
-
- 30 5月, 2022 1 次提交
-
-
由 limingshu 提交于
* 1st commit * fix usless change in header transpose_kernel_h file * add sync
-
- 27 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
- 26 5月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 25 5月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix maybe-uninitialized warning * fix compile * fix xpu compile * fix npu compile * fix infer compile * fix compile * fix compile
-
- 20 5月, 2022 3 次提交
-
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output * add flags to control compute type * default to false * add unit test * default to true
-
由 YuanRisheng 提交于
-
- 19 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * fix bug of higher order derivative * fix bug of FillZeroForEmptyGradInput * remove redundant vector in grad_node * fix bug of test_deformable_conv_v1_op * fix bug of test_deformable_conv_v1_op * some refacotr
-
- 16 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 10 5月, 2022 2 次提交
-
-
由 BrilliantYuKaimin 提交于
* Create elementwise_heaviside_op.cc * add ElementwiseHeavisideFunctor * Create test_elementwise_heaviside_op.py * 增加heaviside的python接口 * add heaviside in white list * 增加heaviside的签名 * 增加heaviside的核函数 * 增加heaviside梯度的核函数 * 增加heaviside梯度的注册 * 调整代码格式 * Update elementwise_sig.cc * add heaviside in __all__ * Update heaviside docs * Update math.py * Update math.py * Update math.py
-
由 shixingbo 提交于
-
- 09 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 01 5月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* set device id of Place() to get GPUContext needed by LimitGridDim in ElemwiseGradBroadcast * fix code style
-
- 27 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize performance of dygraph * optimize performance of dygraph and elementwise_add * optimize the trace op * fix bug * fix bug * fix unittest bug * fix code format
-
- 25 4月, 2022 2 次提交
-
-
由 limingshu 提交于
* change sequential logic * change some quotes * add some notations * change wrong note style.
-
由 Chen Weihang 提交于
-
- 18 4月, 2022 2 次提交
-
-
由 Lijunhui 提交于
-
由 zhangkaihuo 提交于
-
- 16 4月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 14 4月, 2022 2 次提交
-
-
由 Lijunhui 提交于
* regist elementwise_xxx
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-
- 13 4月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 12 4月, 2022 2 次提交
-
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
- 07 4月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 03 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-