- 23 5月, 2022 1 次提交
-
-
由 shixingbo 提交于
-
- 20 5月, 2022 5 次提交
-
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output * add flags to control compute type * default to false * add unit test * default to true
-
由 YuanRisheng 提交于
-
由 Weilong Wu 提交于
-
由 xiaoguoguo626807 提交于
* add tril_indices cpu kernal * modify tril_indice cpu op * modify bug * modify bug * add tril_indices python api * add tril_indices python api * resolve conflict * add tril_indices test * modify details * add tril_indices.cu * pythonapi pass * save tril_indices * CPU tril_indices pass * delete vlog * modify test_tril_indices_op.py * delete tril_indices_kernel.cc.swp * delete tril_indice.cu * modify code style * add newline in creation.py * modify creation.py linux newline * delete annotation * check code style * check .py style add final_state?? * modify code style * add gpu_tril_indices * modify gpu_compiled_juage * modify gpu judge * code style * add test example * modify english document modify english document modify english document modify document modify document * modify pram name * modify pram name * modify pram * reduce test ex
-
- 19 5月, 2022 3 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * fix bug of higher order derivative * fix bug of FillZeroForEmptyGradInput * remove redundant vector in grad_node * fix bug of test_deformable_conv_v1_op * fix bug of test_deformable_conv_v1_op * some refacotr
-
由 zyfncg 提交于
* remove shared_storage * fix bug * fix rnn bug
-
由 Chen Weihang 提交于
* refine enforce code * refine enforce code * fix compile failed * fix infrt failed
-
- 18 5月, 2022 4 次提交
-
-
由 Feiyu Chan 提交于
Add Code Generation for operators, op makers and argument mapping functions (#41772)
-
由 Sławomir Siwek 提交于
* matmul refactor * remove UT which only check ENFORCE output * code format * improve memory usage
-
由 niuliling123 提交于
-
由 zyfncg 提交于
* add intermediate for some api * fix bug * fix fluid.layer
-
- 17 5月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* adapt faster tokenizer op * add eager test * add unittest
-
- 16 5月, 2022 3 次提交
-
-
由 niuliling123 提交于
-
由 Yiqun Liu 提交于
-
由 zyfncg 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * int_array supports constructed by gpu tensor * add some test * polish code * adjust full api * add unittest * add unittest Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 13 5月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 12 5月, 2022 2 次提交
-
-
由 Shuangchi He 提交于
-
由 tiancaishaonvjituizi 提交于
-
- 11 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * remove node pruning
-
- 10 5月, 2022 3 次提交
-
-
由 xiongkun 提交于
* change logic for optimize * modifty
-
由 BrilliantYuKaimin 提交于
* Create elementwise_heaviside_op.cc * add ElementwiseHeavisideFunctor * Create test_elementwise_heaviside_op.py * 增加heaviside的python接口 * add heaviside in white list * 增加heaviside的签名 * 增加heaviside的核函数 * 增加heaviside梯度的核函数 * 增加heaviside梯度的注册 * 调整代码格式 * Update elementwise_sig.cc * add heaviside in __all__ * Update heaviside docs * Update math.py * Update math.py * Update math.py
-
由 shixingbo 提交于
-
- 09 5月, 2022 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add AdamW jit kernel * Second implementation * Add missing header * Correct number of jit kernels in the test
-
由 niuliling123 提交于
-
- 07 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* the copy type of data transform for gpu_pinned to gpu change from syna to async * refactor code
-
- 06 5月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 05 5月, 2022 3 次提交
-
-
由 xiongkun 提交于
-
由 zhangkaihuo 提交于
-
由 QingshuChen 提交于
* update xpu depends *test=kunlun * minor *test=kunlun Co-authored-by: Nroot <root@yq01-sys-hic-p40-0091.yq01.baidu.com>
-
- 04 5月, 2022 2 次提交
-
-
由 XiaoguangHu 提交于
* fix bug of batch_norm_grad kernel with fp16 * format code
-
由 XiaoguangHu 提交于
-
- 01 5月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-
- 30 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove useless lod copy * fix test failed * revert meta change * revert tensor change
-
- 29 4月, 2022 2 次提交
-
-
由 xiongkun 提交于
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default. * add opt_einsum dependence * add yaml and support eager model * fix by code review
-
由 YuanRisheng 提交于
* add double yaml * add inline func
-
- 28 4月, 2022 3 次提交
-
-
由 zyfncg 提交于
-
由 Chen Weihang 提交于
* opt attr eaque perf * opt attr select code * fix one hot infermeta * polish get attr impl * fix tests failed * add testcases
-
由 FlyingQianMM 提交于
* set device id of Place() to get GPUContext needed by LimitGridDim in ElemwiseGradBroadcast * fix code style
-