- 07 6月, 2022 4 次提交
-
-
由 Guoxia Wang 提交于
-
由 Wilber 提交于
-
由 limingshu 提交于
Transpose optimization with assitant of Chengdu Supercomputing Center and auto_tune operation (#42704)
-
由 niuliling123 提交于
-
- 06 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 2 次提交
-
-
由 Siming Dai 提交于
* support heter reindex * add unittest, fix bug * add comment * delete empty line * refine example * fix codestyle * add disable static
-
由 Li Min 提交于
* extend forward fast_ln_kernel to support more column values.
-
- 01 6月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* add yaml * fix infrt compile bugs
-
由 Aganlengzi 提交于
-
由 chentianyu03 提交于
* add conv3d yaml * add conv3d_grad, conv3d_double_grad * add final_state_conv3d test case * add conv3d double test case * add depthwise_conv2d grad yaml * add depthwise_conv2d double grad test case * modify the order of args * add depthwise_conv2d_grad_grad config
-
- 31 5月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* fix assign kernel copy impl * fix test failed
-
由 thunder95 提交于
* rrelu逻辑部分 * unregistered op kernel (unresolved) * commit before merge * 丰富测试用例 * 修复rrelu-sig的bug * 修复cpu环境测试 * 修改拼写错误 * 修改code format * 尝试优化测试用例timeout的问题 * 优化测试用例 * 移除seed, 优化随机函数 * update en doc for rrelu * fix rrelu en docs, test=document_fix * add paper link for en docs, test=document_fix * udpate en doc * add r,test=document_fix
-
由 xiongkun 提交于
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 * make EInsumOP support bf16 * add unittest for BF16 * add condition for test_BF16 * fix bugs * fix
-
- 30 5月, 2022 4 次提交
-
-
由 crystal 提交于
-
由 thunder95 提交于
* nanmedian op * 修改cuda kernel的bug * 修复count_if在其他硬件平台不兼容 * 修复某些cpu硬件不兼容 * 修复某些cpu硬件不兼容 * 修复isnan判断 * 兼容numpy低版本不支持全部nan的情况 * 兼容numpy低版本不支持全部nan的情况 * fix code example * fix api comment error * 修改反向传播逻辑以及c++处理逻辑 * 完成修改建议 * typo pre_dim * update en docs, test=document_fix * remove numpy in en doc, test=document_fix * add r,test=document_fix * 添加api到all * follow advice from chenwhql
-
由 limingshu 提交于
* 1st commit * fix usless change in header transpose_kernel_h file * add sync
-
由 Aganlengzi 提交于
* addmm supports 1-d input * fix coverage * fix * more ut
-
- 27 5月, 2022 2 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
由 xiongkun 提交于
-
- 26 5月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
-
由 YuanRisheng 提交于
* move instance_norm * change mutable_data * fix compile bugs
-
- 25 5月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* fix maybe-uninitialized warning * fix compile * fix xpu compile * fix npu compile * fix infer compile * fix compile * fix compile
-
由 xiongkun 提交于
* change logic for optimize * modifty * optimize the backward speed of EinsumOp * add cache optimizer for einsum op * EinsumOp: fix new dygraph mode error * fix bug * change Cache->InnerCache * fix code * fix * add nan inf utils for einsum op * add as_extra * Compatible with v2.3 EinsumOp * remove dispensable
-
- 24 5月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move grad_add * fix unittest bugs * fix compile bugs
-
- 23 5月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
* add double grad yaml * fix bugs when compile infrt
-
由 zyfncg 提交于
* remove storage * add glog include * add glog include * add glog include
-
由 zhouweiwei2014 提交于
* remove is_init_py of RandomGenerator, and use Global Generator if not OP seed * fix comment
-
由 shixingbo 提交于
-
- 20 5月, 2022 4 次提交
-
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output * add flags to control compute type * default to false * add unit test * default to true
-
由 YuanRisheng 提交于
-
由 xiaoguoguo626807 提交于
* add tril_indices cpu kernal * modify tril_indice cpu op * modify bug * modify bug * add tril_indices python api * add tril_indices python api * resolve conflict * add tril_indices test * modify details * add tril_indices.cu * pythonapi pass * save tril_indices * CPU tril_indices pass * delete vlog * modify test_tril_indices_op.py * delete tril_indices_kernel.cc.swp * delete tril_indice.cu * modify code style * add newline in creation.py * modify creation.py linux newline * delete annotation * check code style * check .py style add final_state?? * modify code style * add gpu_tril_indices * modify gpu_compiled_juage * modify gpu judge * code style * add test example * modify english document modify english document modify english document modify document modify document * modify pram name * modify pram name * modify pram * reduce test ex
-
- 19 5月, 2022 2 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * fix bug of higher order derivative * fix bug of FillZeroForEmptyGradInput * remove redundant vector in grad_node * fix bug of test_deformable_conv_v1_op * fix bug of test_deformable_conv_v1_op * some refacotr
-
由 zyfncg 提交于
* remove shared_storage * fix bug * fix rnn bug
-
- 18 5月, 2022 2 次提交
-
-
由 niuliling123 提交于
-
由 zyfncg 提交于
* add intermediate for some api * fix bug * fix fluid.layer
-
- 16 5月, 2022 2 次提交
-
-
由 niuliling123 提交于
-
由 Yiqun Liu 提交于
-