- 26 1月, 2021 3 次提交
-
-
由 lilong12 提交于
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes
- 25 1月, 2021 4 次提交
-
-
由 arlesniak 提交于
* More precise mkldnn kernel choice in GetExpectedKernelType * Fixes after review * Refresh develop for CI * CI experiment * get back from CI exper
-
由 Jacek Czaja 提交于
-
由 Shang Zhizhou 提交于
* add dla * add dla done * add python api Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>
-
由 chentianyu03 提交于
* add abs test case * use std::abs to fix abs bug * fix the abs bug * fix abs bug
-
- 22 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 21 1月, 2021 3 次提交
-
-
由 Pei Yang 提交于
-
由 Thunderbrook 提交于
* build gpu task core * format
-
由 石晓伟 提交于
-
- 20 1月, 2021 11 次提交
-
-
由 Jacek Czaja 提交于
-
由 TTerror 提交于
* support reduce_max op on kunlun * support reduce_max op on kunlun * support reduce_max op on kunlun * support reduce_max op on kunlun
-
由 QingshuChen 提交于
-
由 wuhuanzhou 提交于
* optimize unity build, test=develop * fix code style error, test=develop * fix code style error and test /MP settings, test=develop
-
由 liuyuhui 提交于
-
由 wanghuancoder 提交于
* delete empty line of pybing.cc, test=develop * use nvtx push pop in timeline, test=develop * change year, test=develop * add #ifdef PADDLE_WITH_CUDA, test=develop * add #ifndef WIN32, test=develop * is_pushed to is_pushed_, test=develop
-
由 chentianyu03 提交于
* rewrite abs op * rewrite abs op and remove abs in activation * remove abs register in old codes * fix abs_grad type error * fix abs double_grad output name error * modify abs_grad, abs_grad_grad functor for windows building * format code style * fix the bug of result is nan when the divisor is zero * add missing abs attr and add abs for float16
-
由 Wilber 提交于
-
由 Tao Luo 提交于
-
由 tangwei12 提交于
* add trainers for pserver Change-Id: I1a75793ec81ce126d07f4c47cae09b95d530bbc8
-
由 wanghuancoder 提交于
* add some RecordEvent, for dygraph timeline, test=develop * change GpuMemcpySync to memory::Copy, test=develop * fix compile problem, test=develop * fix compile problem, test=develop * fix, test=develop * fix, test=develop
-
- 19 1月, 2021 9 次提交
-
-
由 liym27 提交于
-
由 WangXi 提交于
-
由 ykkk2333 提交于
* add rmsprop_op_xpu test=kunlun * modified rmsprop_op_xpu error code. test=kunlun
-
由 Zhou Wei 提交于
-
由 Zhen Wang 提交于
-
由 Leo Chen 提交于
* unify calling cudaSetDevice * fix compile
-
由 pangyoki 提交于
-
由 Leo Chen 提交于
* support layer_norm fp16 in dygraph amp * add ut * refine code
-
由 wanghuancoder 提交于
-
- 18 1月, 2021 7 次提交
-
-
由 hutuxian 提交于
-
由 hutuxian 提交于
-
由 liuyuhui 提交于
-
由 QingshuChen 提交于
-
由 wanghuancoder 提交于
-
由 taixiurong 提交于
-
由 JZ-LIANG 提交于
-
- 17 1月, 2021 1 次提交
-
-
由 guofei 提交于
* Modify the calculation logic of LambOptimizer
-
- 16 1月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
* Move pass tester helper functions to single common place. * Use helper functions in two more fuse pass tests.
-