- 08 9月, 2021 1 次提交
-
-
由 Zhong Hui 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 hong 提交于
* first test version * add test exec; * add data transfer; test=develop * add new exec head; * add memcpy; test=develop * add python fetch * add new test * add graph node; test=develop * remove useless new executor test; test=develop * remove gperf dependency; test=develop * fix compile bugs; test=develop * remove useless code; test=develop * remove useless code; test=develop * add uni test; test=develop * polish code; test=develop * polish code; test=develop * add interpreter cmakefile; test=develop * remove useless code; test=develop
-
- 02 8月, 2021 1 次提交
-
-
由 furnace 提交于
* [NPU] add reduce_max * [NPU] delete skipIf * [NPU] add atrrs support or check * [NPU] add attr out_dtype * [NPU] delete debug codes
-
- 02 7月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 30 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 21 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 14 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 25 8月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 21 8月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 13 8月, 2020 1 次提交
-
-
由 wawltor 提交于
Update the code for the min,max, test=develop
-
- 09 6月, 2020 1 次提交
-
-
由 wawltor 提交于
Add the support the 5d,6d tensor support for the reduce ops; Add the same time, the compile time, it was 22 minutes, it was 21 minutes after fixed.
-
- 17 4月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
* fix error message of l2_normalize, matmul, mean, etc. * add the test case for those ops
-
- 05 4月, 2020 1 次提交
-
-
由 wawltor 提交于
* Add the sum op to API 2.0, test=develop * Fix the import meesage in common_ops_import
-
- 23 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 31 10月, 2019 1 次提交
-
-
由 hong 提交于
* refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop
-
- 28 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop
-
- 12 10月, 2019 1 次提交
-
-
由 Guo Sheng 提交于
-
- 10 10月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 03 10月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
* fix the error message for reduce_mean and reduce_sum op test=develop * fix typo test=develop * fix according review advice test=develop * fix the test test=develop * fix test=develop
-
- 26 9月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 05 9月, 2019 1 次提交
-
-
由 Leo Chen 提交于
-
- 14 5月, 2019 1 次提交
-
-
由 lvmengsi 提交于
* test=develop, double backward reduce_mean * add comment. test=develop * fix format. test=develop * rename GradGrad -> DoubleGrad. test=develop * fix op_use_default_grad_op_maker.spec. test=develop
-
- 12 4月, 2019 2 次提交
-
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
bug fix: reduce_all, reduce_any register GRAD_OP, but have not defined GradKernel
-
- 16 11月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
- 22 7月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 07 6月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "split into multiple .ccl" * "refine file structure" * "refine files" * "remove the cmakelist" * "fix typo" * "fix typo" * fix ci
-
- 05 6月, 2018 1 次提交
-
-
由 whs 提交于
-
- 23 5月, 2018 1 次提交
-
-
由 whs 提交于
* Enhance reduce op for multi dims. * Uncomment some unitest. * Uncomment unitest. * Remove unused code. * Fix infershape and python wrapper. * Add more examples. * Fix l2_normalize. * Fix normalization_wrapper. * Polish code. 1. Rename unitest function. 2. Rename const variable.
-
- 19 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix CPPLint errors in multiclass_nms, nccl, nce, reduce and save_load_combine * Fix
-
- 07 3月, 2018 1 次提交
-
-
由 zhouhanqing 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 13 12月, 2017 1 次提交
-
-
由 guosheng 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 11 11月, 2017 1 次提交
-
-
由 Dong Zhihong 提交于
-