- 21 12月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 wangchaochaohu 提交于
-
- 18 12月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 17 12月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 16 12月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 15 12月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 14 12月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 08 12月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
This reverts commit befd6d53.
-
- 03 12月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* improve performance of elementwise_sum_grad
-
- 01 12月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* pass stop_gradient for cast op * improve performance of elementwise_add grad * use tensor copy async * dygraph branch * fix dygraph branch * add ut
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 30 12月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 19 11月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 30 9月, 2019 1 次提交
-
-
由 danleifeng 提交于
Improve elementwise operators performance in same dimensions
-
- 14 5月, 2019 1 次提交
-
-
由 Kaipeng Deng 提交于
* add elementwise_add_grad_grad op. test=develop * use defined GradMaker. test=develop
-
- 13 5月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Optimize the elementwise op with CUDA kernels. test=develop * Support setting of attr in op config file. test=develop * Add the support the setting dtype and initializer in config. test=develop * Save workspace. * Add initializer "zeros". test=develop * Fix compiling error. * Support the use of existed file to initailize tensor in op_tester. * Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims. test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
- 08 11月, 2018 1 次提交
-
-
由 chengduo 提交于
* fix input<tensor> test=develop * fix split_ids test=develop * ElementwiseMul should not support SelectedRows * fix scale op test=develop * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar() * fix operator * refine MultiOutput * fix MultiOutput test=develop * disable test_dist_save_load test=develop * fix elementwise_op test=develop * add get_sparse_as_op test=develop * add info for check test=develop * rename get_sparse_as_op with extract_rows_as_op. test=develop * elementwise doesn't support selected_rows * fix regularizer * remove extract_rows_as test=develop * fix ci test=develop * add test for sum_op * fix regularizer test=develop * test=develop * fix pserver weight decay multi inputs test=develop
-
- 22 8月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 14 8月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 06 8月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 01 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add gradient register" * "make some enhance" * "better format" * "fix typo" * "fix reuse" * "fix get expected kernel" * "change the mkldnn code" * "fix mkldnn" * "fix mkldnn failed test" * "add comment"
-
- 24 5月, 2018 8 次提交
-
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
- 23 2月, 2018 1 次提交
-
-
由 Yu Yang 提交于
* Speed up elemwise grad * Fix bug * Add macro for MAX_BLOCK_DIM
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 03 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 02 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 16 1月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 15 1月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-