- 28 9月, 2020 4 次提交
-
-
由 lilong12 提交于
-
由 wanghuancoder 提交于
* add paddle.fluid._cuda_synchronize, test=develop * fix bug about core_avx core_noavx, test=develop * delete CPUPlace and XPUPlace, test=develop
-
由 liym27 提交于
* Support assignment to a Variable in dynamic mode. Note: not deal with backward. * Rewrite VarBase __setitem__ for high-performance. * try to test 3 means to do __setitem__ and test the performance of 3 means. * Retain the means of the highest performance: C++ code and don't trace op.
-
由 lilong12 提交于
* add gloo initializer, test=develop
-
- 27 9月, 2020 1 次提交
-
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
- 26 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 23 9月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Makes the Bind Method more intelligent * Makes the Bind Method more intelligent * fix unittest * fix unittest * fix conflict
-
- 21 9月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
- 15 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 14 9月, 2020 2 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 Zhen Wang 提交于
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240) * update amp_check_finite_and_scale_op for static_amp. * use amp_check_finite_and_scale in static graph amp. * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op). * add update_loss_scaling op in cpp. * add update_loss_scaling_op unit test. * update the doc of the check_finite_and_unscale op * Update the process of gradients updating skipping if the gradients have infinite values. * update the way to zero grads. * update test_update_loss_scaling_op.py * add log info when find infinite grads. * add the unit test for UpdateLossScaling Layer.
-
- 08 9月, 2020 1 次提交
-
-
由 wangguanzhong 提交于
* enhance collect_op for dygraph, test=develop * enhance detection ops with lod, test=develop * support none bbox left in generate_proposals, test=develop * unfiy MultiLevelRoisNum, test=develop * update core.ops, test=develop * add op register for new input & output, test=develop
-
- 07 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 04 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 03 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 02 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test * Repair capi_gpu test
-
- 01 9月, 2020 1 次提交
-
- 31 8月, 2020 2 次提交
-
-
由 arlesniak 提交于
* get use of global 'use_mkldnn' in layer_helper * update for CI * update for CI, relu test * update for CI, relu test added, make FLAGS_use_mkldnn a public flag * added more strict tests, fixes after review * fixes after review * fixes after review, CI stuff
-
由 yaoxuefeng 提交于
-
- 28 8月, 2020 4 次提交
-
-
由 Wilber 提交于
* refine paddle inference api Co-authored-by: Nnhzlx <nhzlx.dragon@gmail.com>
-
由 Leo Chen 提交于
* refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed *
🐞 fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci -
由 joanna.wozna.intel 提交于
* Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test
-
由 Zhen Wang 提交于
* update the demo code and the doc of varbase.backward. * update the doc of the fake interface `paddle.fluid.Variable`. * remove BackwardStrategy.
-
- 27 8月, 2020 1 次提交
-
-
由 lilong12 提交于
add collective op for cpu using gloo and paddle.distributed.* apis
-
- 25 8月, 2020 2 次提交
-
-
由 Zhang Ting 提交于
* add unique_v2 op * remove unique_v2 op * update doc
-
由 wanghuancoder 提交于
* optimized transformation form tensor to numpy, test=develop * optimized transformation form tensor to numpy, pass pre-commit, test=develop * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop * modify py:array construct, test=develop * fix _fetch_var to use deep copy, test=develop
-
- 24 8月, 2020 2 次提交
-
-
由 wanghuancoder 提交于
* api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear, test=develop * api2.0 fix code examples, test=develop * modify test_bilinear_api, about place,to_tensor , test=develop * re pass pre-commit, test=develop * Update common.py * fix BilinearTensorProduct ci error, test=develop
-
由 wanghuancoder 提交于
add op_function_generator.exe retry in windows
-
- 23 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 19 8月, 2020 1 次提交
-
-
由 ceci3 提交于
* add SyncBatchNorm,test=develop
-
- 18 8月, 2020 2 次提交
-
-
由 Leo Chen 提交于
-
由 yaoxuefeng 提交于
-
- 17 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* print user-friendly error message * adjust error sumary
-
- 16 8月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 wangchaochaohu 提交于
-
- 15 8月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* expose and unify the Tensor concepts to the user * expose tensor to user * add copy place for Tensor * add copy place for Tensor * add note * add macro PADDLE_WITH_CUDA * remove RUN_TYPE=DIST * fix some error
-
- 14 8月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 13 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* add auto_cast, test=develop * add loss scaler, test=develop * add comments, test=develop * refine code, test=develop * refine code, test=develop * do not set flags automatically, test=develop * fix custom op bug, test=develop * add more test, test=develop * refine enable logic, test=develop * enable amp test with GPU, test=develop * add unittest * add test for found_inf * follow comments * follow comments * remove global variable, use singleton * add some notes * update comments * update comments * update comments * add use_dynamic_loss_scaling argument * refine found_inf * refine found_inf
-