- 10 2月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add dropout * add reshape * add slice * refien slice unittest * refine slice unittest * add cpu bf16 kernel
-
- 18 11月, 2021 1 次提交
-
-
由 Li Min 提交于
* fix bug to support dropout eval grad computing. * Remove useless code.
-
- 15 9月, 2021 1 次提交
-
-
由 Li Min 提交于
-
- 03 9月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 03 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 16 12月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* improve grad perf
-
- 11 12月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* improve drop out * add VectorizedRandomGeneratorWithGenerator * fix bug * modify according to comments
-
- 04 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 13 4月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* add cuda kernel for seed, test=develop
-
- 10 12月, 2019 1 次提交
-
-
由 mapingshuo 提交于
* add seed op
-
- 03 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 20 8月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
* cuda optimie for dropout * remove tmp swp file * fix compile error test=develop * test=develop optimize the cuda realization of dropout op * remove unsed code test=develop * remove tmp file test=develop
-
- 28 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* refine_dropout_mem,test=develop * # This is a combination of 14 commits. # The first commit's message is: remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) # This is the 2nd commit message: Fleet unify distributed training (#16791) * implement distributed transpiler with fleet # This is the 3rd commit message: ParallelDyGraph with GPU collective mode (#16827) implement dygraph.parallel.DataParallel to hook reduce op. # This is the 4th commit message: Init mixed precision training interface (#16856) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop # This is the 5th commit message: fix reference_count_pass,test=develop (#17060) test=develop # This is the 6th commit message: Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop # This is the 7th commit message: remove unnecessary prepare_data (#17080) test=develop # This is the 8th commit message: fix interpolate cu. test=develop (#17101) # This is the 9th commit message: test=develop, double backward leaky_relu (#17067) backward of backward: leaky_relu # This is the 10th commit message: fix fuse optimizer ops (#17102) test=develop # This is the 11th commit message: truncated_gaussian_random supported in distributed training, test=develop (#17091) # This is the 12th commit message: Detailed coordinate description for yolov3 loss (#17007) * Detailed coordinate description for yolov3 loss test=develop * modified api.spec test=develop * modified loss name * fix api.spec test=develop * polish description test=develop * modified api.spec test=develop # This is the 13th commit message: fix test_weight_decay (#17109) test=develop # This is the 14th commit message: Path flag (#17105) * fix python/paddle/fluid/__init__.py detecting problems
-
- 30 1月, 2019 1 次提交
-
-
由 Yibing Liu 提交于
* Some improvements to support bert mixed precision training test=develop * Revert the cast in layer_norm test=develop
-
- 11 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
The macro should be defined by compiler rather than by source. test=develop
-
- 24 10月, 2018 1 次提交
-
-
由 phlrain 提交于
-
- 23 10月, 2018 1 次提交
-
-
由 phlrain 提交于
-
- 20 4月, 2018 1 次提交
-
- 19 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* accelerate dropout * accelerate dropout * "fix the dropout test" * "rerun ci" * "fix ci" * "rerun ci" * "fix ci" * "fix" * "stage" * disable
-
- 27 3月, 2018 1 次提交
-
-
由 gongweibao 提交于
-
- 22 3月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 20 3月, 2018 2 次提交
-
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
- 12 2月, 2018 2 次提交
-
-
由 qingqing01 提交于
-
由 dzhwinter 提交于
* "merge random generator kernel and mul" * "fix dropout"
-
- 10 2月, 2018 2 次提交
- 30 1月, 2018 1 次提交
-
-
由 caoying03 提交于
-
- 26 12月, 2017 2 次提交
-
-
由 Luo Tao 提交于
-
由 chengduoZH 提交于
-
- 21 12月, 2017 1 次提交
-
-
由 Yibing Liu 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 24 11月, 2017 1 次提交
-
-
由 QI JUN 提交于
* is_training to is_test in dropout op * handle dropout and batch_norm operator when prune pdesc in testing mode * handle dropout and batch_norm operator when prune pdesc in testing mode * add get_inference_program method * fix dropout op * fix ci * test data after each batch training * refine code * refine test_book3 * fix ci * follow comments
-
- 28 9月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 20 9月, 2017 1 次提交
-
-
由 dangqingqing 提交于
-
- 19 9月, 2017 2 次提交
-
-
由 Xinghai Sun 提交于
-
由 Xinghai Sun 提交于
Change type of dropout_prob to template typename.
-
- 16 9月, 2017 1 次提交
-
-
由 Xinghai Sun 提交于
-
- 03 9月, 2017 1 次提交
-
-
由 Xinghai Sun 提交于
-
- 02 9月, 2017 1 次提交
-
-
由 Xinghai Sun 提交于
-