- 30 8月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
* Support memory eager deletion on recurrent OP (#17710) Test PaddingRNN on V100 GPU device. Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU. GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR) Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR) * Fix random test_recurrent_op failure (#18718) The change includes 3 things: 1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1. 2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values. 3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
-
- 29 8月, 2019 4 次提交
-
-
由 chengduo 提交于
* update executor feed
-
由 Jiabin Yang 提交于
This PR cherry-picked the fix of multiple Layers parameter missing error in dygraph mode,the original one is #18968
-
由 tangwei12 提交于
* fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222) * fix some bug when merge sparse embedding parameters, test=develop (#18223) * fix communicator with pyreader (#18350) * delete AllocatorFacade destructor (#18606) * fix distribute transpiler GRPC error code 4, RPC Deadline (#18984) * merge pr #18441
-
由 Yibing Liu 提交于
test=release/1.5
-
- 28 8月, 2019 2 次提交
- 27 8月, 2019 2 次提交
-
-
由 Jiabin Yang 提交于
test=release/1.5, fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328) (#19414) * add default getItem * test=develop, fix has_attr disabled error in Layer * test=develop, fix GroupNorm and deepcf bug on attrs
-
由 LielinJiang 提交于
* fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop
-
- 26 8月, 2019 6 次提交
-
-
由 LielinJiang 提交于
* make_roi_perspective_transform_op_return_mask_and_matrix * make_roi_perspective_transform_op_return_mask_and_matrix
-
由 chengduo 提交于
test=release/1.5
-
由 Zhaolong Xing 提交于
* CHERRY_PICK 18941, 18860: TRT fp16 support. test=release/1.5 * CHERRY_PICK 19213: Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. 1. fix affine channel fuse pass. 2. fix condition block op. 3. fix merge lod tensor op bug. 4. fix memory optim cause by reset lod op. test=release/1.5
-
由 石晓伟 提交于
-
由 石晓伟 提交于
-
由 石晓伟 提交于
* add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop
-
- 21 8月, 2019 3 次提交
-
-
由 chengduo 提交于
* Add call stack info during runtime and compile time test=develop
-
由 Zeng Jinle 提交于
-
由 chengduo 提交于
* add warning info for CPU_NUM test=develop * update dygraph parallel.py test=develop * prune the feed op in compiler test=release/1.5 * remove compile from PE test=develop * test CUDAPinnedPlace in reader test=release/1.5
-
- 20 8月, 2019 2 次提交
-
-
由 silingtong123 提交于
* add PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#19211) * test=develop,Modify PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS
-
由 chengduo 提交于
* fix REGISTER_OP_WITHOUT_GRADIENT test=develop
-
- 19 8月, 2019 1 次提交
-
-
由 zhouwei25 提交于
* add function:error code,test=develop * add function:fix inference lib bug,test=develop * fix the bug that PYTHON_EXECUTABLE not exists
-
- 16 8月, 2019 2 次提交
-
-
由 wopeizl 提交于
* merge from develop: add tensorrt support for win test=develop
-
由 silingtong123 提交于
* fix warpctc dynamic library not found issue on mac and windows platform
-
- 15 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix gather op bug test=release/1.5
-
- 29 7月, 2019 2 次提交
-
-
由 chengduo 提交于
* fix backward bug
-
由 Zeng Jinle 提交于
-
- 26 7月, 2019 1 次提交
-
-
由 FDInSky 提交于
[cherry pick]fix roi_align_op cpu backward's bug
-
- 25 7月, 2019 3 次提交
-
-
由 wangchaochaohu 提交于
* rewrite the conv_op using cudnn_conv_helper * add workspace limit for v7 test=develop * fix test=develop * add half float test=develop * fix test=develop * fix test=develop * revise code style test=develop * fix test=develop
-
由 qingqing01 提交于
-
由 qingqing01 提交于
-
- 24 7月, 2019 1 次提交
-
-
由 xsrobin 提交于
* modify install GPU97 * modify install GPU97
-
- 18 7月, 2019 1 次提交
-
-
由 xsrobin 提交于
-
- 08 7月, 2019 3 次提交
-
-
由 Zhaolong Xing 提交于
fix mask rcnn add interface for setting optim_cache_dir(eg: when in trt int8 mode, and load model from memory, there should be a interface for setting the trt calibration table data dir) test=release/1.5
-
由 Jiabin Yang 提交于
* test=release/1.5, cherry-pick hide not_support for dygraph * test=release/1.5, cherry-pick hide not_support for dygraph
-
由 zhaoyuchen2018 提交于
Add path to handle 1D vector
-
- 07 7月, 2019 1 次提交
-
-
由 chengduo 提交于
test=release/1.5
-
- 05 7月, 2019 4 次提交
-
-
由 bingyanghuang 提交于
-
由 gongweibao 提交于
-
由 tensor-tang 提交于
test=release/1.5
-
由 xsrobin 提交于
-