- 28 6月, 2019 3 次提交
-
-
由 chengduo 提交于
* add cuda_is_available test=release/1.5
-
由 Zeng Jinle 提交于
-
由 石晓伟 提交于
* Update the Anakin interfaces for content-dnn and MLU (#17890) * update anakin-engine interfaces for content-dnn test=develop * support only-gpu mode of Anakin modify eltwise parse test=develop * modification for thread-safe test=develop * Integrated template instance test=develop * increase template parameters test=develop * support MLU predictor test=develop * update anakin cmake files test=develop * update TargetWrapper::set_device * update the initialization of anakin subgraph test=develop * use the default constructor of base class test=develop * modify the access level of anakin engine (#18015) test=develop * fix ci test cmake test=develop
-
- 27 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* update pe reduce config test=release/1.5 * drop the local_exe_scopes of the previous parallel_executor test=release/1.5
-
- 26 6月, 2019 2 次提交
-
-
由 chengduo 提交于
test=release/1.5
-
由 tensor-tang 提交于
* fix softrelu doc * update API doc test=release/1.5
-
- 25 6月, 2019 6 次提交
-
-
由 Hongyu Liu 提交于
* sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-*x to elmentwise_op; test=develop * add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop
-
由 Guo Sheng 提交于
test=release/1.5 * Fix the GetExpectedKernelType of add_position_encoding_op. * Fix the doc of lstm_unit outputs in nn.py.
-
由 Yiqun Liu 提交于
test=release/1.5
-
由 chengduo 提交于
* fix default value of fluid.memory_optimize test=release/1.5
-
由 Yibing Liu 提交于
* Use TensorCopySync for sequence_unpad op * Fix the tensor memory alloc bug test=release/1.5
-
由 lujun 提交于
add dygraph api doc for fluidDoc and api spec
-
- 24 6月, 2019 3 次提交
-
-
由 Hongyu Liu 提交于
* fix slice op bug; test=develop * fix variabel test bug; test=develop * remove slice while true; test=develop
-
由 chengduo 提交于
test=release/1.5
-
由 lujun 提交于
Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong. * cherry pick 18000, test=release/1.5
-
- 22 6月, 2019 1 次提交
-
-
由 wopeizl 提交于
* cherry-pick the inference update for win test=develop * test=develop
-
- 21 6月, 2019 1 次提交
-
-
由 lvmengsi 提交于
* update some op doc, test=release/1.5
-
- 20 6月, 2019 2 次提交
-
-
由 qingqing01 提交于
* Update backward appending stragety to support double backward and fix some bug. (#18104) * Update backward.py: - If there is no input grad var in all outputs of previous ops, do not append this op into graph. - Only apply this stragety when double backward. * Update some double backward op. * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
-
由 翟飞跃 提交于
-
- 19 6月, 2019 7 次提交
-
-
由 翟飞跃 提交于
* add mkldnn Int8v2 slim doc (#17909) * Change int8v2 CAPI unit test name and add log in the prediction stage (#18200) * fix issue 18111;test=develop * fix timer;test=develop * refine code;test=develop * test=release/1.5
-
由 chengduo 提交于
* add multi process reader test=release/1.5
-
由 tangwei12 提交于
* fix save/load in fleet (#17675) * fix save/load in Fleet * add UT framework of Fleet (#18058) * add paddle cloud role maker for customized usage, note this is only for industrial users that have cloud environment pre-configuration (#18121) add paddle cloud role maker for specific cloud usage. This pr will simplifies user's configuration in distributed training. * assign role_maker before use (#18137)
-
由 FlyingQianMM 提交于
Cherry pick retinanet_target_assign_op(#17893), sigmoid_focal_loss_op(#17895) and retinanet_detection_output_op(#17896) for supporting retinanet (#18141) * test=release/1.5 Fix conflicts in test_layers.py when adding target assign operator for supporting retinanet. Cherry pick #17893 * test=release/1.5 Add sigmoid focal loss operator for supporting retinanet. Cherry pick #17895 * test=release/1.5 Add detection output operator for supporting retinanet. Cherry pick #17896 * test=release/1.5 fix wrong code style in test_layers.py when cherry pick retinanet_target_assign #17893 * test=release/1.5 Fix type error of std::pow in sigmoid_focal_loss. Cherry pick #17895
-
由 chengduo 提交于
* update execution_strategy option default value test=release/1.5 * fix doc error test=release/1.5
-
由 chengduo 提交于
* remove nccl dep when the number of GPU is 1 test=develop * use multi card run syncBN test=release/1.5
-
由 hutuxian 提交于
Add trainer_desc proto DEPS to solve CI random fail.
-
- 18 6月, 2019 4 次提交
-
-
由 AIFollowers 提交于
Add cascade rcnn support.
-
由 Wojciech Uss 提交于
Cherry pick #18077 and #18111 unify FP32 vs. INT8 comparison tests output, reuse C-API INT8 unit test application (#18145) * unify FP32 vs. INT8 comparison tests output (#18111) test=release/1.5 * reuse C-API INT8 unit test application (#18077) test=release/1.5
-
由 Zeng Jinle 提交于
* fix dygraph mem leak, test=release/1.5 * polish msg, test=release/1.5
-
由 cjt222 提交于
cherry pick for deform roi pooling
-
- 17 6月, 2019 1 次提交
-
-
由 hutuxian 提交于
cherry-pick for (https://github.com/PaddlePaddle/Paddle/pull/17402) Add Pipeline Concurrency Train Mode: - Cpp: pipeline_trainer & section_worker - Python: PipelineOptimizer - Add a new data_feed type: PrivateInstantDataFeed - Add a test demo of pipeline trainer and the test model is gnn - Do not support win32 now
-
- 15 6月, 2019 3 次提交
-
-
由 Sylwester Fraczek 提交于
* fix multithreading issue test=develop * rview fixes test=develop * reivew fix: omp->cpu, infernce_api.cc->pybind.cc test=release/1.5
-
由 Zeng Jinle 提交于
* fix py_reader iterable bug, test=release/1.5 * move data from buffered_reader,test=release/1.5
-
由 chengduo 提交于
* update CPU_NUM config test=develop
-
- 14 6月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 ruri 提交于
-
- 13 6月, 2019 4 次提交
-
-
由 wawltor 提交于
test=release/1.5 cherry-pick from #17952 The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
-
由 Wojciech Uss 提交于
Added unit test for QAT FP32 & INT8 comparison (#17814) Disable MKLDNN FC in Resnet50 test (#18030) test=release/1.5
-
由 tensor-tang 提交于
test=release/1.5
-
由 gongweibao 提交于
-