提交 · accb132f0f2ec150eb97685764553e109f0c4f15 · BaiXuePrincess / Paddle

15 6月, 2019 3 次提交
- S
  
  fix slim int8 mkldnn multithreading issue (#18009) · accb132f
  由 Sylwester Fraczek 提交于 6月 15, 2019
  
  accb132f
- T
  add approval to requirements.txt · 2e1d8cf7
  由 tianshuo78520a 提交于 6月 15, 2019
```
add luotao to approval requirements.txt
```
  2e1d8cf7
- C
  Fix bug of scope_buffered_ssa_graph_executor (#18100) · 24e988a4
  由 chengduo 提交于 6月 15, 2019
```
* fix code bug
test=develop
```
  24e988a4
14 6月, 2019 6 次提交
- H
  Modify format of GPU allocation failure log. (#18034) · 3f55ab0f
  由 Huihuang Zheng 提交于 6月 14, 2019
```
As title

test=develop
```
  3f55ab0f
- G
  
  Fix reinitialized ncclid error! (#18025) · f5caf344
  由 gongweibao 提交于 6月 14, 2019
  
  f5caf344
- W
  Add warning for cudnn warpctc kernel in CUDA9\CUDA10. (#18046) · 354643d8
  由 whs 提交于 6月 14, 2019
```
test=develop
```
  354643d8
- Q
  Hidden paddle.fluid.layers.detection_map. (#18033) · e81756f1
  由 qingqing01 提交于 6月 14, 2019
```
* Remove layers.detection_map API
* Since uers can use fluid.metrics.DetectionMAP to calculate mAP of current-batch and cumulative-batch. layers.detection_map only can calculate cur-batch mAP.
```
  e81756f1
- Y
  Optimize fused_elewise_activation_grad op. (#18041) · 660c1a65
  由 Yiqun Liu 提交于 6月 14, 2019
```
test=develop
```
  660c1a65
- L
  add Mobilienet ssd int8 analyzer tester (#18075) · 46625415
  由 lidanqing 提交于 6月 14, 2019
```
* add pascalvoc preprocess script and mobilenet-ssd analyzer_tester, wait 17737

* change converting local dataset to downloading and converting tarfile
test=develop

* change the test data_path
test=develop

* change copyright (c) 2016 to copyright (c) 2019
test=develop
```
  46625415
13 6月, 2019 6 次提交

石

fix ci test cmake test=develop (#18060) · 42f12a4a
由石晓伟提交于 6月 13, 2019

42f12a4a
C
Update CPU_NUM config (#18059) · b5a1c146
由 chengduo 提交于 6月 13, 2019
```
* update CPU_NUM config
test=develop
```
b5a1c146

refactor the function ConvFwdPrimitiveDesc (#17897) · f8ecc3de

由 lidanqing 提交于 6月 13, 2019

* refractor the function ConvFwdPrimitiveDesc
test=develop

* change according to review
test=develop

* use pointer way without boost::optional
test=develop

* pass vector to function by reference instead of raw vector
test=develop

* change pointer to shared_ptr
test=develop

f8ecc3de

M

Disable MKLDNN FC in Resnet50 test (#18030) · 8462e2b8
由 Michał Gallus 提交于 6月 13, 2019

8462e2b8

Added unit test for QAT FP32 & INT8 comparison (#17814) · 78e93286

由 Wojciech Uss 提交于 6月 13, 2019

* added unit test for QAT FP32 & INT8 comparison

test=develop

* enabled other models and updated filenames

test=develop

* added accuracy check and multiple batch handling

test=develop

* removed quantization_mkldnn_pass.py

test=develop

* cleanup

test=develop

* updated model paths

test=develop

* renamed tests without MKL-DNN

test=develop

* fix reusing mkldnn pool2d primitive

test=develop

* add performance measuring

test=develop

* fix accuracy statistics

test=develop

* removed non-mkldnn tests

test=develop

* added conv2d_depthwise->conv2d mkldnn transformation

test=develop

* format update

test=develop

* fixed creating key for pool2d grad

test=develop

* added pass

* Fix the accuracy issue while using float precision to get the scale.

test=develop

* Fix the format issue when 'X' is not nchw.

test=develop

* removed output comparing and changed number of images

test=develop

* cmake and comment fix

test=develop

* updated acc threshold for QAT comparison tests

test=develop

* added OMP_NUM_THREADS setting

test=develop

* enable all QAT INT8 tests

test=develop

* restored upstream version of a file

test=develop

* modified directory names

test=develop

78e93286

T
concat op support negative axis (#18045) · 566bf2ec
由 tensor-tang 提交于 6月 13, 2019
```
test=develop
```
566bf2ec

12 6月, 2019 12 次提交

Y
Optimize the concat and split cuda implementation for cases when the number of... · 7e463c84
由 Yiqun Liu 提交于 6月 12, 2019
```
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)

test=develop
```
7e463c84
T
fix save/load in fleet (#17675) · 101f74cb
由 tangwei12 提交于 6月 12, 2019
```
* fix save/load in Fleet
* add UT framework of Fleet
```
101f74cb
H

add trainer_desc proto DEPS (#18019) · f1d458da
由 hutuxian 提交于 6月 12, 2019

f1d458da

Fix GetExpectedKernelType of add_position_encoding_op (#17935) · a06b316b

由 Guo Sheng 提交于 6月 12, 2019

* Fix the GetExpectedKernelType of add_position_encoding_op.
test=develop

* Fix the doc of lstm_unit outputs in nn.py.
test=develop

a06b316b

combine noavx and avx package (#17889) · 5c06bff2

由 tensor-tang 提交于 6月 12, 2019

* support avx and noavx core

* add catch and give some log

test=develop

* fix build

test=develop

* add missing package

test=develop

* fix pybind name

test=develop

* fix import error

test=develop

* conbime noavx core

test=develop

* add requirements

test=develop

* fix unkown message

test=develop

* fix api spec

test=develop

* refine and clean

test=develop

* update

* pass dist ut

* follow comments

test=develop

* refine scripts

test=develop

5c06bff2

Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3

由 wawltor 提交于 6月 12, 2019

* test=develop
The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.

* test=develop
Fix some code format problem, and the same time add the test case in gather and scatter op

8eb134c3

update load_error_info, test=develop (#18000) · 75fcd292

由 lujun 提交于 6月 12, 2019

Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong.

75fcd292

石
modify the access level of anakin engine (#18015) · 04ea7cb0
由石晓伟提交于 6月 12, 2019
```
test=develop
```
04ea7cb0

test=develop (#17984) · 2ae8decc

由 wawltor 提交于 6月 12, 2019

Fix bug in sequence_unpad op, when allocate the output memory do not match actual memory, check memory failed. Fix this bug by allocating the output memeory in correct code position.

2ae8decc

Fix edit distance doc (#17947) · 9d6640ff

由 ruri 提交于 6月 12, 2019

* fix im2sequence padding bug, test=develop

* fix edit_distance, test=develop

* add API.spec,test=develop

9d6640ff

Z
Add shape not match doc to data layer (#17936) · a1bdf25e
由 Zeng Jinle 提交于 6月 12, 2019
```
* add shape not match doc to data layer, test=develop

* fix API.spec md5
test=develop
```
a1bdf25e

add deformable psroi pooling (#17827) · 871af28d

由 cjt222 提交于 6月 12, 2019

* add deformable psroi pooling

* test=develop

* test=develop

* test=develop
modify format

* fix bug

* test=develop run ci

* test=develop
add API.spec

* add test_layers.py

* run ci again

* test=develop
run ci again

* run ci again

* test=develop
run ci again

* test=develop
run ci again

* test=develop
run ci again

* add space between two lines

* test=develop
add space between two lines

* test=develop
add space between lines

* test=develop
modify comment in nn.py

* test=develop
add space between two lines

* test=develop
add space between two lines

* update API.spec

* run ci again

* test=develop
run ci again

* rerun ci

* test=develop
rerun ci

* change input shape

* run ci

* test=develop
run ci

* modify format of nn.py

* test=develop

* test=develop

* test=develop
update API.spec

* test=develop
fix API doc

* modify API comment

* modift API comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
modift comment

* test=develop
modift comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
add inference in nn.py

* test=develop
update API.spec

* test=develop
resolve confict

* test=develop
update API.spec

871af28d

11 6月, 2019 7 次提交

add unfold op (new op),test=develop (#17944) · 40885c22

由 SunGaofeng 提交于 6月 11, 2019

* add unfold op
test=develop

* fix divide bug in python3 when calculating output width and height
test=develop

* add name=None in python api, move redundant code into inline function

* try to trigger ci for this code
test=develop

40885c22

[MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965) · 84bb45c0

由 Jacek Czaja 提交于 6月 11, 2019

* - removed is_reusing_

* - Added TID to keys for reusing apart from softmax PD

* - compilation fix

* - Yet another compilation fix

* - Batch Norm and Conv adapted

* - Fix to softmax MT

* - Fixes to MT code of MKL-DNN

* - Lint fixes

test=develop

84bb45c0

G

Polish codes of old prs. (#17938) · da9143c1
由 gongweibao 提交于 6月 11, 2019

da9143c1

石

Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5

由石晓伟提交于 6月 11, 2019

* update anakin-engine interfaces for content-dnn

test=develop

* support only-gpu mode of Anakin

modify eltwise parse

test=develop

* modification for thread-safe

test=develop

* Integrated template instance

test=develop

* increase template parameters

test=develop

* support MLU predictor

test=develop

* update anakin cmake files

test=develop

* update TargetWrapper::set_device

* update the initialization of anakin subgraph

test=develop

* use the default constructor of base class

test=develop

bce259e5

T

added monitoring of python/requirements.txt file (#17957) · 410907f6
由 tianshuo78520a 提交于 6月 11, 2019

410907f6

Pipeline Concurrency (#17402) · 969e6378

由 hutuxian 提交于 6月 11, 2019

Add Pipeline Concurrency Train Mode:
- Cpp: pipeline_trainer & section_worker
- Python: PipelineOptimizer
- Add a new data_feed type: PrivateInstantDataFeed
- Add a test demo of pipeline trainer and the test model is gnn
- Do not support win32 now

969e6378

Light mem reuse strategy for inference. (#17925) · 4e8d5a03

由 Zhaolong Xing 提交于 6月 11, 2019

* fix: when use the load model from memory mode, the RAM occupy is high

test=develop

* ligth mem reuse
test=develop

* fix cpplint
test=develop

4e8d5a03

10 6月, 2019 6 次提交
- T
  fix merge conflict of 'Remove attribute in Allocator::Allocate' and... · 53fd507b
  由 Tao Luo 提交于 6月 10, 2019
```
fix merge conflict of 'Remove attribute in Allocator::Allocate' and elementwise_add_mkldnn_op (#17949)

test=develop
```
  53fd507b
- Z
  refine sum stack api doc (#17923) · 3847d9fc
  由 zhaoyuchen2018 提交于 6月 10, 2019
```
test=develop
```
  3847d9fc
- J
  
  refine GetExpectedKernelType in conat op, test=develop (#17934) · aab4d12c
  由 jerrywgz 提交于 6月 10, 2019
  
  aab4d12c
- Z
  Remove attribute in Allocator::Allocate (#17878) · 3ece61f7
  由 Zeng Jinle 提交于 6月 10, 2019
```
* remove attribute in Allocator::Allocate, test=develop

* fix travis ci error, test=develop
```
  3ece61f7
- Y
  Enable seq_pool op to accept len 0 input (#17284) · 33d1e565
  由 Yibing Liu 提交于 6月 10, 2019
```
* Enable seq_pool op to accept len 0 input

test=develop

* Update sequence_pool's api

test=develop

* Add more unittest cases for seq_pool op

test=develop

* Remove legacy comments

test=develop

* Don't use template in op maker

test=develop
```
  33d1e565
- Y
  Fix the format issue when 'X' is not nchw. (#17833) · 9b501736
  由 Yihua Xu 提交于 6月 10, 2019
```
test=develop
```
  9b501736

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致