提交 · 7e31d5a2967029d0d2a9d4405b792276a9245015 · BaiXuePrincess / Paddle

13 6月, 2019 1 次提交

cherry pick from combine noavx and avx package (#17889) (#18036) · 7e31d5a2

由 tensor-tang 提交于 6月 13, 2019

test=release/1.5

* support avx and noavx core
* add catch and give some log
* fix build
* add missing package
* fix pybind name
* fix import error
* conbime noavx core
* add requirements
* fix unkown message
* fix api spec
* refine and clean
* update
* follow comments
* refine scripts

7e31d5a2

03 6月, 2019 2 次提交
- W
  add support for cuda9 on windows test=develop (#17594) · 3d0e1204
  由 wopeizl 提交于 6月 03, 2019
```
* add support for cuda9 on windows test=develop
* use different git address for cuda9 compatible on windows 
```
  3d0e1204
- W
  use the bj as default address instead of cdn test=develop (#17795) · 82b834cb
  由 wopeizl 提交于 6月 03, 2019
```
The cdn.bcebos.com can be unstable randomly for unknown reason, restore it to bj.bcebos.com.
```
  82b834cb
31 5月, 2019 2 次提交
- W
  fix the dll not found issue on windows (#17750) · f893914f
  由 wopeizl 提交于 5月 31, 2019
```
* fix the dll not found issue on windows
```
  f893914f
- B
  
  [NGraph] Added lookup table to ngraph engine test=develop (#17647) · 2c58f1a8
  由 baojun 提交于 5月 30, 2019
  
  2c58f1a8
30 5月, 2019 1 次提交
- B
  Add deformable conv v2 op,test=develop (#17145) · bba57cdd
  由 Bai Yifan 提交于 5月 30, 2019
```
* unit commits, test=develop

* update API.spec, test=develop
```
  bba57cdd
29 5月, 2019 1 次提交

Optimize the concat and split kernel for specical cases when the number of... · 5782ddda

由 Yiqun Liu 提交于 5月 29, 2019

Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415)

* Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2.
test=develop

* Refine codes.
test=develop

* Correct the condition.
test=develop

* Move the define of tmp_data outside the if statement.

* Print the cudnn minor version.
test=develop

* Fix the case when in_num/o_num is 1 in concat/split op.
test=develop

* Remove const_cast.
test=develop

5782ddda

24 5月, 2019 2 次提交

[MKL-DNN] Add Fully Connected Op for inference only(#15226) · 0c39b97b

由 Michał Gallus 提交于 5月 24, 2019

* fuse mul and elementwise add to fc

* Reimplement the FC forward operator

* Fix FC MKLDNN integration by transposing weights

* Add FC MKLDNN Pass

test=develop

* FC MKLDNN Pass: change memcpy to std::copy

* Fix MKLDNN FC handling of mismatch input and weights dims

* Lower tolerance for MKL-DNN in resnet50 test

test=develop

* Adjust FC to support MKLDNN Op placement

test=develop

* Adjust Placement Op to set use_mkldnn attribute for graph

test=develop

* MKLDNN FC: fix weights format so that gemm version is called

test=develop

* FC MKLDNN: Remove tolerance decrease from tester_helper

* FC MKL-DNN: Refactor the code, change input reorder to weight reorder

* MKL-DNN FC: Introduce operator caching

test=develop

* FC MKL-DNN: Fix the tensor type in ExpectedKernelType

test=develop

* FC MKL-DNN: fix style changes

test=develop

* FC MKL-DNN: fallback to native on non-supported dim sizes

test=develop

* FC MKLDNN: fix CMake paths

test=develop

* FC MKLDNN: Refine placement pass graph mkldnn attribute

test=develop

* Fix Transpiler error for fuse_conv_eltwise

test=develop

* Fix missing STL includes in files

test=develop

* FC MKL-DNN: Enable new output size computation

Also, refine pass to comply with newest interface.
test=develop

* FC MKL-DNN: enable only when fc_mkldnn_pass is enabled

* FC MKL-DNN: Allow Weights to use oi or io format

* FC MKL-DNN: Adjust UT to work with correct dims

test=develop

* Enable MKL DEBUG for resnet50 analyzer

test=develop

* FC MKL-DNN: Improve Hashing function

test=develop

* FC MKL-DNN: Fix shape for fc weights in transpiler

* FC MKL-DNN: Update input pointer in re-used fc primitive

* Add log for not handling fc fuse for unsupported dims

test=develop

* FC MKL-DNN: Move transpose from pass to Op Kernel

test=develop

* FC MKL-DNN: Disable transpose in unit test

test=develop

* FC MKL-DNN: Remove fc_mkldnn_pass from default list

* Correct Flag for fake data analyzer tests

test=develop

* FC MKL-DNN: Add comment about fc mkldnn pass disablement

test=develop

* FC MKL-DNN: Disable fc in int8 tests

test=develop

0c39b97b

M

update ngraph to v0.19 test=develop (#17582) · 6101fd57
由 mozga-intel 提交于 5月 23, 2019

6101fd57

21 5月, 2019 1 次提交
- T
  remove unused SERIAL compiler option (#17500) · 3d19f44a
  由 Tao Luo 提交于 5月 21, 2019
```
test=develop
```
  3d19f44a
20 5月, 2019 1 次提交
- W
  fix the random compilation failure on windows test=develop (#17475) · ca3ba378
  由 wopeizl 提交于 5月 20, 2019
```
* fix the random compilation failure on windows 
```
  ca3ba378
15 5月, 2019 1 次提交

add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206

由 jiaqi 提交于 5月 15, 2019

* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop

* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop

* fix global shuffle bug
test=develop

* fix global shuffle bug
test=develop

* fix code style
test=develop

* fix code style & modify pslib cmake
test=develop

* fix error of _role_maker
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix windows compile error of fleet
test=develop

* fix global shuffle bug

* add comment
test=develop

* update pslib.cmake
test=develop

* fix fill sparse bug
test=develop

* fix push sparse bug
test=develop

66d51206

13 5月, 2019 1 次提交
- J
  Revert "rename the default version from '0.0.0' to 'latest' (#17304)" (#17356) · c843e64c
  由 Jiabin Yang 提交于 5月 13, 2019
```
This reverts commit f456c8be.
```
  c843e64c
10 5月, 2019 1 次提交
- W
  rename the default version from '0.0.0' to 'latest' (#17304) · f456c8be
  由 wopeizl 提交于 5月 10, 2019
```
* rename the default version from '0.0.0' to 'latest'
```
  f456c8be
07 5月, 2019 2 次提交

T
remove unused FLAGS_warpctc_dir (#17162) · ff1661f1
由 Tao Luo 提交于 5月 07, 2019
```
* remove unused FLAGS_warpctc_dir

test=develop

* remove FLAGS_warpctc_dir

test=develop
```
ff1661f1

石

Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a

由石晓伟提交于 5月 07, 2019

* cherry-pick commit from 88770542

* cherry-pick commit from 3f0b97df

* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn

(cherry picked from commit 8643dbc2)

* Cherry-Pick from 16662 : Anakin subgraph cpu support

(cherry picked from commit 7ad182e1)

* Cherry-pick from 1662, 16797.. : add anakin int8 support

(cherry picked from commit e14ab180)

* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4

(cherry picked from commit 4b9fa423)

* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2

Support ShuffleNet and MobileNet-v2, test=release/1.4

(cherry picked from commit a6fb066f)

* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4

(cherry picked from commit 8121b3ec)

* 1. add shuffle_channel_detect

(cherry picked from commit 6efdea89)

* update shuffle_channel op convert, test=release/1.4

(cherry picked from commit e4726a06)

* Modify symbol export rules

test=develop

a72dbe9a

20 4月, 2019 1 次提交
- B
  
  update ngraph to v0.18 test=develop · 855bb4d4
  由 baojun-nervana 提交于 4月 19, 2019
  
  855bb4d4
18 4月, 2019 1 次提交
- G
  
  Polish DGC code (#16818) · cbdb8a17
  由 gongweibao 提交于 4月 18, 2019
  
  cbdb8a17
11 4月, 2019 1 次提交
- W
  
  disable the share lib for protobuf test=develop (#16778) · b6150e1f
  由 wopeizl 提交于 4月 11, 2019
  
  b6150e1f
03 4月, 2019 1 次提交
- C
  Revert "Model data cryption link all lib (#16555)" · 0b2aec14
  由 Chen Weihang 提交于 4月 03, 2019
```
test=develop
This reverts commit c38c7c56.
```
  0b2aec14
02 4月, 2019 1 次提交

Model data cryption link all lib (#16555) · c38c7c56

由 Chen Weihang 提交于 4月 02, 2019

* link the libwbaes.so into paddle

* polish detail, test=develop

* try fix mac_pr_ci error, test=develop

* add compile option, test=develop

* fix ci error, test=develop

* ignore failed to find mac lib, test=develop

* change cdn to bj, cdn can't get the latest version

* trigger ci, test=develop

* temporary delete win32 lib linking, test=develop

* change https to http, test=develop

* turn compile option on to off

* turn compile option off to on, test=develop

* try lib compiled by gcc4.8, test=develop

* update lib version, test=develop

* link other lib, test=develop

* add setup config

* delete false, test=develop

* delete no_soname, test=develop

* recover so name set

* fix, test=develop

* adjust make config, test=develop

* remove link to wbaes, test=develop

* remove useless define, test=develop

c38c7c56

30 3月, 2019 1 次提交
- G
  Fix windows compilation error! (#16546) · fea91164
  由 gongweibao 提交于 3月 30, 2019
```
* fix compiled
test=develop

* follow comments test=develop
```
  fea91164
29 3月, 2019 1 次提交
- S
  
  resolve conflicts with the develop branch test=develop · bddb2cd3
  由 Shixiaowei02 提交于 3月 28, 2019
  
  bddb2cd3
28 3月, 2019 2 次提交
- G
  
  Add DGC(Deep Gradient Compression) interface. (#15841) · eb83abea
  由 gongweibao 提交于 3月 28, 2019
  
  eb83abea
- B
  
  fix compile issue test=develop (#16447) · b1d26051
  由 baojun 提交于 3月 27, 2019
  
  b1d26051
25 3月, 2019 1 次提交
- L
  fix cdn issue, test=develop (#16423) · de3b70a1
  由 liuwei1031 提交于 3月 25, 2019
```
* fix cdn issue, test=develop

* fix cdn issue, test=develop
```
  de3b70a1
22 3月, 2019 1 次提交
- N
  1. Add ANAKIN_ROOT compile option · f3a2e4b3
  由 nhzlx 提交于 3月 22, 2019
```
2. refine trt code
test=develop
```
  f3a2e4b3
15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

09 3月, 2019 1 次提交

Upgrade MKLDNN to v0.18-rc and fix issue caused by lib/lib64 (#15861) · db120b93

由 Brian Liu 提交于 3月 09, 2019

* Upgrade MKLDNN to v0.18-rc and fix issue caused by lib/lib64

Upgrade MKLDNN to v0.18-rc
Also fix the issue during upgrade

test=develop

* Rebase MKLDNN to rls-v0.18 branch

Some issues in v0.18-rc which caused INT8 conv op unit test failure was fixed
in rls-v0.18 branch

test=develop

* Upgrade MKLDNN from v0.18rc to formal v0.18 tag

test=develop

* Fix the windows compile issue.

test=develop

db120b93

04 3月, 2019 4 次提交
- B
  
  fix lib64 test=develop · 4cfc5b49
  由 baojun-nervana 提交于 2月 27, 2019
  
  4cfc5b49
- D
  polish cudnn related code and fix bug. (#15164) · 4449e855
  由 dzhwinter 提交于 2月 27, 2019
```
* staged.

* polish code

* polish code. test=develop

* polish code. test=develop

* api change. test=develop

* fix default value. test=develop

* fix default value. test=develop
```
  4449e855
- Y
  Optimize gelu operation with mkl erf. · b48d56e8
  由 Yihua Xu 提交于 2月 26, 2019
```
test=develop
```
  b48d56e8
- B
  
  Update ngraph version to v0.14 test=develop · dea34134
  由 baojun-nervana 提交于 2月 25, 2019
  
  dea34134
28 2月, 2019 1 次提交
- B
  
  fix lib64 test=develop · b51e4dc0
  由 baojun-nervana 提交于 2月 27, 2019
  
  b51e4dc0
27 2月, 2019 1 次提交

由 dzhwinter 提交于 2月 27, 2019

* staged.

* polish code

* polish code. test=develop

* polish code. test=develop

* api change. test=develop

* fix default value. test=develop

* fix default value. test=develop

225c11a9

26 2月, 2019 2 次提交
- Y
  Optimize gelu operation with mkl erf. · 73967886
  由 Yihua Xu 提交于 2月 26, 2019
```
test=develop
```
  73967886
- B
  
  Update ngraph version to v0.14 test=develop · 2ffacdeb
  由 baojun-nervana 提交于 2月 25, 2019
  
  2ffacdeb
25 2月, 2019 1 次提交
- L
  Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
  由 liangan1 提交于 2月 25, 2019
```
test=develop
```
  4acc5220
22 2月, 2019 2 次提交

T
Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
由 tensor-tang 提交于 2月 22, 2019
```
* Revert "Optimze Gelu with MKL Erf function (#15770)"

This reverts commit 676995c8.

* test=develop
```
ee2321de

Optimze Gelu with MKL Erf function (#15770) · 676995c8

由 Yihua Xu 提交于 2月 22, 2019

* Optimize for gelu operator

* Set up the low accuracy mode of MKL ERF function.

test=develop

* Only enable MKLML ERF when OS is linux

* Use the speical mklml version included vmsErf function to verify gelu mkl kernel.

test=develop

* Add the CUDA macro to avoid NVCC's compile issue.

test=develop

* Add the TODO comments for mklml library modification.

test=develop

* Clean Code

test=develop

* Add the comment of marco for NVCC compiler.

test=develop

676995c8

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致