提交 · b8333edef6e1e7eb1a0c22121d375c9660d91e61 · BaiXuePrincess / Paddle

13 10月, 2019 1 次提交

Add Multihead matmul fuse pass (#20167) · b8333ede

由 zhaoyuchen2018 提交于 10月 13, 2019

* Add multihead fuse pass for ernie opt

* Refine softmax

test=develop

* Refine cuda kernel

* Refine cuda version

* Refine cmake

test=develop

* refine header file

* refine test case and pass
* refine comments

b8333ede

12 10月, 2019 1 次提交

Add ConvTranspose + BatchNorm fuse pass (#20161) · 7faa3e95

由 Adam 提交于 10月 12, 2019

* Add ConvTranspose + BatchNorm fuse pass
test=develop

* Add tests for conv+bn and conv_transpose+bn passes
test=develop

7faa3e95

10 10月, 2019 3 次提交

fix parse content in CreatePreLoadReaders (#20258) · 22b80e12

由 xujiaqi01 提交于 10月 10, 2019

Fix parse content in CreatePreLoadReaders. Before this fix, if you use dataset.set_parse_content and dataset.preload, parse content didn't work.

22b80e12

New save load interface (#20148) · fa43e80e

由 hong 提交于 10月 10, 2019

* add new save load interface; test=develop

* add new save interface; test=develop

* add save load interface ;

* fix save load error;

* fix dygraph set dict bug;

* add save load unit test; test=develop

* fix test_imperative_optimizer bug; test=develop

* fix unitest optimizer bug; test=develop

* fix code coverage; test=develop

* fix converage; test=develop

* add document for apis; test=develop

* fix unitest error; test=develop

* fix save load unit test error; test=develop

* fix error message; test=develop

* change set_parameter set_optimizer to save_dygraph; test=develop

* add load_graph check; test=develop

* fix api spec; test=develop

fa43e80e

Z

simplify op_info.h, test=develop (#20195) · c20b11ba
由 Zeng Jinle 提交于 10月 10, 2019

c20b11ba

08 10月, 2019 1 次提交
- H
  
  update op compatible list; test=develop (#20175) · 0ec2c081
  由 hong 提交于 10月 08, 2019
  
  0ec2c081
07 10月, 2019 1 次提交
- T
  trainer from dataset fetch targets (#19760) · c9139c3d
  由 tangwei12 提交于 10月 07, 2019
```
add executor.FetchHandler for train/infer from the dataset
```
  c9139c3d
30 9月, 2019 2 次提交
- C
  Add place deps for fused_all_reduce_op_handle (#20077) · bfa55c9d
  由 chengduo 提交于 9月 30, 2019
```
test=develop
```
  bfa55c9d
- Z
  
  remove map type from var_type_traits.h, test=develop (#20090) · 5fef859c
  由 Zeng Jinle 提交于 9月 30, 2019
  
  5fef859c
29 9月, 2019 1 次提交
- Z
  
  fix op_compatiable_compile_error, test=develop (#20076) · 4ad66c77
  由 Zeng Jinle 提交于 9月 29, 2019
  
  4ad66c77
28 9月, 2019 2 次提交

Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02

由 qingqing01 提交于 9月 28, 2019

* How to write custom op needs to follow framework OP spec.
* Package fluid_framework.so and headers into whl.
* Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
* Export some C-APIs to merge OpInfo between core.so and custom_op.so.
* Add unit testing.
* Update API.spec.

1a3eef02

Follow comment of Merged QAT PR 18970 (#19979) · 9de67725

由 bingyanghuang 提交于 9月 28, 2019

* Follow Wangzhen's comment in PR 18970, test=develop

* Review comments, test=develop

* Leave fake quantization around mul

test=develop

* Replace Fake with Real Quantized Mul

test=develop

* Fix bug in quantize placement pass

Nodes in the graph now have checked type instead of node name when they are to be marked for quantization test=develop

9de67725

27 9月, 2019 5 次提交

石

update operator compatible info, test=develop (#19978) · 01b9d079

由石晓伟提交于 9月 27, 2019

* update operator compatible info, test=develop

* revert cmake/version.cmake, test=develop

* add unit_tests and fix bugs, test=develop

* update ../paddle/fluid/framework/framework.proto, test=develop

* fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop

* update paddle/fluid/framework/version_test.cc, test=develop

* add comments and rename interfaces, test=develop

01b9d079

Disable conv requant squash (#20041) · f5221ac1

由 joanna.wozna.intel 提交于 9月 27, 2019

* Fix conv2d+dequantize squash for residual fusion

test=develop

* Disable conv-requant squash

test=develop

f5221ac1

codegen code for reconstruction (#19728) · c9ea317b

由 wangchaochaohu 提交于 9月 27, 2019

* codegen code for reconstruction test=develop

* fix the cmake test=develop

* fix review advice test=develop

c9ea317b

the integrated communicator (#19849) · 8f0b3c05

由 tangwei12 提交于 9月 27, 2019

* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training

8f0b3c05

Paddle error message stack shaping and optimization (#19895) · b9163350

由 Chen Weihang 提交于 9月 27, 2019

* shape and optimize paddle error message stack, test=develop

* limit exception type & add unittest, test=develop

* fix multi-platform problem, test=develop

* fix related unnitest failed, test=develop

* add doc & fix unittest errors, test=develop

* fix function name error, test=develop

* update tensor test exception msg compare, test=develop

* remove unittest on win32, the dir format is different, test=develop

* remove useless package, test=develop

* add paddle enforce handler unittest, test=develop

* add exception checkout, test=develop

* fix coverage failed, test=develop

* fix op registry test failed, test=develop

* refactor whole pr, test=develop

* remove test in CMakelist, test=develop

* fix coverage, test=develop

b9163350

26 9月, 2019 3 次提交

C
disable fuse_all_optimizer_ops (#19966) · 2450d15b
由 chengduo 提交于 9月 26, 2019
```
test=develop
```
2450d15b
C
Add dtype for coalesce_tensor_op (#20016) · 101a2b61
由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
101a2b61

Add new data layer (#19916) · 88af4ab6

由 Huihuang Zheng 提交于 9月 26, 2019

The new "fluid.data" changes old "fluid.layers.data":

1. Add shape and dtype check.
2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation

TODO：
Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?

88af4ab6

25 9月, 2019 1 次提交
- X
  fix memory leak in HogwildWorker (#19956) · f50e701b
  由 xujiaqi01 提交于 9月 25, 2019
```
fix memory leak in HogwildWorker,  whose ops are  explicitly deleted in destructor
```
  f50e701b
24 9月, 2019 3 次提交

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

Z

add inplace to assign op, test=develop (#19927) · cc157d59
由 Zeng Jinle 提交于 9月 24, 2019

cc157d59
C
clean tensor array (#19930) · 55ce6969
由 chengduo 提交于 9月 24, 2019
```
test=develop
```
55ce6969

23 9月, 2019 3 次提交
- C
  Delete local execution scopes (#19749) · d7251a8e
  由 chengduo 提交于 9月 23, 2019
```
* Add RecordHistoryLocalExecScopes
test=develop
```
  d7251a8e
- W
  remove the useless warning for user to avoid confuse test=develop (#19871) · 5452b6a1
  由 wopeizl 提交于 9月 23, 2019
```
* remove the useless warning for user to avoid confuse test=develop
```
  5452b6a1
- H
  Add op compatible information (#19910) · 85b398f1
  由 hong 提交于 9月 23, 2019
```
* add op compatible infomation; test=develop

* add enum type

* add enum type; test=develop
```
  85b398f1
20 9月, 2019 2 次提交

Set states of recurrent op as dependent vars in prune (#19865) · e1171142

由 Huihuang Zheng 提交于 9月 20, 2019

* Set states of recurrent op as dependent vars in prune of save inference model

This PR will fix the save/load inference model problem of RNN models.

The reason of the bug is that save_inferenc_model will prune OPs that doesn't contribute to Output. But in recurrent_op, States are not Output, OPs refers States will be pruned. 

This fix adds States of recurrent_op as dependent var so that OPs referring States won't be pruned.

e1171142

Z

fix reduce and broadcast to avoid multi-stream, test=develop (#19889) · b754700f
由 Zeng Jinle 提交于 9月 20, 2019

b754700f

19 9月, 2019 4 次提交

J
Fix conv2d+dequantize squash for residual fusion (#19545) · 3f1d0234
由 joanna.wozna.intel 提交于 9月 19, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Change condition

test=develop
```
3f1d0234
H
Fix deps of prune (#19876) · a35557d8
由 Huihuang Zheng 提交于 9月 19, 2019
```
Add boost as dependency of prune

fix #19862
```
a35557d8
L

fix SplitLodTensor when batch_size = 0, test=develop (#19866) · 578a2f5d
由 Leo Chen 提交于 9月 19, 2019

578a2f5d

Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6

由 Yiqun Liu 提交于 9月 19, 2019

* Add fc_elementwise_layernorm_fuse pass and unittest.

* Add fused_fc_elementwise_layernorm op and its GPU kernel.
test=develop

* Apply fc_elementwise_layernorm_fuse_pass to GPU inference.

* Add the setting of attrs in the definition of binary_op.
test=develop

* Add comment.

* Implement the unittest.
test=develop

* Change the unittest name of layer_norm.
test=develop

3cd985a6

18 9月, 2019 3 次提交
- Z
  
  refine executor_gc_helper codes, test=develop (#19814) · 3f87464e
  由 Zeng Jinle 提交于 9月 18, 2019
  
  3f87464e
- Z
  
  fix gc bug in controlflow ops, test=develop (#19827) · 3fd3b663
  由 Zeng Jinle 提交于 9月 18, 2019
  
  3fd3b663
- Z
  [Bug fix] Disable memory reuse on feeded variables (#19835) · db26de83
  由 Zeng Jinle 提交于 9月 18, 2019
```
* fix memory reuse bug on feeding variables, test=develop

* add comments to reference count members, test=develop
```
  db26de83
17 9月, 2019 4 次提交

T
rm return in vfork (#19734) · 40c66f8d
由 Thunderbrook 提交于 9月 17, 2019
```
* rm return in vfork

* rm return in vfork
test=develop
```
40c66f8d
X
support preload thread, optimize hdfs log, fix master+patch bug (#19695) · 6bf298bf
由 xujiaqi01 提交于 9月 17, 2019
```
* support preload thread
* sleep before fleet wrapper exit for pslib core dump
* optimize hdfs log
* fix master+patch bug
```
6bf298bf

Feature/add transform data dygraph (#19707) · cc311bdf

由 Jiabin Yang 提交于 9月 17, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* add transform_data to dygraph

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* add test and change input to const ref for safety

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* add ut for data transform

* refine ut for data_transform

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

* add test_tracer on multiple devices

* test=develop, change place to mutable for data transform

* test=develop, add transform data on same place test and remove useless log

* test=develop, Add to do for data layout and and ut for conv2d with no bias

cc311bdf

Z

disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778) · 754fd57e
由 Zeng Jinle 提交于 9月 17, 2019

754fd57e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致