提交 · 2a792de7c263fa038ba3e29285e4d9d7e86ab6ca · BaiXuePrincess / Paddle

02 3月, 2020 1 次提交
- T
  
  Fix typos (#22799) · 7244b2a2
  由 tianshuo78520a 提交于 3月 02, 2020
  
  7244b2a2
28 2月, 2020 1 次提交
- T
  
  fix typo word (#22765) · e8f64889
  由 tianshuo78520a 提交于 2月 28, 2020
  
  e8f64889
24 2月, 2020 1 次提交
- T
  SYNC with communicaotor (#22344) (#22725) · 78716128
  由 tangwei12 提交于 2月 24, 2020
```
* add sync communicator and implement
```
  78716128
18 2月, 2020 2 次提交

1

support dumping params/grads in transpiler mode (#22490) (#22649) · 9e80551d
由 123malin 提交于 2月 18, 2020

9e80551d

multi-loss optimization by adding a DownpourOpt worker (#22025) (#22638) · 750c6f42

由 yaoxuefeng 提交于 2月 18, 2020

* update

* update test=develop

* update compile set test=develop

* update compile set test=develop

* update test=develop

* update test=develop

* update test=develop

* update compile setting test=develop

* update compile setting test=develop

* update run demo test=develop

* update test=develop

* update test=develop

* fix test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update format test=develop

* update format test=develop

* update style test=develop

* update style test=develop

* change style test=develop

* change style test=develop

* change style test=develop

* add dataset unittest test=develop

* update test=develop

* update for record test=develop

* udpate style for record test=develop

* update for record test=develop

* update for record test=develop

* update for record test=develop

* fix format test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

* update test=develop

750c6f42

17 2月, 2020 1 次提交

cherry-pick 22551. test=develop test=release/1.7 (#22609) · 6945a80b

由 Wilber 提交于 2月 17, 2020

[cherry-pick] #22551

当一个模型中有多个fc_lstm子图的时候，且其中fc共用了同一个persistable的bias，此时不应该将bias节点删除，只将非persistable的节点去除即可。

6945a80b

13 2月, 2020 1 次提交
- T
  fix bug with compiledProgram (#22495) (#22566) · a8f85f2c
  由 tangwei12 提交于 2月 13, 2020
```
* add thread barrier for the compiled program
```
  a8f85f2c
12 2月, 2020 1 次提交
- G
  Make assign op support LoDTensorArray and modify while_loop API (#22309) (#22525) · d8a2aa5d
  由 guofei 提交于 2月 12, 2020
```
This PR makes assign op support LoDTensorArray and enable the loop_vars in
while_loop to support tuple or list.
```
  d8a2aa5d
11 2月, 2020 1 次提交

cherry-pick 22509. test=develop test=release/1.7 (#22527) · 49a80b45

由 Wilber 提交于 2月 11, 2020

[cherry-pick] #22509

支持不依赖nccl进行编译。

多卡下，如果没有打开WITH_NCCL开关编译，多卡不能通信，则只能选择一张卡使用

49a80b45

07 2月, 2020 1 次提交
- J
  [Cherry-pick] Add dequant-scale squash (#22409) (#22473) · 6892deb1
  由 joanna.wozna.intel 提交于 2月 07, 2020
```
* Add dequant scale squash

* Correct dequant-scale squash test

test=release/1.7
```
  6892deb1
05 2月, 2020 3 次提交

W
cherry-pick 22384 and 22371. test=develop test=release/1.7 (#22453) · fb98116c
由 Wilber 提交于 2月 05, 2020
```
[cherry-pick] #22384 and #22371

22384增加了WITH_NCCL开关

22371修改了fluid依赖lite的commit id
```
fb98116c
X
add GeneralRoleMaker (#22295) (#22446) · 7171b20e
由 xujiaqi01 提交于 2月 05, 2020
```
* add GeneralRoleMaker which is for general usage
* test=develop
```
7171b20e

cherry pick 1.7 , fix copy table, add hdfs ls retry, fix save inference (#22447) · e87ddb28

由 xujiaqi01 提交于 2月 05, 2020

* fix copy table bug (#22432)

* fix copy table bug of lost some feasign
* test=develop

* add hdfs ls retry time and sleep time, fix save inference (#22433)

* add hdfs ls retry time and sleep time, fix save inference
* test=develop

e87ddb28

04 2月, 2020 3 次提交

X
add collective communication library in fleet (#22211) (#22435) · be528bf2
由 xujiaqi01 提交于 2月 04, 2020
```
* add collective communication library in fleet to replace mpi
* test=develop
```
be528bf2
石

remove anakin from code, test=release/1.7 (#22421) · 399bda2b
由石晓伟提交于 2月 04, 2020

399bda2b

[DNNL] Fix accuracy in INT8 FC (#22404) (#22410) · a13490a0

由 Michał Gallus 提交于 2月 04, 2020

test=release/1.7

* Enable quantize to reorder to nchw as well

* Correct FC MKL-DNN input dim requirements to accept 3D

* Improve DNNL FC format, error and 3D input handling

* Improve error checking in FC

* Improve PADDLE_ENFORCE messages in fc-related files

* Remove data layout attribute from obligatory pass args

* Fix message in fc_mkldnn_pass to be logically correct

a13490a0

21 1月, 2020 1 次提交
- L
  
  change std::cout to log(INFO), vlog (#22316) (#22337) · 90ce4aea
  由 lidanqing 提交于 1月 21, 2020
  
  90ce4aea
20 1月, 2020 1 次提交
- T
  integrated HALF_ASYNC to communicator (#21869) (#22343) · fa4e0e82
  由 tangwei12 提交于 1月 20, 2020
```
* add half_async in the communicator
* fix DistributedStrategy
```
  fa4e0e82
19 1月, 2020 1 次提交
- A
  
  Preserve shape in inplace operators (#22369) · ecc52688
  由 Adam 提交于 1月 19, 2020
  
  ecc52688
15 1月, 2020 3 次提交
- Z
  
  fix the type error caused by setting bool attr in OpDesc. release/1.7 (#22257) (#22264) · e1eb5650
  由 Zhen Wang 提交于 1月 15, 2020
  
  e1eb5650
- Z
  
  fix the bug of assert_is_op_output. test=release/1.7 (#22262) (#22293) · bb27ddac
  由 Zhen Wang 提交于 1月 15, 2020
  
  bb27ddac
- Z
  [cherry-pick] faster build by reduce by-product, reduce linking library and... · f785229f
  由 zhouwei25 提交于 1月 15, 2020
```
[cherry-pick] faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11(#22230)
```
  f785229f
14 1月, 2020 1 次提交
- Z
  
  support the fusion of batch_norm and relu for AMP. test=release/1.7 (#22210) · c63a63d5
  由 Zhen Wang 提交于 1月 14, 2020
  
  c63a63d5
09 1月, 2020 4 次提交

test Optimizer in dygraph (#21949) · d0f0a252

由 zhongpu 提交于 1月 09, 2020

* test Optimizer in dygraph, test=develop

* add optest for Optimizer in dygraph, test=develop

* fix adagrad optimizer, test=develop

* fix dpsgd optimizer, test=develop

* fix test_optimizer.py, test=develop

* fix dpsgd optimizer, this op only support cpu, test=develop

* add optest for optimizer, test=develop

* add description for dpsgd, test=develop

* add rmsprop to white_list in unused_var_check.cc, test=develop

* polish code style, test=develop

* polish code style, test=develop

* delete seed attribute for DpsgdOptimizer, test=develop

* change testing to debugging, test=develop

d0f0a252

J

Add multiple quantize operators fuse (#22062) · 5b2e98aa
由 joanna.wozna.intel 提交于 1月 09, 2020

5b2e98aa

Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144) · 96980c22

由 Yiqun Liu 提交于 1月 09, 2020

* Polish the PADDLE_ENFORCE in fusion_group pass related codes.
test=develop

* Correct the unittest because of the change relu_grad's formula.
test=develop

96980c22

W
add support for nested profiling event and printing in different level (#22061) · c3876cf8
由 wangchaochaohu 提交于 1月 09, 2020
```
* add support for nested profiling event and printing in different level
```
c3876cf8

07 1月, 2020 3 次提交
- L
  
  fix xception precision problem, test=develop (#22124) · 724b13e4
  由 liu zhengxi 提交于 1月 07, 2020
  
  724b13e4
- Y
  Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094) · b1401fb7
  由 Yiqun Liu 提交于 1月 07, 2020
```
test=develop
```
  b1401fb7
- B
  
  fix format in operator.cc (#22101) · 4b4a9cc8
  由 bingyanghuang 提交于 1月 07, 2020
  
  4b4a9cc8
06 1月, 2020 3 次提交
- S
  
  test=develop, remove unused parameter from class RuntimeInferShapeContext constructors (#22046) · 6c20e7c4
  由 silingtong123 提交于 1月 06, 2020
  
  6c20e7c4
- J
  
  [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088) · b0b27ff6
  由 Jacek Czaja 提交于 1月 06, 2020
  
  b0b27ff6
- H
  
  Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029) · dd436156
  由 Huihuang Zheng 提交于 1月 06, 2020
  
  dd436156
05 1月, 2020 1 次提交
- J
  
  [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747) · ad8a9cb8
  由 Jacek Czaja 提交于 1月 05, 2020
  
  ad8a9cb8
03 1月, 2020 2 次提交

Add the first implememtation of fusion_group op (#19621) · d4832077

由 Yiqun Liu 提交于 1月 03, 2020

* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
test=develop

* Call CUDA driver api to launch the kernel compiled by nvrtc.
test=develop

* Disable for mac and windows.
test=develop

* Refine the codes to support manually specified num_threads and workload_per_thread.
test=develop

* Refine the CUDA kernel to support large dims.
test=develop

* Add DeviceCodePool to manage all device codes.

* Add the first implementation fusion_group op.

* Add unit-test for fusion_group op.

* Add the check of result.

* Add the check of nvrtc in unit-test.
test=develop

* Add comment to explain the inputs, outputs and features of fusion_group op.
test=develop

* Disable fusion_group op for mac and windows.
test=develop

* Make the compiling of device code return status instead of hanging up.
test=develop

* Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.

* Unify fusion_group_op's input and output names.
test=develop

* Add the check of CUDA driver library in unittest.
test=develop

* Refine the calling of PADDLE_ENFORCE.
test=develop

d4832077

M

[DNNL] 3D Fully-Connected (#21746) · 61921084
由 Michał Gallus 提交于 1月 03, 2020

61921084

29 12月, 2019 1 次提交

Fix multi-threads memory out of bounds error for passes (#21920) · 196e20df

由 liu zhengxi 提交于 12月 29, 2019

* fix seqconv_eltadd_relu pass during multi-threads predictor, test=develop

* fix attention_lstm_fuse_pass during multi-threads inference, test=develop

* fix embedding_fc_lstm_fuse_pass during multi-threads inference, test=develop

* fix fc_lstm_fuse_pass during multi-threads inference, test=develop

* fix seq_concat_fc_fuse_pass during multi-threads inference, test=develop

196e20df

27 12月, 2019 1 次提交
- 石
  fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841) · 03479469
  由石晓伟提交于 12月 27, 2019
```
* fix multi-thread error of fc_gru_fuse_pass.cc, test=develop

* export FLAGS and GLOG symbols, test=develop
```
  03479469
25 12月, 2019 2 次提交
- P
  
  fix trt calib not working bug, test=develop (#21934) · 3e5008ad
  由 Pei Yang 提交于 12月 25, 2019
  
  3e5008ad
- Q
  Pack imperative/layer into paddle_framework.so (#21921) · 20667458
  由 qingqing01 提交于 12月 25, 2019
```
* Pack imperative/layer into paddle_framework.so
```
  20667458

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致