提交 · 6b1e1f0dda5573306f1b4c5eb35e52a15f2591bb · 机器未来 / Paddle

20 11月, 2019 4 次提交

Enable generating code for a given subgraph. (#21126) · 6b1e1f0d

由 Yiqun Liu 提交于 11月 20, 2019

* Enable generating code for a given subgraph.

* Support sorting the subgraph.

* Remove the rearange of expressions because we use the sorted subgraph directly.

* Enable generating code for a subgraph which is composed of grad ops.

* Use expression information to check the accuracy in unittest.

* Separate load and store from computation expressions.
test=develop

* Improve the loading statements in generated codes.
test=develop

* Remove unused arguments from formal list.
test=develop

6b1e1f0d

Z
Fix topk compile failed on windows (#21243) · 3ff5cc2d
由 zhaoyuchen2018 提交于 11月 20, 2019
```
* Fix topk compile failed on windows
* Use explicit cast for assign data
```
3ff5cc2d
P
fix trt weight bug (#21231) · 2e2f92a5
由 Pei Yang 提交于 11月 20, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
2e2f92a5

optimize assign op to avoid copy data from GPU to GPU (#21181) · 01a96463

由 Zhang Ting 提交于 11月 20, 2019

* optimize assign op to avoid copy data from GPU to GPU, test=develop

* modified GetkernelTypeForVar and just avoid device transform, test=develop

01a96463

19 11月, 2019 6 次提交

Z

Determine whether to copy and link inference lib by ON_INFER (#20931) · c0dcb090
由 zhouwei25 提交于 11月 19, 2019

c0dcb090
D

extend elementwise broadcast function (#20957) · 0e7baabe
由 danleifeng 提交于 11月 19, 2019

0e7baabe
A
Fix GELU grad error (#21204) · d623e863
由 Adam 提交于 11月 19, 2019
```
test=develop
```
d623e863
Z

refine Tensor method, test=develop (#21031) · a152315b
由 Zeng Jinle 提交于 11月 19, 2019

a152315b

fix data_norm op to avoid impractical normalization result test=develop (#21152) · b5d8ba83

由 yaoxuefeng 提交于 11月 19, 2019

* fix auc drop first commit test=develop

* update datanorm op

* update datanorm with enforce test=develop

* update test=develop

* update format test=develop

* update format

* update format test=develop

* add unit test test=develop

* update unit test test=develop

* update format test=develop

* update format test=develop

* update API description test=develop

* update API description test=develop

* update format test=develop

* fix codes as comments test=develop

* fix description as comments test=develop

* fix description as comments test=develop

* update codes.. test=develop

b5d8ba83

Polish jit trace codes (#21218) · 67e88424

由 Zeng Jinle 提交于 11月 19, 2019

* polish jit trace codes, test=develop

* polish codes again by removing var_id, test=develop

67e88424

18 11月, 2019 6 次提交

Fix warn of gcc8 (#21205) · cdb3d279

由 Zeng Jinle 提交于 11月 18, 2019

* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop

cdb3d279

fix sporadically hang issue on windows(#21201) · d8b6cf2b

由 liuwei1031 提交于 11月 18, 2019

cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows

d8b6cf2b

modified error message and API doc for channel_last supported Op (#21002) · 9cbe7bcc

由 Zhang Ting 提交于 11月 18, 2019

* modified error message for conv and conv_transpose, test=develop

* modified doc of conv and conv_transpose op, test=develop

* modified the expression for error message, test=develop

* modified error message for group_norm op, test=develop

* modified detail of Attr(data_format) or Attr(data_layout)

* add ValueError in API doc for maxout op, test=develop

9cbe7bcc

Z
TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
65f70525
G

Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118) · 56b5d147
由 guofei 提交于 11月 18, 2019

56b5d147
W

Fix INF bug of softmax_cross_entropy_op (#21165) · 3c98ec90
由 WangXi 提交于 11月 18, 2019

3c98ec90

15 11月, 2019 5 次提交
- X
  fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052) · 23876de5
  由 xujiaqi01 提交于 11月 15, 2019
```
* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop
```
  23876de5
- Y
  
  Fix jit tls issue (#21151) · eec9c9cb
  由 Yihua Xu 提交于 11月 15, 2019
  
  eec9c9cb
- G
  fix cmake fails on inference_download_and_uncompress (#21185) · a9d4eed3
  由 GaoWei8 提交于 11月 15, 2019
```
* solve cmake fails on inference_download_and_uncompress
test=develop

* solve cmake fails on inference_download_and_uncompress
test=develop
```
  a9d4eed3
- X
  add copy table (#21086) · 9e045170
  由 xujiaqi01 提交于 11月 15, 2019
```
* copy some feasigns and corresponding embeddings from one sparse table to another
* copy all feasigns and corresponding embeddings from one sparse table to another
* copy all dense params from one table to another
* copy some local vars to other local vars
```
  9e045170
- R
  
  Refine edit distance cn (#21121) · aeb88791
  由 ruri 提交于 11月 15, 2019
  
  aeb88791
14 11月, 2019 8 次提交
- K
  
  fix elementwise_mod float point kernel. test=develop (#21183) · 98b59cb8
  由 Kaipeng Deng 提交于 11月 14, 2019
  
  98b59cb8
- Z
  Add friendly dygraph trace API (#21091) · 5fdfbe34
  由 Zeng Jinle 提交于 11月 14, 2019
```
* friendly trace interface, test=develop

* refine TracedLayer, test=develop

* add some docs, test=develop
```
  5fdfbe34
- C
  
  fix detail error message error, test=develop (#21170) · 4bd94636
  由 Chen Weihang 提交于 11月 14, 2019
  
  4bd94636
- W
  
  Fix warpctc in padding mode. (#21033) · cfdd1fc2
  由 whs 提交于 11月 14, 2019
  
  cfdd1fc2
- C
  Add examples for error message writing specification - NotFound, OutOfRange,... · 8da0cd53
  由 Chen Weihang 提交于 11月 14, 2019
```
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)

* add examples for error msg spec, test=develop

* change ENFORCE to ENFORCE_**, test=develop

* add more already exists examples, test=develop
```
  8da0cd53
- Z
  Improve topk performance. (#21087) · b93870e6
  由 zhaoyuchen2018 提交于 11月 13, 2019
```
* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  b93870e6
- A
  Add relative error measure when (value > 1) (#21144) · d74ea085
  由 Adam 提交于 11月 14, 2019
```
* Add relative error measure when value > 1
test=develop

* Move code to CheckError function
test=develop
```
  d74ea085
- C
  
  change cuda enforce & add example (#21142) · b3a3e6f6
  由 Chen Weihang 提交于 11月 14, 2019
  
  b3a3e6f6
13 11月, 2019 3 次提交
- C
  Add examples for error message writing specification - PreconditionNotMet,... · 8414575b
  由 Chen Weihang 提交于 11月 13, 2019
```
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)

* add examples for error spec, test=develop

* change ENFORCE to ENFORCE_**, test=develop
```
  8414575b
- C
  Add examples for error message writing specification - InvalidArgument (#21132) · 7e5f74b8
  由 Chen Weihang 提交于 11月 13, 2019
```
* add examples for error msg spec, test=develop

* change ENFORCE to ENFORCE_**, test=develop

* fix error, test=develop
```
  7e5f74b8
- C
  
  add examples for resource exhausted error, test=develop (#21140) · 27fa9c10
  由 Chen Weihang 提交于 11月 13, 2019
  
  27fa9c10
12 11月, 2019 6 次提交
- Z
  Add Asypadding for conv fusion. (#21041) · 4a544762
  由 zhaoyuchen2018 提交于 11月 12, 2019
```
* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.
```
  4a544762
- W
  
  Fix dgc buffer illegal & reuse velocity (#21012) · de5d3ff6
  由 WangXi 提交于 11月 12, 2019
  
  de5d3ff6
- C
  fix instance norm (#21042) · f62a9291
  由 ceci3 提交于 11月 12, 2019
```
* fix instance norm

* update unitest,test=develop
```
  f62a9291
- Z
  
  remove so many logs of parallel executor, test=develop (#21105) · d625aaf0
  由 Zeng Jinle 提交于 11月 12, 2019
  
  d625aaf0
- L
  fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3
  由 lilong12 提交于 11月 12, 2019
```
* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop
```
  e249d9a3
- C
  Further simplify the C++ error info stack (#21093) · edd6680a
  由 Chen Weihang 提交于 11月 12, 2019
```
* simplify C++ error stack by rewrite Place, test=develop

* polish assignment overload func, test=develop
```
  edd6680a
11 11月, 2019 2 次提交

Z

add check for input channels and Attr(groups), test=develop (#21095) · e0285eae
由 Zhang Ting 提交于 11月 11, 2019

e0285eae

Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2

由 Yiqun Liu 提交于 11月 11, 2019

* Add the check of lod_level between compile-time and runtime.
test=develop

* Fix bug in check_compile_vs_runtime.
test=develop

* Fix the check of output when it is dispensiable or intermediate.
test=develop

* Share lod of x to out in match_matrix_tensor op in compile-time.

* Implement GetLoDLevel in InferShapeContext.

* Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
test=develop

* Enable check_compile_vs_runtime in test_match_matrix_tensor.

* Add the implementation of SetLoDLevel in InferShapeContext.

* Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.

* Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.

* Refine some ops and unittests.
test=develop

* Fix a typo.
test=develop

* Remove the check of var type, and change int to int32_t.
test=develop

* Add unittest for Get/SetLoDLevel.
test=develop

35f17ae2

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致