提交 · f297a33285e1037f4b5ccf4f4b06dc2680052d3d · 机器未来 / Paddle

04 4月, 2020 1 次提交

由 Leo Chen 提交于 4月 04, 2020

* fix init_gflags with 'python -c', test=develop

* add test, test=develop

* use sys.executable instead of python, test=develop

* keep dummy, test=develop

f297a332

03 4月, 2020 1 次提交
- C
  Add op inout check macro to simplify error message writing (#23430) · 7f1ad510
  由 Chen Weihang 提交于 4月 03, 2020
```
* add op inout check macro, test=develop

* fix enforce_test, test=develop
```
  7f1ad510
02 4月, 2020 1 次提交
- A
  Delete is_test attribute from activation operators (#23318) · da7c73f8
  由 Adam 提交于 4月 02, 2020
```
* Delete is_test from activation operators
test=develop

* Revent unneeded changes
test=develop
```
  da7c73f8
01 4月, 2020 1 次提交
- 石
  
  reverts the commit 23177, test=develop (#23363) · 5c59d213
  由石晓伟提交于 4月 01, 2020
  
  5c59d213
31 3月, 2020 2 次提交
- Y
  fix nccl comm double free bug (#23344) · 0471476a
  由 Yi Liu 提交于 3月 31, 2020
```
As nccl comm is not created by CUDADeviceContext, it should be destroyed by the creator as the best practice of RAII.
```
  0471476a
- W
  Profiler refine (#23294) · 1ee2a9a4
  由 wangchaochaohu 提交于 3月 31, 2020
```
* refine output of profiler for child event 
```
  1ee2a9a4
30 3月, 2020 2 次提交
- Y
  
  Initialize global nccl_comm in PE (#23275) · 2169e6fb
  由 Yi Liu 提交于 3月 30, 2020
  
  2169e6fb
- 石
  
  supports thread-binding stream, test=develop (#23177) · 75ebb48a
  由石晓伟提交于 3月 30, 2020
  
  75ebb48a
27 3月, 2020 1 次提交
- Z
  
  code polish for adding const qualifier, test=develop, test=document_fix (#23248) · 77b4dc80
  由 Zeng Jinle 提交于 3月 26, 2020
  
  77b4dc80
25 3月, 2020 1 次提交
- Z
  
  add cuda resource pool for BufferedReader, test=develop (#23152) · bba74071
  由 Zeng Jinle 提交于 3月 25, 2020
  
  bba74071
19 3月, 2020 1 次提交
- S
  
  added mkldnn swish activation (#23041) · abee05a8
  由 Sylwester Fraczek 提交于 3月 19, 2020
  
  abee05a8
18 3月, 2020 1 次提交
- Y
  initialize global nccl context in dygraph (#23037) · 121b2aed
  由 Yi Liu 提交于 3月 18, 2020
```
initialize global nccl context in dygraph
test=develop
```
  121b2aed
13 3月, 2020 1 次提交
- W
  
  remove debug log test=develop (#22994) · 99db0cf7
  由 wangchaochaohu 提交于 3月 13, 2020
  
  99db0cf7
12 3月, 2020 1 次提交
- W
  
  refine the profiler print test=develop (#22968) · c979c9f2
  由 wangchaochaohu 提交于 3月 12, 2020
  
  c979c9f2
07 3月, 2020 2 次提交
- Z
  
  fix compute ratio of profile, test=develop (#22872) · ca9c8b41
  由 Zhang Ting 提交于 3月 07, 2020
  
  ca9c8b41
- W
  refine the profiler print (#22823) · dbb0b9b3
  由 wangchaochaohu 提交于 3月 07, 2020
```
* refine the profiler print test=develop
```
  dbb0b9b3
04 3月, 2020 1 次提交

Add flags to limit gpu memory (#22793) · d41d802b

由 Zeng Jinle 提交于 3月 04, 2020

* add recorded cuda memory apis, fix typo, test=develop

* add more ut, test=develop

* follow comments, test=develop

* fix py35 incompatible issues, test=develop

d41d802b

03 3月, 2020 1 次提交
- Z
  
  fix print bug of profile, test=develop (#22804) · 72ff5a09
  由 Zhang Ting 提交于 3月 03, 2020
  
  72ff5a09
02 3月, 2020 2 次提交
- W
  
  polish the profiler_help code (#22811) · 8456c3f4
  由 wangchaochaohu 提交于 3月 02, 2020
  
  8456c3f4
- W
  Profile code refine (#22800) · 7578fcba
  由 wangchaochaohu 提交于 3月 02, 2020
```
* add profiler_help.h to refine the code test=develop
```
  7578fcba
26 2月, 2020 1 次提交
- A
  
  Add cpu_info without XBYAK (#22716) · 2b80e9a7
  由 Adam 提交于 2月 26, 2020
  
  2b80e9a7
25 2月, 2020 1 次提交
- Z
  add framework overhead ratio in profile report (#22590) · f97f3f93
  由 Zhang Ting 提交于 2月 25, 2020
```
* add framework overhead ratio, test=develop

* print GpuMemcpy overhead, test=develop
```
  f97f3f93
24 2月, 2020 1 次提交
- W
  Fusion group profile support (#22718) · 611411b9
  由 wangchaochaohu 提交于 2月 24, 2020
```
* add support for the driver api callback and fix the profiler name show bug
```
  611411b9
23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
21 2月, 2020 1 次提交
- Y
  
  Add the support of fp16 in fusion_group (#22239) · 22bbd547
  由 Yiqun Liu 提交于 2月 21, 2020
  
  22bbd547
19 2月, 2020 1 次提交
- W
  fix the profile print error (#22665) · a089072c
  由 wangchaochaohu 提交于 2月 19, 2020
```
* fix the profile print error test=develop
```
  a089072c
18 2月, 2020 1 次提交
- W
  add flag to control profile level in python API (#22319) · c65c6ae5
  由 wangchaochaohu 提交于 2月 18, 2020
```
* add python flag to control profile level test=develop
```
  c65c6ae5
14 2月, 2020 2 次提交
- C
  
  fix enforce test error, test=develop (#22610) · fe685cc1
  由 Chen Weihang 提交于 2月 14, 2020
  
  fe685cc1
- C
  Fix mismatch with plus sign in the line (#22588) · 266106da
  由 Chen Weihang 提交于 2月 14, 2020
```
* reproduce match error, test=develop, test=document_fix

* fix mismatch error, test=develop, test=document_fix
```
  266106da
10 2月, 2020 1 次提交

Compile without nccl deps. [2/2] (#22484) · de009152

由 Wilber 提交于 2月 10, 2020

Compile without nccl deps. [1/2]
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

de009152

07 2月, 2020 1 次提交
- L
  optimize performance of interpolate op (#22436) · 2b1386b2
  由 LielinJiang 提交于 2月 07, 2020
```
* optimize interpolate op, test=develop
```
  2b1386b2
06 2月, 2020 1 次提交
- W
  
  use enum class to replace the usage of enum in some condition test=develop (#22464) · 77dd0d97
  由 wangchaochaohu 提交于 2月 07, 2020
  
  77dd0d97
05 2月, 2020 1 次提交

add WITH_NCCL option for cmake. (#22384) · 7bc4b095

由 Wilber 提交于 2月 05, 2020

cmake选项中添加了WITH_NCCL，显示指定是否编译NCCL的部分代码，WITH_NCCL默认打开，但如果WITH_GPU为OFF，则关闭WITH_NCCL

添加了PADDLE_WITH_NCCL定义

单机单卡能够关闭NCCL编译，多卡的话需要默认打开NCCL，如果关闭NCCL，则只能使用单卡
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

7bc4b095

31 1月, 2020 1 次提交

[DNNL] Fix accuracy in INT8 FC (#22404) · 269db0d1

由 Michał Gallus 提交于 1月 31, 2020

* Enable quantize to reorder to nchw as well

* Correct FC MKL-DNN input dim requirements to accept 3D

* Improve DNNL FC format, error and 3D input handling

test=develop

* Improve error checking in FC

test=develop

* Improve PADDLE_ENFORCE messages in fc-related files

* Remove data layout attribute from obligatory pass args

test=develop

* Fix message in fc_mkldnn_pass to be logically correct

test=develop

269db0d1

10 1月, 2020 1 次提交
- W
  fix the bug of profile update (#22207) · 621d3e0b
  由 wangchaochaohu 提交于 1月 11, 2020
```
* fix the bug of profile update test=develop
```
  621d3e0b
09 1月, 2020 3 次提交
- 石
  
  [Feature] Lite subgraph (#22114) · ad0dfb17
  由石晓伟提交于 1月 09, 2020
  
  ad0dfb17
- Y
  Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144) · 96980c22
  由 Yiqun Liu 提交于 1月 09, 2020
```
* Polish the PADDLE_ENFORCE in fusion_group pass related codes.
test=develop

* Correct the unittest because of the change relu_grad's formula.
test=develop
```
  96980c22
- W
  add support for nested profiling event and printing in different level (#22061) · c3876cf8
  由 wangchaochaohu 提交于 1月 09, 2020
```
* add support for nested profiling event and printing in different level
```
  c3876cf8
08 1月, 2020 2 次提交
- Z
  Refine stack op to improve xlnet performance, test=develop (#22142) · 3d4f2aa6
  由 zhaoyuchen2018 提交于 1月 08, 2020
```
stack's wait cost a lot of cpu time, use cuda kernel to do memory copy
will reduce cpu time.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  3d4f2aa6
- Z
  
  fix allocator strategy comment, test=develop, test=document_fix (#22121) · 4c2df8e4
  由 Zeng Jinle 提交于 1月 08, 2020
  
  4c2df8e4

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致