提交 · f6dbf8e3a4e65e9de5366d11c29bff7661fac510 · Crayon鑫 / Paddle

18 4月, 2020 1 次提交

由 Zhang Ting 提交于 4月 18, 2020

* update eigen, test=develop

* remove patches, test=develop

* add definition of -fabi-version, test=develop

* add patch for TensorBlock.h, test=develop

* test windows, test=develop

* only update eigen for Linux, test=develop

* add code comments, test=develop

b89dd86f

17 4月, 2020 1 次提交

石

DeviceContext Split, test=develop (#23737) · 2d01cc85

由石晓伟提交于 4月 17, 2020

* supports thread-binding stream, test=develop

* avoid using thread_local variables in dtor, test=develop

* modify the stream priority enum, test=develop

2d01cc85

15 4月, 2020 1 次提交

Correct the wrong name in the flag comment (#22977) · c2a60bb1

由 guofei 提交于 4月 15, 2020

Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce) based on the information from our official website.

c2a60bb1

14 4月, 2020 1 次提交
- Y
  Fix CUDAHandleHolder destruction problem. (#23772) · 14e7041c
  由 Yi Liu 提交于 4月 14, 2020
```
eagerly release cuda resources before cuda enviroment destroying
test=develop
```
  14e7041c
11 4月, 2020 1 次提交

[DNNL][INT8][FP32] MatMul (#23395) · a63bcf9a

由 Michał Gallus 提交于 4月 11, 2020

* Initial FP32 DNNL MatMul Implementation

* Implement int8 DNNL MatMul

* Unify in-kernel-naming, clean UTs

* MatmuL: Introduce op caching

* Final adjustments

test=develop

* Remove dy_graph disablement

test=develop

* Change dnnl header name to new one

test=develop

* Contrain multi head check to prevent fails

test=develop

* Resolve dnnl header problems on MAC CI

* Variable namings to kernel and skip_grad_ci added

test=develop

* Prevent MAC CI from failing

* Prevent windows build from failing

test=develop

* Modify UTs to conform to the rules

* Modify MatMul aux functions namings

test=develop

a63bcf9a

10 4月, 2020 4 次提交
- L
  test=develop, add addmm op (#23384) · 1c08a213
  由 littletomatodonkey 提交于 4月 10, 2020
```
add addmm op
```
  1c08a213
- Z
  
  fix GET_DATA_SAFELY ptr, test=develop (#23679) · 674355a0
  由 Zeng Jinle 提交于 4月 09, 2020
  
  674355a0
- S
  
  show the exception messages of cpp inference library in msvc (#23702) · c6d14bc8
  由 silingtong123 提交于 4月 10, 2020
  
  c6d14bc8
- T
  
  solve mklml memory leak (#23557) · e4f1b1c5
  由 Tao Luo 提交于 4月 10, 2020
  
  e4f1b1c5
09 4月, 2020 1 次提交

Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a

由 mozga-intel 提交于 4月 09, 2020

* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop

* Remove ngraph from:
1. Cmake file
2. Python file
test=develop

3baaee9a

08 4月, 2020 1 次提交
- Z
  
  API(place-related) error message enhancement (#23515) · 480530c4
  由 Zhang Ting 提交于 4月 08, 2020
  
  480530c4
04 4月, 2020 2 次提交

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

Dev/fix init flags (#23465) · f297a332

由 Leo Chen 提交于 4月 04, 2020

* fix init_gflags with 'python -c', test=develop

* add test, test=develop

* use sys.executable instead of python, test=develop

* keep dummy, test=develop

f297a332

03 4月, 2020 1 次提交
- C
  Add op inout check macro to simplify error message writing (#23430) · 7f1ad510
  由 Chen Weihang 提交于 4月 03, 2020
```
* add op inout check macro, test=develop

* fix enforce_test, test=develop
```
  7f1ad510
02 4月, 2020 1 次提交
- A
  Delete is_test attribute from activation operators (#23318) · da7c73f8
  由 Adam 提交于 4月 02, 2020
```
* Delete is_test from activation operators
test=develop

* Revent unneeded changes
test=develop
```
  da7c73f8
01 4月, 2020 1 次提交
- 石
  
  reverts the commit 23177, test=develop (#23363) · 5c59d213
  由石晓伟提交于 4月 01, 2020
  
  5c59d213
31 3月, 2020 2 次提交
- Y
  fix nccl comm double free bug (#23344) · 0471476a
  由 Yi Liu 提交于 3月 31, 2020
```
As nccl comm is not created by CUDADeviceContext, it should be destroyed by the creator as the best practice of RAII.
```
  0471476a
- W
  Profiler refine (#23294) · 1ee2a9a4
  由 wangchaochaohu 提交于 3月 31, 2020
```
* refine output of profiler for child event 
```
  1ee2a9a4
30 3月, 2020 2 次提交
- Y
  
  Initialize global nccl_comm in PE (#23275) · 2169e6fb
  由 Yi Liu 提交于 3月 30, 2020
  
  2169e6fb
- 石
  
  supports thread-binding stream, test=develop (#23177) · 75ebb48a
  由石晓伟提交于 3月 30, 2020
  
  75ebb48a
27 3月, 2020 1 次提交
- Z
  
  code polish for adding const qualifier, test=develop, test=document_fix (#23248) · 77b4dc80
  由 Zeng Jinle 提交于 3月 26, 2020
  
  77b4dc80
25 3月, 2020 1 次提交
- Z
  
  add cuda resource pool for BufferedReader, test=develop (#23152) · bba74071
  由 Zeng Jinle 提交于 3月 25, 2020
  
  bba74071
19 3月, 2020 1 次提交
- S
  
  added mkldnn swish activation (#23041) · abee05a8
  由 Sylwester Fraczek 提交于 3月 19, 2020
  
  abee05a8
18 3月, 2020 1 次提交
- Y
  initialize global nccl context in dygraph (#23037) · 121b2aed
  由 Yi Liu 提交于 3月 18, 2020
```
initialize global nccl context in dygraph
test=develop
```
  121b2aed
13 3月, 2020 1 次提交
- W
  
  remove debug log test=develop (#22994) · 99db0cf7
  由 wangchaochaohu 提交于 3月 13, 2020
  
  99db0cf7
12 3月, 2020 1 次提交
- W
  
  refine the profiler print test=develop (#22968) · c979c9f2
  由 wangchaochaohu 提交于 3月 12, 2020
  
  c979c9f2
07 3月, 2020 2 次提交
- Z
  
  fix compute ratio of profile, test=develop (#22872) · ca9c8b41
  由 Zhang Ting 提交于 3月 07, 2020
  
  ca9c8b41
- W
  refine the profiler print (#22823) · dbb0b9b3
  由 wangchaochaohu 提交于 3月 07, 2020
```
* refine the profiler print test=develop
```
  dbb0b9b3
04 3月, 2020 1 次提交

Add flags to limit gpu memory (#22793) · d41d802b

由 Zeng Jinle 提交于 3月 04, 2020

* add recorded cuda memory apis, fix typo, test=develop

* add more ut, test=develop

* follow comments, test=develop

* fix py35 incompatible issues, test=develop

d41d802b

03 3月, 2020 1 次提交
- Z
  
  fix print bug of profile, test=develop (#22804) · 72ff5a09
  由 Zhang Ting 提交于 3月 03, 2020
  
  72ff5a09
02 3月, 2020 2 次提交
- W
  
  polish the profiler_help code (#22811) · 8456c3f4
  由 wangchaochaohu 提交于 3月 02, 2020
  
  8456c3f4
- W
  Profile code refine (#22800) · 7578fcba
  由 wangchaochaohu 提交于 3月 02, 2020
```
* add profiler_help.h to refine the code test=develop
```
  7578fcba
26 2月, 2020 1 次提交
- A
  
  Add cpu_info without XBYAK (#22716) · 2b80e9a7
  由 Adam 提交于 2月 26, 2020
  
  2b80e9a7
25 2月, 2020 1 次提交
- Z
  add framework overhead ratio in profile report (#22590) · f97f3f93
  由 Zhang Ting 提交于 2月 25, 2020
```
* add framework overhead ratio, test=develop

* print GpuMemcpy overhead, test=develop
```
  f97f3f93
24 2月, 2020 1 次提交
- W
  Fusion group profile support (#22718) · 611411b9
  由 wangchaochaohu 提交于 2月 24, 2020
```
* add support for the driver api callback and fix the profiler name show bug
```
  611411b9
23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
21 2月, 2020 1 次提交
- Y
  
  Add the support of fp16 in fusion_group (#22239) · 22bbd547
  由 Yiqun Liu 提交于 2月 21, 2020
  
  22bbd547
19 2月, 2020 1 次提交
- W
  fix the profile print error (#22665) · a089072c
  由 wangchaochaohu 提交于 2月 19, 2020
```
* fix the profile print error test=develop
```
  a089072c
18 2月, 2020 1 次提交
- W
  add flag to control profile level in python API (#22319) · c65c6ae5
  由 wangchaochaohu 提交于 2月 18, 2020
```
* add python flag to control profile level test=develop
```
  c65c6ae5
14 2月, 2020 1 次提交
- C
  
  fix enforce test error, test=develop (#22610) · fe685cc1
  由 Chen Weihang 提交于 2月 14, 2020
  
  fe685cc1

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致