提交 · 34122e665ed529cd0e55ae4d51a96bc21a188969 · BaiXuePrincess / Paddle

25 4月, 2020 1 次提交
- W
  
  fix warning mac compiler (#24138) · 6bf26ef1
  由 wangchaochaohu 提交于 4月 25, 2020
  
  6bf26ef1
24 4月, 2020 2 次提交

由 Guo Sheng 提交于 4月 24, 2020

* Add cholesky_op forward part. test=develop

* Complete cholesky_op forward part. test=develop

* Add cholesky_op backward part. test=develop

* Complete cholesky_op backward part. test=develop

* Refine cholesky_op error check and docs. test=develop

* Add grad_check unit test for cholesky_op. test=develop

* Fix sample code in cholesky doc. test=develop

* Refine some error messages of cholesky_op. test=develop

* Refine some error messages of cholesky_op. test=develop

* Remove unused input in cholesky_grad. test=develop

* Remove unused input in cholesky_grad. test=develop

* Fix stream for cusolverDnSetStream. test=develop

* Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
test=develop

* Add CUSOLVER ERROR in enforce.h
test=develop

* Fix the missing return value in cholesky. test=develop

a8c0fb4e

W

Reduce the construction time of fuction about profiler (#24117) · 6ba7c3ac
由 wangchaochaohu 提交于 4月 24, 2020

6ba7c3ac

23 4月, 2020 1 次提交
- 石
  
  declare the stream::Priority as enum class, test=develop (#24013) · 34d7d6ae
  由石晓伟提交于 4月 23, 2020
  
  34d7d6ae
22 4月, 2020 2 次提交
- J
  
  [DNNL] Added elementwise_add mkl-dnn inplace (#23477) · c6c65c65
  由 Jacek Czaja 提交于 4月 22, 2020
  
  c6c65c65
- 石
  
  add boost dependency to cuda_stream (#24032) · db6d8673
  由石晓伟提交于 4月 22, 2020
  
  db6d8673
21 4月, 2020 1 次提交

石

New feature: thread local allocator, test=develop (#23989) · d2584a70

由石晓伟提交于 4月 21, 2020

* add the thread_local_allocator, test=develop

* refactor the thread_local_allocator, test=develop

* provides option setting strategy, test=develop

d2584a70

20 4月, 2020 1 次提交

Optimize the error messages of paddle CUDA API (#23816) · 78170037

由 Zhou Wei 提交于 4月 20, 2020

* Optimize the error messages of paddle CUDA API, test=develop

* fix the error messages of paddle CUDA API, test=develop

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop

* remove build_ex_string,test=develop

* merge conflict,test=develop

78170037

18 4月, 2020 1 次提交

Update eigen (#23203) · b89dd86f

由 Zhang Ting 提交于 4月 18, 2020

* update eigen, test=develop

* remove patches, test=develop

* add definition of -fabi-version, test=develop

* add patch for TensorBlock.h, test=develop

* test windows, test=develop

* only update eigen for Linux, test=develop

* add code comments, test=develop

b89dd86f

17 4月, 2020 1 次提交

石

DeviceContext Split, test=develop (#23737) · 2d01cc85

由石晓伟提交于 4月 17, 2020

* supports thread-binding stream, test=develop

* avoid using thread_local variables in dtor, test=develop

* modify the stream priority enum, test=develop

2d01cc85

15 4月, 2020 1 次提交

Correct the wrong name in the flag comment (#22977) · c2a60bb1

由 guofei 提交于 4月 15, 2020

Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce) based on the information from our official website.

c2a60bb1

14 4月, 2020 1 次提交
- Y
  Fix CUDAHandleHolder destruction problem. (#23772) · 14e7041c
  由 Yi Liu 提交于 4月 14, 2020
```
eagerly release cuda resources before cuda enviroment destroying
test=develop
```
  14e7041c
11 4月, 2020 1 次提交

[DNNL][INT8][FP32] MatMul (#23395) · a63bcf9a

由 Michał Gallus 提交于 4月 11, 2020

* Initial FP32 DNNL MatMul Implementation

* Implement int8 DNNL MatMul

* Unify in-kernel-naming, clean UTs

* MatmuL: Introduce op caching

* Final adjustments

test=develop

* Remove dy_graph disablement

test=develop

* Change dnnl header name to new one

test=develop

* Contrain multi head check to prevent fails

test=develop

* Resolve dnnl header problems on MAC CI

* Variable namings to kernel and skip_grad_ci added

test=develop

* Prevent MAC CI from failing

* Prevent windows build from failing

test=develop

* Modify UTs to conform to the rules

* Modify MatMul aux functions namings

test=develop

a63bcf9a

10 4月, 2020 4 次提交
- L
  test=develop, add addmm op (#23384) · 1c08a213
  由 littletomatodonkey 提交于 4月 10, 2020
```
add addmm op
```
  1c08a213
- Z
  
  fix GET_DATA_SAFELY ptr, test=develop (#23679) · 674355a0
  由 Zeng Jinle 提交于 4月 09, 2020
  
  674355a0
- S
  
  show the exception messages of cpp inference library in msvc (#23702) · c6d14bc8
  由 silingtong123 提交于 4月 10, 2020
  
  c6d14bc8
- T
  
  solve mklml memory leak (#23557) · e4f1b1c5
  由 Tao Luo 提交于 4月 10, 2020
  
  e4f1b1c5
09 4月, 2020 1 次提交

Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a

由 mozga-intel 提交于 4月 09, 2020

* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop

* Remove ngraph from:
1. Cmake file
2. Python file
test=develop

3baaee9a

08 4月, 2020 1 次提交
- Z
  
  API(place-related) error message enhancement (#23515) · 480530c4
  由 Zhang Ting 提交于 4月 08, 2020
  
  480530c4
04 4月, 2020 2 次提交

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

Dev/fix init flags (#23465) · f297a332

由 Leo Chen 提交于 4月 04, 2020

* fix init_gflags with 'python -c', test=develop

* add test, test=develop

* use sys.executable instead of python, test=develop

* keep dummy, test=develop

f297a332

03 4月, 2020 1 次提交
- C
  Add op inout check macro to simplify error message writing (#23430) · 7f1ad510
  由 Chen Weihang 提交于 4月 03, 2020
```
* add op inout check macro, test=develop

* fix enforce_test, test=develop
```
  7f1ad510
02 4月, 2020 1 次提交
- A
  Delete is_test attribute from activation operators (#23318) · da7c73f8
  由 Adam 提交于 4月 02, 2020
```
* Delete is_test from activation operators
test=develop

* Revent unneeded changes
test=develop
```
  da7c73f8
01 4月, 2020 1 次提交
- 石
  
  reverts the commit 23177, test=develop (#23363) · 5c59d213
  由石晓伟提交于 4月 01, 2020
  
  5c59d213
31 3月, 2020 2 次提交
- Y
  fix nccl comm double free bug (#23344) · 0471476a
  由 Yi Liu 提交于 3月 31, 2020
```
As nccl comm is not created by CUDADeviceContext, it should be destroyed by the creator as the best practice of RAII.
```
  0471476a
- W
  Profiler refine (#23294) · 1ee2a9a4
  由 wangchaochaohu 提交于 3月 31, 2020
```
* refine output of profiler for child event 
```
  1ee2a9a4
30 3月, 2020 2 次提交
- Y
  
  Initialize global nccl_comm in PE (#23275) · 2169e6fb
  由 Yi Liu 提交于 3月 30, 2020
  
  2169e6fb
- 石
  
  supports thread-binding stream, test=develop (#23177) · 75ebb48a
  由石晓伟提交于 3月 30, 2020
  
  75ebb48a
27 3月, 2020 1 次提交
- Z
  
  code polish for adding const qualifier, test=develop, test=document_fix (#23248) · 77b4dc80
  由 Zeng Jinle 提交于 3月 26, 2020
  
  77b4dc80
25 3月, 2020 1 次提交
- Z
  
  add cuda resource pool for BufferedReader, test=develop (#23152) · bba74071
  由 Zeng Jinle 提交于 3月 25, 2020
  
  bba74071
19 3月, 2020 1 次提交
- S
  
  added mkldnn swish activation (#23041) · abee05a8
  由 Sylwester Fraczek 提交于 3月 19, 2020
  
  abee05a8
18 3月, 2020 1 次提交
- Y
  initialize global nccl context in dygraph (#23037) · 121b2aed
  由 Yi Liu 提交于 3月 18, 2020
```
initialize global nccl context in dygraph
test=develop
```
  121b2aed
13 3月, 2020 1 次提交
- W
  
  remove debug log test=develop (#22994) · 99db0cf7
  由 wangchaochaohu 提交于 3月 13, 2020
  
  99db0cf7
12 3月, 2020 1 次提交
- W
  
  refine the profiler print test=develop (#22968) · c979c9f2
  由 wangchaochaohu 提交于 3月 12, 2020
  
  c979c9f2
07 3月, 2020 2 次提交
- Z
  
  fix compute ratio of profile, test=develop (#22872) · ca9c8b41
  由 Zhang Ting 提交于 3月 07, 2020
  
  ca9c8b41
- W
  refine the profiler print (#22823) · dbb0b9b3
  由 wangchaochaohu 提交于 3月 07, 2020
```
* refine the profiler print test=develop
```
  dbb0b9b3
04 3月, 2020 1 次提交

Add flags to limit gpu memory (#22793) · d41d802b

由 Zeng Jinle 提交于 3月 04, 2020

* add recorded cuda memory apis, fix typo, test=develop

* add more ut, test=develop

* follow comments, test=develop

* fix py35 incompatible issues, test=develop

d41d802b

03 3月, 2020 1 次提交
- Z
  
  fix print bug of profile, test=develop (#22804) · 72ff5a09
  由 Zhang Ting 提交于 3月 03, 2020
  
  72ff5a09
02 3月, 2020 2 次提交
- W
  
  polish the profiler_help code (#22811) · 8456c3f4
  由 wangchaochaohu 提交于 3月 02, 2020
  
  8456c3f4
- W
  Profile code refine (#22800) · 7578fcba
  由 wangchaochaohu 提交于 3月 02, 2020
```
* add profiler_help.h to refine the code test=develop
```
  7578fcba

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致