提交 · 6e326ca2c6b5b2011d2612ea0b4165b7cdb1c819 · BaiXuePrincess / Paddle

20 8月, 2019 1 次提交

optimize the realization of cuda dropout (#19136) · 6e326ca2

由 wangchaochaohu 提交于 8月 20, 2019

* cuda optimie for dropout

* remove tmp swp file

* fix compile error test=develop

* test=develop optimize the cuda realization of dropout op

* remove unsed code test=develop

* remove tmp file test=develop

6e326ca2

19 8月, 2019 6 次提交

Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213) · 76c95af0

由 Zhaolong Xing 提交于 8月 19, 2019

* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop

* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop

76c95af0

Q

Remove warning in batch_norm_op (#19260) · 5fc8de44
由 qingqing01 提交于 8月 19, 2019

5fc8de44

Add match_matrix_tensor op (#18525) · 78a3d837

由 Aurelius84 提交于 8月 19, 2019

* add matrch_matrix_tensor op test=develop

* fix ignore unittest if with_mkl=off test=develop

* clean code and rm is_test param test=develop

* modify API.spec test=develop

* rm useless code in search_compute.h test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

* Add API test code test=develop

* clean code in search_computer.h

* modify PADDLE_ENFORCE and clean search_compute.h test=develop

* fix code style test=develop

78a3d837

Z

merge develop to solve conflict, also fix API doc, test=develop (#18823) · 5b6673c4
由 Zeng Jinle 提交于 8月 19, 2019

5b6673c4

add fl_listen_and_serv &fl_transpiler,test=develop (#19091) · 539c8707

由 zhang wenhui 提交于 8月 19, 2019

add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.

539c8707

S
change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205) · af0fbd90
由 silingtong123 提交于 8月 19, 2019
```
* print error code if cuda related API fails
```
af0fbd90

18 8月, 2019 1 次提交
- G
  Unset unittests http_proxy env to avoid timeout. (#19269) · fd4b15a2
  由 gongweibao 提交于 8月 18, 2019
```
Unset unittests http_proxy env to avoid timeout.
```
  fd4b15a2
16 8月, 2019 2 次提交
- K
  fix temporal_shift OP PADDLE_ENFORCE. test=develop (#19161) · 2848cb79
  由 Kaipeng Deng 提交于 8月 16, 2019
```
* fix temporal_shift OP PADDLE_ENFORCE. test=develop

* fix HasInput/HasOutpu ENFORECE. test=develop
```
  2848cb79
- Z
  
  move_flags_to_unified_files_for_management, test=develop (#19224) · 708bd979
  由 Zeng Jinle 提交于 8月 16, 2019
  
  708bd979
15 8月, 2019 2 次提交

A
Add generalized Conv+Activation MKLDNN fuse pass creation (#19072) · b837689e
由 Adam 提交于 8月 15, 2019
```
test=develop
```
b837689e

Add padding support for crf_decoding (#19057) · 50b1cab1

由 Yibing Liu 提交于 8月 15, 2019

* Add padding support for crf_decoding

* Fixes in comupte kernel

test=develop

* Update API Spec

test=develop

* Update API.spec

test=develop

* Avoid using paddle_enforce

test=develop

* Fix enforce

test=develop

50b1cab1

14 8月, 2019 3 次提交
- C
  Fix gather op bug (#19168) · b5ba801e
  由 chengduo 提交于 8月 14, 2019
```
* fix gather op bug
test=develop
```
  b5ba801e
- L
  Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166) · 80eab822
  由 Leo Chen 提交于 8月 14, 2019
```
* remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR(), test=develop

* remove SplitIdsOpGradMaker since it is buggy and not tested, update spec file, test=develop
```
  80eab822
- C
  Use CUDAPinnedPlace in buffered_reader (#19112) · c70a97f4
  由 chengduo 提交于 8月 14, 2019
```
Use CUDAPinnedPlace in buffered_reader
```
  c70a97f4
13 8月, 2019 1 次提交

Instag Implemention (#18394) · 6ac32d09

由 Jiawei Wang 提交于 8月 13, 2019

* instag lod tensor impl

* First PR for instag

* First PR for instag

* Before adding Selection Rows.

* Change name from instag to filter_instag, add upgrade the impl of filter_instag

* Change name from instag to filter_instag, add upgrade the impl of filter_instag

* Fix yapf error in gradient_checker.py to pass Travis-CI

* Fix Filter Instag Grad test=develop

* Fix Filter Instag Grad test=develop

* 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop

* Impl Loss_weight and empty output handler

* change Loss Weight datatype to Float32, and add Loss Weight as 2nd output

* 1) Support Tensor Input(without LOD) 2) Add Unit test

* Filter By Instag Final test=develop

* Update API.spec for filter_by_instag test=develop

* Update API.spec for filter_by_instag 2 test=develop

* Add Filter By Instag Coverage

* code format of test_layers.py

* code format test_layers.py test=develop

* Make API args more readable test=develop

* Make API args more readable and pass code format test=develop

* Filter By Instag Op, Rename Map to Index Map test=develop

* Filter By Instag Op, code format err in filter_by_instag_op.cc  test=develop

* Filter by instag op: code format of cpp files test=develop

* Filter by instag Op: Api spec modification test=develop

* Filter by instag Op: Api spec doc id modification test=develop

* Filter by instag Op: Api spec and doc preview  test=develop test=document_preview

* Filter By Instag Op, fix doc erro test=document_preview test=develop

* Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop

* Filter By Instag Op, fix Api spec test=document_preview test=develop

* Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop

* Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop

6ac32d09

12 8月, 2019 5 次提交
- H
  Add hard swish op (new op) (#19001) · 20f18930
  由 huangjun12 提交于 8月 12, 2019
```
* add hard_swish activation op (new op)
test=develop

* remove redundancy files

* modify document content of HardSwish OP

* add API test in test_layers.py

* add dynamic_graph for test_hard_swish
```
  20f18930
- J
  Replace Relu with bounded Relu in MobileNetV2 quantization (#18988) · bce72c7f
  由 joanna.wozna.intel 提交于 8月 12, 2019
```
test=develop
```
  bce72c7f
- W
  
  refine infer shape in box decoder and assign op, test=develop (#19118) · 1fc242a7
  由 wangguanzhong 提交于 8月 12, 2019
  
  1fc242a7
- G
  Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
  由 gongweibao 提交于 8月 12, 2019
```
Polish fleet API to support cuda collective mode and nccl2 mode
```
  29d87812
- K
  fix code too big test=develop (#19111) · 945f3cf6
  由 Kevin 提交于 8月 12, 2019
```
Fix seq_pool failed when input dims is too large.
Resolve issue #3023
```
  945f3cf6
09 8月, 2019 4 次提交

Z

remove unused inplace act codes, test=develop (#19079) · 88f111f8
由 Zeng Jinle 提交于 8月 09, 2019

88f111f8

add eye op, kernel and unitest test=develop (#18980) · 4397cb31

由 ShenLiang 提交于 8月 09, 2019

* add eye op,test=document_preview test=develop

* fix the API.spec, test=develop

* fix the document, test=document_preview test=develop

* add unitest for CI coverage, test=develop

4397cb31

Add trilinear_interp OP (#18711) · f86fead6

由 Kaipeng Deng 提交于 8月 09, 2019

* add trilinear interp. test=develop

* fix unittest. test=develop

* add python api and test_layers. test=develop

* refine API.spec. test=develop

* fix format. test=develop

* add python API test. test=develop

* format code. test=develop

* refine code strcuture. test=develop

* fix format

* fix doc. test=develop

* fix converage. test=develop

* fix format. test=develop

f86fead6

Z
optimize error message for "embedding" and "cross_entropy" OP (#18765) · c2063217
由 Zhang Ting 提交于 8月 09, 2019
```
* optimize error message, test=develop

* optimize error message, test=develop
```
c2063217

06 8月, 2019 2 次提交

Y
Add the check of lod in sequence_softmax kernel. (#18996) · a445c335
由 Yiqun Liu 提交于 8月 06, 2019
```
* Add the check of lod in sequence_softmax kernel.
test=develop

* Refine the comments.
test=develop
```
a445c335

Add var_conv_2d op (#18518) · e681d655

由 Kevin 提交于 8月 06, 2019

* fix overflow by int32 mul test=develop

* fix reference nullptr

* fix codestyle test=develop

* modify to point in ContextProjectFunctor test=develop

* modify to point in ContextProjectFunctor test=develop

* modify . to -> test=develop

* add var_conv_2d op test=develop

* edit api.spec test=develop

* ignore unittest if with_mkl=off test=develop

* fix python3 division test=develop

* fix ignore unittest bug test=develop

* remove useless code test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

e681d655

05 8月, 2019 2 次提交
- P
  fix for multithreading test_analyzer_image_classification --num_threads=X (#18265) · e53f517a
  由 pawelpiotrowicz 提交于 8月 05, 2019
```
test=develop
```
  e53f517a
- L
  support tensor input for ctc align op (#18887) · faf6890b
  由 Liufang Sang 提交于 8月 05, 2019
```
* test=develop support Tensor input for ctc_align_op

* test=develop add some comment
```
  faf6890b
02 8月, 2019 3 次提交

H

fix concat check info typo (#18975) · b62c4f9b
由 hutuxian 提交于 8月 02, 2019

b62c4f9b

Open gc by default (#18836) · 7ac748ad

由 Zeng Jinle 提交于 8月 02, 2019

* open gc by default, test=develop

* fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop

* fix conditional_block op eager deletion bug, test=develop

* add some comments to reviewers, test=develop

7ac748ad

石

Fusion: seqpool_cvm_concat (#18471) · ee2f296e

由石晓伟提交于 8月 02, 2019

* add fusion_seqpool_cvm_concat test=develop

* simplify pass, test=develop

* fix code style, test=develop

ee2f296e

01 8月, 2019 3 次提交

Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c

由 wawltor 提交于 8月 01, 2019

* test=develop
Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.

* test=develop
Check the input and dtype in the op of unique_with_counts

* test=develop
test=document_preview
update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`

* test=develop
test=document_preview
Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.

* Fix some API problem in the op of `unique_with_counts`
test=develop
test=document_preview

* test=develop
test=document_preview
Fix the api sample of op `unique_with_counts`, and update api.spec

3ab1866c

- Removed passing X from FWD to GRAD via device context (#18911) · 5cf2d385

由 Jacek Czaja 提交于 8月 01, 2019

test=develop

- Extracted key generation from FWD and GRAD into separate function

test=develop

- Compilation fix

test=develop

- another compilation

test=develop

5cf2d385

L
Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
由 LielinJiang 提交于 8月 01, 2019
```
* fix depthwise conv gpu kernel bug, test=develop
* add more depthwise conv test, test=develop
```
22fa4c2d

31 7月, 2019 5 次提交

fix several security bugs reported by security team (#18831) · 0d996908

由 liuwei1031 提交于 7月 31, 2019

* fix security issue, test=develop

* bug fix, test=develop

* throw an exception when null pointer data with non-zero length PaddleBuf is passed, test=develop

0d996908

Trt fp16 support (#18860) · 61238d31

由 Zhaolong Xing 提交于 7月 31, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

61238d31

C
[DyGraph] Make multi-card program faster (#18892) · 20859c08
由 chengduo 提交于 7月 31, 2019
```
* update parallel.py
test=develop
```
20859c08

Add center Loss Op Support (#18681) · 24f85431

由 HaoRen 提交于 7月 31, 2019

* support center loss
* change tensor copy  api to high level api tensorcopy

* test=develop rewrite the center_loss cuda_kernel to make it faster
and add document of the center loss api,also update test function

* test=document_preview test=develop
update document of center loss

* test=document_preview test=develop
modify API.spec modify test code remove nouse const_cast

24f85431

L
use mkl to accelerate gelu_grad (#18099) · 86e494eb
由 Leo Zhao 提交于 7月 31, 2019
```
test=develop
```
86e494eb

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致