提交 · 138ecf24aa3a6cc5b64a2a38f5ccfb33cc4aae98 · BaiXuePrincess / Paddle

21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

29 11月, 2019 1 次提交

Add dygraph execution context (#20157) · ac854670

由 hong 提交于 11月 29, 2019

* add_dygraph_execution_context

* add dygraph infershape context and execution context; test=develop

* fix imperative bug; test=develop

* remove inputs outputs interface from execution context,
because it have same function with inputNames;
test=develop

* remove tracer_test ctest; test=develop

* fix split op bug; test=develop

* fix unitests bug; test=develop

* fix distribute test bug; test=develop

* fix ngraph compile bug; test=develop

* fix grad maker bug; test=develop

* fix load op bugs; test=develop

* fix operator.cc construct bug; test=develop

* remove useless name find in operator; test=develop

* add tracer_test; test=develop

* fix concat, split bug; test=develop

* remove tracer_test unitest; test=develop

* fix attribute check bug; test=develop

* add test code to fix converage; test=develop

* remove useless code, change check backward input in engin; test=develop

* unlock var type infer shape;test=develop

* add ShareAllLoD api; test=develop

* add dygraph infershape context unitest; test=develop

* remove increase and decrease lod in dygraph; test=develop

* addd override; test=develop

* fix increase descrease lod; test=develop

* fix paddle_enforce; test=develop

* disable lod op dygraph check; test=develop

* fix paddle enforce error; test=develop

* add comment for op_registry and OperatorBase; test=develop

* optimize the comment of op_registry; test=develop

* fix format of comment; test=develop

* fix format of comment; test=develop

* optimize the format of comment; test=develop

* optimize the format of the comment; test=develop

* optimize comment of op_registry; test=develop

ac854670

31 10月, 2019 1 次提交

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

19 8月, 2019 1 次提交
- C
  Fix REGISTER_OP_WITHOUT_GRADIENT (#19251) · 8a89ca94
  由 chengduo 提交于 8月 19, 2019
```
* fix REGISTER_OP_WITHOUT_GRADIENT
test=develop
```
  8a89ca94
25 2月, 2019 1 次提交
- L
  Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
  由 liangan1 提交于 2月 25, 2019
```
test=develop
```
  4acc5220
26 12月, 2018 1 次提交
- P
  fix test issues on windows · 01c00b07
  由 peizhilin 提交于 12月 26, 2018
```
test=develop
```
  01c00b07
18 12月, 2018 2 次提交
- P
  include the mkl fix only · b601f2de
  由 peizhilin 提交于 12月 18, 2018
```
test=develop
```
  b601f2de
- P
  
  add mkl,ctc support for windows · 5a6d7fe2
  由 peizhilin 提交于 12月 18, 2018
  
  5a6d7fe2
11 12月, 2018 1 次提交
- X
  fix clang · 1735022a
  由 Xin Pan 提交于 12月 11, 2018
```
test=develop
```
  1735022a
05 12月, 2018 1 次提交
- X
  allow customize kernel selection · 41c28d54
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  41c28d54
22 11月, 2018 1 次提交

Windows/online (#14474) · d9a1f3e5

由 wopeizl 提交于 11月 22, 2018

* add recordio support

* disable the openblas multi-thread on windows since no support
adjust the python script

* code style

* code style
test=develop

* add create_recordio_file_reader back

* fix code style
test=develop

* fix the gtest.cmake on windows

* fix cc_test on windows

* fix the win build
test=develop

* remove fused compile support on windows
test=develop

* add the jit support
test=develop

* add the jit support, test=develop

* add the jit support, test=develop

* add the jit back
fix compile error on windows

* rollback test=develop

* test case fix

* disable DSO by default on windows

* exclude warpctc_op on windows

* exclude the dynload_warpctc out on windows
test=develop

* fix the scripts error
test=develop

* disable avx on windows by default
test=develop

* re-organize the cmake file

* disable mkl on windows by default

* add warp_ctc back

* fix the dependency

* fix the dependency

* fix the build issue on windows

* remove unsupported flag on windows

* code style

* code style
test=develop

* fix issue

* add profiler, parallel_executor back

* clean up the pre-definitions on windows

* fix build issue

* test=develop

d9a1f3e5

21 11月, 2018 1 次提交
- P
  
  clean up the pre-definitions on windows · 6e66fadb
  由 peizhilin 提交于 11月 21, 2018
  
  6e66fadb
24 9月, 2018 1 次提交
- D
  
  "fix link error" (#13545) · 97636a9f
  由 dzhwinter 提交于 9月 24, 2018
  
  97636a9f
12 9月, 2018 1 次提交
- D
  
  add demo · c3e1fb5a
  由 dzhwinter 提交于 9月 12, 2018
  
  c3e1fb5a
02 9月, 2018 1 次提交
- D
  
  switch to 9.2 · 75681c0a
  由 dzhwinter 提交于 9月 02, 2018
  
  75681c0a
25 8月, 2018 1 次提交
- D
  
  more platform is done · d7f98f37
  由 dzhwinter 提交于 8月 25, 2018
  
  d7f98f37
03 7月, 2018 1 次提交
- Y
  Remove Op::Clone method · 4e4438a8
  由 yuyang18 提交于 7月 03, 2018
```
It is used by NetOp before.
```
  4e4438a8
02 7月, 2018 3 次提交
- Y
  Add register kernel functor and shrink reshape op · 82866d4a
  由 yuyang18 提交于 7月 02, 2018
```
* Shrink reshape_op library size
* User can register a standard C++ functor as a op kernel
```
  82866d4a
- Y
  
  Make Kernel registed as a function · 3b00ed81
  由 yuyang18 提交于 7月 02, 2018
  
  3b00ed81
- Y
  
  Polish reshape op · 1ce478f1
  由 yuyang18 提交于 7月 02, 2018
  
  1ce478f1
07 6月, 2018 2 次提交

split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2

由 dzhwinter 提交于 6月 07, 2018

* "split into multiple .ccl"

* "refine file structure"

* "refine files"

* "remove the cmakelist"

* "fix typo"

* "fix typo"

* fix ci

d48172f2

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

18 4月, 2018 1 次提交
- Y
  
  remove REGISTER_OP and REGISTER_OP_EX · 68d96385
  由 Yang Yang 提交于 4月 17, 2018
  
  68d96385
17 4月, 2018 1 次提交
- Y
  
  first commit · dafe06af
  由 Yang Yang 提交于 4月 13, 2018
  
  dafe06af
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
09 2月, 2018 1 次提交
- E
  
  cumsum operator (#8288) · 725e6448
  由 emailweixu 提交于 2月 09, 2018
  
  725e6448
17 1月, 2018 1 次提交
- Q
  
  change DEVICE_TYPE in op_registry to LIBRARY_TYPE (#7588) · 6f71f89d
  由 Qiao Longfei 提交于 1月 17, 2018
  
  6f71f89d
03 1月, 2018 1 次提交
- L
  
  add more comments in CMakelists.txt of operator · 2d2b6332
  由 Luo Tao 提交于 1月 03, 2018
  
  2d2b6332
27 12月, 2017 1 次提交

"refine kernel registrar" (#6998) · 35c1683e

由 dzhwinter 提交于 12月 27, 2017

* "refine kernel registrar"

* "refine registrar with multikey"

* "fix register"

* "refine multikernel register"

* "fix CI"

* "fix CI"

* "fix registry"

* "swtich GPU to CUDA"

* "add register macro test case"

* "fix CI"

35c1683e

25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
24 12月, 2017 1 次提交
- Q
  
  rm unsed RegisterOp method in OpRegistry · 6b99402d
  由 qiaolongfei 提交于 12月 24, 2017
  
  6b99402d
22 12月, 2017 1 次提交

Enforce drop_empty_grad=false When the input of an op is duplicable. · 0bfa1f7c

由 xuwei06 提交于 12月 01, 2017

For input argument with a list of variables, drop_empty_grad is not allowed because it makes the correspondence bewteen a variable and its gradient ambiguous. Use REGISTER_OP_EX to register the op or call InputGrad(?,false) in GradOpDescMaker.

0bfa1f7c

21 12月, 2017 1 次提交
- Y
  Rename XXDescBind --> XXDesc (#6797) · 09189732
  由 Yu Yang 提交于 12月 21, 2017
```
* Rename XXDescBind --> XXDesc

* Fix Compile
```
  09189732
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

08 11月, 2017 1 次提交

Polish OpWithKernel · bbdac7f7

由 Yu Yang 提交于 11月 07, 2017

* Chage `IndicateDataType` to `GetKernelType`. Make it easier to
  understand.
* Change `OpKernelKey` to `OpKernelType`
* Make operator developers can customize which kernel the operator will
  use in runtime.

bbdac7f7

01 11月, 2017 1 次提交

Feature/executor use program bind (#5196) · 1363ddb6

由 Yu Yang 提交于 10月 31, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

1363ddb6

29 10月, 2017 1 次提交

Cast Operator (#5149) · b84e8226

由 Yu Yang 提交于 10月 28, 2017

* Cast Operator

Cast input variable to other data type

* Fix compile error

* Add cast op

* Follow comments

b84e8226

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致