提交 · f2e92c5d1371c49761fa10bbe430f939e8ceea10 · 机器未来 / Paddle

31 1月, 2018 1 次提交

Add variant of new load and save ops for storing model params in a single file (#7909) · 2e907c36

由 Siddharth Goyal 提交于 1月 30, 2018

* Add save_combine_op

* Add load_combine_op and test

* Add unit-test

* Add a delete to free buffer memory

* Add new variant of load/save

* Fix unit-test

* Add another unit test for compatibility with original save/load

* Address review comments and simplify logic

* Address review comments and simplify code - part 2

* Fix naming issues and CMake problems

* Address review comments

* Fix LoD information in tests

* Address review comments: round 2

2e907c36

30 1月, 2018 1 次提交
- X
  
  More efficient, add check on python side · 6e17babe
  由 xzl 提交于 1月 30, 2018
  
  6e17babe
28 1月, 2018 1 次提交
- Y
  
  Format doc & add unit test for dynamic_lstmp api · 634faab1
  由 Yibing Liu 提交于 1月 28, 2018
  
  634faab1
23 1月, 2018 1 次提交
- X
  
  ../../../../../paddle/api · 06db7038
  由 xzl 提交于 1月 23, 2018
  
  06db7038
22 1月, 2018 1 次提交
- W
  1. Add sequence_num as edit distance op's output · 1bc8de32
  由 wanghaoshuang 提交于 1月 22, 2018
```
2. Fix evaluator using 'reduce_sum' op instead of 'mean' op
```
  1bc8de32
18 1月, 2018 1 次提交
- Y
  
  Bugfix/beamsearch op (#7611) · 3388e52d
  由 Yan Chunwei 提交于 1月 18, 2018
  
  3388e52d
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

12 1月, 2018 1 次提交
- Y
  
  feature/add print op (#6799) · 3423022e
  由 Yan Chunwei 提交于 1月 12, 2018
  
  3423022e
11 1月, 2018 1 次提交
- W
  1. Fix warpctc grad op · b1af5e43
  由 wanghaoshuang 提交于 1月 11, 2018
```
2. Add check grad test
```
  b1af5e43
09 1月, 2018 1 次提交

Port WarpCTC Operator (#5107) · b5fda272

由 Yiqun Liu 提交于 1月 09, 2018

* Add Seq2BatchFunctor, which will be used in WarpCTCOp.

* Implement WrapCTCFunctor and WrapCTCKernel.

* Add unittest of warpctc_op.

* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.

* Use absolute offset lod in warpctc_op and related functors.

* Refine the comments of warpctc_op.

* The new python unittest supports checking a subset of the outputs, so revoke the previous change.

* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.

* Update to the newest codes.

* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.

b5fda272

03 1月, 2018 2 次提交
- L
  
  add more comments in CMakelists.txt of operator · 2d2b6332
  由 Luo Tao 提交于 1月 03, 2018
  
  2d2b6332
- L
  
  refine comments in CMakelists.txt of operator · 5974c1b7
  由 Luo Tao 提交于 1月 03, 2018
  
  5974c1b7
02 1月, 2018 4 次提交
- L
  
  manually pybind some specific operators · e4e95bee
  由 Luo Tao 提交于 1月 02, 2018
  
  e4e95bee
- L
  
  auto pybind when *_op.cc contains several operators · f3851fe5
  由 Luo Tao 提交于 1月 02, 2018
  
  f3851fe5
- S
  
  for del DEPS · 554f6967
  由 sweetsky0901 提交于 1月 02, 2018
  
  554f6967
- S
  
  for makelist update · 0df22907
  由 sweetsky0901 提交于 1月 02, 2018
  
  0df22907
29 12月, 2017 1 次提交
- C
  
  move cos_sim_functor to math · 24cf2fcd
  由 chengduoZH 提交于 12月 29, 2017
  
  24cf2fcd
27 12月, 2017 2 次提交
- L
  
  fix nccl cmake error in ONLY_CPU mode · b654e6f7
  由 Luo Tao 提交于 12月 27, 2017
  
  b654e6f7
- L
  
  refine CMakeLists.txt when add op need DEPS · b6796962
  由 Luo Tao 提交于 12月 27, 2017
  
  b6796962
25 12月, 2017 1 次提交
- T
  
  update remove unused code · 700bd24b
  由 typhoonzero 提交于 12月 25, 2017
  
  700bd24b
19 12月, 2017 1 次提交
- Y
  
  parallel_do skeleton pass compile · 9d2c77e6
  由 Yang Yang 提交于 12月 19, 2017
  
  9d2c77e6
12 12月, 2017 2 次提交

S

modify for some update in trunk · a3addcdc
由 sweetsky0901 提交于 12月 12, 2017

a3addcdc

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

01 12月, 2017 3 次提交
- T
  
  add grpc benchmark · c66c65cb
  由 typhoonzero 提交于 12月 01, 2017
  
  c66c65cb
- Y
  Fix grpc compile warning (#6050) · 1b612d3a
  由 Yancey 提交于 12月 01, 2017
```
* fix grpc compile warn

* update

* -Wnon-virtual-dtor -> -Wno-non-virtual-dtor
```
  1b612d3a
- T
  
  add switch for distributed support · 1a852861
  由 typhoonzero 提交于 12月 01, 2017
  
  1a852861
28 11月, 2017 1 次提交

武

Send recv op (#5520) · 0a8a86e0

由武毅提交于 11月 28, 2017

* WIP send recv op

* WIP send recv

* put grpc impl in details

* put grpc impl in details

* update wip

* update proto

* update proto

* update proto

* clean cmake

* wip on op implementations

* wip on op implementations

* compile ok adding ut

* wip unitest

* add extern cares for linking

* wip add ut

* working version send recv

* revert optimizer.py

* update test cmake

* add libtool to dockerfile

* update cmake dependency

* update cmake depends

* update cmake grpc depends

* fix cmake dependency

* fix compile error

* fix compile

* follow comments

* update

* update copyfrom

0a8a86e0

27 11月, 2017 2 次提交

F

Compelete max_sequence_len_op (#5913) · 33fa2dfb
由 fengjiayi 提交于 11月 27, 2017

33fa2dfb

武

Conv cudnn 3d (#5783) · a06bec12

由武毅提交于 11月 27, 2017

* conv cudnn 3d

* update test case

* update

* update

* follow comments and remove groups from helper

* update

* refine

* update

* follow comments2

* update

* fix compile

a06bec12

26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

22 11月, 2017 1 次提交
- S
  
  test unpool ok cpu · 90f664d0
  由 sweetsky0901 提交于 11月 22, 2017
  
  90f664d0
21 11月, 2017 2 次提交
- S
  
  add unpool2d make ok · 45a8c9dd
  由 sweetsky0901 提交于 11月 21, 2017
  
  45a8c9dd
- S
  
  add unpool · bc45335e
  由 sweetsky0901 提交于 11月 21, 2017
  
  bc45335e
18 11月, 2017 1 次提交
- A
  
  Adding logical operators for beam search and control flow (#5708) · 6cfcf624
  由 Abhinav Arora 提交于 11月 18, 2017
  
  6cfcf624
16 11月, 2017 1 次提交

support adagrad sparse update (#5272) · d7bf372d

由 QI JUN 提交于 11月 15, 2017

* adam sparse support

* fix gpu build error

* fix ci

* fix ci

* fix adagrad sparse update bug

* fix gpu build error

d7bf372d

13 11月, 2017 3 次提交

C

add conv3d_trans_cudnn_op · 3a507b44
由 chengduoZH 提交于 11月 13, 2017

3a507b44

BeamSearchDecodeOp (#5498) · a4106278

由 Qiao Longfei 提交于 11月 13, 2017

* init trieconcat_op

* add basic implementation

* add test

* add more test

* update unit test

* add PackAllSteps test

* fix PackAllSteps

* all test passed

* clean code

* remove state inside helper

* rename prob to score

* optimize RemoveFromEnd

* use deconstructor to delete BeamNode recursively

* optimize interface

* add comment to interface

* optimizer data structure

* use template to define the type of score

* use template parameter for BeamHelper

* change father to parent

* rename TrieConcat to BeamSearchOutConcat

* use LoDTensorArray

* rename BeamSearchOutConcat to BeamSearchDecode

* refine code

* remain all candidate sentence in beam_search_decode_op, do not consider endid

* use unique_ptr

* fix compare bug

* fix lod compile problem

a4106278

D

Fix compling for softmax_with_cross_entropy_op. · 91d4fc69
由 dangqingqing 提交于 11月 13, 2017

91d4fc69

11 11月, 2017 2 次提交
- D
  
  Use G++ to compile some cu operators. · f5e36765
  由 dangqingqing 提交于 11月 11, 2017
  
  f5e36765
- W
  
  this for maxout op new add · 058bdd34
  由 wanghaox 提交于 11月 11, 2017
  
  058bdd34

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致