提交 · 45c988d86a43bf34667ce7110972fff8dcaf20de · Crayon鑫 / Paddle

16 3月, 2018 2 次提交

Demostration of cmake refine for HIP support. · 45c988d8

由 sabreshao 提交于 3月 16, 2018

1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.

45c988d8

Q

Delete the detection_output_op, which had been split into several operators. (#9121) · 7c1a0b77
由 qingqing01 提交于 3月 16, 2018

7c1a0b77

15 3月, 2018 1 次提交

Implement Select OP (#9088) · 1e4c504e

由 Thuan Nguyen 提交于 3月 15, 2018

* Fix old documentation for channel_recv

* Initial design of CSP select

* Redesign channel implementation for Select Op

* Remove unecessary header

* Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.

* Init python select op API

* Python select bug fix when checking op creates block

* Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op

* Add in addition code for select op

* Init fibonacci test from python

* implement fibonnaci sequence test

* update fib unit test

* Improve select test cases

* Shorten non-pep-8-ed lines

* Add methods on channel needed by select op

* Fix compile issues, finish implementation, still need to debug code

* Fix issue with fibonncci test, it works now!

* Change QueueMessage callback to take in an ChannelAction enum, fix select unit test

* Fix case attributes

* Fix issue with select control flow

* Make cases - previously on each selectcase conditional_block - attributes to select

* Use class constants for type of channel

* Change select op to take in "cases" attribute

* return boolean from select callback function to tell Channel if this RECV or SEND should be executed

* Improve attributes and inputs comments on select op

* Fix issues with python unit test

* Assert fibonacci final output

* Fix issue when channel name / channel var is null for "default" case in select op

* Assert base select test output

* Make QueueMessage use shared pointer and modify the order of the callback

* Fixing the order in which the callback is called

* Move channel utility methods to paddle/fluid/operators/concurrency/channel_util

* Create channel_util and move channel util methods

* Fix crash when calling select_op

* Fix deadlock

* Fix issue of channel destructor deadlock

* Fix precommit issues

* Accidentally checked in changes to beam_search_op, reverting change.

* Fix dependency issue in concurrency cmake

* add device_context dependency for concurrency target

1e4c504e

13 3月, 2018 1 次提交

Repair nccl op test (#8575) · 7287630e

由 QI JUN 提交于 3月 13, 2018

* fix nccl op unit test

* fix build error

* format code

* refine nccl related unit test

* fix build error

* add setGPUData

* clean up

* follow comments

* rm test_nccl.cu

* follow comment

* rm wait

7287630e

07 3月, 2018 3 次提交
- L
  
  rename concat_functor to concat, refine CMakeLists based on comments · 3ddc9971
  由 Luo Tao 提交于 3月 07, 2018
  
  3ddc9971
- P
  MKLDNN conv2d kernel added (#8451) · 8c71adaa
  由 pzelazko-intel 提交于 3月 07, 2018
```
* MKLDNN conv2 OP kernel added

* TODOs added

* mkldnn conv2d OP refactor

* CanCUDNNBeUsed and CanMKLDNNBeUsed moved
```
  8c71adaa
- Y
  
  FIX CI · 4690b9c9
  由 Yu Yang 提交于 3月 07, 2018
  
  4690b9c9
06 3月, 2018 1 次提交
- Y
  
  Extract create_reader_op to three files · 4d8345e3
  由 Yu Yang 提交于 3月 06, 2018
  
  4d8345e3
02 3月, 2018 3 次提交
- T
  
  fix fluid distribute build · f94a758c
  由 typhoonzero 提交于 3月 02, 2018
  
  f94a758c
- L
  
  refine operator/math/CMakeLists.txt, seperate im2col from math_function · f67275a9
  由 Luo Tao 提交于 3月 01, 2018
  
  f67275a9
- C
  
  refine concat_op · 60e7ee06
  由 chengduoZH 提交于 2月 28, 2018
  
  60e7ee06
27 2月, 2018 3 次提交
- L
  
  combine batch_size_like.cc into batch_size_like.h · 6dd3a61b
  由 Luo Tao 提交于 2月 27, 2018
  
  6dd3a61b
- C
  
  follow comments · 62fe2f28
  由 chengduoZH 提交于 2月 27, 2018
  
  62fe2f28
- C
  
  refine cmake for cudnn · 16fc5e38
  由 chengduoZH 提交于 2月 26, 2018
  
  16fc5e38
16 2月, 2018 1 次提交

Generating random numbers with given batch size (#8337) · 6752b06f

由 emailweixu 提交于 2月 15, 2018

* Generating random numbers with given batch size

uniform_random_batch_size_like_op
gaussian_random_batch_size_like_op

* More comments about random seed.

* Move test_*_random_batch_size_like_op to unittests

6752b06f

10 2月, 2018 2 次提交
- L
  move paddle/pybind/pybind.h to paddle/fluid/pybind/pybind.h, and cancel the... · 77f04fd9
  由 Luo Tao 提交于 2月 10, 2018
```
move paddle/pybind/pybind.h to paddle/fluid/pybind/pybind.h, and cancel the test_parallel_op temporary
```
  77f04fd9
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
07 2月, 2018 1 次提交
- F
  
  fix compile errors · c1349d98
  由 fengjiayi 提交于 2月 07, 2018
  
  c1349d98
01 2月, 2018 1 次提交
- F
  
  Complete CreateShuffleReaderOp · 1696cb0e
  由 fengjiayi 提交于 2月 01, 2018
  
  1696cb0e
31 1月, 2018 1 次提交

Add variant of new load and save ops for storing model params in a single file (#7909) · 2e907c36

由 Siddharth Goyal 提交于 1月 30, 2018

* Add save_combine_op

* Add load_combine_op and test

* Add unit-test

* Add a delete to free buffer memory

* Add new variant of load/save

* Fix unit-test

* Add another unit test for compatibility with original save/load

* Address review comments and simplify logic

* Address review comments and simplify code - part 2

* Fix naming issues and CMake problems

* Address review comments

* Fix LoD information in tests

* Address review comments: round 2

2e907c36

30 1月, 2018 1 次提交
- X
  
  More efficient, add check on python side · 6e17babe
  由 xzl 提交于 1月 30, 2018
  
  6e17babe
28 1月, 2018 1 次提交
- Y
  
  Format doc & add unit test for dynamic_lstmp api · 634faab1
  由 Yibing Liu 提交于 1月 28, 2018
  
  634faab1
23 1月, 2018 1 次提交
- X
  
  ../../../../../paddle/api · 06db7038
  由 xzl 提交于 1月 23, 2018
  
  06db7038
22 1月, 2018 1 次提交
- W
  1. Add sequence_num as edit distance op's output · 1bc8de32
  由 wanghaoshuang 提交于 1月 22, 2018
```
2. Fix evaluator using 'reduce_sum' op instead of 'mean' op
```
  1bc8de32
18 1月, 2018 1 次提交
- Y
  
  Bugfix/beamsearch op (#7611) · 3388e52d
  由 Yan Chunwei 提交于 1月 18, 2018
  
  3388e52d
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

12 1月, 2018 1 次提交
- Y
  
  feature/add print op (#6799) · 3423022e
  由 Yan Chunwei 提交于 1月 12, 2018
  
  3423022e
11 1月, 2018 1 次提交
- W
  1. Fix warpctc grad op · b1af5e43
  由 wanghaoshuang 提交于 1月 11, 2018
```
2. Add check grad test
```
  b1af5e43
09 1月, 2018 1 次提交

Port WarpCTC Operator (#5107) · b5fda272

由 Yiqun Liu 提交于 1月 09, 2018

* Add Seq2BatchFunctor, which will be used in WarpCTCOp.

* Implement WrapCTCFunctor and WrapCTCKernel.

* Add unittest of warpctc_op.

* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.

* Use absolute offset lod in warpctc_op and related functors.

* Refine the comments of warpctc_op.

* The new python unittest supports checking a subset of the outputs, so revoke the previous change.

* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.

* Update to the newest codes.

* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.

b5fda272

03 1月, 2018 2 次提交
- L
  
  add more comments in CMakelists.txt of operator · 2d2b6332
  由 Luo Tao 提交于 1月 03, 2018
  
  2d2b6332
- L
  
  refine comments in CMakelists.txt of operator · 5974c1b7
  由 Luo Tao 提交于 1月 03, 2018
  
  5974c1b7
02 1月, 2018 4 次提交
- L
  
  manually pybind some specific operators · e4e95bee
  由 Luo Tao 提交于 1月 02, 2018
  
  e4e95bee
- L
  
  auto pybind when *_op.cc contains several operators · f3851fe5
  由 Luo Tao 提交于 1月 02, 2018
  
  f3851fe5
- S
  
  for del DEPS · 554f6967
  由 sweetsky0901 提交于 1月 02, 2018
  
  554f6967
- S
  
  for makelist update · 0df22907
  由 sweetsky0901 提交于 1月 02, 2018
  
  0df22907
29 12月, 2017 1 次提交
- C
  
  move cos_sim_functor to math · 24cf2fcd
  由 chengduoZH 提交于 12月 29, 2017
  
  24cf2fcd
27 12月, 2017 2 次提交
- L
  
  fix nccl cmake error in ONLY_CPU mode · b654e6f7
  由 Luo Tao 提交于 12月 27, 2017
  
  b654e6f7
- L
  
  refine CMakeLists.txt when add op need DEPS · b6796962
  由 Luo Tao 提交于 12月 27, 2017
  
  b6796962
25 12月, 2017 1 次提交
- T
  
  update remove unused code · 700bd24b
  由 typhoonzero 提交于 12月 25, 2017
  
  700bd24b
19 12月, 2017 1 次提交
- Y
  
  parallel_do skeleton pass compile · 9d2c77e6
  由 Yang Yang 提交于 12月 19, 2017
  
  9d2c77e6

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致