提交 · 3560e6806c8626e886c97780334f8bf69dae1e05 · Crayon鑫 / Paddle

31 3月, 2021 1 次提交

[ROCM] Add ROCm support for warpctc op (#31817) (#31971) · 3560e680

由 furnace 提交于 3月 31, 2021

* bugfix for warpctc

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix WARPCTC_WITH_HIP invalid

* Add logs to find out why can not dlopen libwarpctc.so

* fix warpctc commit id

* fix unit test test_warpctc_op

* Optime failed log for dlopen

* Optime failed log for dlopen

* Delete extra changes

* fix warpctc commit id

* fix warpctc commit id

* Add is_compiled_with_rocm for test_warpctc_op

* fix warpctc commit id

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* fix code style problems

3560e680

28 9月, 2020 1 次提交
- L
  
  fix tests warpctc (#27639) · 516d84b2
  由 Li Fuchen 提交于 9月 28, 2020
  
  516d84b2
27 9月, 2020 1 次提交

add support to float64 input of warpctc op. (#27399) · 1501a80f

由 Li Fuchen 提交于 9月 27, 2020

* add float64 input to ctc_loss

* modified error message of  warpctc

* update repo and tag of warpctc

* add test for warpctc with float64 input

* modified warpctc.cmake to make sure build always

* resolved sample code bug of warpctc

* add core.ops in warpctc dygraph

* fix a bug of test

1501a80f

23 9月, 2020 1 次提交

Make the Bind Method of Tensor more automatic (#27270) · 1e1ae5c5

由 Zhou Wei 提交于 9月 23, 2020

* Makes the Bind Method more intelligent

* Makes the Bind Method more intelligent

* fix unittest

* fix unittest

* fix conflict

1e1ae5c5

21 8月, 2020 1 次提交
- L
  add functional ctc_loss and CTCLoss class. (#26384) · dbf232a9
  由 Li Fuchen 提交于 8月 21, 2020
```
* add functional ctc_loss and CTCLoss class.

* modified docstring of ctc_loss and CTCLoss
```
  dbf232a9
30 4月, 2020 1 次提交

OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement (#24261) · 5dc069d0

由 Li Fuchen 提交于 4月 30, 2020

* enhance add_position_encoding error message, test=develop

* enhance warpctc & scaled_dot_product_attention error message, test=develop

* modified error message and ctest of scaled_dot_product_attention, test=develop

5dc069d0

13 12月, 2019 1 次提交

use large input shape for accuracy test (#21716) · d528ffaa

由 zhupengyang 提交于 12月 13, 2019

affine_grid, label_smooth, spectral_norm, warpctc,
nearest_interp, data_norm, match_matrix_tensor,
var_conv_2d, fused_embedding_seq_pool

test=develop

d528ffaa

29 11月, 2019 1 次提交

Add dygraph execution context (#20157) · ac854670

由 hong 提交于 11月 29, 2019

* add_dygraph_execution_context

* add dygraph infershape context and execution context; test=develop

* fix imperative bug; test=develop

* remove inputs outputs interface from execution context,
because it have same function with inputNames;
test=develop

* remove tracer_test ctest; test=develop

* fix split op bug; test=develop

* fix unitests bug; test=develop

* fix distribute test bug; test=develop

* fix ngraph compile bug; test=develop

* fix grad maker bug; test=develop

* fix load op bugs; test=develop

* fix operator.cc construct bug; test=develop

* remove useless name find in operator; test=develop

* add tracer_test; test=develop

* fix concat, split bug; test=develop

* remove tracer_test unitest; test=develop

* fix attribute check bug; test=develop

* add test code to fix converage; test=develop

* remove useless code, change check backward input in engin; test=develop

* unlock var type infer shape;test=develop

* add ShareAllLoD api; test=develop

* add dygraph infershape context unitest; test=develop

* remove increase and decrease lod in dygraph; test=develop

* addd override; test=develop

* fix increase descrease lod; test=develop

* fix paddle_enforce; test=develop

* disable lod op dygraph check; test=develop

* fix paddle enforce error; test=develop

* add comment for op_registry and OperatorBase; test=develop

* optimize the comment of op_registry; test=develop

* fix format of comment; test=develop

* fix format of comment; test=develop

* optimize the format of comment; test=develop

* optimize the format of the comment; test=develop

* optimize comment of op_registry; test=develop

ac854670

14 11月, 2019 1 次提交
- W
  
  Fix warpctc in padding mode. (#21033) · cfdd1fc2
  由 whs 提交于 11月 14, 2019
  
  cfdd1fc2
11 9月, 2019 1 次提交
- T
  paddle::framework::vectorize() templatization (#19730) · ec9bc1bd
  由 Tao Luo 提交于 9月 11, 2019
```
remove unused accuracy-diff warpctc-cudnn implementation

test=develop
```
  ec9bc1bd
27 8月, 2019 1 次提交

Support Tensor input with padding for warpctc op (#19322) · 482ce818

由 vincentXiyu 提交于 8月 27, 2019

* support tensor input with padding for warpctc op

* merge with develop

* test=develop

* modified python API examples test=develop

* nn.py is modified for code coverage test=develop

* update documents info about warpctc op in API.spec test=develop

* add test_warpctc_with_padding in test_layers test=develop

* add warning log for cuda_version back to warpctc_op.cc

* modify API.spec for warpctc op test=develop

* modify API.spec

* update warpctc test to new CompiledProgram API test=develop

* modify code examples for warpctc op test=develop

* modify API.spec for warpctc op test=develop

* modify API.spec for warpctc op test=develop

482ce818

12 6月, 2019 1 次提交

Cherry-pick: fix random CI failure. (#18011) · 0bf25351

由 Huihuang Zheng 提交于 6月 12, 2019

* Cherry-pick fix random Python3 CI failure.

In some tests, SWEs used "print('xxx').format('xxx')". The syntax
is only supported in Python2, not python3. However, since those
lines are related to data download, if the CI machines already have
the data, it passes CI tests. That causes random failure.

* Cherry-pick: disable CUDNN case of test_warpctc_op

Also temporary disable a unit test. The test will be fixed under high priority.

0bf25351

10 6月, 2019 1 次提交
- H
  Ignore a unit test which failed on cuda9/10 python3 ci task (#17950) · 9f519baf
  由 Huihuang Zheng 提交于 6月 10, 2019
```
TODO: it is a temporary fix for Paddle release 1.5. We have to fix
this failed unit test soon.

test=develop
```
  9f519baf
16 11月, 2018 1 次提交

Add cudnn ctc loss (#12366) · b32c13dc

由 Wu Yi 提交于 11月 16, 2018

* add cudnn ctc loss

* wip add test test=develop

* wip

* wip

* done test=develop

* move include cudnn test=develop

* test test=develop

* fix build test=develop

* fix build test=develop

* fix build on cudnn5 test=develop

* fix cudnn5 build test=develop

* fix cudnn5 build test=develop

* merge develop softmax functor change test=develop

b32c13dc

15 8月, 2018 1 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
07 8月, 2018 1 次提交

Fix pybind11 problem · 6abe819f

由 minqiyang 提交于 8月 07, 2018

Fix str and bytes problem
Fix sorted problem
Fix math problem
Fix CI problem

6abe819f

26 7月, 2018 2 次提交
- M
  
  Remove python3 relative import of unittest · 9fc13fde
  由 minqiyang 提交于 7月 26, 2018
  
  9fc13fde
- M
  
  Change iter_parameters back and port unittests code to Python3 · 35e6abd7
  由 minqiyang 提交于 7月 26, 2018
  
  35e6abd7
15 6月, 2018 1 次提交

Modify Pybind LoDTensor API according to length-based LoD (#11106) · 417fcf4f

由 Kexin Zhao 提交于 6月 15, 2018

* add lod_tensor util and modify pybind

* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test

* fix test error

* fix detection map op test

* fix reorder_lod_tensor test

* fix seq_concat_op

* fix chunk evel op test

* fix target assign op

* fix warp ctc op

* address comments step 1: reverse reset_lod op

* step 2: modify op test

* add warning message

* remove has_valid_lod

* add back has_valid_lod

* address comments

* add exception catching trial

417fcf4f

22 5月, 2018 1 次提交
- Y
  
  enable serial tests · b920d2c2
  由 yuyang18 提交于 5月 22, 2018
  
  b920d2c2
21 5月, 2018 1 次提交
- Y
  
  Skip hang op · 0ce84027
  由 yuyang18 提交于 5月 21, 2018
  
  0ce84027
24 2月, 2018 1 次提交
- L
  
  move Fluid API code out of V2 API code · b11956a0
  由 Luo Tao 提交于 2月 24, 2018
  
  b11956a0
13 2月, 2018 1 次提交

Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a

由 Xin Pan 提交于 2月 13, 2018

Currently, our tests run with 2 GPUs, the init time is absurdly long:
about 4s for each process.  Currently, we run each OP test on
different processes. This PR:

1. create cmake function py_test_modules which will generate the
Makefile that runs a list of Python unittest module in a single Python
process.

2. move all "python unittest compatible" (e.g., used the unittest
package, not just a regular python file). from fluid/tests to
fluid/tests/unittests.

3. cmake now will run all OP tests in fluid/tests/unittests in a
single process, except the time-consuming tests, they are separated
into different processes to utilize parallelism. Please make sure to
use the unittest package if you put the python test file in
fluid/tests/unittests

4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
to disable unittest, we can not do it when running all tests in a
single process since it will terminate the process without running the
other tests. Instead, the test is disabled in
fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
item. Please disable the unittest from
fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
Python file, for all Python file in fluid/tests/unittests/.

5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
tests in separate process so that they can be tested individually.

cde6241a

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

15 1月, 2018 2 次提交
- D
  Feature/hooks (#7513) · b9b75377
  由 dzhwinter 提交于 1月 15, 2018
```
* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci
```
  b9b75377
- W
  Fix sequence scale functor cuda kernel · 8f37c3c2
  由 wanghaoshuang 提交于 1月 15, 2018
```
1. Fix kernel
2. Add more test case
```
  8f37c3c2
13 1月, 2018 1 次提交

1. Fix warpctc grad tensor initial bug. · 137f0dfc

由 wanghaoshuang 提交于 1月 13, 2018

2. Remove num_seq arguments.
3. Refine CUDA kernel of ScaleLoDTensorFunctor.
4. Change max_relative_error of gradient unitest to 0.007

137f0dfc

11 1月, 2018 2 次提交
- W
  
  Uncomment check output in unitest · fd24e195
  由 wanghaoshuang 提交于 1月 11, 2018
  
  fd24e195
- W
  1. Fix warpctc grad op · b1af5e43
  由 wanghaoshuang 提交于 1月 11, 2018
```
2. Add check grad test
```
  b1af5e43
09 1月, 2018 1 次提交

Port WarpCTC Operator (#5107) · b5fda272

由 Yiqun Liu 提交于 1月 09, 2018

* Add Seq2BatchFunctor, which will be used in WarpCTCOp.

* Implement WrapCTCFunctor and WrapCTCKernel.

* Add unittest of warpctc_op.

* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.

* Use absolute offset lod in warpctc_op and related functors.

* Refine the comments of warpctc_op.

* The new python unittest supports checking a subset of the outputs, so revoke the previous change.

* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.

* Update to the newest codes.

* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.

b5fda272

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致