- 18 1月, 2018 2 次提交
-
-
由 Abhinav Arora 提交于
-
由 Yu Yang 提交于
* Fix ParallelDo not support empty input gradient * Polish ParallelDo and fix several bugs * Fix CI * Fix CI
-
- 16 1月, 2018 3 次提交
-
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 sidgoyal78 提交于
-
- 15 1月, 2018 12 次提交
-
-
由 gongweibao 提交于
Fix grpc bugs
-
由 dzhwinter 提交于
* add copyright hook * add copyright hook * refine copyright hook * "test copyright hook" * fix check style * fix ci
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 wanghaoshuang 提交于
1. Fix kernel 2. Add more test case
-
由 guosheng 提交于
-
由 yangyaming 提交于
-
由 Qiao Longfei 提交于
* fix while_grad_op first step loss lod problem * optimize code
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 13 1月, 2018 2 次提交
-
-
由 wanghaoshuang 提交于
2. Remove num_seq arguments. 3. Refine CUDA kernel of ScaleLoDTensorFunctor. 4. Change max_relative_error of gradient unitest to 0.007
-
由 Cao Ying 提交于
* update code comments. * update the comments. * follow comments.
-
- 12 1月, 2018 4 次提交
-
-
由 Yan Chunwei 提交于
-
由 xuwei06 提交于
-
由 xuwei06 提交于
-
由 xuwei06 提交于
We need this operator to assign value to a tensor and the values are stored in the program so that they can be used independent of python.
-
- 11 1月, 2018 4 次提交
-
-
由 Abhinav Arora 提交于
-
由 wanghaoshuang 提交于
-
由 gongweibao 提交于
Async GRPC sendrecv
-
由 wanghaoshuang 提交于
2. Add check grad test
-
- 10 1月, 2018 8 次提交
-
-
由 Yibing Liu 提交于
-
由 Yibing Liu 提交于
-
由 Yibing Liu 提交于
-
由 Yang Yang(Tony) 提交于
feature/parallel_gpu
-
由 Yang Yu 提交于
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
* add lod tensor ToAbsOffset test * add share lod to topk op and softmax op
-
由 xuwei06 提交于
-
- 09 1月, 2018 4 次提交
-
-
由 yangyaming 提交于
-
由 fengjiayi 提交于
-
由 Yancey 提交于
* test dist word2vec * multiple trainers work
-
由 Yiqun Liu 提交于
* Add Seq2BatchFunctor, which will be used in WarpCTCOp. * Implement WrapCTCFunctor and WrapCTCKernel. * Add unittest of warpctc_op. * Modify the check_output inferface in python unittest framework to allow check a subset of outputs. * Use absolute offset lod in warpctc_op and related functors. * Refine the comments of warpctc_op. * The new python unittest supports checking a subset of the outputs, so revoke the previous change. * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor. * Update to the newest codes. * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
-