- 08 10月, 2019 3 次提交
-
-
由 Chengmo 提交于
* Speed GEO-SGD (#20158) * delete debug vlog & add rpc function & fix word2vec bug & speed GEO-SGD
-
由 tangwei12 提交于
Heartbeat for distributed async training.
-
由 silingtong123 提交于
-
- 07 10月, 2019 1 次提交
-
-
由 Yibing Liu 提交于
* Update crf_decoding api & example * Update api spec * Fix linear chain crf api * Avoid sharing data pointer with input * Simplify the logic in linear_chain_crf_decoding * Add unittest for crf_decoding when label & path both are set * Update API spec * Add unittest for layers && correct infer_shape in chunk_eval test=release/1.6
-
- 04 10月, 2019 1 次提交
-
-
由 Li Fuchen 提交于
* Set lod level of sequence_unpad's output to 1 in compile time test=release/1.6
-
- 03 10月, 2019 2 次提交
-
-
由 zhaoyuchen2018 提交于
test=release/1.6 * Add multihead op for ernie opt * Refine softmax * Refine kernel. * Refine cuda kernel * Refine cuda version * Refine cmake
-
由 liym27 提交于
1.support asymmetric padding; 2.support padding algorithm:"SAME" and "VALID"; 3.support channel_last: data_format NHWC and NDHWC; 4.change doc of python API and c++; test=release/1.6
-
- 02 10月, 2019 1 次提交
-
-
由 Chengmo 提交于
* refector geo sgd & communicator
-
- 01 10月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 29 9月, 2019 2 次提交
-
-
由 chengjuntao 提交于
-
由 wangguanzhong 提交于
* refine doc, test=document_fix * add API.spec,test=develop,test=document_fix
-
- 28 9月, 2019 4 次提交
-
-
由 silingtong123 提交于
* test=develop, argument shape support tensor and tensor in list * test=develop,Increasing the coverage of CI tests * test=develop, modify the document and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the interface of UniformInitializer * test=develop, modify the interface of XavierInitializer and MSRAInitializer * test=develop, modify based on review's comments * test=develop, modify based on review's comments * test=develop, modify based on review's comments
-
由 liym27 提交于
* fix pool2d pool3d: 1. support asymmetric padding; 2. support padding algorithm:"SAME" and "VALID"; 3. support channel_last: data_format NHWC and NDHWC; 4. support inferring shape when input with negative dims in compile time; 5. change doc of python API and c++; 6. fix bug in cuda kernel when Attr(adaptive) is true. test=develop,test=document_preview * fix 'tensors' to 'Tensors'. test=develop,test=document_preview * add test for converage ValueError.test=develop,test=document_preview * resolve conflict in test_pool2d. test=develop
-
由 Adam 提交于
test=develop
-
由 lvmengsi 提交于
-
- 27 9月, 2019 6 次提交
-
-
由 Kaipeng Deng 提交于
* fix pool infershape. test=develop * fix unittest converage. test=develop * fix format. test=develop
-
由 chengduo 提交于
* make pad and split support fp16 test=develop
-
由 lvmengsi 提交于
-
由 tangwei12 提交于
* add a base class for the Communicator * add AsyncCommunicator Impl for async distributed training
-
由 danleifeng 提交于
Polish English docs of elementwise_add/sub/mul/div
-
由 Li Fuchen 提交于
Use AllocateTmpTensor() for creating temporary tensors in warpctc.
-
- 26 9月, 2019 11 次提交
-
-
由 wangchaochaohu 提交于
-
由 Adam 提交于
test=develop
-
由 joanna.wozna.intel 提交于
* Fix conv2d+dequantize squash for residual fusion test=develop * Correct int8 input test=develop * Add if exclude or include padding in pool2d mkldnn test=develop
-
由 Aurelius84 提交于
* x.dims == y.dims test=develop * refine comment
-
由 Aurelius84 提交于
* fix input shape check test=develop * move PADDLE_ENFORCE test=develop
-
由 chengduo 提交于
Add dtype for coalesce_tensor_op
-
由 Zhaolong Xing 提交于
test=develop test=document_fix
-
由 gongweibao 提交于
Polish elementwise max min pow document to add more examples
-
由 Aurelius84 提交于
-
由 Tao Luo 提交于
test=develop
-
由 Chen Weihang 提交于
* add lod check for sequence op, test=develop * delete unnecessary check in expend op, test=develop
-
- 25 9月, 2019 6 次提交
-
-
由 zhongpu 提交于
* add kernel for fill_op, test=develop * modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop * add op test for fill_op, test=develop * REGISTER COP CUDA KERNEL, test=develop * update test_fill_op.py, test=develop * change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop * fix op test, test=develop * add head file, test=develop
-
由 wangchaochaohu 提交于
* add support tensor and tensorlist for strided_slice OP test=develop * fix the commnet test=develop * fix test=develop * fix the bug test=develop * delete log test=develop * fix API.spec test=develop * fix test=develop
-
由 lvmengsi 提交于
* fix bn
-
由 Bob Zhu 提交于
* add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * refactor the code of matmul with multiple head even different width and height test=develop
-
由 Liufang Sang 提交于
* refine ctc align op with padding * refine api sample code
-
由 Aurelius84 提交于
* Removing last dims constraints of seq_pad and seq_unpad test=develop * fix test_layer api code test=develop * fix sequence_pad_op.cc conflict test=develop * remove test_analyzer_mm_dnn test=develop * fix vectorize bug test=develop * fix vectorize<int> test=develop
-
- 24 9月, 2019 2 次提交
-
-
由 jhjiangcs 提交于
-
由 Yang Zhang 提交于
* Add float16 support to `sync_batch_norm_op` test=develop * Add test for sync_bn with FP16 input test=develop
-