- 26 11月, 2019 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 Michał Gallus 提交于
* Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop
-
- 25 11月, 2019 3 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 WangXi 提交于
-
由 Zeng Jinle 提交于
* add global value getter setter, test=develop * fix error messages, test=develop
-
- 24 11月, 2019 3 次提交
-
-
由 Dong Daxiang 提交于
* fix fetch handler problem and refactor when a user define FetchHandler class, he or she should initialize a handler with variable dict. the key of a variable dict is a user defined name, the value of a variable dict is a Varaible generated from python API. For each fetching, a user should implement handler function in which fetched_result_dict will be available and the user can access the fetched value with user defined keys.
-
由 gongweibao 提交于
-
- 22 11月, 2019 3 次提交
-
-
由 Liufang Sang 提交于
* add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop
-
由 hutuxian 提交于
Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel. What's more, we improve the UTs about CVM OP.
-
由 Chen Weihang 提交于
* polish code details, test=develop * futher polish hint msg, test=develop
-
- 21 11月, 2019 4 次提交
-
-
由 xujiaqi01 提交于
* fix fs_client_param bug, user can set this config through fleet_desc_file or fleet config * test=develop
-
由 zhongpu 提交于
-
由 zhongpu 提交于
* open dygraph op test, test=develop * modify to_variable, test=develop * modify input and output for dygraph, test=develop * modify input and output for dygraph(fix bug), test=develop * fix input processing of dygraph op test, test=develop * fix bug, test=develop * fix op test, test=develop * fix forward bug for dygraph, test=develop * fix mkldnn op test for forward, test=develop * update nn.py for dygraph, test=develop * fix crop_tensor_op, test=develop * fix elementwise_mul_op, test=develop * fix fill_op, test=develop * fix some mkldnn op, test=develop * open backward op test for dygraph, test=develop * delete log, test=develop * close backward op test for dygraph, test=develop * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop * fix optest backward bug for dygraph, test=develop * fix optest backward bug for dygraph, test=develop * close backward op test for dygraph, test=develop * close backward op test for dygraph, test=develop * open dygraph op test, test=develop * fix op test for dygraph, fix GradOpDescMaker, test=develop * fix bug for linear_chain_crf_op.h, test=develop * remove log, test=develop * remove log, test=develop * remove log for op_test.py, test=develop * remove log for op_test.py, test=develop * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop * fix bug for test_increment_ngraph_op.py, test=develop * fix lod for op test in dygraph, test=develop * refactor op_test.py to reduce redundant code, test=develop * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop * remove debug log, test=develop * remove redundant code in base.py, test=develop * fix some error in optest, test=develop * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop * refactor op_test.py, test=develop * remove redundant writing, test=develop * fix error(get tensor of the grad variable), test=develop * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix some redundant code, test=develop * reslove conflict and rewrite paddle error message, test=develop
-
由 lijianshe02 提交于
* add input type and input data type check for Print_op test=develop
-
- 20 11月, 2019 1 次提交
-
-
由 liym27 提交于
* add control flow API: case. test=develop * delete 'raise TypeError' in _error_message() and return a string. test=develop * polish API document. test=develop
-
- 19 11月, 2019 2 次提交
-
-
由 danleifeng 提交于
-
由 yaoxuefeng 提交于
* fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop
-
- 18 11月, 2019 6 次提交
-
-
由 Zeng Jinle 提交于
* fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop
-
由 danleifeng 提交于
-
由 liym27 提交于
* add API switch_case. test=develop add Nest * modify code according to reviews: 1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'. 2.remove useless code. test=develop * replace fluid.layers.data with fluid.data and polish API document. test=develop
-
由 guofei 提交于
-
由 WangXi 提交于
-
由 Zeng Jinle 提交于
-
- 15 11月, 2019 1 次提交
-
-
由 xujiaqi01 提交于
* fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop
-
- 14 11月, 2019 4 次提交
-
-
由 Kaipeng Deng 提交于
-
由 Zeng Jinle 提交于
* friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop
-
由 whs 提交于
-
由 Tao Luo 提交于
* add input type and dtype check template, and update some APIs check * refine check template, and update some APIs check in nn.py * update some APIs check in loss.py test=develop
-
- 13 11月, 2019 1 次提交
-
-
由 gongweibao 提交于
use 2 cards test=develop
-
- 12 11月, 2019 5 次提交
-
-
由 zhaoyuchen2018 提交于
* Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.
-
由 WangXi 提交于
-
由 lilong12 提交于
modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802) * modify the implementation of save_persistables and save_inference_model functions for fleet collective, test=develop * add ut, test=develop
-
由 ceci3 提交于
* fix instance norm * update unitest,test=develop
-
由 lilong12 提交于
* set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop
-
- 11 11月, 2019 3 次提交
-
-
由 Huihuang Zheng 提交于
-
由 Huihuang Zheng 提交于
-
由 Yiqun Liu 提交于
* Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop
-
- 08 11月, 2019 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdb, reversing changes made to 2ce6473f. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd7. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop
-
由 LielinJiang 提交于
-