- 26 11月, 2019 16 次提交
-
-
由 Youwei Song 提交于
* add axis check for concat op test=develop * fix PADDLE_ENFORCE format test=develop * move to ComputeAxis for InferShape check test=develop
-
由 itminner 提交于
-
由 Tao Luo 提交于
* make CUDA_ARCH_NAME default Auto test=develop * refine warning test=develop
-
由 Zhen Wang 提交于
* fix some typos in AMP. test=develop * delete useless codes. test=develop
-
由 zhaoyuchen2018 提交于
* Fix ernie pythoin infer diff * Refine mask test=develop
-
由 Lv Mengsi 提交于
* fix_bn * revert unittest,test=develop
-
由 lilong12 提交于
* add the framework support for distfc and ut, test=develop * fix the implementation of shard_index_op, test=develop
-
由 Zeng Jinle 提交于
-
由 hong 提交于
* change download log formate; test=develop * add unittest for data download; test=develop * remove cache before download; test=develop
-
由 GaoWei8 提交于
* Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop
-
由 ruri 提交于
-
由 silingtong123 提交于
-
由 silingtong123 提交于
-
由 Jacek Czaja 提交于
-
由 Michał Gallus 提交于
* Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop
-
由 Dong Daxiang 提交于
* fix logger problem test=develop * refine logger test=develop
-
- 25 11月, 2019 9 次提交
-
-
由 zhouwei25 提交于
-
由 wangchaochaohu 提交于
* fix the fill_constant op precious problem test=develop
-
由 zhaoyuchen2018 提交于
* Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 lijianshe02 提交于
-
由 juncaipeng 提交于
-
由 Thunderbrook 提交于
* print table stat test=develop * notes test=develop * notes test=develop
-
由 zhouwei25 提交于
-
由 WangXi 提交于
-
由 Zeng Jinle 提交于
* add global value getter setter, test=develop * fix error messages, test=develop
-
- 24 11月, 2019 5 次提交
-
-
由 Leo Zhao 提交于
* use prefetch to load next mem into cache test=develop * remove hard code memcpy om pyramid_hash_ff test=develop
-
由 Dong Daxiang 提交于
* fix fetch handler problem and refactor when a user define FetchHandler class, he or she should initialize a handler with variable dict. the key of a variable dict is a user defined name, the value of a variable dict is a Varaible generated from python API. For each fetching, a user should implement handler function in which fetched_result_dict will be available and the user can access the fetched value with user defined keys.
-
由 gongweibao 提交于
-
由 Yiqun Liu 提交于
* Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. test=develop * Print the subgraph when check failed. test=develop
-
- 22 11月, 2019 7 次提交
-
-
由 Yihua Xu 提交于
* Fix the crash issue when scale or bias was null-pointer. test=develop * Add the error message for passing CI. test=develop
-
由 Zhang Ting 提交于
-
由 Liufang Sang 提交于
* add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop
-
由 hutuxian 提交于
Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel. What's more, we improve the UTs about CVM OP.
-
由 Yihua Xu 提交于
* Avoid the string as the key of map to improve the jit performance. test=develop * Use map to replace unordered_map. test=develop
-
由 Chen Weihang 提交于
* polish code details, test=develop * futher polish hint msg, test=develop
-
由 Yi Liu 提交于
pass the argument `allow_out_of_range` of one_hot op to c++ back end.
-
- 21 11月, 2019 3 次提交
-
-
由 xujiaqi01 提交于
* fix fs_client_param bug, user can set this config through fleet_desc_file or fleet config * test=develop
-
由 Thunderbrook 提交于
* general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop * solve pslib stop core test=develop * barrier test=develop * add notes test=develop
-
由 zhongpu 提交于
-