- 26 10月, 2020 2 次提交
-
-
由 Zhou Wei 提交于
-
由 mapingshuo 提交于
* add sharding
-
- 23 10月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor
-
由 lidanqing 提交于
-
- 22 10月, 2020 4 次提交
-
-
由 Double_V 提交于
-
由 Feiyu Chan 提交于
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace * add unittest for tensors in cuda pinned place * skip test for cuda pinned place on cpu machines
-
由 Leo Chen 提交于
* fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass
-
由 WangXi 提交于
-
- 21 10月, 2020 6 次提交
-
-
由 Wilber 提交于
-
由 danleifeng 提交于
* nccl init support hostname and ip; test=develop
-
由 wangguanzhong 提交于
-
由 Pei Yang 提交于
-
由 Double_V 提交于
-
由 Zhou Wei 提交于
-
- 20 10月, 2020 8 次提交
-
-
由 wangguanzhong 提交于
* fix generate_proposal_labels in cascade-rcnn series model, test=develop * fix example code & unittest, test=develop * update code from review comments, test=develop
-
由 Leo Chen 提交于
* fill_constant supports nan and inf * add ut
-
由 zhupengyang 提交于
-
由 Double_V 提交于
* add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun * add rois_num for roi_align xpu OP, test=develop
-
由 xiaoting 提交于
-
由 joanna.wozna.intel 提交于
* Add AVX512 instruction check for C-API * Fix formatting
-
由 wangchaochaohu 提交于
-
由 yinhaofeng 提交于
* lookup_table_v2_op_xpu report errors;test=kunlun * lookup_table_v2_op_xpu report errors;test=kunlun
-
- 19 10月, 2020 13 次提交
-
-
由 yinhaofeng 提交于
* lookup_table_xpu op report errors;test=kunlun * add adam xpu op;test=kunlun * reset lookup * change adam wrong;test=kunlun
-
由 TeslaZhao 提交于
-
由 Chengmo 提交于
* fix error message,test=kunlun * fix, test=kunlun
-
由 LutaoChu 提交于
Fix diag OP bug on Windows Python3.8 ,remove the std::min
-
由 Pei Yang 提交于
-
由 huangxu96 提交于
* Still has bugs. * Fixed allclose_op bug, which cannot deal with some cases of fp64 inputs. * improved CUDA kernel performance. * Changed CUDA code. * Fixed a bug in cuda kernel which cannot deal with large dimension input, and added an unittest for it. * Add a test case for float32 input.
-
由 pangyoki 提交于
* fix multinomial doc * fix multinomial error message * little doc change * fix Categorical class doc * optimize format of error message * fix CPU Kernel error message format * fix isinf and isnan error in WindowsOPENBLAS CI * delete inf and nan * add manual_seed in sample code * little error message change * change error message to InvalidArgument * add full point for error message and add manual_seed in CPU environment
-
由 Kaipeng Deng 提交于
-
由 Double_V 提交于
* add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun
-
由 pangyoki 提交于
* Add truncated_gaussian_random_op XPU kernel * Add truncated_gaussian_random_op XPU kernel, test=kunlun * little change, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * little change, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * little change, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* Add gaussian_random XPU kernels * commit kunlun, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* support uniform_random op on Baidu Kunlun * change dtype of attr shape from int to int64_t * kunlun ci, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format * run Kunlun CI, test=kunlun * add TODO, test=kunlun
-
由 xiaoting 提交于
* polish error message,test=kunlun * polish error,test=kunlun * polish error,test=kunlun * polish error,test=kunlun
-
- 18 10月, 2020 1 次提交
-
-
由 liuyuhui 提交于
* addd * add cast_op_xpu, test=kunlun * fix bug for cast_op_xpu,test=kunlun * add concat_op_xpu, test=kunlun * slove conflicts, test=kunlun * fix bug,test=kunlun * add assign_op_xpu, test=kunlun * fix bug,test=kunlun * test=kunlun;test=develop * fix concat bug,test=kunlun * fix check_dygraph set in test_concat_op_xpu.py,test=kunlun * fix error message,test=kunlun Co-authored-by: Nmapingshuo <mps2012@yeah.net>
-
- 16 10月, 2020 4 次提交
-
-
由 Guo Sheng 提交于
* Incorporate cudnn_lstm into LSTM api. test=develop * Make coalesce_tensor support alignment optionally. test=develop * Reorganize RNN apis. test=develop * Fix cudnn rnn layout conversion. test=develop * Add sequence_length support for RNN cudnn implement. Add optional init_h and init_c gradient for cudnn_lstm_op. test=develop * Use create_parameter for rnn cudnn impl. test=develop * Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program. test=develop * Update RNN api unittest to use set_device. test=develop * Fix set_place for unit tests of RNN apis. test=develop * Fix use_align in coalesce_tensor_op. test=develop * Adjust RNN apis arguments according to comments. test=develop * Polish documents for SimpleRNN apis. test=develop * Refine random seed in cudnn_lstm_op. Expose rnn params from sublayers to RNN. test=develop * Fix RNN saving for jit.save. Refine cudnn_lstm dropout behavior. test=develop * Fix doc of GRU. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Remove updates on cudnn_lstm temporarily. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Refine random seed in cudnn_lstm_op. test=develop * Fix test_lstm by adjust ConcreteProgram buffer getter. test=develop * Use create_parameter instead of create_var for rnn._flat_weight for static graph usage. test=develop * Remove W input for cudnn_lstm to pass unused_var_check. test=develop * Add test_predict for RNN unit tests coverage. test=develop * Fix code style of rnn. test=develop * Fix F.rnn usage in rnn.py. test=develop
-
由 chentianyu03 提交于
* change paddle.fluid.layers.reduce_sum to paddle.sum in sample codes * format codes
-
由 Guanghua Yu 提交于
-
由 Jack Zhou 提交于
* test=kunlun; Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast): * elementwise_div op * elementwise_max op * elementwise_mul op (with grad op) * elementwise_sub op (with grad op) * 0.05->0.01 * add xpu error message description;test=kunlun
-