- 20 10月, 2020 4 次提交
-
-
由 yinhaofeng 提交于
* lookup_table_v2_op_xpu report errors;test=kunlun * lookup_table_v2_op_xpu report errors;test=kunlun
-
由 yinhaofeng 提交于
* xpu adam op (#28031) * lookup_table_xpu op report errors;test=kunlun * add adam xpu op;test=kunlun * reset lookup * change adam wrong;test=kunlun * add adam xpu op;test=kunlun
-
由 Double_V 提交于
-
由 TeslaZhao 提交于
-
- 19 10月, 2020 13 次提交
-
-
由 Chengmo 提交于
* fix error message,test=kunlun * fix, test=kunlun
-
由 huangxu96 提交于
* Fixed allclose_op bug, which cannot deal with some cases of fp64 inputs. * improved CUDA kernel performance. * Fixed a bug in cuda kernel which cannot deal with large dimension input, and added an unit test for it. * Add a test case for float32 input.
-
由 xiaoting 提交于
-
由 Double_V 提交于
* add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun
-
由 pangyoki 提交于
* Add truncated_gaussian_random_op XPU kernel * Add truncated_gaussian_random_op XPU kernel, test=kunlun * little change, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * little change, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * little change, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* Add gaussian_random XPU kernels * commit kunlun, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* support uniform_random op on Baidu Kunlun * change dtype of attr shape from int to int64_t * kunlun ci, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format * run Kunlun CI, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* fix multinomial doc * fix multinomial error message * little doc change * fix Categorical class doc * optimize format of error message * fix CPU Kernel error message format * fix isinf and isnan error in WindowsOPENBLAS CI * delete inf and nan * add manual_seed in sample code * little error message change * change error message to InvalidArgument * add full point for error message and add manual_seed in CPU environment
-
由 LutaoChu 提交于
Fix diag OP bug on Windows Python3.8, remove the std::min
-
由 liuyuhui 提交于
* addd * add cast_op_xpu, test=kunlun * fix bug for cast_op_xpu,test=kunlun * add concat_op_xpu, test=kunlun * slove conflicts, test=kunlun * fix bug,test=kunlun * add assign_op_xpu, test=kunlun * fix bug,test=kunlun * test=kunlun;test=develop * fix concat bug,test=kunlun * fix check_dygraph set in test_concat_op_xpu.py,test=kunlun * fix error message,test=kunlun Co-authored-by: Nmapingshuo <mps2012@yeah.net> Co-authored-by: Nmapingshuo <mps2012@yeah.net>
-
由 Kaipeng Deng 提交于
-
由 xiaoting 提交于
* polish error message,test=kunlun * polish error,test=kunlun * polish error,test=kunlun * polish error,test=kunlun
-
由 Guo Sheng 提交于
* Incorporate cudnn_lstm into LSTM api (#27217) * Incorporate cudnn_lstm into LSTM api. test=develop * Make coalesce_tensor support alignment optionally. test=develop * Reorganize RNN apis. test=develop * Fix cudnn rnn layout conversion. test=develop * Add sequence_length support for RNN cudnn implement. Add optional init_h and init_c gradient for cudnn_lstm_op. test=develop * Use create_parameter for rnn cudnn impl. test=develop * Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program. test=develop * Update RNN api unittest to use set_device. test=develop * Fix set_place for unit tests of RNN apis. test=develop * Fix use_align in coalesce_tensor_op. test=develop * Adjust RNN apis arguments according to comments. test=develop * Polish documents for SimpleRNN apis. test=develop * Refine random seed in cudnn_lstm_op. Expose rnn params from sublayers to RNN. test=develop * Fix RNN saving for jit.save. Refine cudnn_lstm dropout behavior. test=develop * Fix doc of GRU. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Remove updates on cudnn_lstm temporarily. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Refine random seed in cudnn_lstm_op. test=develop * Fix test_lstm by adjust ConcreteProgram buffer getter. test=develop * Use create_parameter instead of create_var for rnn._flat_weight for static graph usage. test=develop * Remove W input for cudnn_lstm to pass unused_var_check. test=develop * Add test_predict for RNN unit tests coverage. test=develop * Fix code style of rnn. test=develop * Fix F.rnn usage in rnn.py. test=develop * Fix test_lstm unittest failed and Add more unittest (#28029) * fix test_lstm unittest failed * add more unittest * modify cmakelist * fix judgement Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
- 17 10月, 2020 2 次提交
- 16 10月, 2020 2 次提交
-
-
由 tangwei12 提交于
* add size method for large scale * add large scale UT * add ut for checkpoint
-
由 mapingshuo 提交于
-
- 15 10月, 2020 6 次提交
-
-
由 Zhou Wei 提交于
* add tensor clone (#27953) * add tensor clone * fix unittest test_var_base * fix bug of tensor copy of CUDAPinnedPlace (#27966)
-
由 123malin 提交于
* test=develop, fix geo sgd communicator and gloo http_init for ps
-
由 Guanghua Yu 提交于
-
由 Feiyu Chan 提交于
1. support channel last in BatchNorm*d (#27875) 2. fix a bug in batch_norm_op cuda kernel by extracting ResizeToChannelFist(Last), TransToChannelFirst(Last) to operators/layer_utils.h
-
由 mapingshuo 提交于
-
由 Qinghe JING 提交于
-
- 14 10月, 2020 13 次提交
-
-
由 Leo Chen 提交于
* support setting xpu place * add ut, test=kunlun
-
由 MRXLT 提交于
* fix adam * fix gpu adam * fix code style * fix ut * update ut add cuda code
-
由 Double_V 提交于
* add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun
-
由 Jack Zhou 提交于
Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast
-
由 wangchaochaohu 提交于
-
由 Chengmo 提交于
* add sparse tensor load method
-
由 tangwei12 提交于
-
由 wawltor 提交于
update the code for the topk message optimize
-
由 Chen Weihang 提交于
* polish some error message * add white list * revert shell script change
-
由 123malin 提交于
* test=develop, bug fix for parameter_recv * test=develop, for unittest, test_fleet_rolemaker_new
-
由 QingshuChen 提交于
*test=kunlun
-
由 zhang wenhui 提交于
* add multitask * add multitask, test=develop * fix code style, test=develop * add partail push dense, test=develop * fix has_kay in py3, test=develop * fix, test=develop * fix, test=develop * fix, test=develop
-
由 zhang wenhui 提交于
* fix norm api doc, test=develop * fix error message, test=develop * fix api norm, test=develop * add adagrad, test=develop * fix bug, test=develop * fix bug, test=develop * add spetral_norm, test=develop * fix adagrad, test=develop * merge , test=develop
-