- 31 1月, 2023 8 次提交
-
-
由 张春乔 提交于
-
由 RedContritio 提交于
-
由 Yiqun Liu 提交于
* Unify the gpu implementation of stack and unstack to reuse the optimization. * Optimize the cuda implementation of unstack. * Use GpuMemcpyAsync instead of memory::Copy. * Fix error of calculating the index. * Use FastDivMod to further imporve the performance of unstack.
-
由 张春乔 提交于
* change int -> int64 * Update python/paddle/fluid/tests/unittests/ir/inference/test_trt_convert_one_hot.py
-
由 mjxs 提交于
-
由 LiYuRio 提交于
-
由 姜永久 提交于
* update erf gumbel_softmax ..ops * lint * reset sequence_conv * reset exponetial&interp nearest
-
由 姜永久 提交于
* rm flags_retain grad in pybind * retain grads for xpu test * set retain grad for xpu * rm flag * lint --------- Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
-
- 30 1月, 2023 14 次提交
-
-
由 jiangcheng 提交于
-
由 RedContritio 提交于
* add dim check for spectral_norm * add unittest out of range for spectral_norm * use ValueError when dim out of range for spectral_norm * update dim limit and add unittest for spectral_norm
-
由 RedContritio 提交于
* add pivots type check and fix batchsize error * add unittest for batchsize = 0 * fix nullptr in lu_unpack fix batchsize error in LU_Unpack add nullptr check in OneFunctor * remove exception in device code
-
由 张春乔 提交于
-
由 Ryan 提交于
* add unstack axis check * IndexErr -> ValueError * add static select
-
由 Ryan 提交于
* add pinv check * add unitest * update unitest * roll back * fix not call stupid bug * use context
-
由 engineer1109 提交于
replace all TensorFromVector & TensorToVector AssignKernel async copy
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
由 Ryan 提交于
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 wanghuancoder 提交于
* refine _found_inf
-
由 gem5 提交于
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
* make FLAGS_gemm_use_half_precision_compute_type=false defaultly * fix comments
-
- 29 1月, 2023 9 次提交
-
-
由 jiangcheng 提交于
-
由 zhangbo9674 提交于
-
由 sneaxiy 提交于
-
由 houj04 提交于
-
由 sneaxiy 提交于
* add missing proto file * fix windows ci * fix ci compile error
-
由 ronnywang 提交于
[CustomDevice] registering feed_dense_tensor, feed_sparse_coo_tensor, feed_strings kernels for custom device (#50042) * [CustomDevice] registering feed_dense_tensor, feed_sparse_coo_tensor, feed_strings kernels for custom device * update * update * update
-
由 LiYuRio 提交于
* remove max_slot_num * fix test case
-
由 jiangcheng 提交于
* [CINN] collect inplace var into cinn op desc's kInplaceVarNames attribute * attr move from op desc to subgraph * GetFetchIds from var_map instead of var_model_to_program_map_
-
由 Yuang Liu 提交于
-
- 28 1月, 2023 3 次提交
-
-
由 姜永久 提交于
* update tril unbind ops for eager tests * add eager op test * reset unbind * lint * lint * rm eager checker * copyright 2018->2023
-
由 LiYuRio 提交于
-
由 GGBond8488 提交于
* fix xpu unitest error * fix xpu unitest error
-
- 26 1月, 2023 1 次提交
-
-
由 张春乔 提交于
* fix div 0 error of fftfreq * fix div 0 error of fftfreq * bug fix * add 'n' value check
-
- 25 1月, 2023 1 次提交
-
-
由 limingshu 提交于
Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
- 20 1月, 2023 4 次提交
-
-
由 Jiabin Yang 提交于
-
由 jakpiase 提交于
* fix for matmul_grad * another fix for matmul_grad * fix
-
由 sprouteer 提交于
-
由 jameszhang 提交于
* update xccl lib & use native Reduce in dygraph * minor
-