- 30 12月, 2021 13 次提交
-
-
由 xiongkun 提交于
* fix wait for tiexing * fix work2vec model. new_exe support EOF Exception in ReadOp now
-
由 xiongkun 提交于
* refine run_program_op_grad output var name * add default for global_block. for pass the eagle_generator_cmd * fix * ; * fix * const cast * mutable block
-
由 jakpiase 提交于
* working test for padding only * added full conv2d grad kernel * removed some trash * minor change * Ci fix * format fix
-
由 zmxdream 提交于
-
由 sneaxiy 提交于
-
由 JingZhuangzhuang 提交于
-
由 Chen Weihang 提交于
* remove offset in storage * revert api change * fix custom op slice bug * fix mutable_data error
-
由 From00 提交于
-
由 Chen Weihang 提交于
-
由 Xiaoxu Chen 提交于
* extend Distribution baseclass for supporting multivariant distribution and prob method * add ExponentialFamily base class and entropy using Bregman divergence * add dirichlet probability distribution
-
由 Xiaoxu Chen 提交于
* add dirichlet sample op and cpu backend kernel * add Dirichlet op cuda kernel (#6) * add dirichlet op hip kernel Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>
-
由 Leo Guo 提交于
* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. * Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. test=kunlun Co-authored-by: NZibin <guozibin@baidu.com>
-
由 tianshuo78520a 提交于
-
- 29 12月, 2021 23 次提交
-
-
由 Leo Chen 提交于
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
-
由 ShenLiang 提交于
* fix bug of dp in pfp16 * fix topo
-
由 zhouweiwei2014 提交于
-
由 yaoxuefeng 提交于
add hashtable dynamic mf support
-
由 yaoxuefeng 提交于
add dynamic mf size api
-
由 zhangbo9674 提交于
* add bn_1d_2d_3d for fp16 decorate * add unittest
-
由 JZ-LIANG 提交于
* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod
-
由 Qi Li 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update OS info * split host_event_recorder * split host_event_recorder * update * update * update * update * update * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Huihuang Zheng 提交于
Fix Buddy Allocator random CI failure due to machine environment.
-
由 王明冬 提交于
-
由 小湉湉 提交于
-
由 ykkk2333 提交于
-
由 Shang Zhizhou 提交于
-
由 heliqi 提交于
* del mkldnn options of baseline * add timeout for matmul_scale_fuse_pass * add timeout for matmul
-
由 TTerror 提交于
* add argsort/scatter for kunlun * update test_scatter * update xpu.cmake * update xpu.cmake * fix scatter
-
由 sneaxiy 提交于
-
由 Tao Luo 提交于
-
由 sneaxiy 提交于
-
由 limingshu 提交于
-
由 WangXi 提交于
-
- 28 12月, 2021 4 次提交
-
-
由 limingshu 提交于
* first commit * pass ctest of elementwise_div_grad
-
由 zhiboniu 提交于
-
由 baoachun 提交于
-
由 From00 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage * Utilize StreamSafeCUDAAllocator to support fast GC in new executor * Fix compile error for Windows and ROCm * Fix compile error for Windows * Modify UT stream_safe_cuda_alloc_test * Modify UT stream_safe_cuda_alloc_test * Rewrite fast GC * Rewrite fast GC * Fix compile error for BOOST_GET_CONST * Fix compile error for BOOST_GET_CONST * Changes default stream for StreamSafeCUDAAllocator * Fix a small CI error * Remove some redundant code * Fix conflict * Fix compile error for ROCm * Fix Windoes CI error * Fix CI error * Remove some unnecessary code * Fix CI error * Add UT for fast GC * Fix CI error * add device-agnostic stream class * add stream.h * fix ut * fix cpu compile * Use RWLock in GetAllocator * Fix CI error Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-