- 30 12月, 2021 19 次提交
-
-
由 Haohongxiang 提交于
* add cpu kernel of lstsq * update * modify code style * modify unittest * remove support for complex
-
由 zhangkaihuo 提交于
将cuSparse的handle与DeviceContext进行绑定,避免op中进行创建和销毁 添加对cuSparse中dense和sparse转换的API进行封装 添加对封装的API的单测
-
由 LiYuRio 提交于
-
由 Jiabin Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators * Refactored Eager AutoCodeGen with more organized helper objects * Enabled Eager AutoCodeGen for operators with multiple OpBases * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen * Adjusted function generation/call between Python-C API & Dygraph API * Synchronized auto-generated Python-C API with Dygraph Forward Functions * support more eager tensor api * fix merge compile error * fix compile error and fit develop code * support pure CPU * fix some logic error in eager_mode * support _varbase_creator in eager mode * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs * for eager mode * refine * support multiple constructor for eager tensor * add place related code * polish code * specific randint with dtype of int64 * Support pure cpu test * eager logic * refine test in pure cpu * eager logic * eager logic * eager logic, test=develop * skip core.eager when in inference, test=develop * refine, test=develop * refine, test=develop * call RetainGrad after run forward kernel, test=develop * refine, test=develop * support dygraph util, meta, guard test * support inference test * refine test and fix initializer failed * support create varbase and fix retain grad error * fix windows error * support test_imperative_basic test in eager mode * remove additional log in variable.h * remove additional log in variable.h * remove additional code create in merge Co-authored-by: Njim19930609 <jim19930609@gmail.com> Co-authored-by: NWang Huan <wanghuan29@baidu.com>
-
由 wenbin 提交于
* dynamic shape clone supported
-
由 limingshu 提交于
-
由 xiongkun 提交于
* fix wait for tiexing * fix work2vec model. new_exe support EOF Exception in ReadOp now
-
由 xiongkun 提交于
* refine run_program_op_grad output var name * add default for global_block. for pass the eagle_generator_cmd * fix * ; * fix * const cast * mutable block
-
由 jakpiase 提交于
* working test for padding only * added full conv2d grad kernel * removed some trash * minor change * Ci fix * format fix
-
由 zmxdream 提交于
-
由 sneaxiy 提交于
-
由 JingZhuangzhuang 提交于
-
由 Chen Weihang 提交于
* remove offset in storage * revert api change * fix custom op slice bug * fix mutable_data error
-
由 From00 提交于
-
由 Chen Weihang 提交于
-
由 Xiaoxu Chen 提交于
* extend Distribution baseclass for supporting multivariant distribution and prob method * add ExponentialFamily base class and entropy using Bregman divergence * add dirichlet probability distribution
-
由 Xiaoxu Chen 提交于
* add dirichlet sample op and cpu backend kernel * add Dirichlet op cuda kernel (#6) * add dirichlet op hip kernel Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>
-
由 Leo Guo 提交于
* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. * Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. test=kunlun Co-authored-by: NZibin <guozibin@baidu.com>
-
由 tianshuo78520a 提交于
-
- 29 12月, 2021 21 次提交
-
-
由 Leo Chen 提交于
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
-
由 ShenLiang 提交于
* fix bug of dp in pfp16 * fix topo
-
由 zhouweiwei2014 提交于
-
由 yaoxuefeng 提交于
add hashtable dynamic mf support
-
由 yaoxuefeng 提交于
add dynamic mf size api
-
由 zhangbo9674 提交于
* add bn_1d_2d_3d for fp16 decorate * add unittest
-
由 JZ-LIANG 提交于
* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod
-
由 Qi Li 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update OS info * split host_event_recorder * split host_event_recorder * update * update * update * update * update * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Huihuang Zheng 提交于
Fix Buddy Allocator random CI failure due to machine environment.
-
由 王明冬 提交于
-
由 小湉湉 提交于
-
由 ykkk2333 提交于
-
由 Shang Zhizhou 提交于
-
由 heliqi 提交于
* del mkldnn options of baseline * add timeout for matmul_scale_fuse_pass * add timeout for matmul
-
由 TTerror 提交于
* add argsort/scatter for kunlun * update test_scatter * update xpu.cmake * update xpu.cmake * fix scatter
-
由 sneaxiy 提交于
-
由 Tao Luo 提交于
-
由 sneaxiy 提交于
-