- 12 4月, 2022 21 次提交
-
-
由 feng_shuai 提交于
* strided_slice * fix: compiler error because of size() * fix: warning * fix : warning * init input_shape * fix:forget punctuation
-
由 hong 提交于
* add layer norm infermeta * add layer norm yaml * polish layer norm infer meta * add layer norm to black list
-
由 JingZhuangzhuang 提交于
* add python share_data interface * Update inference_api.cc * Update inference_api.cc * add python share_data interface
-
由 chentianyu03 提交于
* exchange assign and assign_raw kernel name * fix register error
-
由 danleifeng 提交于
* perform SlotRecordInMemoryDataFeed feedvec;test=develop
-
由 hong 提交于
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 Leo Chen 提交于
-
由 feng_shuai 提交于
* add trt supoort for slice op * fix:output dims bug * fix: test * fix:for c++ coverage * fix:c++ coverage * fix: fix test bug * fix: CI test
-
由 wuyefeilin 提交于
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures
-
由 Chen Weihang 提交于
* add context pool unittests * fix timeout * polish details * change option pos * add dll decl for wndows * fix pre-commit error * move dll_decl and export DeviceContext * replace lost dll_decl.h
-
由 Aurelius84 提交于
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw * fix xpu unittest failed
-
由 joanna.wozna.intel 提交于
-
由 JingZhuangzhuang 提交于
* fix_paddle_numel_check * fix_paddle_numel_check
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
由 Chen Weihang 提交于
* add new method for custom double grad * add tanh double grad unittest * change year * revert tensor init method
-
由 zyfncg 提交于
* support setting vector out size in yaml * support setting size of vector<tensor> for out in yaml
-
由 liutiexing 提交于
-
由 zyfncg 提交于
-
由 王明冬 提交于
-
- 11 4月, 2022 9 次提交
-
-
由 石晓伟 提交于
-
由 zhouweiwei2014 提交于
-
由 Allen Guo 提交于
-
由 jakpiase 提交于
-
由 ykkk2333 提交于
-
由 YuanRisheng 提交于
* add multi_dot,maxout,multiplex yaml * add code converage
-
由 chentianyu03 提交于
* add assign yaml * add assign api * add assign backward api * add assign * add assign yaml * add assign * assign yaml * add assign raw kernel and use assign_raw in yaml * merge develop branch * add missing python_api
-
由 xiongkun 提交于
* gather op * add mod * [Yaml] final state for uniform and uniform_random
-
由 sneaxiy 提交于
-
- 10 4月, 2022 5 次提交
-
-
由 Liu-xiandong 提交于
* [KP]fix bug when TruncatedNormal cannot fall back in cpu * delete useless comment * delete useless comment
-
由 Chen Weihang 提交于
-
由 baoachun 提交于
-
由 Wilber 提交于
-
由 baoachun 提交于
* add mkldnn int8 pass * add mkldnn int8 pass * update pass
-
- 09 4月, 2022 5 次提交
-
-
由 zhaocaibei123 提交于
* update name * update name * fix test * fix fleet bind * update name * update name * fix test * fix gpups wrapper * remove Push/Pull/Load/Save with context in client and wrapper base class * fix * fix * remove some interface * fix * remove * code style * recover * fix * remove code unused * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable * fix * fix * fix * recover * remove unused code * recover unittest * fix * remove * fix * remove code unuseful * remove * fix * recover * remove Co-authored-by: Nesythan <esythan@126.com>
-
由 crystal 提交于
-
由 hong 提交于
-
由 王明冬 提交于
-
由 limingshu 提交于
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode. * Use the system cudaMalloc and cudaFree to allocate workspace during searching. * Enable switch of two kind of workspace setting methods. Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-