- 01 3月, 2022 11 次提交
- 
- 
由 phlrain 提交于
- 
由 phlrain 提交于
- 
由 zhangbo9674 提交于* add layer norm * add p norm * add reduce sum * refine layer norm register bf16 for cudnn811 * add bf16 cast for hip * add unittest * refine rocm * refine layer_norm unittest * refine reduce op * refine unittest * enhance atol for reduce unittest 
- 
由 wenbin 提交于* remove * pass * more pass 
- 
由 zhangchunle 提交于
- 
由 zhangbo9674 提交于* add scale gather sum * refine CUDA_ATOMIC_WRAPPER ADD for bf16 * add gather unittest * solve conflict * add scale uinttest * add sum unittest * solve conflict * refine gather unittest * refine unittest 
- 
由 HydrogenSulfate 提交于
- 
由 zyfncg 提交于* add multi input for infer_shape * support multi output for infershape * fix split bug * fix bug of concat * support vector<MetaTensor*> in infrt * fix bug 
- 
由 Aurelius84 提交于* [Phi] Migrate logical_and/or/not/xor into Phi * fix unittest * fix function name 
- 
由 ShenLiang 提交于* add reducer 
- 
由 sneaxiy 提交于* vectorize lamb kernel * remove flags, add ut * remove useless codes * refine code, add param order 
 
- 
- 28 2月, 2022 5 次提交
- 
- 
由 ronnywang 提交于
- 
由 zhangchunle 提交于* update;test=cpu-py3 
- 
由 Chen Weihang 提交于* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict 
- 
由 zhangbo9674 提交于* refine bf16 amp-o1 logic * refine amp GLOG * refine unittest * refine unittest 
- 
由 zyfncg 提交于* fix selected_rows bug in C++ API * add optional for C++ APIO * data transform support optional * remove data transform for optional vector<Tensor> * adjust some format of funtcion * fix empyt bug 
 
- 
- 27 2月, 2022 1 次提交
- 
- 
由 Leo Chen 提交于* fix pylayer problem with amp * add ut * refine code 
 
- 
- 26 2月, 2022 2 次提交
- 
- 
由 zyfncg 提交于* Support custom implement for C++ API * rename api_invoke_impl to api_custom_impl * remove manual_api * delete mutable_data in copy_to api * fix problem of copy_to * add unittest for infer_meta_fn_factory * fix split cofig in yaml * fix split cofig in yaml * modify sum api yaml * add copy_to wrapped infermeta * rollback copy impl 
- 
由 Weilong Wu 提交于* Support Eager Hook, expose interface to python * Fix CI issue 
 
- 
- 25 2月, 2022 6 次提交
- 
- 
由 jakpiase 提交于
- 
由 sneaxiy 提交于* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS 
- 
由 zn 提交于
- 
由 zhangbo9674 提交于* add ele_add * add ele_mul * add ele_sub * sovle conflict * fix npu * refine ele_add * add ele_mul unittest * refine ele_sub * refine ci * refine unittest 
- 
由 joeqiao12 提交于
- 
由 fwenguang 提交于
 
- 
- 24 2月, 2022 11 次提交
- 
- 
由 Allen Guo 提交于* Update IpuStrategy Python Part * add docs * add add_custom_op for ipu_strategy * fix build warning * rm unneeded part * clean api * fix typo * update option names * update IpuStrategy doc 
- 
由 zn 提交于
- 
由 ronnywang 提交于
- 
由 crystal 提交于* fix eigh test * modify atol and rtol 
- 
由 xiaoting 提交于* fix maxunpool2d example, test=document_fix * fix maxunpool2d example, test=document_fix 
- 
由 jakpiase 提交于* added nearest interp v2 bf16 * disabled bilinear interp nhwc test * added skipping UT for gpu * added NHWC support * removed unnecessary statements * minor change * CI fix * added appropriate changes to interpolate_v1 * fix after review * minor change * minor change * revert unwanted deletions * CI fix 
- 
由 Zhanlue Yang 提交于* Refactored GradNodeAccumulation data structure and behaviour * Fixed CI issues * Fix compilation issues * Fixed minor issues * Reverted changes for intermediate and OverwriteOutput * fixed minor issue * Fixed code format issues * Fixed CI-Coverage issue * Fixed CI issues 
- 
由 Leo Chen 提交于* fix 'invalid escape sequence' * fix assert error 
- 
由 Huihuang Zheng 提交于Add note for Place of Executor in parallel environment 
- 
由 JZ-LIANG 提交于
- 
由 wanghuancoder 提交于* eager, test=develop * fix bug, test=develop * eager, test=develop * merge legacy to fluid * eager, test=develop * eager, test=develop * Refactor TensorAdd func by template and remove gradient_accumulation in eager * Remove needless target name * eager, test=develop * eager, test=develop * Use overload instead of template * Remove legacy code * Remove legacy code * selectedrows, test=develop * Remove DataType test * eager, test=develop * eager, test=develop * support gan, test=develop * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer * refine code * ptb, test=develop * Rename all EagerTensor to Tensor * Rename some EagerTensor to Tensor * rename EagerTensor to EagerVariable * eager, test=develop * eager, test=develop * eager, test=develop * eager, test=develop * add more test * eager, test=develop * Support copiable selected rows and merge develop * save load, eager, test=develop * save load, eager, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * revert static_runner, test=develop * EagerTensor to Tensor, test=develop * refine, test=develop * refine, test=develop * clear grad, test=develop * merge, develop * merge, develop * merge, test=develop * merge, test=develop Co-authored-by: NJiabinYang <360788950@qq.com> Co-authored-by: NWeilong Wu <veyron_wu@163.com> 
 
- 
- 23 2月, 2022 4 次提交
- 
- 
由 jakpiase 提交于
- 
由 ShenLiang 提交于* add processgroup_nccl 
- 
由 zhouweiwei2014 提交于* change CUDA implementaion of bernoulli OP * fix CI 
- 
由 zhangxiaoci 提交于*test=kunlun 
 
- 
 
