- 14 2月, 2022 2 次提交
- 11 2月, 2022 22 次提交
- 
- 
由 jakpiase 提交于* added shape oneDNN kernel * removed unnecessary import from test * added skipping tests for GPU * refactoring * refactored shape kernel * added tests in new framework * removed one line * minor change * added newline at EOF * added formatting * added attributes as extra 
- 
由 fwenguang 提交于
- 
由 joeqiao12 提交于
- 
由 zhangbo9674 提交于* add transpose unbind * add unittest * refine transpose unittest 
- 
由 zn 提交于Co-authored-by: Nzhangna <zhangna@cambricon.com>
- 
由 fwenguang 提交于
- 
由 Feiyu Chan 提交于* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs` 
- 
由 Chen Weihang 提交于* ermove xxx_info include * fix namespace error * resolve conflict * skip xpu context in registry * fix macro error * resolve conflict * resolve conflict * revert xpu convert * remove trans to fluid place * remove useless headers 
- 
由 Zhang Zheng 提交于* Optimize performance of softmax_bwd when axis!=-1 * fix * fix * fix * fix 
- 
由 Lijunhui 提交于* bilinear_fw init * optimize code * pre-compute linear_interp input index 
- 
由 JingZhuangzhuang 提交于
- 
由 zhangchunle 提交于
- 
由 zhangkaihuo 提交于
- 
由 Chen Weihang 提交于* move grad get expected pten kernel args * fix reduce sum error * fix element_sub_grad failed * revert kernel judge change 
- 
由 ziyoujiyi 提交于* delete gloo connect retry * the_one_ps dirs reconstruct * . * . * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify * refactor ps optimize * refactor ps optimize * refactor ps optimize * . * . * . * . * . * . * refactor theoneps * the_one_ps * add ps pass unittest * add ps pass unittest * ps unitest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * ps unittest frame * add cpu_async_ps_mode test * add cpu_async_ps_mode test * add cpu_async_ps_mode test * ps unittest ready * ps unittest ready * solve dist_pass init conflict * solve import CommContext error * unittest ok * implement AllocateFrom * solve setup.py.in conflict * solve conflict * solve conflict * solve conflict * . * . * cpu-async-ps minimize test ok & gpu minimize test ok Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
- 
由 Wangzheee 提交于* support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved 
- 
由 liutiexing 提交于* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor Co-authored-by: Nliutiexing <liutiexing@google.com>
- 
由 Leo Chen 提交于
- 
由 Shang Zhizhou 提交于* test=document_fix;add print pten kernel tool * test=document_fix * test=document_fix * test=document_fix * test=document_fix * add print_pten_kernels tool * add print_pten_kernels tool * fix windows complie * notest,test=rocm_ci * add merge tool * add comments 
- 
由 chenjian 提交于* add event node implementation * modify profiler.stop interface * fix according to review * fix file mode * modify class method name in event_node.cc * modify LLONG_MAX to ULLONG_MAX * fix ci error * fix ci error 
- 
由 Zhang Ting 提交于* improve backward performance * support different dtypes for elementwise ops 
- 
由 zyfncg 提交于* fix code conflict * generate inter_meta register * clear cache * just try * add sign c++ api * polish some code 
 
- 
- 10 2月, 2022 16 次提交
- 
- 
由 0x45f 提交于* refine Assign * add UT 
- 
由 fwenguang 提交于* [MLU] add mlu kernel for accuracy op * fix license format * fix error message 
- 
由 furnace 提交于[NPU] add reduce_min 
- 
由 TeFeng Chen 提交于* add a graph pass to share MemOptVarInfos of external variables into subgraph * update pass name * fix compile failed * add share_mem_opt_info_to_subgraph_pass test * share_mem_opt_info_to_subgraph_pass_test pass * modify some codes for better style and more robust * update cmake 
- 
由 Wei Shengyu 提交于* change dtype of pooling mask to 'int32' for Paddle2ONNX * empty commit to rerun ci * fix format 
- 
由 Zhanlue Yang 提交于* Removed debug info * Added automatic code generation for final state Eager Dygraph * Modified backward yaml * Added EagerUtils helper functions for final state CodeGen * Adjusted CMakeFiles to support compilation for final state auto generated codes * Added python-c code generation for final state Eager Dygraph * Fixed minor issue * Fixed yaml.load() method failure * Fixed minor issues * Refactored Python-C Attributes Parsing Functions * Fixed minor issue with Python-C AddFunctions * Fixed issues from merge * Fixed merge issues 
- 
由 zyfncg 提交于
- 
由 chenyanlann 提交于
- 
由 hong 提交于* move masked select cpu kernel * add masked selected gpu kernel; test=develop * fix bugs; test=develop * bug fix; test=develop * bug fix; test=develop * add namespace to set mask array; test=develop * fix bug; test=develop * fix bugs; test=develop * fix ddim bug; test=develop * fix npu op bug; test=develop * fix xpu dependecy bug; test=develop * move kernel args to sig.cc; test=develop 
- 
由 Wilber 提交于
- 
由 wenbin 提交于* mkldnn conv fix * definetion 
- 
由 sneaxiy 提交于* add _get_parameter func to lamb * remove duplicate code 
- 
由 Zhanlue Yang 提交于
- 
由 zyfncg 提交于* refactor C++ API code-gen * fix windows problem of C++ API 
- 
由 crystal 提交于* optimize conv1d forward * add conv opt * Optimize memory copy * delete share data with * set num_filters=512 * add nlc optimize * Optimize num_filter=512 data on A100 and V100 * Fix the workspace_size size setting of filter 
- 
由 zhangbo9674 提交于* add squeeze unsqueeze stack * add unittest * add cpu kernel 
 
- 
