- 17 9月, 2021 13 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
由 Leo Chen 提交于
* temporally disable the warnings * disable ut
-
由 Guoxia Wang 提交于
-
由 andyjpaddle 提交于
* add pinv api, test=develop * add linalg pinv api, test=develop * update example code, test=develop
-
由 Haohongxiang 提交于
* Support EMA in Paddle2.x and Fleet * update * update * update * modify ut of ema * modify docs * modify bugs * update * update * update * modify ut
-
由 Haipeng Wang 提交于
* add scale_op in model save step is not necessary, just fix the prune method to support static graph and inplace op * fix jit.save, no need to add scale_op to each outputvar anymore. fix prune_with_input, now it supports inplace op * temporarily disable test_trt_dynamic_shape.TRTDynamicShapeOutOfBound2Test
-
由 津 提交于
-
由 xiaoxiaohehe001 提交于
* add_skip_layernorm * add_skip_layernorm * add_skip_layernorm * add_skip_layernorm * add_skip_layernorm * add_skip_layernorm * add_skiplayernorm_teller * add_skip_layernorm * add_skip_layernorm_teller * add_skip_layernorm_teller * add_skip_layernorm * add_skip_teller
-
由 Leo Chen 提交于
* expose cuda stream to users * add ut
-
由 津 提交于
* add test * add test * add test
-
由 津 提交于
* add test * add test * add test * add test * add test
-
由 yeliang2258 提交于
* update equal op, input Y can be float,int,bool or tensor * update test * update code style * update code style * update doc * update str check * remote str * add type check
-
由 0x45f 提交于
-
- 16 9月, 2021 13 次提交
-
-
由 Yuang Liu 提交于
-
由 Zhong Hui 提交于
-
由 Aganlengzi 提交于
* [NPU] add index_select_grad kernel and unit tests * dim=0 not need transpose
-
由 Kaipeng Deng 提交于
* fix DataLoader exit with SIGABRT/SIGSEGV. test=develop
-
由 Haohongxiang 提交于
* Support new API linalg.cond in paddle * check code style * check code style * modify codes * add docs_eng of linalg.cond * add svd_norm for linalg.cond * modify docs_en of cond * add support for empty input in dynamic mode * modify set_time of unittest * update * modify unittest of cond * update * remove cond in paddle.__all__ * pull latest codes * merge latest codes * update
-
由 crystal 提交于
-
由 Wangzheee 提交于
* fix gather * fix
-
由 0x45f 提交于
* fix no_grad context error in dy2stat * remove useless comments * fix error by drop_kids in python * add test and fix review
-
由 Guoxia Wang 提交于
* support fp16 dtype
-
由 lilong12 提交于
* update
-
由 wuhuanzhou 提交于
PR主要功能:针对fusion等子图替换场景,支持Python侧开发并注册Pass。 背景 Pass是指输入一个深度学习计算图Graph,依照一定条件进行修改,输出修改后的Graph的过程; 当前PaddlePadle框架编写Pass代码存在以下问题: 用户需要手写Graph的条件匹配、在Graph上的修改代码; 对Graph操作需要深入底层框架代码,了解Graph的结构,并且知道相关Pass写法; 我们提出了针对fusion等子图替换类Pass的优化方案以支持用户在Python侧开发注册Pass,提升二次开发体验: 用户只需要输入匹配和替换的子图描述,由深度学习框架编写的代码来生成匹配和替换的逻辑,不需要用户对Graph进行匹配和替换操作; API级别的替换,用户可以通过Paddle的Python API构造子图,从而不需要知道Graph的结构,也能写Paddle的Graph Pass代码
-
由 WangXi 提交于
-
由 zhangkaihuo 提交于
-
- 15 9月, 2021 14 次提交
-
-
由 jakpiase 提交于
* fixed slice error * added handling of StartsTensor+List and EndsTensor+List * fix for ppyolo model
-
由 王明冬 提交于
* clip op extra information when export model,test=ocr * rename clip_extra parameter to kwargs in save_inference_model, test=ocr
-
由 zyfncg 提交于
* Change the invoking method of settiem from numpy to set_value op when value is not tensor * fix the check logic for inplace in setitem * fix the unittest problem caused by setitem doesn't support fp16 * modify some code format in setitem
-
由 zhaoyingli 提交于
* add dist_attr for dist op * add unitest * update inputname * update function name * add unitest * update CMakeLists.txt for CI * fix dis_matmul * fix compile error * update matmul to matmul_v2
-
由 pangyoki 提交于
* add beam_search npu op * fix CMakeList and add unittest * fix bug of beam search npu op * fix unittest * let input ids become int64 * set output ids to int64_t * delete check_dygraph * fix beam_width=1
-
由 WeiXin 提交于
* support numpy.ndarray index. * polish code.
-
由 Qi Li 提交于
* [NPU] fix depthwise_conv2d_grad, test=develop * remove debug files, test=develop
-
由 JingZhuangzhuang 提交于
Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
-
由 YuanRisheng 提交于
* Add New Op: gumbel_softmax * Add New Op: gumbel_softmax * Add New Op: gumbel_softmax (amend) * add __main__ function in unit test * fix bugs when test in windows ci * update en docs * delete reletive error in unit test * delete relative error in unit test * set hard=True in unit test
-
由 Siming Dai 提交于
Add paddle.cuda.device.stream_guard API
-
由 WangXi 提交于
-
由 xiaoxiaohehe001 提交于
* add_split_op * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller * add_split_teller
-
由 xiaoxiaohehe001 提交于
* add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller * add_transpose_teller
-
由 xiaoxiaohehe001 提交于
* add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller * add_scale_teller
-