- 17 1月, 2023 8 次提交
-
-
由 danleifeng 提交于
-
由 Jiabin Yang 提交于
-
由 Huihuang Zheng 提交于
Support 0d Tensor in ConditionalBlockOp 1. Add dygraph 0d tensor support for ConditionalBlockOp 2. Set scalar loss shape when `append_backward`
-
由 姜永久 提交于
* rm retain grad * fix zero_dim * fix zero_dim for xpu * reset zero dim for xpu * reset xpu * reset custom_relu * Reset flip * fix zero dim
-
由 zhangkaihuo 提交于
-
由 zhouweiwei2014 提交于
-
由 WangZhen 提交于
* Support call backward() without params in dy2st
-
由 Xiaoxu Chen 提交于
* support elementwise base func * fix compiling error and add test * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * another magic * add skip rename strategy * support add vjp * support add with new axis cal * support sub vjp * [prim] add multiply vjp rules * [prim] add multiply vjp rules * [prim] fix no infershape with composite in _append_backward_ops * [prim] add expand vjp rule * [prim] add exp vjp rule * uncomment infer shape for reshape/sum static prim api * [prim] fix tanh nullptr error * remove some print message * fix magic number in run_program relative tests @JiaBinYang * [prim] add expand,multiply,exp vjp rules * fix only support single direction reduce error * infer reduce dims using out dims Co-authored-by: NJiabinYang <360788950@qq.com>
-
- 16 1月, 2023 10 次提交
-
-
由 Weilong Wu 提交于
-
由 wawltor 提交于
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
由 wangxiaoning 提交于
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of paddle.save. It may cause bug for saving sharded optimizer state_dict() in parallel.
-
由 QingshuChen 提交于
-
由 Weilong Wu 提交于
-
由 zqw_1997 提交于
-
由 xiaoguoguo626807 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style * [Auto Parallel] Clear some fluid APIs
-
- 15 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp * enhance_tests * support more dtype for full
-
- 13 1月, 2023 21 次提交
-
-
由 Weilong Wu 提交于
-
由 cyber-pioneer 提交于
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
由 zhouweiwei2014 提交于
-
由 wanghuancoder 提交于
-
由 ronnywang 提交于
* add where, atan2, median 0d ut * add where, atan2, median 0d ut * update * update * update
-
由 Zhang Jun 提交于
* update trt engine to set in/out data type * update * Update engine.cc * Update engine.cc * update * set engine output type before freeze the network * update * update trt autoscan ut * update * update ut * fix equal bug, update ut * fix cast and equal ut * update cast ut using TRT < 8.4 * set datatype from scope * check output var is nullptr * Update op_converter.h * update tensorrt_engine_op_test ut * update
-
由 duanyanhui 提交于
* clear ProcessGroupCustom manually * fix bug * fix bug * move destroy ProcessGroup to ProcessGroupIdMap * enable destroy to all device * remove unused comments * change to internal api * Update process_group.cc * Update process_group.cc
-
由 duanyanhui 提交于
* update get_device to custom * add custom_device api * rm is_compiled_with_custom_device from framework * add todo comments
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp
-
由 wuhuachaocoding 提交于
-
由 Weilong Wu 提交于
* [Eager] polish some apis logic * polish api logic
-
由 wuhuachaocoding 提交于
-
由 jameszhang 提交于
* kunlun add support for c_concat and c_split * replace mutable_data() and ShareDataWith()
-
由 ykkk2333 提交于
-
由 jameszhang 提交于
* fix xpu unittest issue: zero_dim_tensor * deal with leftout issue introduced by #49470
-
由 Leo Guo 提交于
-
由 zhouweiwei2014 提交于
-
由 wangshengxiang 提交于
-
由 Weilong Wu 提交于
* [PHI] rrelu add yaml * polish * polish
-
由 wuhuachaocoding 提交于
-