- 24 11月, 2021 12 次提交
-
-
由 Aurelius84 提交于
-
由 Leo Chen 提交于
-
由 YuanRisheng 提交于
* elementwise_mul refactor * perfect code in test * delete redundant code * fix bugs when run test_multiply * adjust the location of macro * fix bugs when run ci
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment
-
由 Wangzheee 提交于
* matmul_convert_int8 * matmul_convert_int8 * matmulconvert_int8 * Matmul_int8_convert: tensor*tensor * Matmul_int8_convert: tensor*tensor * Matmul_int8_convert: tensor*tensor
-
由 zhaoyingli 提交于
* adapt auto search * adapt auto search * fix matmulv2 compatible * del debug
-
由 Aurelius84 提交于
-
由 0x45f 提交于
* run dy2stat pure fp16 in Linear model * no use self._pure_fp16_inputs * add test and fix Adam error in dy2stat pure fp16 training * use paddle.optimizer.Adam * run test in gpu * change test time for CI * enlarge atol for test_resnet_pure_fp16 * refine code and enlarge atol * make custom_white_list and custom_black_list take effect for AMP and pure fp16 * check tracer is not None * use default atol * change filter_size * change atol and add some NOTE
-
由 zhupengyang 提交于
-
由 feng_shuai 提交于
-
由 WangXi 提交于
-
由 Jiabin Yang 提交于
* Add EagerTensor and tests * remove useless enforce * remove comment in cmake * support autograd meta * support grad node info test * support grad_node_info * add more edge test * remove Python.h * add tensor wrapper with tests * support compute require grad and stop gradient * support sync methods and global utils * support pure cpu test * refine error msg * refine error msg * refine error info * fix npu error
-
- 23 11月, 2021 19 次提交
-
-
由 pangyoki 提交于
* fix inplace bug * fix custom grad input error * add unittest * fix inplace bug
-
由 Qi Li 提交于
* [XPU] Reorganize xpu device codes in platform, test=develop * fix xpu_header.h, test=develop
-
由 Li Min 提交于
Add support for bias is none for fused_attention op.
-
由 wanghuancoder 提交于
-
由 Yuang Liu 提交于
-
由 Feiyu Chan 提交于
-
由 wangxinxin08 提交于
* modify code about fp16 of dcnv2 trt
-
由 Zhanlue Yang 提交于
-
由 Leo Chen 提交于
* sync scope and variable_scope when init executor * set var_desc for new var
-
由 Chen Weihang 提交于
-
由 Wangzheee 提交于
* fix_nearest * fix_nearest * fix_nearest * fix_nearest
-
由 zmx 提交于
-
由 sneaxiy 提交于
* enhance scatter err msg check * fix ci error
-
由 YuanRisheng 提交于
* elementwise_div refactor * fix compile bugs in windows ci
-
由 Jiabin Yang 提交于
* Add EagerTensor and tests * remove useless enforce * remove comment in cmake * support autograd meta * support grad node info test * support grad_node_info * add more edge test * remove Python.h * refine error code * add error type in error msg * given default null name for tensor
-
由 ronnywang 提交于
* Added HCCL backend support in dynamic graph mode * fix segmentation fault * add ut
-
由 Zhanlue Yang 提交于
* Bug fix for snapshoting VariableWrapper with initialized tensor but empty allocation * Added unittest for inplace&clear_gradient
-
由 Chen Weihang 提交于
* adapt to inference api dir for pten * fix conflit with develop * fix test_egr_ds_eager_tensor compile failed
-
由 Aurelius84 提交于
* Add transfer_layout/dtype op * clean useless codes * fix unused var * add optest in white.txt * split into data_transfer.cc * fix cmake * modify according reviewer comment * replace cast_op with transfer_dtype_op
-
- 22 11月, 2021 9 次提交
-
-
由 Feiyu Chan 提交于
* disable copying of datatype when sharing buffer between two tensors. * fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.
-
由 andyjpaddle 提交于
* add isclose op, test=develop * add isclose op, test=develop * add isclose api, test=develop * rm useless code * rm useless code * update python api of isclose * add some unittest of isclose op, test=develop
-
由 zhupengyang 提交于
-
由 zyfncg 提交于
* support zero dim for slice op * support zero dim Tensor in set_value op * polish some debug log
-
由 zyfncg 提交于
-
由 Zhanlue Yang 提交于
-
由 Weilong Wu 提交于
* Removed one ENFORCE statement * Changed func name to _share_buffer_to * Improve error reporting information * Updated the logic of _is_share_buffer_to func
-
由 Jiabin Yang 提交于
* Add EagerTensor and tests * remove useless enforce * remove comment in cmake * fix test_error * add depends on python * Remove python.h * Merge develop and add Eager tensor with test back
-
由 chentianyu03 提交于
* add cast kernel * add cast cuda kernel * add cast kernel * make cast kernel output dtype undefined * get cast dtype from vardesc * move cast to manipulation and add test case * add castinfershape * avoid reinitilaze variable * InitializeVariable support datatype * merge develop branch * fix merge bug * revert modify initializeVariable * revert modify on InitializeVariable * revert modify on InitializeVariable * mutable support reset dtype * enable make pten tensor from variable when def_arg.type is undefined * fix build pten ctx start_idx error * copy pten out tensor to variable * merge develop branch * fix non pten kernel cast failed * add reset allocation place for remake tensor * fix inplace realloc error * add mutable on pten kernles and remove unused cast files * rename function names * fix output type error * fix conflict with develop branch * set data type to variable with pten's dtype * fix test_cast_api type mismatch * densorTensro mutable_data support 0 bytes value * fix the inplace bug of reshape kernel * fix pten.backend != variable.place when moving storage, palce mismatch bug * fix conflict with develop branch * Fix bug of paddle::experimental::MovesStorage * fix ReMakePtenDenseTensor place mismatch bug * Revert "fix ReMakePtenDenseTensor place mismatch bug" This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a. * fix ReMakePtenDenseTensor place mismatch bug * reverts the set_lod interface, test=develop * modify by the review options * modify error message * add & for const input arguments * add reference in params * elementwise_sub add mutable_data * fix ResetHolderWithType check size bug * add dependence pten_tensor to test_cast_api object * remove unused code to pass ci coverage Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-