- 29 11月, 2021 16 次提交
-
-
由 chentianyu03 提交于
* add cuda implement of cast kernel * remove bfloat16 when defined paddle_with_hip
-
由 zhangbo9674 提交于
* amp.decorate optimizers set to None is ok * refine unittest * add unittest and refine example code * refine unittest
-
由 Sing_chan 提交于
-
由 wanghuancoder 提交于
* suport fetch lodtensor array, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Yuang Liu 提交于
-
由 Thunderbrook 提交于
* auc temp * cuballocator * code format * code format
-
由 WangXi 提交于
-
由 Aganlengzi 提交于
-
由 Sing_chan 提交于
-
由 Weilong Wu 提交于
* native commit for triple grad of sigmod * Updated unittests files * init functional jacobian api * Updated trible_test func * Updated gradient_checker & test_script * finish test with dtype float32 * add float64 test case * polish code * use atol=1e-5 with dtype float64 * fix for ci * set timeout for test_jacobian * fix dygraph grad to support high differential * polish API docstring * Updated gradient checker and some related files * fix double grad strip error for high differential * fix double grad strip error for high differential * Add Sigmoid triple grad tests * fix dygraph double grad dtype error when calling for high differential senario * Updated triple grad teses func * Use np.random to initialize ddx * Updated triple_grad_check func * add todo for gradient checker and refine some comments * remove additional code * add test for warnging in backward.py * format python code * support multi input in triple gradient checker * Add matmul triple grad kernel * Updated comments of TODO * Supported some special tests * Change code-format to follow CI std * Updated gradient_checker.py * Fix conflicts * Removed unnecessary printing log * Change code style to follow CI std * support batch in jacobian and hessian * add batch jacobian and batch hessian * Add batch_jacobian test, draft version * [New features] Add elementwise_mul triple grad kernel (#37152) * Add elementwise_mul triple grad kernel * Removed InplaceInferer and polished code * Add numerical_batch_jacobian,numerical_batch_hessian and tests * Support batch_jacobian and batch_numerical * Use pre-commit to check code format * Update doc, polish code, add unit test * Reset the TIMEOUT properties of test_jacobian to pass CI Co-authored-by: Nlevi131 <limaolin01@baidu.com> Co-authored-by: NJiabin Yang <360788950@qq.com>
-
由 Baibaifan 提交于
-
由 xiaoxiao-luomu 提交于
-
由 Zhanlue Yang 提交于
-
由 Zhanlue Yang 提交于
-
由 李季 提交于
Co-authored-by: NChen Long <1300851984@qq.com>
-
由 Wilber 提交于
-
- 28 11月, 2021 1 次提交
-
-
由 Zhanlue Yang 提交于
-
- 27 11月, 2021 5 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the unified cluster representation * [Auto Parallel] Add the graph class for physical mapping * [Auto Parallel] Add the simple physical mapper * Set the timeout of the mapper * Merge the upstream develop unittests cmake files * Fix a bug of the process group * Remove mapper unittest from platforms which is not GPU * Move the instantiation of process group after resharding * Add the local id for devices * Update the rank mapping format * Add some comments * Remove the related files about mapping * Remove unused rank_mapping unittest * Improve the unittest coverage
-
由 Zhanlue Yang 提交于
-
由 Zhanlue Yang 提交于
-
由 JingZhuangzhuang 提交于
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 26 11月, 2021 18 次提交
-
-
由 zmx 提交于
-
由 Zhanlue Yang 提交于
-
由 Steffy-zxf 提交于
* fix data parallel when VOCAB var in program
-
由 YUNSHEN XIE 提交于
-
由 wanghuancoder 提交于
* clear local scope every setp, test=develop * refine,test=develop * refine, test=develop
-
由 wanghuancoder 提交于
-
由 zhaocaibei123 提交于
* test * test * rm test * update * update * update * add unittest * update * update save
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
-
由 YuanRisheng 提交于
* Support parse kernel key by multi-inputs * optimize code according to reviewer
-
由 Li Min 提交于
* Fix bugs when bias is none for static graph for fused_attention op.
-
由 Zhanlue Yang 提交于
reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care. This is essentially a hack interface to satisfy one specific request
-
由 Yuang Liu 提交于
-
由 wangzhen38 提交于
* add tdm sample * add tdm sample in c++ * update tdm sample * modify sample count * fix conflict * add set_date * fix cmake error * fix bug of proto * update index_dataset proto * update cmake * fix error cmake * fix cmake mkldnn * fix cmake proto * update cmake proto * update cmake * update rec * update dataset * update dataset * update dataset * updata dataset * updata dataset * updata coverage * updata ci * goback4 * fix npu ci * add xxhash dep
-
由 smallv0221 提交于
* fix dropout static when axis != None * update dropout test * add dropout test * fix test * Update test_dropout_op.py * Update test_dropout_op.py * fix testcase * fix testcase * Update test_dropout_op.py * fix testcase * fix testcase * optimize perf * add new test * fix testcase
-
由 zyfncg 提交于
-
由 Chen Weihang 提交于
-
由 Yuang Liu 提交于
-