- 13 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
- 11 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
- 31 1月, 2023 1 次提交
-
-
由 姜永久 提交于
* rm flags_retain grad in pybind * retain grads for xpu test * set retain grad for xpu * rm flag * lint --------- Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
-
- 01 12月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* do not link python lib in tensor wrapper
-
由 HongyuJia 提交于
* fix typo error * pass CI-coverage
-
- 24 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* rm dependence to "convert_utils.h" in some files * fix bugs * replace DataType2String with DataTypeToString * replace framework::DataTypeSize with phi::SizeOf * mv convert_function from fluid to phi and rm old map * recommit with pre-commit * repalce ProtoVarType with ProtoDataType and update comment. * fix error about include "dnnl.hpp" * revert add dep mkldnn to convert_utils in phi * add mkldnn deps in convert_utils.h in phi * move deps to convert_utils.h in phi
-
- 17 11月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
Support bfloat16 for adamw and adam optimizer. Fit the lr for pure bf16 training with tensor fusion. (#48041) * add bfloat16 for adamw * set lr not to bfloat16 for pure bf16 training * update the logic * update the adamw optimizer * support bfloat for adam
-
- 16 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 WangZhen 提交于
Get grads types from cpp for adam to speed up
-
- 09 11月, 2022 1 次提交
-
-
由 WangZhen 提交于
* Get params and grads in cpp to avoid gpu idel time * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py * Get grads from cpp and construct params_grads on python * Check meta and remove comments
-
- 07 11月, 2022 1 次提交
-
-
由 WangZhen 提交于
* Get three grad lists in CPP to avoid gpu idle time * Support legacy mode
-
- 28 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
- 27 9月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* refine gil use
-
- 15 9月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* saved_tensors_hooks
-
- 06 9月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
-
- 09 8月, 2022 1 次提交
-
-
由 WangZhen 提交于
* Pybind Fucniton and hide ExecutorEngine and PEEngine * Remove FunctionNames in compilation_unit
-
- 08 8月, 2022 3 次提交
-
-
由 WangZhen 提交于
* Polish function code * Rename funciton to engine * Fix Log msg and doc * Rename Function to Engine and using new Function class to warp Engine * Rename EngineInfo * Adjust member variable order
-
由 Chen Weihang 提交于
-
由 Weilong Wu 提交于
* [Eager] fix to_uva_tensor without specific gpu number * Update test_tensor_uva.py update test case
-
- 29 7月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Jiabin Yang 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* release gil when run backward
-
- 13 7月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [JIT]Layer support eager dygraph mode and polish Function interface * remove usless code * fix #define
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* fix custom op error * fix code error
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 23 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove storage * add glog include * add glog include * add glog include
-
- 11 5月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 06 5月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* fix pylayer_memleak * inc ref before return Py_None * refine * refine * refine * refine
-
- 29 4月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 24 4月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 14 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 04 4月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Eager] support tensor uva, test=windows_ci * Add headers to fix CI, test=windows_ci * Expose _uva python interface, Fix windows ci issue
-
- 03 4月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Update ResNet test cases * [Eager] Support uva, adamax, fill_diagonal_, to_list and so on. * Fix CI * Updated CUDA defined statement * Fix CI * Update headers, Fix CI * Remove useless setting * Updated func name to Fix windows-CI * Remove tensor uva related codes * Remove uva related code * recover original test
-
- 01 4月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 24 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 22 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* replace Backend by Place in C++ API * fix left code * fix test_to_api bug
-
- 18 3月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Supported Complex2Real Conversion for Eager Dygraph * Supported Complex2Real Conversion for Eager Dygraph * Enabled complex type promotion test for matmul_v2 * Fix CI issues * Merged adj_edges_ with GradSlotMeta * Fixed monir issue * Adjusted num runs * Recovered Eager performance tests configurations * Recovered Eager performance tests configurations * Adjusted performance tests configurations * Fixed Minor Issues with performance tests * Moved out Edge from GradSlotMeta * Fixed issues from merge * Fixed typo * Addressed review comments * Fixed minor issues
-
- 17 3月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Eager] Support eager grad interface, draft version * Support eager grad interface with allow_unused and multi startup_op * Fix code format * Fix allow_unused case, return PyNone if tensor not initialize * Support output's stop_gradient related to create_graph * Support grad exception case in eager mode, fix coverage CI * Update ToPyObject, return PyNone if not initialize * AccumulationNode add FLAGS_retain_grad_for_all_tensor * Fix ci issue * Fix CI issue * fix, use core.eager.Tensor * Add func SetBufferSlotRankZeros for GradTensorHolder * Support retain_graph by using ClearTensorWrappers * Support retain_graph by using ClearTensorWrappers * Update retain_graph and no_grad_vars related test case * Update code gen logic for ClearTensorWrappers * Fix by override statement * fix override func args * Support retain_graph, update unit tests * Updated ClearTensorWrappers logic * fix grad python interface * Use deep copy and update unit tests * Polish code * Polish code * Fix CI issue, Deep copy only use when user set grad_tensors * Fix CI, use Backward instead RunBackward * Fix CI, Declare kernel explicitly in test file * Polish, remove vector of TensorWrapper * Refactor the logic of grad/backward, polish codes * Update code after merge upstream develop * Polish after merge upstream develop * Update to adapt new GradNodeBase superclass * Fix error introduced during conflict resolution * Update purify potential_startup_nodes logic * Fix errors * Polish code * Remove useless args for ToPyObject * Remove useless TensorWrappersSet * Fix code-format, re-install pre-commit * Fix pre-process logic for potential_startup_ops * Update unit tests, use eager mode
-