- 16 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 19 9月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 11 7月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
-
- 07 7月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* feat(): dynamic_to_static support for ipu. * fix(): format fix. * fix format * fix cpplint error * use phi::errors * fix format * fix format * fix(): add api to restore patched function. * fix(): identity_loss uses cpu place as expected kernel type. * doc(): add IPU dy2static related docs. * fix(): combine test cases. * fix format * fix comment * fix format * apply comment * fix compiling * fix(): align docs. * fix(): fix identity_loss function docs. * fix(): adjust mean and sum in identity_loss. * fix(): minor docs. * move API to paddle.incubate.identity_loss * fix UT Co-authored-by: Nzhaorui chen <zhaoruic@graphcore.ai>
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 23 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 5月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize performance of dygraph * optimize performance of dygraph and elementwise_add * optimize the trace op * fix bug * fix bug * fix unittest bug * fix code format
-
- 26 4月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Performance]Remove redundant op_type in RecordEvent * [Performance]Remove redundant op_type in RecordEvent * [Performance]Remove redundant op_type in RecordEvent
-
- 19 4月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
-
- 07 4月, 2022 1 次提交
-
-
由 chenjian 提交于
* no * maintain old profiler * fix old dygraph record event
-
- 23 3月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* fix some slice bug, test=develop * eager slice, test=develop * eager slice, test=develop * refine, test=develop * refine, test=develop * fix bug, test=develop * refine, test=develop * rename function name, test=develop
-
- 15 3月, 2022 2 次提交
-
-
由 xiongkun 提交于
* run python api in eager model and filter the out in argument list * fix code
-
由 Zhanlue Yang 提交于
* Added more signposts to dygraph profiling * Fixed minor issues * Refactored signpost names * Fixed typo * Removed debug codes * Fixed typo * Adjusted signpost names * Fixed issues from branch merge
-
- 07 3月, 2022 1 次提交
-
-
由 xiongkun 提交于
* add python api test in TestOp * test_python_api if self.python_api is set * fix code by CR
-
- 03 3月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 28 2月, 2022 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Add host_trace_level env variable * Revert "Optimize perf of softmax_with_cross_entropy (#39553)" This reverts commit bbe5228c. Co-authored-by: Nliutiexing <liutiexing@google.com> Co-authored-by: NZzSean <18818272991@163.com>
-
由 zhangbo9674 提交于
* refine bf16 amp-o1 logic * refine amp GLOG * refine unittest * refine unittest
-
- 22 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 21 2月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update record event interface using * update record event interface using * update operator.cc * update part2 * update part1 * fix include profiler.h header in ps server * fix include profiler.h header in ps server * fix profiler.h header
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 18 2月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* support dtype param for auto_cast * add amp_dtype for tracer * add unsupported bf16 list * support bf16 amp for O2 * refine python interface for bfloat16 * refine code * refine code * refine unittest * refine code * refine code * add bf16 o1 * refine code by comment * add gradient accumulator * add recompute
-
- 16 2月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* merge legacy to fluid * Remove legacy code * Remove legacy code * Remove DataType test * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer * refine code * Rename all EagerTensor to Tensor * Rename some EagerTensor to Tensor * rename EagerTensor to EagerVariable * add more test * merge develop and refine code
-
- 15 2月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomRuntime] Add DeviceManager * [CustomRuntime] Add DeviceInterface * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager * [CustomRuntime] Add plug-in device * [CustomRuntime] Memory module support PluggableDevice * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option * update * [API] update API doc based on comments, test=develop Co-authored-by: Nqili93 <qili93@qq.com>
-
- 09 2月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* merge legacy to fluid * Remove legacy code * Remove legacy code * Remove DataType test * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer
-
- 02 2月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 15 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove cached kernel context * revert dataloader format change
-
- 20 12月, 2021 1 次提交
-
-
由 fwenguang 提交于
-
- 10 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* resolve conflit with develop * cache kernel context in tracer for perf up * replace densetensor when build kernel context * fix detail compile error * append impl to static mode * fix conflit error * clear attrs after run kernel * fix coverage failed * fix cycle compile error * remove multi-in&out adapt code * remove tensor meta utils * clear data when throw exception
-
- 13 10月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* refine amp level * fix typo * update tracer._amp_level
-
- 17 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
- 05 8月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 30 6月, 2021 1 次提交
-
-
由 houj04 提交于
* support set_device for NPU. * minor update doc and add more unit test.
-
- 24 6月, 2021 1 次提交
-
-
由 houj04 提交于
* in NPU environment, use CPUPlace for missing operators. * in NPU environment, use CPUPlace for missing operators. * fix TensorCopy bug and add unit test. * fix code style. * add more unit tests.
-
- 23 6月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* optimize attr default value, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * fix bug in AttrReader, test=develop * fix bug, test=develop * fix double_grad, test=develop * refine, test=develop * refine, test=develop * fix checker null, test=develop * for test, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-