- 28 4月, 2022 2 次提交
-
-
由 zyfncg 提交于
* Optimize performance of dygraph (v4) (#42196) * optimize performance of dygraph * optimize performance of dygraph and elementwise_add * optimize the trace op * fix bug * fix bug * fix unittest bug * fix code format * fix cherry-pick problem
-
由 zyfncg 提交于
* Optimize the performanece of sum api (#42231) * optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log * Add move construct for KernelSignature (#42253) * add move construct for KernelSignature * add noexcept * fix cherry-pick problem
-
- 27 4月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* Remove std::type_index in AttributeArdDef (#42122) * polish some impl * add lost attr type * polish details * fix error type * polish in name lists * add double attr * adapt infrt attr parse * add attr type test (#42263) * opt attr eaque perf (#42272)
-
由 Aurelius84 提交于
* [Performance]Remove redundant op_type in RecordEvent * [Performance]Remove redundant op_type in RecordEvent * [Performance]Remove redundant op_type in RecordEvent
-
由 Chen Weihang 提交于
* Change small vector size (#42202) * change samll vector size * Update type_defs.h * Optimize dygraph InferShape perf (#42155) * init commit * remove two hash impl * fix bug * polish details * fix compile failed * fix compile failed * fix compile failed * add default kernel sig cache * fix get kernel arg defs error * remove kernel arg defs cache * fix origin op execute
-
- 26 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* Add paddle::variant and replace paddle::any (#42139) * add variant and replace any * split attribute * Optimize dygraph GetExpectedKernelType perf (#42154) * opt dygraph scheduling * revert part impl * fix variant compile error (#42203) * replace any by variant in infermeta (#42181)
-
- 25 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimiaze performance of PreparePhiData (#42093) * Dygraph performance optimization (v2) (#42103) * optimiaze performance of PreparePhiData * dygraph performance optimization * optimize performance of dygraph (#42137)
-
- 21 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) * support setting vector out size in yaml * support setting size of vector<tensor> for out in yaml * resolve conflict Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
-
- 20 4月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
cherry-pick #41884
-
- 19 4月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
Cherry-pick #40338 #41741 #41313
-
- 15 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix data transform problem for cudnn backend (#41622) * Fix problem of infermeta with vector output (#41646) * remove stack_grad infershape * fix bug of output with null * fix bug
-
- 12 4月, 2022 1 次提交
-
-
由 chenjian 提交于
* no * maintain old profiler * fix old dygraph record event
-
- 11 4月, 2022 1 次提交
-
-
由 Liu-xiandong 提交于
fix bug when TruncatedNormal cannot fall back in cpu
-
- 05 4月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* switch autotune * implement AutoTuneCache * implement AutoTuneCache class * add pybind api * add dygraph test * support static mode and eager mode and improve unittests * rename the SwitchAutoTune Class and improve tests * improve AutoTuneStatus and reduce the cost of tests
-
- 02 4月, 2022 1 次提交
-
-
由 Liu-xiandong 提交于
* [KP] fix bug in phi static graph mode * modify the useless code
-
- 01 4月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
由 chentianyu03 提交于
* add interploate cpu kernel * fix nullptr bug * add interpolate gpu kernel * fix unit test error * remove raw kernels * add cuda kernel impl * add infermeta * recover accidentally deleted kernels in interpolate op * fix grad x_grad name error * remove interpolate_v2_op.h * rm unused codes * fix xpu build error * fix build error * fix namespace error * add register header for nup * fix infermeta error * modify by review * add the missing args in test_trt_convert_nearest_interp_v2
-
由 Liu-xiandong 提交于
* fix bug in activation xpu kp kernel * delete useless comment
-
- 31 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 Liu-xiandong 提交于
* [KP] fix bug in phi kp * delete useless comment * update * update * choose the xpu kp kernel in phi
-
- 28 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* remove in_dtype, out_dtype in redcue grad * set the dtype and layout in noneedbufferInputs func
-
- 24 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
- 23 3月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* fix some slice bug, test=develop * eager slice, test=develop * eager slice, test=develop * refine, test=develop * refine, test=develop * fix bug, test=develop * refine, test=develop * rename function name, test=develop
-
由 Zhanlue Yang 提交于
* Removed redundant use of declarations.h * Fixed minor bug
-
- 21 3月, 2022 1 次提交
-
-
由 zn 提交于
-
- 18 3月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
* first commit * fix compile error * support std::vector<std::srting> * fix * fix op support on GPU by chenweihang * pass test * infershape * add set_dtype * fix order * fix * unify the impl of dt and sr * fix
-
- 17 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* move assign kernel init commit * change vec<tensor> to vec<tensor*> * support tensor array * support api declare * fix test_list failed * fix npu and xpu failed * fix infrt failed * remove assign array size in operator * move assign sr header into sr dir * add infermeta for assign * test op success * fix test_list failed * fix kunlun failed * add set host allocator in tests * support tensor array in arg ctx * open set layout in share_meta * fix meta tensor layout error * fix test failed
-
由 Qi Li 提交于
-
- 16 3月, 2022 3 次提交
-
-
由 zyfncg 提交于
* move roi_align_grad kernel * move roi_align grad kernel and infershape to phi * remove roi_align infershape
-
由 Liu-xiandong 提交于
* [kp]fix bug that cannot fallback to CPU normally in XPU KP * fix bug in static graph
-
由 qipengh 提交于
-
- 15 3月, 2022 4 次提交
-
-
由 xiongkun 提交于
* run python api in eager model and filter the out in argument list * fix code
-
由 furnace 提交于
* [NPU] add AMP O1 support * [NPU] fix NOTE and warnings
-
由 Zhanlue Yang 提交于
* Added more signposts to dygraph profiling * Fixed minor issues * Refactored signpost names * Fixed typo * Removed debug codes * Fixed typo * Adjusted signpost names * Fixed issues from branch merge
-
由 hong 提交于
* move one hot to phi; test=develop * fix bugs; test=develop * fix bugs; test=develop * add infer meta; test=develop * fix bugs; test=develop * resolve confilct * resolve confilct * fix bug; * fix error; test=develop * update; test=develop * polish code; test=develop * add one api in eager mode; test=develop * add one hot test; test=develop * remove use less code; test=develop * fix bug; test=develop * polish code; test=develop * polish code; test=develop
-
- 14 3月, 2022 1 次提交
-
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-
- 12 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix eager benchmark test failed * fix test_tracer failed
-
- 11 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* remove needless deps in unittests * add gpu marco * fix other unittests * fix kernel name error * fix test_prepare_op * fix failed dygraph unittests * fix gpu failed tests * fix cinn test failed * fix cinn test failed * fix dropout tests
-
由 chentianyu03 提交于
* add reduce_sum grad kernel * add reduce_grad * modify reduce grad * update reduce grad functions * fix build error * add argument mapping * move cast input after grad * add dims.size=1 cpu reduce_sum grad compute method * update reduce grad GPU * remove raw reduce_sum_grad kernel * modify header files * add namespace funcs for reduce_grad_funcstions
-
- 10 3月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-