1. 16 3月, 2022 1 次提交
  2. 15 3月, 2022 4 次提交
    • X
      run python api in eager model and filter the out in argument list (#40523) · 4d886f75
      xiongkun 提交于
      * run python api in eager model and filter the out in argument list
      
      * fix code
      4d886f75
    • F
      [NPU] add AMP O1 support (#40362) · 69dd43d1
      furnace 提交于
      * [NPU] add AMP O1 support
      
      * [NPU] fix NOTE and warnings
      69dd43d1
    • Z
      Added more profile signposts to dygraph (#40201) · 36db75b4
      Zhanlue Yang 提交于
      * Added more signposts to dygraph profiling
      
      * Fixed minor issues
      
      * Refactored signpost names
      
      * Fixed typo
      
      * Removed debug codes
      
      * Fixed typo
      
      * Adjusted signpost names
      
      * Fixed issues from branch merge
      36db75b4
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37
  3. 14 3月, 2022 1 次提交
  4. 12 3月, 2022 1 次提交
  5. 11 3月, 2022 2 次提交
    • C
      [Phi] Remove needless deps in unittests (#40256) · 89ed57e2
      Chen Weihang 提交于
      * remove needless deps in unittests
      
      * add gpu marco
      
      * fix other unittests
      
      * fix kernel name error
      
      * fix test_prepare_op
      
      * fix failed dygraph unittests
      
      * fix gpu failed tests
      
      * fix cinn test failed
      
      * fix cinn test failed
      
      * fix dropout tests
      89ed57e2
    • C
      [Phi] Reduce grad (#40263) · f452ad5c
      chentianyu03 提交于
      * add reduce_sum grad kernel
      
      * add reduce_grad
      
      * modify reduce grad
      
      * update reduce grad functions
      
      * fix build error
      
      * add argument mapping
      
      * move cast input after grad
      
      * add dims.size=1 cpu reduce_sum grad compute method
      
      * update reduce grad GPU
      
      * remove raw reduce_sum_grad kernel
      
      * modify header files
      
      * add namespace funcs for reduce_grad_funcstions
      f452ad5c
  6. 10 3月, 2022 1 次提交
  7. 09 3月, 2022 1 次提交
  8. 08 3月, 2022 1 次提交
  9. 07 3月, 2022 2 次提交
  10. 03 3月, 2022 2 次提交
  11. 02 3月, 2022 3 次提交
  12. 01 3月, 2022 1 次提交
  13. 28 2月, 2022 3 次提交
  14. 24 2月, 2022 2 次提交
  15. 23 2月, 2022 2 次提交
  16. 22 2月, 2022 3 次提交
  17. 21 2月, 2022 2 次提交
    • C
      [pten]rm reduce_sum and reduce_mean raw kernel (#39484) · 2bb5aae8
      chentianyu03 提交于
      * rm reduce_sum raw kernel
      
      * remove reduce_mean kernel
      
      * remove reduce_mean kernel
      
      * reduce support int and int64_t
      
      * mean support int and int64_t type
      2bb5aae8
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  18. 20 2月, 2022 1 次提交
  19. 19 2月, 2022 2 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
    • C
      fix RecordEvent interface (#39675) · 019a552b
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update operator.cc
      019a552b
  20. 18 2月, 2022 4 次提交
    • F
      [Pten] blas and lapck migration (#39587) · 8c7ee8c2
      Feiyu Chan 提交于
      * move blas related files
      * move lapack related files
      8c7ee8c2
    • Z
      [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
      zhangbo9674 提交于
      * support dtype param for auto_cast
      
      * add amp_dtype for tracer
      
      * add unsupported bf16 list
      
      * support bf16 amp for O2
      
      * refine python interface for bfloat16
      
      * refine code
      
      * refine code
      
      * refine unittest
      
      * refine code
      
      * refine code
      
      * add bf16 o1
      
      * refine code by comment
      
      * add gradient accumulator
      
      * add recompute
      7d6d3848
    • Q
      [MLU]add matmul and matmul_v2 op (#39539) · 229ec32a
      qipengh 提交于
      * [MLU]add matmul and matmul_v2 op
      
      * [MLU] fix data_type and del matmul
      
      * [MLU] fix compile error
      
      * [MLU] fix ci_check error
      229ec32a
    • J
      [Bug Fix]Fix gradient accumulator (#39577) · a7cbd3ef
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * fix different device gradient_accmulator bug
      
      * merge develop
      
      * remove useless tests
      a7cbd3ef
  21. 16 2月, 2022 1 次提交