1. 19 1月, 2023 2 次提交
    • J
      [KUNLUN] add op: maxpool_with_index (#49505) · f71f77e9
      jameszhang 提交于
      * [KUNLUN] add op: maxpool_with_index
      
      * use DeviceContext::Alloc() instead of DenseTensor::mutable_data()
      
      * fix file format
      
      * solve clip unittest failure
      
      * minor fix
      
      * Revert "solve clip unittest failure" since the issue is fixed
      in #49535
      
      This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b.
      
      * align with xdnn on the definition of mask in max_pool_with_index
      
      * minor
      f71f77e9
    • H
      [Paddle Inference]Support PaddlePaddle Backend on Triton (#49758) · e3f39833
      heliqi 提交于
      * support PaddlePaddle Backend on Triton
      
      * fix test cases
      
      * fix Codestyle
      
      * add test case
      
      * add test case
      e3f39833
  2. 18 1月, 2023 6 次提交
  3. 17 1月, 2023 10 次提交
    • J
      Add more dy2st ut2 (#49881) · 2242136a
      Jiabin Yang 提交于
      * add test for composite with dy2st
      
      * add more log
      2242136a
    • Z
      Refine munmap freq for RefcountedMemoryMapAllocation (#49691) · 3fdc105f
      zhangbo9674 提交于
      * refine munmap freq for ref_cnt_mmap_allocator
      
      * add shm reuse logic
      
      * fix compile bug
      
      * fix compile bug
      
      * fix bug of file refcount
      
      * fix compile bug
      
      * fix compile bug
      
      * refine code for delete shm case
      
      * polish code
      
      * refine shm cache pool size setting logic
      
      * set buffer is 2
      
      * refine shm cache size logic
      
      * refine max shm cache
      
      * refine shm cache size
      3fdc105f
    • P
      Rewrite mat reshape transpose testers (#49580) · d9d47dc6
      Paulina Gacek 提交于
      * reshape_transpose_matmul_pass_tester rewritten
      
      * matmul_transpose_reshape_pass_tester rewritten
      
      * mkldnn to onednn
      d9d47dc6
    • P
      support CUDA Graph for new executor (#49708) · 8e5ed04d
      pangyoki 提交于
      * new exe supports CUDA Graph
      
      * fix
      
      * fix
      
      * fix
      
      * fix FLAGS_use_stream_safe_cuda_allocator in unittest
      
      * insert output of coalesce_tensor op to skip_gc_var
      
      * fix
      8e5ed04d
    • X
      Prim api gen (#49654) · 813e27c9
      xiaoguoguo626807 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * origin_yaml codegen success
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * clear #
      
      * merge develop
      
      * clear #
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * clear prim_gen
      
      * recover git ignore for generated op
      
      * parse_yaml success
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * code_style
      
      * revert parse_utils+ clear prim_gen
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      
      * modify gen cmake
      
      * add more api gen
      
      * add header
      
      * modify static
      
      * add static expand unsqueeze
      
      * comments
      
      * modify compopmaker
      
      * revert
      
      * modify gen name
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      813e27c9
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
    • J
      add test for composite with dy2st (#49873) · b927ce81
      Jiabin Yang 提交于
      b927ce81
    • W
      [Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
      WangZhen 提交于
      * Support call backward() without params in dy2st
      2f24b2d8
    • L
      Modified compute and amplifier interceptor (#42044) · 989e39a5
      LiYuRio 提交于
      989e39a5
    • X
      【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a
      Xiaoxu Chen 提交于
      * support elementwise base func
      
      * fix compiling error and add test
      
      * support vjp for div using comp
      
      * remove additional change
      
      * fix dy2st error with magic num
      
      * fix dy magic num
      
      * another magic
      
      * another magic
      
      * another magic
      
      * add skip rename strategy
      
      * support add vjp
      
      * support add with new axis cal
      
      * support sub vjp
      
      * [prim] add multiply vjp rules
      
      * [prim] add multiply vjp rules
      
      * [prim] fix no infershape with composite in _append_backward_ops
      
      * [prim] add expand vjp rule
      
      * [prim] add exp vjp rule
      
      * uncomment infer shape for reshape/sum static prim api
      
      * [prim] fix tanh nullptr error
      
      * remove some print message
      
      * fix magic number in run_program relative tests @JiaBinYang
      
      * [prim] add expand,multiply,exp vjp rules
      
      * fix only support single direction reduce error
      
      * infer reduce dims using out dims
      Co-authored-by: NJiabinYang <360788950@qq.com>
      39c6765a
  4. 16 1月, 2023 12 次提交
  5. 15 1月, 2023 2 次提交
    • R
      support mp on xpu (#49815) · 6a56bce7
      Roc 提交于
      1 update xccl lib
      2 when using comm_ctx, the allocator should be set manually.
      6a56bce7
    • J
      【Prim】Enhance tests (#49814) · 090aa45d
      Jiabin Yang 提交于
      * support elementwise base func
      
      * fix compiling error and add test
      
      * remove additional param
      
      * support vjp for div using comp
      
      * remove additional change
      
      * fix dy2st error with magic num
      
      * fix dy magic num
      
      * another magic
      
      * another magic
      
      * add more test
      
      * fix windows problem
      
      * another magic
      
      * fix windows compile
      
      * invoke ci
      
      * add skip rename strategy
      
      * support add vjp
      
      * fix test_tanh
      
      * support add with new axis cal
      
      * fix resnet and some test
      
      * add composite log
      
      * support sub vjp
      
      * enhance_tests
      
      * support more dtype for full
      090aa45d
  6. 13 1月, 2023 8 次提交