1. 27 4月, 2022 19 次提交
  2. 26 4月, 2022 21 次提交
    • Q
      support nhwc format for kunlun conv/batch_norm (#42195) · 88d68c08
      QingshuChen 提交于
      * support nhwc format for kunlun conv/batch_norm
      *test=kunlun
      
      * minor
      *test=kunlun
      88d68c08
    • B
      【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824
      BrilliantYuKaimin 提交于
      * 增加PixelUnshuffle的形状推断
      
      * 增加PixelUnshuffle的算子注册
      
      * 增加PixelUnshuffle及其梯度的核函数
      
      * 增加PixelUnshuffle算子的描述
      
      * 增加PixelUnshuffle算子的签名
      
      * 在Python层面增加PixelUnshuffle
      
      * 增加PixelUnshuffle的单测
      
      * Update test_pixel_unshuffle.py
      
      * test=document_fix
      
      * Update test_pixel_unshuffle.py
      
      增加对extra_repr的测试
      
      * 修正代码格式
      
      * Update test_pixel_unshuffle.py
      
      修正对extra_repr的测试
      
      * 修改pixel_unshuffle核函数的实现位置
      
      * 修正代码格式
      
      * 完善对输入的检查
      
      * Update test_pixel_unshuffle.py
      
      * 完善pixel_unshuffle的输入检查
      
      * Update pixel_unshuffle_op.cc
      
      * Update unary.cc
      
      * add pixel_unshuffle
      
      * Update test_pixel_unshuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update vision.py
      
      * Delete extra spaces
      
      * Update pixel_unshuffle_sig.cc
      
      * Update vision.py
      
      * Update vision.py
      
      * add PixelUnshuffleGradInferMeta
      
      * remove PixelUnshuffleOpArgumentMapping
      
      * Update pixel_unshuffle_op.cc
      
      * 调整pixel_unshuffle及其梯度的核函数的实现位置
      
      * Update pixel_unshuffle_op.cc
      5be9b824
    • S
      3cdc7a01
    • C
      add attr type test (#42263) · eb64983a
      Chen Weihang 提交于
      eb64983a
    • A
      [Performance]Remove redundant op_type in RecordEvent (#42246) · 64d88e71
      Aurelius84 提交于
      * [Performance]Remove redundant op_type  in RecordEvent
      
      * [Performance]Remove redundant op_type  in RecordEvent
      
      * [Performance]Remove redundant op_type  in RecordEvent
      64d88e71
    • W
    • W
      [Eager] Fix final state adam in selected rows case (#42219) · 12311ddc
      Weilong Wu 提交于
      * [Eager] Support final_state_adam when argument grad (position 1) is selected_rows
      
      * Remove needless code
      
      * Add adam_dense_param_sparse_grad kernel
      12311ddc
    • W
    • D
      replace the numpy with paddle for the data generation in code; test=document_fix (#42259) · 30838aa6
      David Nicolas 提交于
      * replace the numpy with paddle_tensor for the data generation in code example
      
      * Create manipulation.py
      
      replace name as: https://github.com/PaddlePaddle/docs/blob/develop/docs/templates/common_docs.py#L9
      
      * for CI;test=document_fix
      
      * for CI;test=document_fix
      Co-authored-by: NChen Long <1300851984@qq.com>
      30838aa6
    • N
      update (#42248) · 9ee1dc53
      niuliling123 提交于
      9ee1dc53
    • S
      optimize graph_engine pybind (#42192) · 1bf08eca
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add dsm sample method
      
      * add graph_neighbor_sample_v2
      
      * Add graph_neighbor_sample_v2
      
      * fix for loop
      
      * add cpu sample interface
      
      * fix kernel judgement
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * change index settings
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * move cudamemcpy after cuda stream sync
      
      * fix linking problem
      
      * remove comment
      
      * add cpu test
      
      * test
      
      * add cpu test
      
      * change comment
      
      * combine feature table and graph table
      
      * test
      
      * test
      
      * pybind
      
      * test
      
      * test
      
      * test
      
      * test
      
      * pybind
      
      * pybind
      
      * fix cmake
      
      * pybind
      
      * fix
      
      * fix
      
      * add pybind
      
      * add pybind
      
      * optimize pybind
      
      * test
      
      * fix pybind
      
      * fix
      Co-authored-by: NDesmonDay <908660116@qq.com>
      1bf08eca
    • S
      Make size op do no data transformation (#42204) · 06694953
      sneaxiy 提交于
      * fix size op
      
      * update
      06694953
    • Z
      fix heter_client&heter_server (#42188) · 27cb52a4
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * .
      
      * .
      
      * arm_brpc compile
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * only output is ok
      
      * base is ok
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * add switch server bin
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * adapt brpc ssl
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * fix heter_server & heter_client
      
      * .
      
      * .
      
      * int->int64_t
      
      * .
      27cb52a4
    • F
      Adapt BKCL comm for XPUPS (#42168) · fccb0819
      Fan Zhang 提交于
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      
      * Adapt XPUPS - adapt BKCL comm for XPUPS - 4.24
      
      * update by compilation
      
      * Adapt XPUPS - register PSGPUTrainer for XPUPS - 4.25
      
      * update device_worker_factory
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      fccb0819
    • S
      fix bug: arange can not return shape when enable_static (#42182) · d5b4570d
      ShiningZhang 提交于
      * fix bug: arange can not return shape when enable_static
      
      * fix bug: test_arange
      d5b4570d
    • Z
      Optimize the performanece of sum api (#42231) · 2fe4bf2f
      zyfncg 提交于
      * optimize the performanece of sum api
      
      * optimize IsDenseTensorInput
      
      * remove debug log
      2fe4bf2f
    • D
      align the API parameter “name” annotation in math.py; test=document_fix (#42200) · 51ea349c
      David Nicolas 提交于
      * align the api name parameter annotation in math.py; test=document_fix
      
      * Update math.py
      
      * Update math.py
      
      * for CI;test=document_fix
      Co-authored-by: NChen Long <1300851984@qq.com>
      51ea349c
    • C
      Remove std::type_index in AttributeArdDef (#42122) · 996358a1
      Chen Weihang 提交于
      * polish some impl
      
      * add lost attr type
      
      * polish details
      
      * fix error type
      
      * polish in name lists
      
      * add double attr
      
      * adapt infrt attr parse
      996358a1
    • L
      fit for printing cinn_launch op (#42141) · ee56906e
      Leo Chen 提交于
      * fit for printing cinn_launch op
      
      * update boost::variant caster for bytes
      ee56906e
    • Z
      Add Sparse MaxPool3D (#42130) · 18e9aafb
      zhangkaihuo 提交于
      18e9aafb
    • X
      Add C++ EinsumOp which support 2 operands einsum. (#42105) · c7302f96
      xiongkun 提交于
      * full api fix
      
      * when out is None, go old dygraph mode
      
      * by static check
      
      * first version: support 2-inputs forwards. TODO: 1. backward  2. BroadCast  3. MultiVariable
      
      * time out -> 120
      c7302f96