1. 07 5月, 2022 4 次提交
    • P
      [dockerfile] update go version and delete useless package in dockerfile (#36809) · afcf6bd0
      pangyoki 提交于
      * add cuda11.4 develop docker
      
      * change default python from 2.7 to 3.7
      
      * change base image for cpu docker
      
      * fix gcc bug
      
      * fix whl package name
      
      * update go version and delete useless package in dockerfile
      
      * fix release18 error
      
      * fix wget sqlite problem
      
      * update go version
      
      * update go version in dev dockerfile
      
      * fix CI error
      
      * install zstd
      
      * fix CI error
      
      * add --no-check-certificate when install go
      
      * python2.7 do not install requirements
      
      * fix CI Coverage error
      
      * coverage==5.5
      
      * fix test_activation ut
      
      * let numpy < 1.22 to pass test_activation_op unittest
      
      * fix test_python_bf16_numpy_datatype unittest
      
      * change paddle-bfloat==0.1.3
      
      * recover version of paddle-bfloat
      afcf6bd0
    • A
      sync misc changes (#42534) · 37580838
      Allen Guo 提交于
      37580838
    • S
      support set cuda_arch_name in pipeline (#42498) · bb5a14dd
      Sing_chan 提交于
      * set auto to reduce core_avx/noavx.pyd size
      
      * set CUDA_ARCH_NAME in each case
      bb5a14dd
    • F
      Reduce the number of threads per block of deformable_psroi_pooling to solve... · 8c1b2fa6
      FlyingQianMM 提交于
      Reduce the number of threads per block of deformable_psroi_pooling to solve the bug where too many resources requested for launch (#42531)
      
      8c1b2fa6
  2. 06 5月, 2022 26 次提交
    • E
      bind elementwise_mod_op_xpu (#42175) · 6ea2f049
      enzodechine 提交于
      * bind elementwise_mod_op_xpu *test=kunlun
      
      * add more supported dtypes and UTs *test=kunlun
      
      * fix datatype error
      
      * add op to in xpu1_op_list
      
      * Update Mac cmake version >=3.15 (#41456)
      
      * Update Mac cmake version >=3.15
      
      * notest;read test1
      
      notest;read test2
      
      notest;read test3
      
      * fix inference link error
      
      * fix inference link error
      
      * fix windows link error
      
      * fix cmake_policy
      
      * fix build big size
      
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * disable unittest failed in eager CI in temporary (#42101)
      
      * test=py3-eager
      
      * test=py3-eager
      
      * test=py3-eager
      
      * combine graph_table and feature_table in graph_engine (#42134)
      
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add dsm sample method
      
      * add graph_neighbor_sample_v2
      
      * Add graph_neighbor_sample_v2
      
      * fix for loop
      
      * add cpu sample interface
      
      * fix kernel judgement
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * change index settings
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * move cudamemcpy after cuda stream sync
      
      * fix linking problem
      
      * remove comment
      
      * add cpu test
      
      * test
      
      * add cpu test
      
      * change comment
      
      * combine feature table and graph table
      
      * test
      
      * test
      
      * pybind
      
      * test
      
      * test
      
      * test
      
      * test
      
      * pybind
      
      * pybind
      
      * fix cmake
      
      * pybind
      
      * fix
      
      * fix
      
      * add pybind
      
      * add pybind
      Co-authored-by: NDesmonDay <908660116@qq.com>
      
      * [CustomDevice] add eager mode support (#42034)
      
      * fix FlattenContiguousRangeOpConverter out dim error (#42087)
      
      * fix FlattenContiguousRangeOpConverter out dim error
      
      * update code
      
      * fix python3.10 compile bug on windows (#42140)
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix incorrect usages of std::move and other compile errors (#41045)
      
      * fix bug of std::move and others
      
      * fix an compile error in debug mode
      
      * fix wrong copy assignment operator
      Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>
      
      * reformat
      Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>
      
      * reformat
      Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>
      
      * fix ArrayRef constructor following llvm
      
      * fix format
      
      * fix conflict with master
      
      * fix variant compile error (#42203)
      
      * [Eager] Support numpy.ndarry in CastNumpy2Scalar (#42136)
      
      * [Eager] Remove redundancy code, fix fp16 case (#42169)
      
      * [Eager] Support div(scalar) in eager mode (#42148)
      
      * [Eager] Support div scalar in eager mode
      
      * Updated and remove debug logs
      
      * Remove list, use 'or' directly
      
      * Remove useless statement
      
      * fix recompute (#42128)
      
      * fix recompute
      
      * modify return
      
      * add LICENSE in wheel dist-info package (#42187)
      
      * replace any by variant in infermeta (#42181)
      
      * 【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743)
      
      * Add infermeta for ChannelShuffle
      
      * Create channel_shuffle_grad_kernel.h
      
      * Create channel_shuffle_kernel.h
      
      * Create channel_shuffle_sig.cc
      
      * Create channel_shuffle_op.cc
      
      ChannelShuffle算子的描述
      
      * Create channel_shuffle_kernel_impl.h
      
      ChannelShuffle核函数的实现
      
      * Create channel_shuffle_grad_kernel_impl.h
      
      ChannelShuffle反向核函数的实现
      
      * Add kernel register of channel shuffle and grad
      
      注册ChannelShuffle及其反向的核函数
      
      * add nn.functional.channel_shuffle
      
      * add nn.ChannelShuffle
      
      * Create test_channel_shuffle.py
      
      * Update example of ChannelShuffle in vision.py
      
      * Update test_channel_shuffle.py
      
      * 修改channel_shuffle核函数的实现位置
      
      * 修正代码格式
      
      * 删除多余空格
      
      * 完善channel_shuffle的错误检查
      
      * Update unary.cc
      
      * Update channel_shuffle_op.cc
      
      * Update test_channel_shuffle.py
      
      * Update unary.cc
      
      * add channel_shuffle
      
      * Update test_channel_shuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update channel_shuffle_sig.cc
      
      * 更新ChannelShuffle的文档
      
      * 更新channel_shuffle的文档
      
      * remove ChannelShuffleOpArgumentMapping
      
      * add ChannelShuffleGradInferMeta
      
      * Update channel_shuffle_op.cc
      
      * 调整channel_shuffle及其梯度的核函数的位置
      
      * Do not reset default stream for StreamSafeCUDAAllocator (#42149)
      
      * remove redundant computation in Categorical.probs (#42114)
      
      * Downloading data for test_analyzer_vit_ocr (#42041)
      
      * Change server URL
      
      * update config
      
      * add test to parallel UT rule
      
      * add checksum to ensure files are downloaded
      
      * change downloading target
      
      * reuse existing variable
      
      * change target directory
      
      * fix en docs of some Apis (gradients, scope_guard, cuda_places, name_scope, device_guard, load_program_state, scale, ParamAttr and WeightNormParamAttr) (#41604)
      
      * Update scope_guard; test=document_fix
      
      * gradients; test=document_fix
      
      * gradients; test=document_fix
      
      * name_scope; test=document_fix
      
      * cpu_places; test=document_fix
      
      * WeightNormParamAttr; test=document_fix
      
      * cuda_places; test=document_fix
      
      * load_program_state; test=document_fix
      
      * device_guard; test=document_fix
      
      * device_guard; test=document_fix
      
      * ParamAttr; test=document_fix
      
      * scale; test=document_fix
      
      * scale; test=document_fix
      
      * update code example;test=document_fix
      Co-authored-by: NChen Long <1300851984@qq.com>
      
      * fix datatype error
      
      add op to in xpu1_op_list
      
      *test=kunlun
      
      * fix elementwise_mod op path error  *test=kunlun
      
      * fix elementwise_mod UT error  *test=kunlun
      
      * fix datatype error
      
      add op to in xpu1_op_list
      
      *test=kunlun
      
      add op to in xpu1_op_list
      
      fix elementwise_mod op path error  *test=kunlun
      
      fix elementwise_mod UT error  *test=kunlun
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: Npangyoki <pangyoki@126.com>
      Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com>
      Co-authored-by: NDesmonDay <908660116@qq.com>
      Co-authored-by: Nronnywang <524019753@qq.com>
      Co-authored-by: Nbaoachun <962571062@qq.com>
      Co-authored-by: zhouweiwei2014's avatarZhou Wei <1183042833@qq.com>
      Co-authored-by: Ntiancaishaonvjituizi <452565578@qq.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      Co-authored-by: NRoc <30228238+sljlp@users.noreply.github.com>
      Co-authored-by: NBrilliantYuKaimin <91609464+BrilliantYuKaimin@users.noreply.github.com>
      Co-authored-by: NRuibiao Chen <chenruibiao@baidu.com>
      Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      Co-authored-by: NYilingyelu <103369238+Yilingyelu@users.noreply.github.com>
      Co-authored-by: NChen Long <1300851984@qq.com>
      6ea2f049
    • T
      Revert "Del github api token (#42330)" (#42544) · e3e3fa60
      tianshuo78520a 提交于
      This reverts commit 2c5cecb1.
      e3e3fa60
    • Z
      fix conv3d backward (#42502) · 503569a0
      zhangkaihuo 提交于
      503569a0
    • A
      [NPU] support model PPO (#42484) · d73eb38c
      Aganlengzi 提交于
      d73eb38c
    • A
      [NPU] add clip_by_norm op (#42411) · 1588e7e7
      Aganlengzi 提交于
      * [NPU] add clip_by_norm op
      
      * fix
      
      * update
      1588e7e7
    • F
      [XPUPS] Register pull_box_sparse op under XPU_KP compilation (#42354) · 63067e90
      Fan Zhang 提交于
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      
      * Adapt XPUPS - adapt BKCL comm for XPUPS - 4.24
      
      * update by compilation
      
      * Adapt XPUPS - register PSGPUTrainer for XPUPS - 4.25
      
      * update device_worker_factory
      
      * Adapt XPUPS - split heter_ps into .cu and .cc - 4.27
      
      * Adapt XPUPS - register pull_box_sparse op under XPU_KP - 4.28
      
      * update
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      63067e90
    • C
      [Eager] Polish generated code details (#42512) · 3da97b45
      Chen Weihang 提交于
      * polish code details
      
      * remove needless prefix
      
      * revert needless change
      
      * polish grad func generated format
      3da97b45
    • Z
    • T
      Del github api token (#42330) · 2c5cecb1
      tianshuo78520a 提交于
      2c5cecb1
    • L
      add send/recv for ProcessGroupHeter (#42318) · df113208
      lilong12 提交于
      df113208
    • W
      [Eager] inc ref before return Py_None (#42505) · a384828d
      wanghuancoder 提交于
      * fix pylayer_memleak
      
      * inc ref before return Py_None
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      a384828d
    • S
      Fix some typos. (#42407) · d7728051
      Shuangchi He 提交于
      d7728051
    • T
      fix false positive warning of gcc>=9 (#42265) · f5689178
      tiancaishaonvjituizi 提交于
      * fix false positive warning in gcc>=9
      
      * use more aggressive way
      
      * Add comments
      f5689178
    • A
      update UTs 3 (#42519) · 94acf7c8
      Allen Guo 提交于
      94acf7c8
    • J
      Fix stray error (#42509) · 832e58d6
      Jiabin Yang 提交于
      * fix @ stray error in dygraph
      
      * fix @ stray error in dygraph
      832e58d6
    • Y
      fix dataset ut (#42504) · 06927016
      yaoxuefeng 提交于
      * fix dataset ut
      
      * fix seed state ut
      06927016
    • Z
      b5b02f3f
    • L
      skip bf16 test if not supported (#42503) · 69b5d74d
      Leo Chen 提交于
      69b5d74d
    • A
      [IPU] remove transfer cast pass (#42520) · 09a13294
      Allen Guo 提交于
      * rm transfer_cast_op_pass
      
      * rm header
      09a13294
    • A
      update UTs 2 (#42518) · 001dab0b
      Allen Guo 提交于
      001dab0b
    • A
      update UTs 1 (#42517) · 063a3509
      Allen Guo 提交于
      063a3509
    • A
      [IPU] update UTs 0 (#42516) · 63d4d05a
      Allen Guo 提交于
      * update UTs 0
      
      * fix ci
      
      * fix ci 3
      63d4d05a
    • A
      [IPU] clean ipu related code (#42511) · 1b5647d7
      Allen Guo 提交于
      * clean code
      
      * fix ci
      
      * fix ci
      
      * fix ci 2
      1b5647d7
    • W
      6ff35e17
    • Z
      [AutoParallel] adapt for 2d laplace (#41601) · c043a21b
      zhaoyingli 提交于
      * add default_ctx in backward.py
      
      * record grad_var_to_var with grad_times
      
      * fix backward
      
      * update annotation
      
      * add complete_high_order_grad in complete_forward
      
      * add dist slice op
      
      * update grad_var_to_var type
      
      * update partition_block init mapping before loss op
      
      * update compatible for 'XShape' & update 'allreduce_vars'
      
      * add dist reshape op when input dim equal to output dim
      
      * update 'set_grad_var_shape' with grad_var_to_var
      
      * fix dist slice
      
      * fix set_grad_var_shape
      
      * add dist pnorm op
      
      * fix dist pnorm dist_attr
      
      * fix engine startprogram & adapt highorder grad
      
      * fix set_grad_var_shape when mp
      
      * update unittest
      
      * update cmakelist
      
      * default strategy in engine: dp
      
      * bug fix
      
      * tiny fix
      
      * flatten outputs
      
      * fix default strategy
      
      * init default ctx
      
      * tiny fix
      
      * test=allcase
      c043a21b
    • W
      fix pylayer_memleak (#42501) · 2d29d833
      wanghuancoder 提交于
      2d29d833
  3. 05 5月, 2022 10 次提交