1. 26 11月, 2021 5 次提交
    • Z
      upgrade async distributed training in pscore (#37515) · 74605fc2
      zhaocaibei123 提交于
      * test
      
      * test
      
      * rm test
      
      * update
      
      * update
      
      * update
      
      * add unittest
      
      * update
      
      * update save
      74605fc2
    • L
      Fix bugs when bias add none in static graph for fused_attention op. (#37566) · 097e098d
      Li Min 提交于
      * Fix bugs when bias is none for static graph for fused_attention op.
      097e098d
    • Z
      Added interface reset_grad_inplace_version (#37573) · dcb91fd7
      Zhanlue Yang 提交于
      reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.
      
      This is essentially a hack interface to satisfy one specific request
      dcb91fd7
    • W
      TDM2 (#37044) · 4826167c
      wangzhen38 提交于
      * add tdm sample
      
      * add tdm sample in c++
      
      * update tdm sample
      
      * modify sample count
      
      * fix conflict
      
      * add set_date
      
      * fix cmake error
      
      * fix bug of proto
      
      * update index_dataset proto
      
      * update cmake
      
      * fix error cmake
      
      * fix cmake mkldnn
      
      * fix cmake proto
      
      * update cmake proto
      
      * update cmake
      
      * update rec
      
      * update dataset
      
      * update dataset
      
      * update dataset
      
      * updata dataset
      
      * updata dataset
      
      * updata coverage
      
      * updata ci
      
      * goback4
      
      * fix npu ci
      
      * add xxhash dep
      4826167c
    • S
      Fix dropout static when axis != None (#37223) · f25fda37
      smallv0221 提交于
      * fix dropout static when axis != None
      
      * update dropout test
      
      * add dropout test
      
      * fix test
      
      * Update test_dropout_op.py
      
      * Update test_dropout_op.py
      
      * fix testcase
      
      * fix testcase
      
      * Update test_dropout_op.py
      
      * fix testcase
      
      * fix testcase
      
      * optimize perf
      
      * add new test
      
      * fix testcase
      f25fda37
  2. 25 11月, 2021 7 次提交
  3. 24 11月, 2021 4 次提交
    • T
      [GpuPs]pybind core (#37287) · d69daed1
      Thunderbrook 提交于
      * pybind core
      
      * set use psgpu
      d69daed1
    • W
      [Paddle-Inference] Matmul_int8_convert: tensor*tensor (#37285) · 16590799
      Wangzheee 提交于
      * matmul_convert_int8
      
      * matmul_convert_int8
      
      * matmulconvert_int8
      
      * Matmul_int8_convert: tensor*tensor
      
      * Matmul_int8_convert: tensor*tensor
      
      * Matmul_int8_convert: tensor*tensor
      16590799
    • Y
      [Auto Parallel] Add the unified cluster representation (#37091) · db727551
      Yulong Ao 提交于
      * [Auto Parallel]  Add the unified cluster representation
      
      * Add the local id for devices
      
      * Add some comments
      db727551
    • 0
      [Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a
      0x45f 提交于
      * run dy2stat pure fp16 in Linear model
      
      * no use self._pure_fp16_inputs
      
      * add test and fix Adam error in dy2stat pure fp16 training
      
      * use paddle.optimizer.Adam
      
      * run test in gpu
      
      * change test time for CI
      
      * enlarge atol for test_resnet_pure_fp16
      
      * refine code and enlarge atol
      
      * make custom_white_list and custom_black_list take effect for AMP and pure fp16
      
      * check tracer is not None
      
      * use default atol
      
      * change filter_size
      
      * change atol and add some NOTE
      52edad6a
  4. 23 11月, 2021 7 次提交
  5. 22 11月, 2021 9 次提交
  6. 19 11月, 2021 6 次提交
    • W
      Add dygraph triple grad test, broadcast case (#37377) · bb2733fa
      Weilong Wu 提交于
      bb2733fa
    • zhouweiwei2014's avatar
      add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
      zhouweiwei2014 提交于
      * add new API paddle.nn.initializer.Orthogonal and calculate_gain
      
      * fix comment
      
      * fix comment
      62ad3594
    • W
      Add fuse_resnet_unit pass (#36818) · 3cd3bf29
      wuhuanzhou 提交于
      * GeneratePass support attr condition and mapping, test=develop
      
      * fix coverage, test=develop
      
      * Add fuse_resnet_unit pass, test=develop
      
      * fix CI errors, test=develop
      
      * fix CI errors, test=develop
      
      * fix unittest error when compiling without CUDA, test=develop
      
      * fix static ci error, test=develop
      
      * limit kernel size must equal 1, test=develop
      3cd3bf29
    • S
      Add paddle.incubate.graph_send_recv API (#37205) · 39012536
      Siming Dai 提交于
      * add cpu version, using set: sum, min, max
      
      * add cpu version: mean
      
      * improve cpu code and fix dynamic memory allcation problem
      
      * fix arg error, add index judge, delete fp16
      
      * fix bug in CudaAtomicMax and CudaAtomicMin
      
      * add CUDA version
      
      * fix grad_op bug for index
      
      * add op test, add correct cpu grad op
      
      * Add correct CUDA Mean grad
      
      * [Add] Successful MEAN and SUM
      
      * [Add] Successful MIN and MAX in CPU
      
      * [Add] Successful MIN and MAX in CUDA
      
      * fix windows dtype ci
      
      * fix ROCM ci by adding HIP flag
      
      * rename fused_gather_scatter to send_recv
      
      * unify name as send and recv
      
      * change zero index return time
      
      * add send_recv incubate api
      
      * fix index data type, add unittest case for API
      
      * delete redundant input tensor
      
      * fix en example and docs, add default value in pool_type
      
      * add shape judge and max grid judge
      
      * fix comment
      
      * fix index type bug
      
      * add const &
      
      * fix en docs
      
      * delete numpy in examples
      
      * add unittest for int input
      
      * fix send_recv comment
      
      * change send_recv to graph_send_recv
      39012536
    • Y
      [fleet_executor] Parse pipeline config (#37319) · ca088f92
      Yuang Liu 提交于
      ca088f92
    • 0
      [Dy2stat]Support `for i in [1,2,3]` statements in dy2stat (#37259) · d772a9aa
      0x45f 提交于
      * support `for i in [1,2,3]` statements in dy2stat
      
      * add test case
      
      * fix ci
      
      * remove wrong code
      d772a9aa
  7. 18 11月, 2021 2 次提交
    • Z
      [heterps]change default executor for heter trainer (#37314) · c98d175d
      zmx 提交于
      * fix pslib. test=develop
      
      * add device to train_from_dataset. test=develop
      
      * refine fleet.stop_worker. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix executor & ut. test=develop
      
      * fix executor & ut. test=develop
      
      * fix executor & ut. test=develop
      c98d175d
    • X
      Optimize fleet elastic scale in/out (#37177) · 6d34d266
      xiayanming 提交于
      * fleet support elastic train
      
      * fleet support elastic train
      
      * support elastic
      
      * add unittest
      
      * fix unitest bug
      
      * fix unittest bug
      
      * fix unittest bug
      
      * fix unittest coverage
      
      * fix unittest coverage
      
      * fix unittest coverage
      
      * fix unittest coverage
      
      * fix unittest coverage
      
      * fix elastic bug
      
      * fix ci fail
      
      * fix ci fail
      
      * fix elastic bug
      
      * fix elastic bug
      
      * fix joint debugging bug
      
      * fix joint debugging bug
      
      * fix windows ci failed
      
      * fix windows ci failed
      
      * Optimize fleet elastic scale in/out
      
      * elastic support pre hook
      
      * add prehook unittest
      6d34d266