1. 24 10月, 2022 4 次提交
  2. 21 10月, 2022 6 次提交
    • Z
      [Paddle-TRT] Fix conv2d (#47034) · d42a1dc3
      zhoutianzi666 提交于
      * forbid Conv2D into Paddle-TensoRT when weight is not persistable.
      d42a1dc3
    • H
      support qat in sharding stage2 (#47169) (#47240) · 281891c5
      Haohongxiang 提交于
      281891c5
    • R
      [CustomDevice] turn on WITH_CUSTOM_DEVICE when WITH_PYTHON=ON (#47165) · d1fedc54
      ronnywang 提交于
      cherry pick #47108
      
      原 WITH_CUSTOM_DEVICE 默认打开/关闭的策略随 ON_INFER开关,由于训练和预测共同发包,现在训练包编译时会打开 ON_INFER,导致 WITH_CUSTOM_DEVICE 默认关闭,custom device 功能不可用
      
      WITH_CUSTOM_DEVICE 默认打开/关闭的策略更改为随 WITH_PYTHON 开关
      d1fedc54
    • V
      add pdsa-2022-001, test=document_fix (#47228) · 001c8a6a
      Vigi Zhang 提交于
      Add PDSA-2022-001 security advisory
      001c8a6a
    • J
      Add infer prune function (#47047) · 8739497c
      JingZhuangzhuang 提交于
      * Add infer prune function
      
      * add fusion op
      8739497c
    • Y
      Add paddle audio dataset & backend (#45939) (#47230) · 29c9f027
      YangZhou 提交于
      * add audio feature dataset
      
      * fix coding style
      
      * fix coding style2
      
      * rm librosa
      
      * rm voxceleb
      
      * rm librosa in test
      
      * add scipy fftpack
      
      * add functional
      
      * fix setup
      
      * fix setup2
      
      * rm colorlog
      
      * refactor dataset __init__.py
      
      * fix converage
      
      * fix librosa import error
      
      * fix windows test
      
      * fix windows ci
      
      * rm datasets
      
      * fix setup
      
      * remove testdata
      
      * add librosa in requirement
      
      * add librosa in requirement2
      
      * change librosa to 0.8.1
      
      * update ci docker
      
      * fix ci error
      
      * fix ci error2
      
      * fix ci coverage
      
      * fix converage
      
      * fix coverage
      
      * rm audio_base in test, notest,test=coverage
      
      * fix copyright
      
      * rm backend
      
      * add datast in __init__
      
      * rm compliance&&add function test
      
      * fix setup
      
      * fix windows
      
      * fix windows2
      
      * fix test timeout
      
      * add backend & datasets
      
      * fix bugs
      
      * fix ci time issue
      
      * add dataset test
      
      * rm test_audio_feature
      
      * avoid windows isssue, tmp
      
      * note windows isssue
      
      * skip windows issue
      
      * refactor dataset test
      
      * add dataset.py
      
      * fix dtype in layers.mfcc
      
      * fix ci-static-check
      
      * fix dtype in layers.mfcc && fix ci-static-check
      
      * add relative accuracy
      
      * modity API.spec
      
      * skip cuda11.2 test
      
      * skip cuda11.2 test2
      
      * skip cuda11.2
      
      * change dataset name
      
      * fix format
      
      * update api.spec
      
      * update api.spec2
      
      * fix coverage
      
      * add dataset test
      
      * rm download load dict
      
      * rm download load dict in init
      
      * update api.spec3
      
      * fix dataset coverage
      
      * fix coverage
      
      * fix coverage2
      
      * restore api.spec
      
      * restore api.spec2
      
      * fix api-spec 3
      
      * fix api-spec 4
      
      * fix api.spec
      
      * fix api.spec6
      
      * refactor init_backend
      
      * fix typo
      
      * change paddleaudio backend set
      
      * fix get_current_audio_backend()
      
      * fix format
      
      * fix format2
      
      * remove format in parameters
      
      * fix format2
      
      * add warning massage in wave_backend && remove redundant audio util
      
      * rm audio util in print_signatures
      
      * fix format3
      
      * add tess dataset license
      
      * format warning
      
      * add more info in warning msg
      
      * add paddleaudio version check
      
      * replace dataset esc50 with tess
      
      * add tess dataset && rm numpy transform in dataset.py
      
      * fix set audio backend bug
      
      * fix equal error
      
      * fix format && coverage error
      
      * add api example
      
      * fix format
      
      * fix error
      
      * fix typo
      
      * add noqa in __init__
      
      * fix backend doc example error
      
      * rm seed in dataset
      
      * update bakcend example
      
      * fix typo
      
      * fix typo
      
      * fix example err
      
      * fix typo
      
      * fix ci dataset test
      
      * fix example fil
      
      * try to fix ci
      
      * clean dataset doc
      
      * change get_current_audio_backend to get_current_backend
      
      * creplace paddle.audio.backends.info with paddle.audio.info, same with load, save
      
      * fix ci error
      
      * repalce api in test_audio_backend
      
      * fix save&&set_backend exmaple
      29c9f027
  3. 20 10月, 2022 13 次提交
  4. 19 10月, 2022 9 次提交
    • Z
      [Cherry-Pick][AutoParallel] auto_parallel cherry-pick to release2.4 (#47145) · 90b31790
      zhaoyingli 提交于
      * [Auto Parallel] Make Engine class callable (#46416)
      
      * [Auto Parallel] Imporve the user-defined fetches and logging
      
      * [Auto Parallel] Make Engine class callable
      
      * [Auto Parallel] Update the data loading of tuner
      
      * Print IPS in auto parallel Engine (#46554)
      
      * [AutoParallel] fix dist_split (#46505)
      
      * [AutoParallel] fix dist_split
      
      * add unittest
      
      * update cmakelist
      
      * [AutoParallel] fix sharding (#46572)
      
      * [AutoParallel] fix process_mesh (#46583)
      
      * [AutoParallel] fix reshard when train with eval (#46605)
      
      * [AutoParallel] fix reshard when train with eval
      
      * fix mppp
      
      * [AutoParallel] fix amp when predict (#46637)
      
      * [Auto Parallel]Update comp cost and completion for gpt auto search (#46387)
      
      * update comp cost and completion for gpt auto search
      
      * add unittest
      
      * [Auto Parallel] Fix bugs caused by the inconsistent outputs of Engine API (#46633)
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Improve the fine-grained APIs (#46552)
      
      * [Auto Parallel] Suppport different dataloaders
      
      * [Auto Parallel] Add num_shards config for dataset
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Add the prepare API and replace __call__ with run
      
      * [Auto Parallel] Improve the private implementations of Engine
      
      * [Auto Parallel] Set capacity of dataloader for opt tuning
      
      * [Auto Parallel] [WIP] Change the fine-grained API
      
      * [Auto Parallel] Improve APIs to support different user cases
      
      * [Auto Parallel] Add removed config
      
      * [Auto Parallel] Add imports
      
      * [Auto Parallel] Fix bugs for to_static
      
      * [Auto Parallel] Remove unnecessary imports
      
      * bugfix (#46921)
      
      * [Auto Parallel] Fix the bug for None labels (#46987)
      
      * [AutoParallel] adapt for gpt-gen (#46771)
      
      * for gpt-gen
      
      * fix reshard
      
      * adapt assign and shape op
      
      * add dist_assign & unittest
      
      * add conditional block unittest
      
      * rename unittest
      
      * [Auto Parallel] Fix the bug of completion (#47056)
      
      * [Auto Parallel] Fix the bug for None labels
      
      * [Auto Parallel] Fix the completion bug
      
      * [AutoParallel] add callbacks (#47014)
      
      * [AutoParallel] add callbacks
      
      * fix unittest
      
      * fix dist_context
      
      * fix engine
      
      * fix cmakelist
      
      * fix unittest's returns
      
      * fix cmakelist
      
      * [Auto Parallel] Add cost interface (#47043)
      
      * add cost interface
      
      * update inferface and add unittest
      
      * update unittest
      
      * update inferface
      
      * [Auto Parallel]Add parallel tuner (#46189)
      
      * add parallel tuner
      
      * add unittest
      
      * fix unittest
      
      * set timeout of unittest
      
      * set unittest timeout
      
      * fix auto_mode setting
      
      * update unittest
      
      * sync from develop and update unittest
      
      * remove unused import
      
      * update unittest
      
      * update cmakelist
      
      * add unittests
      Co-authored-by: NYulong Ao <aoyulong@baidu.com>
      Co-authored-by: NRuibiao Chen <chenruibiao@baidu.com>
      Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
      Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>
      90b31790
    • Z
      [cherry-pick] strided_slice grad add fp16 support (#47159) · 23f2a4ea
      Zhang Ting 提交于
      * strided_slice grad add fp16 support
      23f2a4ea
    • Y
      Add unsigned int8 scale propagation (#46378) (#47156) · 66dccd7d
      yeliang2258 提交于
      * Add unsigned int8 propagation
      
      * Add or modify unit tests
      
      * Correct concat scale checking
      
      * Apply review suggestions
      
      * Corrections
      Co-authored-by: Njoanna.wozna.intel <joanna.wozna@intel.com>
      66dccd7d
    • A
      [Dy2Stat]Polish @to_static temporary file directory to speed up transformation (#47102) (#47144) · 5a9befea
      Aurelius84 提交于
      Polish @to_static temporary file directory to speed up transformation
      5a9befea
    • X
      [CherryPick] Support TypeHint for function decorated by @to_static (#47147) · 247ef477
      xiongkun 提交于
      * [Dy2Static] Support TypeHint for function decorated by @to_static (#47121)
      
      * Add TypeHint Transformer
      
      * add unittest for typehint transformer
      
      * [Dy2Static] Remove GradTransformer (#47063)
      
      * [Dy2Static] Remove GradTransformer
      1. fix einsum infershape bugs.
      2. remove grad_transformer and unify paddle.grad and paddle.static.gradient.
      3. add dygraph_and_dy2static_only decorator for dy2static.
      
      * fix bugs
      
      * rename
      247ef477
    • G
      Add enable_partial_send_recv switch in pipeline_configs (#46992) (#47083) · 1d015f12
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Support allow_partial switch, which can be configure in
      pipeline_configs. If sent tensor are not the same from
      different hosts, they shouldn't been sent partially and
      then concated as a whole tensor.
      
      * Change name allow_partial to enable_partial_send_recv.
      
      * Add global variable _enable_partial_send_recv
      1d015f12
    • W
      [Dy2St]Fix recurrent op eager deletion pass error in dy2st (#47105) (#47134) · 69515e90
      WangZhen 提交于
      [CherryPick][Dy2St]Fix recurrent op eager deletion pass error in dy2st
      69515e90
    • Y
      update audio api examples (#46938) (#47126) · f08c1041
      YangZhou 提交于
      * update audio api examples
      
      * fix format
      
      * format
      
      * fix
      
      * test api
      
      * fix format
      
      * fix static check error
      
      * fix doc error
      
      * fix ci
      
      * fix api error
      
      * update api.spec
      
      * fix ci
      
      * fix typo in window gaussian
      f08c1041
    • H
      [ cherrypick] Construct exec and ctx only once in cond op to speed up (#47012) · fcb9c0b5
      Hui Zhang 提交于
      Construct exec and ctx only once in cond op to speed up
      fcb9c0b5
  5. 18 10月, 2022 7 次提交
  6. 17 10月, 2022 1 次提交
    • W
      [Cherry-pick] Collective communication APIs (#46922) · 5fba2a98
      Wen Sun 提交于
      * Support both use_calc_stream and sync_op in send recv APIs (#46023)
      
      * Support both use_calc_stream and sync_op in allgather API (#46295)
      
      * Support both use_calc_stream and sync_op in collective communication API (#46761)
      
      * Move group and all reduce from collective to communication (#45848)
      
      * Completes bfloat16 dtype for collective api in eager mode (#45844)
      
      * Fix collective APIs cannot be recognized when building docs (#46962)
      Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com>
      5fba2a98