1. 17 10月, 2021 1 次提交
  2. 14 10月, 2021 1 次提交
    • littletomatodonkey's avatar
      fix readme (#5356) · 58f21273
      littletomatodonkey 提交于
      * fix readme
      
      * fix reprod
      
      * Update ThesisReproduction_CV.md
      
      * fix number
      
      * add grad print example
      
      * add content
      
      * add content
      
      * fix
      58f21273
  3. 12 10月, 2021 1 次提交
  4. 09 10月, 2021 1 次提交
  5. 08 10月, 2021 1 次提交
  6. 29 9月, 2021 1 次提交
  7. 23 9月, 2021 1 次提交
  8. 25 8月, 2021 1 次提交
  9. 15 8月, 2021 1 次提交
    • R
      Update PaddleAudio transforms and functionals (#5334) · 9cab6c61
      ranchlai 提交于
      * added reverb/noisify/AudioReader/RandomChoice/RandomApply
      
      * bug fixed
      
      * transform name changes
      
      * work around for bug in paddle's groupnorm
      
      * upgraded to use float64 inside for high numerical acc
      
      * fixed docstring, add nn.Layer as super for Noisify
      
      * fixed docstring
      
      * added mfcc func/trans and dct function
      
      * updated unit test
      
      * add dtype to control datatype in win function
      
      * add dtype control in transforms
      
      * add dtype control in functionals
      
      * updated test
      
      * added dtype control, updated test
      9cab6c61
  10. 26 7月, 2021 2 次提交
  11. 20 7月, 2021 1 次提交
    • R
      Export parameters for setting power_to_db in LogMel transform (#5326) · 6ec25e3c
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      
      * fixed import
      
      * fixed duplicate output in logging
      
      * add code examples, shape info, etc
      
      * add doc for public functions
      
      * make backend controllable
      
      * fixed coding stype in docstring
      
      * export parameters for power_to_db LogMel transform
      
      * default to_db to False to be consistent with functional
      
      * fixed typo in docstring
      6ec25e3c
  12. 16 7月, 2021 1 次提交
    • R
      Update doc string with examples/shapes, controllable backends, and some bug fixed (#5324) · ad8856aa
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      
      * fixed import
      
      * fixed duplicate output in logging
      
      * add code examples, shape info, etc
      
      * add doc for public functions
      
      * make backend controllable
      
      * fixed coding stype in docstring
      ad8856aa
  13. 14 7月, 2021 1 次提交
    • R
      Add transform/functional to paddleaudio. (#5319) · a9cd9789
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      a9cd9789
  14. 15 6月, 2021 1 次提交
    • R
      Add wav2vec 2.0 (#5313) · ff1273ea
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      Co-authored-by: Nranchlai <=ranchlai@163.com>
      ff1273ea
  15. 30 5月, 2021 1 次提交
    • K
      Add aishell and librispeech dataset (#5312) · 98fa5803
      KP 提交于
      * Add aishell and librispeech dataset
      
      * Add aishell and librispeech dataset
      
      * Add aishell and librispeech dataset
      
      * Add UrbanAudioVisualScenes Dataset
      
      * Update features api
      98fa5803
  16. 25 5月, 2021 1 次提交
    • L
      Add SMOKE model (#5308) · bbdb65ed
      Liu Yi 提交于
      * add SMOKE
      
      * add deployment
      
      * add pretrained link.
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * update config
      
      * add figure
      
      * fix a typo
      
      * delete unused codes
      
      * add reference links
      
      * resolved problems
      
      * change to 2.1
      Co-authored-by: Nliuyi22 <liuyi22@baidu.com>
      bbdb65ed
  17. 23 5月, 2021 4 次提交
  18. 22 5月, 2021 1 次提交
    • R
      PaddleAudio framework alpha version (#5311) · 7260766f
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      Co-authored-by: Nranchlai <=ranchlai@163.com>
      7260766f
  19. 08 5月, 2021 2 次提交
  20. 30 4月, 2021 2 次提交
  21. 28 4月, 2021 1 次提交
  22. 23 4月, 2021 1 次提交
  23. 15 4月, 2021 1 次提交
  24. 23 2月, 2021 1 次提交
  25. 08 2月, 2021 1 次提交
  26. 05 2月, 2021 7 次提交
    • J
      Add Distill Bi-LSTM (#5158) · 4f11f93a
      Jiaqi Liu 提交于
      * add distill lstm
      
      * update distill and small model
      
      * add data augmentation for distill Bi-LSTM
      
      * update readme
      
      * load embedding from pre-trainined files
      
      * add some dataloader, modify model to support QQP
      
      * update README
      
      * upload senta word dict
      
      * fix qqp distill data loader
      
      * update data augmentation method
      
      * Delete senta_word_dict.txt
      
      * fix chinese data augmentation bug
      
      * update some codes according to review, update readme
      
      * use args, update readme, delete run_bert_finetune.py
      
      * fix utils bugs
      
      * add args
      
      * update distill readme
      
      * update readme
      
      * rename readme's headline
      
      * convert bert to uppercase
      4f11f93a
    • K
      optimize lac and msra_ner example (#5270) · 352d9fb8
      kinghuin 提交于
      * optimize lac and msra_ner example
      
      * optimize ner and lac
      
      * rename filename to lexical_analysis_dataset_tiny_path0
      
      * modify lac dataset url
      352d9fb8
    • J
      Add some lr schedulers with warmup (#5256) · fee616e8
      Jiaqi Liu 提交于
      * add -linear-schedule-with-warmup
      
      * update lr scheduler usage in run_glue
      
      * add import info
      
      * add other 5 scheduler with warmup
      
      * add copyright, update usage in run_glue
      
      * simplify argument, make warmup arg support float and int
      
      * Add some LambdaDecay scheduler with warmup, update usage in run_glue
      
      * update classname and unify two cosine decays into one
      
      * update usage in run_glue
      
      * fix typo
      
      * update WarmUp to Warmup, and update class name about Const, and update doc, and usage
      
      * update usage of decay class
      
      * update usage of decay class
      fee616e8
    • J
      add seq2seq infer (#5205) · a22fa4b3
      Jiaqi Liu 提交于
      * add seq2seq infer
      
      * update argument description, remove useless import
      
      * add deploy directory
      
      * add deploy directory, add relative import
      
      * update arg usage in README, fix import order.
      a22fa4b3
    • Z
      Quick fix typo (#5273) · 052da3d7
      Zhong Hui 提交于
      052da3d7
    • X
      Fix DGU bug (#5272) · 82367481
      xiemoyuan 提交于
      82367481
    • L
      Transformer dy2sta and inference support (#5209) · 09490523
      liu zhengxi 提交于
      * dy2sta and inference support
      
      * delete useless code
      
      * update dir
      
      * sys
      
      * delete useless import
      09490523
  27. 04 2月, 2021 2 次提交