1. 20 7月, 2021 1 次提交
    • R
      Export parameters for setting power_to_db in LogMel transform (#5326) · 6ec25e3c
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      
      * fixed import
      
      * fixed duplicate output in logging
      
      * add code examples, shape info, etc
      
      * add doc for public functions
      
      * make backend controllable
      
      * fixed coding stype in docstring
      
      * export parameters for power_to_db LogMel transform
      
      * default to_db to False to be consistent with functional
      
      * fixed typo in docstring
      6ec25e3c
  2. 16 7月, 2021 1 次提交
    • R
      Update doc string with examples/shapes, controllable backends, and some bug fixed (#5324) · ad8856aa
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      
      * fixed import
      
      * fixed duplicate output in logging
      
      * add code examples, shape info, etc
      
      * add doc for public functions
      
      * make backend controllable
      
      * fixed coding stype in docstring
      ad8856aa
  3. 14 7月, 2021 1 次提交
    • R
      Add transform/functional to paddleaudio. (#5319) · a9cd9789
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      
      * add stft,spectrogram, melspectrogram, log-melspectrogram
      
      * add _internal, transoform, functional to imports
      
      * add new module: functional
      
      * add new module: window.py to _internel/
      
      * add correspoding new unit-test for the new modules
      
      * added ISTFT
      
      * clean code and docstring, clean unit test
      
      * clean code and docstring
      
      * functional
      
      * added back preprocessing
      
      * add README.md
      
      * remove preprocessing for now
      
      * clean code, add doc
      
      * change _internal to signal
      
      * add new transoforms
      
      * add new functionals
      
      * add eps to amsoftmax, return the prediction
      
      * add ffmpeg backend
      
      * remove dithering in depth-convert, add ffmpeg to backend
      
      * add Mudecode/enccode/RandomCodec
      
      * changed variable name, fixed bug
      
      * use namedtuple for returning
      
      * refactor utils
      
      * refactor
      
      * add melspectrogram/spectrogram, add doc string
      
      * add doc string, clean code
      
      * rename window to windowing
      
      * updated docstring, minor bug fixed
      
      * move losses.py to future examples
      
      * remove mu_encode/decode
      
      * refactor
      
      * move metrics to future examples
      
      * remove features/
      
      * naming changes for mu law algorithms
      
      * update test, add testing utils
      
      * fixed import
      a9cd9789
  4. 15 6月, 2021 1 次提交
    • R
      Add wav2vec 2.0 (#5313) · ff1273ea
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      
      * added dcase task1b example
      
      * update README.md
      
      * code fixed for last review
      
      * fixed level string formating
      
      * fixed according to PR reviews
      
      * added wav2vec2.0
      
      * restore datatsets
      
      * add liscense, remove scipy, move test_audio to cloud
      
      * remove 3rd-party dependency:pathos
      
      * add testing for wav2vec2
      
      * update README.md
      
      * updated README.md, added librispeech results
      
      * Revert "updated README.md, added librispeech results"
      
      This reverts commit da4012958e8e0bf2d7f4b608f74518583dd7d73b.
      
      * code fixed from reviews
      
      * add librispeech test
      
      * remove pathos imports
      
      * updated README.md
      
      * update README.md
      
      * minor-fix according to code reviews
      
      * updated README_LP.MD
      
      * fixed according to code review
      
      * fixed according to code review
      
      * added preprocessing example
      
      * removed dcase2021_task1b from examples
      
      * remove preprocessing from examples
      
      * added amsoftmax to losses
      
      * added eer/min_dcf to metrics
      
      * updated __init__.py
      Co-authored-by: Nranchlai <=ranchlai@163.com>
      ff1273ea
  5. 30 5月, 2021 1 次提交
    • K
      Add aishell and librispeech dataset (#5312) · 98fa5803
      KP 提交于
      * Add aishell and librispeech dataset
      
      * Add aishell and librispeech dataset
      
      * Add aishell and librispeech dataset
      
      * Add UrbanAudioVisualScenes Dataset
      
      * Update features api
      98fa5803
  6. 22 5月, 2021 1 次提交
    • R
      PaddleAudio framework alpha version (#5311) · 7260766f
      ranchlai 提交于
      * added sound classication
      
      * added liscense, clean code, add pre-commit
      
      * update req
      
      * moved to PaddlePaddle-models
      
      * code re-structure
      
      * update README.md
      
      * update README.md
      
      * Update README.md
      
      * add audioset training
      
      * default resample mode to kaiser_fast
      
      * delete some comments
      
      * precommit check
      
      * sha->rev
      
      * add config.ymal
      
      * remove SoundClassification from paddlespeech, since it's in PaddleAudio now
      
      * add labels
      
      * remove old labels
      
      * update code
      
      * empty
      
      * #5300
      
      * add evaluate, etc
      
      * remove trace|
      
      * import evaluate
      
      * path update
      
      * precommit check
      
      * recover slowfast
      
      * restore README.md to paddle:develop
      
      * refactor
      
      * update readme
      
      * update README.md
      
      * refactor
      
      * refactor
      
      * refactor
      
      * refactor
      
      * precommit fixed
      
      * update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update train.py
      
      changed prefixed, removed some comments
      
      * add wav file for testing
      
      * bug fixed eval,new checkpoint map=0.416
      
      * Update README.md
      Co-authored-by: Nranchlai <=ranchlai@163.com>
      7260766f
  7. 30 4月, 2021 1 次提交
  8. 28 4月, 2021 1 次提交
  9. 23 4月, 2021 1 次提交