Created by: xinghai-sun
resolve #90 (closed)
- Refactor the data preprocessor with newly added classes, e.g.
AudioSegment
,SpeechSegment
,TextFeaturizer
,AudioFeaturizer
,SpeechFeaturizer
etc. - Add data augmentation interfaces and classes e.g.
AugmentorBase
,AugmentationPipeline
,VolumePerturbAugmentor
etc., to make it easier to add more data augmentation models. - Separate normalizer's mean-std computing from
DataGenerator
. AddFeatureNormalizer
. - - Add an independent tool
compute_mean_std.py
for users to create mean_std file before training. - Re-organize
data
directory intodatasets
anddata_utils
. - Add module, class, function docs, and update README.md.