Better Data Augmentation

3eb7f5e4 · HypoX64 · bb5ad100 · 3eb7f5e4 · 3eb7f5e4 · 3eb7f5e4
18 changed file
--- a/README.md
+++ b/README.md
@@ -6,25 +6,69 @@
 | English | [中文版](./README_CN.md) |<br><br>
 A time series signal analysis and classification framework.<br>
-It contain multiple network  and provide data preprocessing, reading, training, evaluation, testing and other functions.<br>
+It contain multiple network  and provide data preprocessing, data augmentation, training, evaluation, testing and other functions.<br>
 Some output examples: [heatmap](./image/heatmap_eg.png)  [running_loss](./image/running_loss_eg.png)  [log.txt](./docs/log_eg.txt)<br>
-Supported network:<br>
+## Feature
+### Data preprocessing
+General signal preprocessing method.
+* Normliaze
+5_95 | maxmin | None
+* filter
+fft | fir | iir | wavelet | None
+### Data augmentation
+Various data augmentation method.<br>[[Time Series Data Augmentation for Deep Learning: A Survey]](https://arxiv.org/pdf/2002.12478.pdf)
+* Base
+scale, warp, app, aaft, iaaft, filp, crop
+* Noise
+spike, step, slope, white, pink, blue, brown, violet
+* Gan
+dcgan
+### Network
+Various networks for evaluation.
 >1d
 >
->>lstm, cnn_1d, resnet18_1d, resnet34_1d, multi_scale_resnet_1d, micro_multi_scale_resnet_1d
+>>lstm, cnn_1d, resnet18_1d, resnet34_1d, multi_scale_resnet_1d, micro_multi_scale_resnet_1d,autoencoder,mlp
 >2d(stft spectrum)
 >
 >>mobilenet, resnet18, resnet50, resnet101, densenet121, densenet201, squeezenet, dfcnn, multi_scale_resnet,
+### K-fold
+Use k-fold to make the results more reliable.
+```--k_fold```&```--fold_index```<br>
+* --k_fold
+```python
+# fold_num of k-fold. If 0 or 1, no k-fold and cut 80% to train and other to eval.
+```
+* --fold_index
+```python
+"""--fold_index
+When --k_fold != 0 or 1:
+Cut dataset into sub-set using index , and then run k-fold with sub-set
+If input 'auto', it will shuffle dataset and then cut dataset equally
+If input: [2,4,6,7]
+when len(dataset) == 10
+sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]
+-------
+When --k_fold == 0 or 1:
+If input 'auto', it will shuffle dataset and then cut 80% dataset to train and other to eval
+If input: [5]
+when len(dataset) == 10
+train-set : dataset[0:5]  eval-set : dataset[5:]
+"""
+```
 ## A example: Use EEG to classify sleep stage
 [sleep-edfx](https://github.com/HypoX64/candock/tree/f24cc44933f494d2235b3bf965a04cde5e6a1ae9)<br>
 Thank [@swalltail99](https://github.com/swalltail99)for the bug. In other to load sleep-edfx dataset，please install mne==0.18.0<br>
 ```bash
 pip install mne==0.18.0
 ```
 ## Getting Started
 ### Prerequisites
 - Linux, Windows,mac
@@ -32,11 +76,11 @@ pip install mne==0.18.0
 - Python 3
 - Pytroch 1.0+
 ### Dependencies
-This code depends on torchvision, numpy, scipy , matplotlib, available via pip install.<br>
+This code depends on torchvision, numpy, scipy, pywt, matplotlib, available via pip install.<br>
 For example:<br>
 ```bash
-pip3 install matplotlib
+pip install matplotlib
 ```
 ### Clone this repo:
 ```bash
@@ -64,7 +108,7 @@ python3 simple_test.py --label 50 --input_nc 1 --model_name micro_multi_scale_re
 ## Training with your own dataset
 * step1: Generate signals.npy and labels.npy in the following format.
 ```python
-#1.type:numpydata   signals:np.float64   labels:np.int64
+#1.type:numpydata   signals:np.float32   labels:np.int64
 #2.shape  signals:[num,ch,length]    labels:[num]
 #num:samples_num, ch :channel_num,  length:length of each sample
 #for example:
@@ -73,28 +117,4 @@ labels = np.array([0,0,0,0,0,1,1,1,1,1])      #0->class0    1->class1
 ```
 * step2: input  ```--dataset_dir "your_dataset_dir"``` when running code.
-### About k-fold
+### [ More training options](./util/options.py).
-```--k_fold```&```--fold_index```<br>
\ No newline at end of file
-* k_fold
-```python
-# fold_num of k-fold. If 0 or 1, no k-fold and cut 0.8 to train and other to eval.
-```
-* fold_index
-```python
-        """--fold_index
-        5-fold:
-        Cut dataset into sub-set using index , and then run k-fold with sub-set
-        If input 'auto', it will shuffle dataset and then cut dataset equally
-        If input: [2,4,6,7]
-        when len(dataset) == 10
-        sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]
-        ---------------------------------------------------------------
-        No-fold:
-        If input 'auto', it will shuffle dataset and then cut 80% dataset to train and other to eval
-        If input: [5]
-        when len(dataset) == 10
-        train-set : dataset[0:5]  eval-set : dataset[5:]
-        """
-```
-### [ More options](./util/options.py).
\ No newline at end of file
--- a/README_CN.md
+++ b/README_CN.md
@@ -6,18 +6,61 @@
 | [English](./README.md) | 中文版 |<br><br>
 一个通用的一维时序信号分析,分类框架.<br>
-它将包含多种网络结构，并提供数据预处理,读取,训练,评估,测试等功能.<br>
+它将包含多种网络结构，并提供数据预处理,数据增强,训练,评估,测试等功能.<br>
 一些训练时的输出样例: [heatmap](./image/heatmap_eg.png)  [running_loss](./image/running_loss_eg.png)  [log.txt](./docs/log_eg.txt)<br>
-目前支持的网络结构:<br>
+## 支持的功能
+### 数据预处理
+通用的数据预处理方法
+* Normliaze
+5_95 | maxmin | None
+* filter
+fft | fir | iir | wavelet | None
+### 数据增强
+多种多样的数据增强方法.注意:使用时应该结合数据的物理特性进行选择.<br>[[Time Series Data Augmentation for Deep Learning: A Survey]](https://arxiv.org/pdf/2002.12478.pdf)
+* Base
+scale, warp, app, aaft, iaaft, filp, crop
+* Noise
+spike, step, slope, white, pink, blue, brown, violet
+* Gan
+dcgan
+### 网络
+提供多种用于评估的网络.
 >1d
 >
->>lstm, cnn_1d, resnet18_1d, resnet34_1d, multi_scale_resnet_1d, micro_multi_scale_resnet_1d
+>>lstm, cnn_1d, resnet18_1d, resnet34_1d, multi_scale_resnet_1d, micro_multi_scale_resnet_1d,autoencoder,mlp
 >2d(stft spectrum)
 >
 >>mobilenet, resnet18, resnet50, resnet101, densenet121, densenet201, squeezenet, dfcnn, multi_scale_resnet,
+### K-fold
+使用K-fold使得结果更加可靠．
+```--k_fold```&```--fold_index```<br>
+* --k_fold
+```python
+# fold_num of k-fold. If 0 or 1, no k-fold and cut 80% to train and other to eval.
+```
+* --fold_index
+```python
+"""--fold_index
+When --k_fold != 0 or 1:
+Cut dataset into sub-set using index , and then run k-fold with sub-set
+If input 'auto', it will shuffle dataset and then cut dataset equally
+If input: [2,4,6,7]
+when len(dataset) == 10
+sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]
+-------
+When --k_fold == 0 or 1:
+If input 'auto', it will shuffle dataset and then cut 80% dataset to train and other to eval
+If input: [5]
+when len(dataset) == 10
+train-set : dataset[0:5]  eval-set : dataset[5:]
+"""
+```
 ## 关于EEG睡眠分期数据的实例
 为了适应新的项目，代码已被大幅更改，不能正常运行如sleep-edfx等睡眠数据集，如果仍然需要运行，请参照下文按照输入格式标准自行加载数据，如果有时间我会修复这个问题。
 当然，如果需要加载睡眠数据集也可以直接使用[老的版本](https://github.com/HypoX64/candock/tree/f24cc44933f494d2235b3bf965a04cde5e6a1ae9)<br>
@@ -75,27 +118,4 @@ labels = np.array([0,0,0,0,0,1,1,1,1,1])      #0->class0    1->class1
 ```
 * step2: 输入  ```--dataset_dir "your_dataset_dir"``` 当运行代码的时候.
-### 关于 k-fold
-```--k_fold```&```--fold_index```<br>
-* k_fold
-```python
-# fold_num of k-fold. If 0 or 1, no k-fold and cut 0.8 to train and other to eval
-```
-* fold_index
-```python
-        """--fold_index
-        5-fold:
-        Cut dataset into sub-set using index , and then run k-fold with sub-set
-        If input 'auto', it will shuffle dataset and then cut dataset equally
-        If input: [2,4,6,7]
-        when len(dataset) == 10
-        sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]
-        ---------------------------------------------------------------
-        No-fold:
-        If input 'auto', it will shuffle dataset and then cut 80% dataset to train and other to eval
-        If input: [5]
-        when len(dataset) == 10
-        train-set : dataset[0:5]  eval-set : dataset[5:]
-        """
-```
 ### [ More options](./util/options.py).
\ No newline at end of file
--- a/data/__init__.py
+++ b/data/__init__.py
--- a/data/augmenter.py
+++ b/data/augmenter.py
@@ -2,24 +2,29 @@ import os
 import time
 import numpy as np
+import scipy.signal
+import scipy.fftpack as fftpack
+import pywt
 import torch
 from torch import nn, optim
 from multiprocessing import Process, Queue
 import matplotlib
-matplotlib.use('Agg')
 import matplotlib.pyplot as plt
-# import torch.multiprocessing as mp
 import warnings
 warnings.filterwarnings("ignore")
 import sys
 sys.path.append("..")
-from util import util,transformer,dataloader,statistics,plot,options
+from util import util,plot,options,dsp
+from util import array_operation as arr
+from . import transforms,dataloader,statistics,surrogates,noise
 from models.net_1d.gan import Generator,Discriminator,GANloss,weights_init_normal
 from models.core import show_paramsnumber
-def gan(opt,signals,labels):
+def dcgan(opt,signals,labels):
    print('Augment dataset using gan...')
    if opt.gpu_id != -1:
        os.environ["CUDA_VISIBLE_DEVICES"] = str(opt.gpu_id)
@@ -118,9 +123,113 @@ def gan(opt,signals,labels):
    # return signals,labels
    return out_signals,out_labels
+def base1d(opt,data,test_flag):
-def base(opt,signals,labels):
+    """
-    pass
+    data : batchsize,ch,length
+    """
+    batchsize,ch,length = data.shape
+    random_list = np.random.rand(15)
+    threshold = 1/(len(opt.augment)+1)
+    if test_flag:
+        move = int((length-opt.finesize)*0.5)
+        result = data[:,:,move:move+opt.finesize]
+    else:
+        result = np.zeros((batchsize,ch,opt.finesize))
+        for i in range(batchsize):
+            for j in range(ch):
+                signal = data[i][j]
+                _length = length
+                # Time Domain
+                if 'scale' in opt.augment and random_list[0]>threshold:
+                    beta = np.clip(np.random.normal(1, 0.1),0.8,1.2)
+                    signal = arr.interp(signal, int(_length*beta))
+                    _length = signal.shape[0]
+                if 'warp' in opt.augment and random_list[1]>threshold:
+                    pos = np.sort(np.random.randint(0, _length, 2))
+                    if pos[1]-pos[0]>10:
+                        beta = np.clip(np.random.normal(1, 0.1),0.8,1.2)
+                        signal = np.concatenate((signal[:pos[0]], arr.interp(signal[pos[0]:pos[1]], int((pos[1]-pos[0])*beta)) , signal[pos[1]:]))
+                        _length = signal.shape[0]
+                # Noise            
+                if 'spike' in opt.augment and random_list[2]>threshold:
+                    std = np.std(signal)
+                    spike_indexs = np.random.randint(0, _length, int(_length*np.clip(np.random.uniform(0,0.05),0,1)))
+                    for index in spike_indexs:
+                        signal[index] = signal[index] + std*np.random.randn()*opt.augment_noise_lambda
+                if 'step' in opt.augment and random_list[3]>threshold:
+                    std = np.std(signal)
+                    step_indexs = np.random.randint(0, _length, int(_length*np.clip(np.random.uniform(0,0.01),0,1)))
+                    for index in step_indexs:
+                        signal[index:] = signal[index:] + std*np.random.randn()*opt.augment_noise_lambda
+                if 'slope' in opt.augment and random_list[4]>threshold: 
+                    slope = np.linspace(-1, 1, _length)*np.random.randn()
+                    signal = signal+slope*opt.augment_noise_lambda
+                if 'white' in opt.augment and random_list[5]>threshold:
+                    signal = signal+noise.noise(_length,'white')*(np.std(signal)*np.random.randn()*opt.augment_noise_lambda)
+                if 'pink' in opt.augment and random_list[6]>threshold:
+                    signal = signal+noise.noise(_length,'pink')*(np.std(signal)*np.random.randn()*opt.augment_noise_lambda)
+                if 'blue' in opt.augment and random_list[7]>threshold:
+                    signal = signal+noise.noise(_length,'blue')*(np.std(signal)*np.random.randn()*opt.augment_noise_lambda)
+                if 'brown' in opt.augment and random_list[8]>threshold:
+                    signal = signal+noise.noise(_length,'brown')*(np.std(signal)*np.random.randn()*opt.augment_noise_lambda)
+                if 'violet' in opt.augment and random_list[9]>threshold:
+                    signal = signal+noise.noise(_length,'violet')*(np.std(signal)*np.random.randn()*opt.augment_noise_lambda)
+                # Frequency Domain
+                if 'app' in opt.augment and random_list[10]>threshold:
+                    # amplitude and phase perturbations
+                    signal = surrogates.app(signal)
+                if 'aaft' in opt.augment and random_list[11]>threshold:  
+                    # Amplitude Adjusted Fourier Transform
+                    signal = surrogates.aaft(signal)
+                if 'iaaft' in opt.augment and random_list[12]>threshold:
+                    # Iterative Amplitude Adjusted Fourier Transform
+                    signal = surrogates.iaaft(signal,10)[0]
+                # crop and filp
+                if 'filp' in opt.augment and random_list[13]>threshold:
+                    signal = signal[::-1]
+                if _length >= opt.finesize:
+                    move = int((_length-opt.finesize)*np.random.random())
+                    signal = signal[move:move+opt.finesize]
+                else:
+                    signal = arr.pad(signal, opt.finesize-_length, mod = 'repeat')
+                result[i,j] = signal
+    return result
+def base2d(img,finesize = (224,244),test_flag = True):
+    h,w = img.shape[:2]
+    if test_flag:
+        h_move = int((h-finesize[0])*0.5)
+        w_move = int((w-finesize[1])*0.5)
+        result = img[h_move:h_move+finesize[0],w_move:w_move+finesize[1]]
+    else:
+        #random crop
+        h_move = int((h-finesize[0])*random.random())
+        w_move = int((w-finesize[1])*random.random())
+        result = img[h_move:h_move+finesize[0],w_move:w_move+finesize[1]]
+        #random flip
+        if random.random()<0.5:
+            result = result[:,::-1]
+        #random amp
+        result = result*random.uniform(0.9,1.1)+random.uniform(-0.05,0.05)
+    return result
 def augment(opt,signals,labels):
    pass

--- a/util/dataloader.py
+++ b/util/dataloader.py
@@ -5,8 +5,11 @@ import random
 import scipy.io as sio
 import numpy as np
-from . import dsp,transformer,statistics
+import sys
-from . import array_operation as arr
+sys.path.append("..")
+from . import transforms,statistics
+from util import dsp
+from util import array_operation as arr
 def del_labels(signals,labels,dels):
@@ -23,7 +26,7 @@ def del_labels(signals,labels,dels):
 def segment_traineval_dataset(signals,labels,a=0.8,random=True):
    length = len(labels)
    if random:
-        transformer.shuffledata(signals, labels)
+        transforms.shuffledata(signals, labels)
        signals_train = signals[:int(a*length)]
        labels_train = labels[:int(a*length)]
        signals_eval = signals[int(a*length):]
@@ -123,6 +126,6 @@ def loaddataset(opt):
        signals = new_signals
    if opt.fold_index == 'auto':
-        transformer.shuffledata(signals,labels)
+        transforms.shuffledata(signals,labels)
-    return signals,labels
+    return signals.astype(np.float32),labels.astype(np.int64)
\ No newline at end of file
--- a/data/noise.py
+++ b/data/noise.py
+"""
+clone from 
+https://github.com/scivision/soothing-sounds
+https://github.com/python-acoustics
+LICENSE: GPL-3.0
+"""
+"""
+# Generator
+The generator module provides signal generators.
+The following functions calculate `N` samples and return an array containing the samples.
+For indefinitely long iteration over the samples, consider using the output of these functions in `itertools.cycle`.
+## Noise
+In general, noise with spectrum S(f) is generated by taking uniform white noise
+and filtering with filter response H(f) to get the desired noise spectrum.
+Color  | Power/octave | Power density/octave
+-------|-------|--------------
+White  | +3 dB | 0 dB
+Pink   |  0 dB | -3 dB
+Blue   | +6 dB | +3 dB
+Brown  | -3 dB | -6 dB
+Violet | +9 dB | +6 dB
+-------|-------|--------------
+"""
+import numpy as np
+import scipy
+from scipy.fftpack import rfft, irfft
+def ms(x: np.ndarray) -> np.ndarray:
+    return (np.abs(x)**2).mean()
+def rms(x: np.ndarray) -> np.ndarray:
+    return np.sqrt(ms(x))
+def normalise(y: np.ndarray, x: np.ndarray = 1.) -> np.ndarray:
+    return y * np.sqrt(ms(x) / ms(y))
+def noise(N: int, color: str = 'white') -> np.ndarray:
+    """Noise generator.
+    * N: Amount of samples.
+    * color: Color of noise.
+        noise_generators = {
+        'white': white,
+        'pink': pink,
+        'blue': blue,
+        'brown': brown,
+        'violet': violet
+    }
+    https://github.com/python-acoustics
+    """
+    noise_generators = {
+        'white': white,
+        'pink': pink,
+        'blue': blue,
+        'brown': brown,
+        'violet': violet
+    }
+    return noise_generators[color](N)
+def white(N: int) -> np.ndarray:
+    """
+    White noise.
+    * N: Amount of samples.
+    White noise has a constant power density.
+    Its narrowband spectrum is therefore flat.
+    The power in white noise will increase by a factor of two for each octave band,
+    and therefore increases with 3 dB per octave.
+    https://github.com/python-acoustics
+    """
+    return np.random.randn(N).astype(np.float32)
+def pink(N: int) -> np.ndarray:
+    """
+    Pink noise.
+    * N: Amount of samples.
+    Pink noise has equal power in bands that are proportionally wide.
+    Power density decreases with 3 dB per octave.
+    https://github.com/python-acoustics
+    """
+    # This method uses the filter with the following coefficients.
+    # b = np.array([0.049922035, -0.095993537, 0.050612699, -0.004408786])
+    # a = np.array([1, -2.494956002, 2.017265875, -0.522189400])
+    # return lfilter(B, A, np.random.randn(N))
+    # Another way would be using the FFT
+    x = white(N)
+    X = rfft(x) / N
+    S = np.sqrt(np.arange(X.size)+1.)  # +1 to avoid divide by zero
+    y = irfft(X/S).real[:N]
+    return normalise(y)  # extremely tiny value 1e-9 without normalization
+def blue(N: int) -> np.ndarray:
+    """
+    Blue noise.
+    * N: Amount of samples.
+    Power increases with 6 dB per octave.
+    Power density increases with 3 dB per octave.
+    https://github.com/python-acoustics
+    """
+    x = white(N)
+    X = rfft(x) / N
+    S = np.sqrt(np.arange(X.size))  # Filter
+    y = irfft(X*S).real[:N]
+    return normalise(y)
+def brown(N: int) -> np.ndarray:
+    """
+    Violet noise.
+    * N: Amount of samples.
+    Power decreases with -3 dB per octave.
+    Power density decreases with 6 dB per octave.
+    https://github.com/python-acoustics
+    """
+    x = white(N)
+    X = rfft(x) / N
+    S = np.arange(X.size)+1  # Filter
+    y = irfft(X/S).real[:N]
+    return normalise(y)
+def violet(N: int) -> np.ndarray:
+    """
+    Violet noise. Power increases with 6 dB per octave.
+    * N: Amount of samples.
+    Power increases with +9 dB per octave.
+    Power density increases with +6 dB per octave.
+    https://github.com/python-acoustics
+    """
+    x = white(N)
+    X = rfft(x) / N
+    S = np.arange(X.size)  # Filter
+    y = irfft(X*S).real[0:N]
+    return normalise(y)
--- a/util/statistics.py
+++ b/util/statistics.py
 import numpy as np
 import os
-from . import plot
+import sys
-from . import util
+sys.path.append("..")
+from util import plot,util
 def label_statistics(labels):
    labels = (np.array(labels)).astype(np.int64)

--- a/data/surrogates.py
+++ b/data/surrogates.py
+# -*- coding: utf-8 -*-
+# This code clone from https://github.com/manu-mannattil/nolitsa
+# BSD-3 LICENSE : https://github.com/manu-mannattil/nolitsa/blob/master/LICENSE
+# Change np.fft to scipt.fftpack
+"""Functions to generate surrogate series.
+This module provides a set of functions to generate surrogate series
+from a given time series using multiple algorithms.
+Surrogates Generation
+---------------------
+  * ft -- generates Fourier transform surrogates.
+  * aaft -- generates amplitude adjusted Fourier transform surrogates.
+  * iaaft -- generates iterative amplitude adjusted Fourier transform
+    surrogates.
+Utilities
+---------
+  * mismatch -- finds the segment of a time series with the least
+    end-point mismatch.
+"""
+import numpy as np
+import scipy
+def rescale(x, interval=(0, 1)):
+    """Rescale the given scalar time series into a desired interval.
+    Rescales the given scalar time series into a desired interval using
+    a simple linear transformation.
+    Parameters
+    ----------
+    x : array_like
+        Scalar time series.
+    interval: tuple, optional (default = (0, 1))
+        Extent of the interval specified as a tuple.
+    Returns
+    -------
+    y : array
+        Rescaled scalar time series.
+    """
+    x = np.asarray(x)
+    if interval[1] == interval[0]:
+        raise ValueError('Interval must have a nonzero length.')
+    return (interval[0] + (x - np.min(x)) * (interval[1] - interval[0]) /
+            (np.max(x) - np.min(x)))
+def ft(x):
+    """Return simple Fourier transform surrogates.
+    Returns phase randomized (FT) surrogates that preserve the power
+    spectrum (or equivalently the linear correlations), but completely
+    destroy the probability distribution.
+    Parameters
+    ----------
+    x : array
+        Real input array containg the time series.
+    Returns
+    -------
+    y : array
+        Surrogates with the same power spectrum as x.
+    """
+    y = scipy.fftpack.rfft(x)
+    phi = 2 * np.pi * np.random.random(len(y))
+    phi[0] = 0.0
+    if len(x) % 2 == 0:
+        phi[-1] = 0.0
+    y = y * np.exp(1j * phi)
+    return scipy.fftpack.irfft(np.real(y), n=len(x))
+def aaft(x):
+    """Return amplitude adjusted Fourier transform surrogates.
+    Returns phase randomized, amplitude adjusted (AAFT) surrogates with
+    crudely the same power spectrum and distribution as the original
+    data (Theiler et al. 1992).  AAFT surrogates are used in testing
+    the null hypothesis that the input series is correlated Gaussian
+    noise transformed by a monotonic time-independent measuring
+    function.
+    Parameters
+    ----------
+    x : array
+        1-D input array containg the time series.
+    Returns
+    -------
+    y : array
+        Surrogate series with (crudely) the same power spectrum and
+        distribution.
+    """
+    # Generate uncorrelated Gaussian random numbers.
+    y = np.random.normal(size=len(x))
+    # Introduce correlations in the random numbers by rank ordering.
+    y = np.sort(y)[np.argsort(np.argsort(x))]
+    y = ft(y)
+    return np.sort(x)[np.argsort(np.argsort(y))]
+def iaaft(x, maxiter=1000, atol=1e-8, rtol=1e-10):
+    """Return iterative amplitude adjusted Fourier transform surrogates.
+    Returns phase randomized, amplitude adjusted (IAAFT) surrogates with
+    the same power spectrum (to a very high accuracy) and distribution
+    as the original data using an iterative scheme (Schreiber & Schmitz
+    1996).
+    Parameters
+    ----------
+    x : array
+        1-D real input array of length N containing the time series.
+    maxiter : int, optional (default = 1000)
+        Maximum iterations to be performed while checking for
+        convergence.  The scheme may converge before this number as
+        well (see Notes).
+    atol : float, optional (default = 1e-8)
+        Absolute tolerance for checking convergence (see Notes).
+    rtol : float, optional (default = 1e-10)
+        Relative tolerance for checking convergence (see Notes).
+    Returns
+    -------
+    y : array
+        Surrogate series with (almost) the same power spectrum and
+        distribution.
+    i : int
+        Number of iterations that have been performed.
+    e : float
+        Root-mean-square deviation (RMSD) between the absolute squares
+        of the Fourier amplitudes of the surrogate series and that of
+        the original series.
+    Notes
+    -----
+    To check if the power spectrum has converged, we see if the absolute
+    difference between the current (cerr) and previous (perr) RMSDs is
+    within the limits set by the tolerance levels, i.e., if abs(cerr -
+    perr) <= atol + rtol*perr.  This follows the convention used in
+    the NumPy function numpy.allclose().
+    Additionally, atol and rtol can be both set to zero in which
+    case the iterations end only when the RMSD stops changing or when
+    maxiter is reached.
+    """
+    # Calculate "true" Fourier amplitudes and sort the series.
+    ampl = np.abs(scipy.fftpack.rfft(x))
+    sort = np.sort(x)
+    # Previous and current error.
+    perr, cerr = (-1, 1)
+    # Start with a random permutation.
+    t = scipy.fftpack.rfft(np.random.permutation(x))
+    for i in range(maxiter):
+        # Match power spectrum.
+        s = np.real(scipy.fftpack.irfft(ampl * t / np.abs(t), n=len(x)))
+        # Match distribution by rank ordering.
+        y = sort[np.argsort(np.argsort(s))]
+        t = scipy.fftpack.rfft(y)
+        cerr = np.sqrt(np.mean((ampl ** 2 - np.abs(t) ** 2) ** 2))
+        # Check convergence.
+        if abs(cerr - perr) <= atol + rtol * abs(perr):
+            break
+        else:
+            perr = cerr
+    # Normalize error w.r.t. mean of the "true" power spectrum.
+    return y, i, cerr / np.mean(ampl ** 2)
+def app(x,alpha=1.0):
+    """amplitude and phase perturbations
+    Parameters
+    ----------
+    x : array
+        1-D real input array of length N containing the time series.
+    alpha : float
+        strength of app
+     Returns
+    -------
+    y : array
+        1-D output time series.
+    """
+    length = x.shape[0]
+    x_fft = scipy.fftpack.fft(x)
+    amp = np.abs(x_fft)
+    phase = np.angle(x_fft)
+    amp_std = np.std(amp)
+    phase_std = np.std(phase)
+    pos_a = np.sort(np.random.randint(0, length, 2))
+    pos_p = np.sort(np.random.randint(0, length, 2))
+    if pos_a[1]-pos_a[0]>10 and pos_p[1]-pos_p[0]>10:
+        amp[pos_a[0]:pos_a[1]] = amp[pos_a[0]:pos_a[1]] + np.random.normal(0,alpha,pos_a[1]-pos_a[0])*amp_std
+        phase[pos_p[0]:pos_p[1]] = phase[pos_p[0]:pos_p[1]] + np.random.normal(0,alpha,pos_p[1]-pos_p[0])*phase_std
+    fft_re = amp*np.exp(1j*phase)
+    y = scipy.fftpack.ifft(fft_re)
+    return np.real(y)
+def mismatch(x, length=None, weight=0.5, neigh=3):
+    """Find the segment that minimizes end-point mismatch.
+    Finds the segment in the time series that has minimum end-point
+    mismatch.  To do this we calculate the mismatch between the end
+    points of all segments of the given length and pick the segment with
+    least mismatch (Ehlers et al. 1998).  We also enforce the
+    condition that the difference between the first derivatives at the
+    end points must be a minimum.
+    Parameters
+    ----------
+    x : array
+        Real input array containg the time series.
+    length : int, optional
+        Length of segment.  By default the largest possible length which
+        is a power of one of the first five primes is selected.
+    weight : float, optional (default = 0.5)
+        Weight given to discontinuity in the first difference of the
+        time series.  Must be between 0 and 1.
+    neigh : int, optional (default = 3)
+        Num of end points using which the discontinuity statistic should
+        be computed.
+    Returns
+    -------
+    ends : tuple
+        Indices of the end points of the segment.
+    d : float
+        Discontinuity statistic for the segment.
+    Notes
+    -----
+    Both the time series and its first difference are linearly rescaled
+    to [0, 1].  Thus the discontinuity statistic varies between 0 and 1
+    (0 means no discontinuity and 1 means maximum discontinuity).
+    """
+    # Calculate the first difference of the time series and rescale it
+    # to [0, 1]
+    dx = rescale(np.diff(x))
+    x = rescale(x)[1:]
+    n = len(x)
+    if not length:
+        primes = np.array([2, 3, 5, 7, 11])
+        i = np.argmax(primes ** np.floor(np.log(n) / np.log(primes)) - n)
+        length = int(primes[i] ** (np.floor(np.log(n) / np.log(primes[i]))))
+    d = np.zeros(n - (length + neigh))
+    for i in np.arange(n - (length + neigh)):
+        d[i] = ((1 - weight) * (np.mean((x[i:i + neigh] -
+                                x[i + length:i + length + neigh]) ** 2.0)) +
+                weight * (np.mean((dx[i:i + neigh] -
+                          dx[i + length:i + length + neigh]) ** 2.0)))
+    return (1 + np.argmin(d), 1 + np.argmin(d) + length), np.min(d)
+#####################################自己复现的代码，iaaft是错的#################################
+"""
+def rank_like(src,dst):
+    src = np.sort(src)
+    sort_index = np.argsort(dst)
+    src_new = np.zeros_like(src)
+    src_new[sort_index] = src[:]
+    return src_new
+def aaft(signal):
+    # step 1 
+    Xs = np.random.randn(len(signal))
+    # step 2
+    Xs = rank_like(Xs, signal)
+    # step 3
+    Xs_fft = fft(Xs)
+    Xs_angle = np.angle(Xs_fft)
+    np.random.shuffle(Xs_angle)
+    Xs_fft_re = np.abs(Xs_fft)*np.exp(1j*Xs_angle)
+    Xs_re = ifft(Xs_fft_re)
+    # step 4
+    signal_new = rank_like(signal, Xs_re)
+    return signal_new
+def iaaft(signal,iter=10):
+    Ck = np.argsort(signal)
+    Ak = fft(signal)
+    Ak_abs = np.abs(Ak)
+    #Pk = np.angle(X_fft)
+    Sn = aaft(signal)
+    for i in range(iter):
+        Sk = fft(Sn)
+        Sk_angle = np.angle(Sk)
+        Sk_1 = Ak_abs*np.exp(1j*Sk_angle)
+        Sn = ifft(Sk_1)
+        Sn = rank_like(Sn, signal)
+    return Sn
+"""
\ No newline at end of file
--- a/util/transformer.py
+++ b/util/transformer.py
@@ -2,15 +2,18 @@ import os
 import random
 import numpy as np
 import torch
-from . import dsp
-from . import array_operation as arr
+import sys
+sys.path.append("..")
+from util import dsp
+from util import array_operation as arr
+from . import augmenter
 def shuffledata(data,target):
    state = np.random.get_state()
    np.random.shuffle(data)
    np.random.set_state(state)
    np.random.shuffle(target)
-    # return data,target
 def k_fold_generator(length,fold_num,fold_index = 'auto'):
    sequence = np.linspace(0,length-1,length,dtype='int')
@@ -45,93 +48,38 @@ def batch_generator(data,target,sequence,shuffle = True):
    for i in range(batchsize):
        out_data[i] = data[sequence[i]]
        out_target[i] = target[sequence[i]]
    return out_data,out_target
 def ToTensor(data,target=None,gpu_id=0):
    if target is not None:
-        data = torch.from_numpy(data).float()
+        data = torch.from_numpy(data)
-        target = torch.from_numpy(target).long()
+        target = torch.from_numpy(target)
        if gpu_id != -1:
            data = data.cuda()
            target = target.cuda()
        return data,target
    else:
-        data = torch.from_numpy(data).float()
+        data = torch.from_numpy(data)
        if gpu_id != -1:
            data = data.cuda()
        return data
-def random_transform_1d(data,opt,test_flag):
+def ToInputShape(opt,data,test_flag = False):
-    batchsize,ch,length = data.shape
-    if test_flag:
-        move = int((length-opt.finesize)*0.5)
-        result = data[:,:,move:move+opt.finesize]
-    else:
-        #random scale
-        if 'scale' in opt.augment:
-            length = np.random.randint(opt.finesize, length*1.1, dtype=np.int64)
-            result = np.zeros((batchsize,ch,length))
-            for i in range(batchsize):
-                for j in range(ch):
-                    result[i][j] = arr.interp(data[i][j], length)
-            data = result
-        #random crop    
-        move = int((length-opt.finesize)*random.random())
-        result = data[:,:,move:move+opt.finesize]
-        #random flip
-        if 'flip' in opt.augment:
-            if random.random()<0.5:
-                result = result[:,:,::-1]
-        #random amp
-        if 'amp' in opt.augment:
-            result = result*random.uniform(0.9,1.1)
-        #add noise
-        if 'noise' in opt.augment:
-            noise = np.random.rand(ch,opt.finesize)
-            result = result + (noise-0.5)*0.01
-    return result
-def random_transform_2d(img,finesize = (224,244),test_flag = True):
-    h,w = img.shape[:2]
-    if test_flag:
-        h_move = int((h-finesize[0])*0.5)
-        w_move = int((w-finesize[1])*0.5)
-        result = img[h_move:h_move+finesize[0],w_move:w_move+finesize[1]]
-    else:
-        #random crop
-        h_move = int((h-finesize[0])*random.random())
-        w_move = int((w-finesize[1])*random.random())
-        result = img[h_move:h_move+finesize[0],w_move:w_move+finesize[1]]
-        #random flip
-        if random.random()<0.5:
-            result = result[:,::-1]
-        #random amp
-        result = result*random.uniform(0.9,1.1)+random.uniform(-0.05,0.05)
-    return result
-def ToInputShape(data,opt,test_flag = False):
-    #data = data.astype(np.float32)
-    _batchsize,_ch,_size = data.shape
    if opt.model_type == '1d':
-        result = random_transform_1d(data, opt, test_flag = test_flag)
+        result = augmenter.base1d(opt, data, test_flag = test_flag)
    elif opt.model_type == '2d':
+        _batchsize,_ch,_size = data.shape
        result = []
        h,w = opt.stft_shape
        for i in range(_batchsize):
            for j in range(opt.input_nc):
                spectrum = dsp.signal2spectrum(data[i][j],opt.stft_size,opt.stft_stride, opt.stft_n_downsample, not opt.stft_no_log)
-                spectrum = random_transform_2d(spectrum,(h,int(w*0.9)),test_flag=test_flag)
+                spectrum = augmenter.base2d(spectrum,(h,int(w*0.9)),test_flag=test_flag)
                result.append(spectrum)
        result = (np.array(result)).reshape(_batchsize,opt.input_nc,h,int(w*0.9))
-    return result
+    return result.astype(np.float32)
--- a/examples/show_example.npy
+++ b/examples/show_example.npy
--- a/models/core.py
+++ b/models/core.py
@@ -11,7 +11,8 @@ warnings.filterwarnings("ignore")
 import sys
 sys.path.append("..")
-from util import util,transformer,dataloader,statistics,plot
+from util import util,plot,options
+from data import augmenter,transforms,dataloader,statistics
 from . import creatnet
 def show_paramsnumber(net,opt):
@@ -74,10 +75,10 @@ class Core(object):
        _times = np.ceil(len(sequences)/self.opt.batchsize).astype(np.int)
        for i in range(_times):
            if i != _times-1:
-                signal,label = transformer.batch_generator(signals, labels, sequences[i*self.opt.batchsize:(i+1)*self.opt.batchsize])
+                signal,label = transforms.batch_generator(signals, labels, sequences[i*self.opt.batchsize:(i+1)*self.opt.batchsize])
            else:
-                signal,label = transformer.batch_generator(signals, labels, sequences[i*self.opt.batchsize:])
+                signal,label = transforms.batch_generator(signals, labels, sequences[i*self.opt.batchsize:])
-            signal = transformer.ToInputShape(signal,self.opt,test_flag =self.test_flag)
+            signal = transforms.ToInputShape(self.opt,signal,test_flag =self.test_flag)
            self.queue.put([signal,label])
    def start_process(self,signals,labels,sequences):
@@ -130,7 +131,7 @@ class Core(object):
            self.optimizer.zero_grad()
            signal,label = self.queue.get()
-            signal,label = transformer.ToTensor(signal,label,gpu_id =self.opt.gpu_id)
+            signal,label = transforms.ToTensor(signal,label,gpu_id =self.opt.gpu_id)
            output,loss,features,confusion_mat = self.forward(signal, label, features, confusion_mat)
            epoch_loss += loss.item()     
@@ -138,7 +139,8 @@ class Core(object):
            self.optimizer.step()
        self.plot_result['train'].append(epoch_loss/(i+1))
-        plot.draw_loss(self.plot_result,self.epoch+(i+1)/(sequences.shape[0]/self.opt.batchsize),self.opt)
+        if self.epoch%10 == 0:
+            plot.draw_loss(self.plot_result,self.epoch+(i+1)/(sequences.shape[0]/self.opt.batchsize),self.opt)
        # if self.opt.model_name != 'autoencoder':
        #     plot.draw_heatmap(confusion_mat,self.opt,name = 'current_train')
@@ -153,7 +155,7 @@ class Core(object):
        self.process_pool_init(signals, labels, sequences)
        for i in range(np.ceil(len(sequences)/self.opt.batchsize).astype(np.int)):
            signal,label = self.queue.get()
-            signal,label = transformer.ToTensor(signal,label,gpu_id =self.opt.gpu_id)
+            signal,label = transforms.ToTensor(signal,label,gpu_id =self.opt.gpu_id)
            with torch.no_grad():
                output,loss,features,confusion_mat = self.forward(signal, label, features, confusion_mat)
                epoch_loss += loss.item()

--- a/models/net_1d/lstm.py
+++ b/models/net_1d/lstm.py
@@ -23,7 +23,7 @@ class lstm_block(nn.Module):
        return x
 class lstm(nn.Module):
-    def __init__(self,input_size,time_step,input_nc,num_classes,Hidden_size=128,Num_layers=2):
+    def __init__(self,input_size,time_step,input_nc,num_classes,Hidden_size=256,Num_layers=3):
        super(lstm, self).__init__()
        self.input_size=input_size
        self.time_step=time_step
@@ -31,7 +31,7 @@ class lstm(nn.Module):
        self.point = input_size*time_step
        for i in range(input_nc):
-            exec('self.lstm'+str(i) + '=lstm_block(input_size, time_step, '+str(Hidden_size)+')')
+            exec('self.lstm'+str(i) + '=lstm_block(input_size, time_step, '+str(Hidden_size)+','+str(Num_layers)+')')
        self.fc = nn.Linear(Hidden_size*input_nc, num_classes)
    def forward(self, x):

--- a/simple_test.py
+++ b/simple_test.py
@@ -3,7 +3,8 @@ import numpy as np
 import torch
 import matplotlib.pyplot as plt
-from util import util,transformer,dataloader,statistics,options
+from util import util,options
+from data import augmenter,transforms,dataloader,statistics
 from models import creatnet
 '''
@@ -24,9 +25,9 @@ if not opt.gpu_id:
    net.cuda()
 for signal,true_label in zip(signals, labels):
-    signal = signal.reshape(1,1,-1) #batchsize,ch,length
+    signal = signal.reshape(1,1,-1).astype(np.float32) #batchsize,ch,length
-    true_label = true_label.reshape(1) #batchsize
+    true_label = true_label.reshape(1).astype(np.int64) #batchsize
-    signal,true_label = transformer.ToTensor(signal,true_label,gpu_id =opt.gpu_id)
+    signal,true_label = transforms.ToTensor(signal,true_label,gpu_id =opt.gpu_id)
    out = net(signal)
    pred_label = torch.max(out, 1)[1]
    pred_label=pred_label.data.cpu().numpy()

--- a/tools/server.py
+++ b/tools/server.py
@@ -7,7 +7,9 @@ import numpy as np
 import sys
 sys.path.append("..")
-from util import util,transformer,dataloader,statistics,plot,options
+from util import util,plot,options
+from data import augmenter,transforms,dataloader,statistics
 from util import array_operation as arr
 from models import creatnet,core

--- a/train.py
+++ b/train.py
@@ -7,15 +7,15 @@ from torch import nn, optim
 import warnings
 warnings.filterwarnings("ignore")
-from util import util,transformer,dataloader,statistics,plot,options
+from util import util,plot,options
-from data import augmenter
+from data import augmenter,transforms,dataloader,statistics
 from models import core
 opt = options.Options().getparse()
 """Use your own data to train
 * step1: Generate signals.npy and labels.npy in the following format.
-# 1.type:numpydata   signals:np.float64   labels:np.int64
+# 1.type:numpydata   signals:np.float32   labels:np.int64
 # 2.shape  signals:[num,ch,length]    labels:[num]
 # num:samples_num, ch :channel_num,  length:length of each sample
 # for example:
@@ -28,11 +28,11 @@ labels = np.array([0,0,0,0,0,1,1,1,1,1])      #0->class0    1->class1
 t1 = time.time()
 signals,labels = dataloader.loaddataset(opt)
 if opt.gan:
-    signals,labels = augmenter.gan(opt,signals,labels)
+    signals,labels = augmenter.dcgan(opt,signals,labels)
 label_cnt,label_cnt_per,label_num = statistics.label_statistics(labels)
 util.writelog('label statistics: '+str(label_cnt),opt,True)
 opt = options.get_auto_options(opt, signals, labels)
-train_sequences,eval_sequences = transformer.k_fold_generator(len(labels),opt.k_fold,opt.fold_index)
+train_sequences,eval_sequences = transforms.k_fold_generator(len(labels),opt.k_fold,opt.fold_index)
 t2 = time.time()
 print('Cost time: %.2f'% (t2-t1),'s')

--- a/util/array_operation.py
+++ b/util/array_operation.py
@@ -17,11 +17,18 @@ def pad(data,padding,mod='zero'):
        for i in range(repeat_num):
            out_data = np.append(out_data, data)
        pad_data = data[:padding-repeat_num*len(data)]
-        return np.append(out_data, pad_data)
+        out_data = np.append(out_data, pad_data)
+        return out_data
    elif mod == 'reflect':
+        length = data.shape[0]
        pad_data = data[::-1][:padding]
-        return np.append(data, pad_data)
+        out_data =  np.append(data, pad_data)
+        if padding < length:
+            return out_data
+        else:
+            return pad(out_data,padding-length,mod='reflect')
 def normliaze(data, mode = 'norm', sigma = 0, dtype=np.float32, truncated = 2):
    '''

--- a/util/options.py
+++ b/util/options.py
@@ -2,7 +2,11 @@ import argparse
 import os
 import time
 import numpy as np
-from . import util,dsp,plot,statistics
+from . import util,dsp,plot
+import sys
+sys.path.append("..")
+from data import statistics
 class Options():
    def __init__(self):
@@ -37,9 +41,13 @@ class Options():
        # ------------------------Data Augmentation------------------------
        # base
-        self.parser.add_argument('--augment', type=str, default='all', help='all | scale,filp,amp,noise | scale,filp ....')
+        self.parser.add_argument('--augment', type=str, default='scale', 
+            help='all | scale,warp,app,aaft,iaaft,filp,spike,step,slope,white,pink,blue,brown,violet , enter some of them')
+        self.parser.add_argument('--augment_noise_lambda', type=float, default = 0.1, help='noise level(spike,step,slope,white,pink,blue,brown,violet)')
        # fft channel --> use fft to improve frequency domain information.
        self.parser.add_argument('--augment_fft', action='store_true', help='if specified, use fft to improve frequency domain informationa')
+        # self.parser.add_argument('--augment_times', type=float, default=10, help='how many times that will be augmented')
        # for gan,it only support when fold_index = 1 or 0 now
        # only support when k_fold =0 or 1
@@ -52,22 +60,22 @@ class Options():
        # ------------------------Dataset------------------------
        """--fold_index
-        5-fold:
+        When --k_fold != 0 or 1:
        Cut dataset into sub-set using index , and then run k-fold with sub-set
        If input 'auto', it will shuffle dataset and then cut dataset equally
        If input: [2,4,6,7]
        when len(dataset) == 10
        sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]
        -------
-        No-fold:
+        When --k_fold == 0 or 1:
        If input 'auto', it will shuffle dataset and then cut 80% dataset to train and other to eval
        If input: [5]
        when len(dataset) == 10
        train-set : dataset[0:5]  eval-set : dataset[5:]
        """
+        self.parser.add_argument('--k_fold', type=int, default=0,help='fold_num of k-fold.If 0 or 1, no k-fold and cut 80% to train and other to eval')
        self.parser.add_argument('--fold_index', type=str, default='auto',
            help='where to fold, eg. when 5-fold and input: [2,4,6,7] -> sub-set: dataset[0:2],dataset[2:4],dataset[4:6],dataset[6:7],dataset[7:]')
-        self.parser.add_argument('--k_fold', type=int, default=0,help='fold_num of k-fold.If 0 or 1, no k-fold and cut 0.8 to train and other to eval')
        self.parser.add_argument('--dataset_dir', type=str, default='./datasets/simple_test',help='your dataset path')
        self.parser.add_argument('--save_dir', type=str, default='./checkpoints/',help='save checkpoints')
        self.parser.add_argument('--load_thread', type=int, default=8,help='how many threads when load data')  
@@ -149,7 +157,7 @@ class Options():
            self.opt.fold_index = (np.load(os.path.join(self.opt.dataset_dir,'index.npy'))).tolist()
        if self.opt.augment == 'all':
-            self.opt.augment = ["scale","filp","amp","noise"]
+            self.opt.augment = ['scale','warp','spike','step','slope','white','pink','blue','brown','violet','app','aaft','iaaft','filp']
        else:
            self.opt.augment = str2list(self.opt.augment)

--- a/util/plot.py
+++ b/util/plot.py
 import os
 import numpy as np
 import matplotlib
-matplotlib.use('Agg')
+# matplotlib.use('Agg')
 import matplotlib.pyplot as plt
 from mpl_toolkits.mplot3d import Axes3D