Created by: pkuyym
- Use 
codecsto make sure that all texts are 'utf-8' encoding including vocabulary and transcriptions. - Optimize error rate calculation. Mainly reduce space requirement from O(m*n) to O(min(m, n)).
 
Created by: pkuyym
codecs to make sure that all texts are 'utf-8' encoding including vocabulary and transcriptions.