提交 d88ceef7 编写于 作者: J JiehangXie

Fix punctuation bug

修复顿号和英文冒号停顿和分句的问题
上级 6065b1b6
...@@ -51,7 +51,7 @@ from .quantifier import replace_temperature ...@@ -51,7 +51,7 @@ from .quantifier import replace_temperature
class TextNormalizer(): class TextNormalizer():
def __init__(self): def __init__(self):
self.SENTENCE_SPLITOR = re.compile(r'([::、,;。?!,;?!][”’]?)') self.SENTENCE_SPLITOR = re.compile(r'([:、,;。?!,;?:!][”’]?)')
def _split(self, text: str, lang="zh") -> List[str]: def _split(self, text: str, lang="zh") -> List[str]:
"""Split long text into sentences with sentence-splitting punctuations. """Split long text into sentences with sentence-splitting punctuations.
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册