提交 · 888272b54742c4ab70eb4152f357c9658fde92f9 · BaiXuePrincess / Paddle

08 11月, 2022 1 次提交
- N
  [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) · 888272b5
  由 Nyakku Shigure 提交于 11月 08, 2022
```
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition

* fix an increment
```
  888272b5
23 10月, 2022 1 次提交
- N
  [CodeStyle][black] use black instead of yapf (#46014) · 7097630f
  由 Nyakku Shigure 提交于 10月 23, 2022
```
* update config

* re-blacken python code

* temporarily disable date and diff_py_file

* skip a format
```
  7097630f
11 10月, 2022 1 次提交
- N
  [CodeStyle] use built-in `open` instead of `io.open` (#46751) · 75528ad6
  由 Nyakku Shigure 提交于 10月 11, 2022
```
* [CodeStyle] use built-in `open` instead of `io.open`

* revert flake8 config changes
```
  75528ad6
10 10月, 2022 1 次提交
- N
  [CodeStyle][F401] remove unused imports in unittests/distributed_passes,tokenizer,sequence (#46793) · ab1babbb
  由 Nyakku Shigure 提交于 10月 10, 2022
```
* [CodeStyle][F401] remove unused imports in unittests/distributed_passes,tokenizer,sequence

* add noqa after required imports
```
  ab1babbb
14 9月, 2022 1 次提交
- N
  [CodeStyle][W291] trim trailing whitespace in python file (#45937) · de8c0ba5
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* trim trailing whitespace

* fix `.cmake-format.py`

* revert npu ut changes, avoid npu ci error
```
  de8c0ba5
05 6月, 2022 1 次提交

【code format check upgrade】 step2：yapf (#42944) · a072fca8

由 Sing_chan 提交于 6月 05, 2022

* use yapf to format all python file

* yapf exclude two unittests file for they rely on writing and reading file, and format will break them

* disable diff_py_file because too many diff files cause command following failed

a072fca8

20 10月, 2021 1 次提交

Add FasterTokenizer Operator (#34491) · 3f2d6a3f

由 Steffy-zxf 提交于 10月 20, 2021

Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.

* support the text string as an input Tensor
* support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
* Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
* It first applies basic tokenization, followed by wordpiece tokenization.

3f2d6a3f

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致