提交 3370e5f6 编写于 作者: H Hongyu Li 提交者: Yibing Liu

Minor bug fix (#2443)

上级 b91cf1fc
...@@ -82,6 +82,7 @@ Where `parameters_to_restore` is the model parameters used in the evaluatation ( ...@@ -82,6 +82,7 @@ Where `parameters_to_restore` is the model parameters used in the evaluatation (
| :------------- | :---------: | :----------: | :---------: | :----------: | :---------: | :----------: |:----------: | | :------------- | :---------: | :----------: | :---------: | :----------: | :---------: | :----------: |:----------: |
| baseline + EMA | 70.2/54.7 | 57.3/47.5 | 64.1/52.8 | 51.7/37.2 | 87.9/77.7 | 63.1/53.5 | 65.7 | | baseline + EMA | 70.2/54.7 | 57.3/47.5 | 64.1/52.8 | 51.7/37.2 | 87.9/77.7 | 63.1/53.5 | 65.7 |
| baseline woEMA | 69.9/54.6 | 57.0/47.3 | 64.0/52.8 | 51.8/37.4 | 87.8/77.6 | 63.0/53.4 | 65.6 | | baseline woEMA | 69.9/54.6 | 57.0/47.3 | 64.0/52.8 | 51.8/37.4 | 87.8/77.6 | 63.0/53.4 | 65.6 |
Note that we turn on exponential moving average (EMA) during training by default (in most cases EMA can improve performance) and save EMA parameters into the final checkpoint files. The predicted answers using EMA parameters are saved into `ema_predictions.json`. Note that we turn on exponential moving average (EMA) during training by default (in most cases EMA can improve performance) and save EMA parameters into the final checkpoint files. The predicted answers using EMA parameters are saved into `ema_predictions.json`.
...@@ -103,4 +104,4 @@ Unless required by applicable law or agreed to in writing, software ...@@ -103,4 +104,4 @@ Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS, distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and See the License for the specific language governing permissions and
limitations under the License. limitations under the License.
\ No newline at end of file
...@@ -48,7 +48,7 @@ python -u src/run_mrqa.py --use_cuda true\ ...@@ -48,7 +48,7 @@ python -u src/run_mrqa.py --use_cuda true\
--max_seq_len 512 \ --max_seq_len 512 \
--bert_config_path ${BERT_BASE_PATH}/bert_config.json \ --bert_config_path ${BERT_BASE_PATH}/bert_config.json \
--predict_file ${DATA_PATH_dev}/mrqa-combined.raw.json \ --predict_file ${DATA_PATH_dev}/mrqa-combined.raw.json \
--do_lower_case false \ --do_lower_case true \
--doc_stride 128 \ --doc_stride 128 \
--train_file ${DATA_PATH_train}/mrqa-combined.raw.json \ --train_file ${DATA_PATH_train}/mrqa-combined.raw.json \
--learning_rate 3e-5 \ --learning_rate 3e-5 \
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册