文本分类示例问题
Created by: tyabs
按照 http://www.paddlepaddle.org/docs/develop/models/text_classification/README.html 中的自定义数据格式组织样本数据,中文切分成字符,然后指定test_data_dir参数,跑模型得出如下输出,
[INFO 2018-03-15 10:03:03,443 train.py:81] class number is : 2. [INFO 2018-03-15 10:03:03,444 train.py:101] length of word dictionary is : 197. I0315 10:03:03.702831 214 Util.cpp:166] commandline: --use_gpu=False --trainer_count=1 I0315 10:03:03.768990 214 GradientMachine.cpp:94] Initing parameters.. I0315 10:03:03.780875 214 GradientMachine.cpp:101] Init parameters done. [INFO 2018-03-15 10:03:05,168 train.py:134] Pass 0, Batch 0, Cost 0.723217, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.71875}
[INFO 2018-03-15 10:03:20,437 train.py:134] Pass 0, Batch 100, Cost 0.002880, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:03:35,861 train.py:134] Pass 0, Batch 200, Cost 0.000867, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:03:51,160 train.py:134] Pass 0, Batch 300, Cost 0.000377, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:04:05,879 train.py:134] Pass 0, Batch 400, Cost 0.000256, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:04:20,718 train.py:134] Pass 0, Batch 500, Cost 0.000153, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:04:36,332 train.py:134] Pass 0, Batch 600, Cost 0.000151, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:04:51,897 train.py:134] Pass 0, Batch 700, Cost 0.000118, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:05:07,207 train.py:134] Pass 0, Batch 800, Cost 0.000073, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
[INFO 2018-03-15 10:05:21,856 train.py:134] Pass 0, Batch 900, Cost 0.000046, {'auc_evaluator_0': 0.0, 'classification_error_evaluator': 0.0}
接下里都是的这两个参数都是0,直到cost也到达0. 请问是原因是?看了其他issue,数据已经平衡,且直接用的text_classificationdemo代码,label数据已经是integer_value格式。