*`--data_type` {criteo,synthetic}: Currently we support criteo dataset and synthetic dataset.(Default: ./criteo_data/).
*`--data_path` : The path of the data file.
*`--dense_dim` : The number of your continues fields.
*`--slot_dim` : The number of your sparse fields, it can also be called category features.
*`--threshold` : Word frequency below this value will be regarded as OOV. It aims to reduce the vocab size.
*`--train_line_count`: The number of examples in your dataset.
*`--skip_id_convert`: 0 or 1. If set 1, the code will skip the id convert, regarding the original id as the final id.
## Dataset
The common used benchmark datasets are used for model training and evaluation.
### Generate the synthetic Data
The following command will generate 40 million lines of click data, in the format of "label\tdense_feature[0]\tdense_feature[1]...\tsparse_feature[0]\tsparse_feature[1]...".