未验证 提交 2ae46434 编写于 作者: J Jiawei Wang 提交者: GitHub

Update CUBE_QUANT.md

上级 47ec8010
......@@ -32,7 +32,7 @@ seq_generator ctr_serving_model/SparseFeatFactors ./cube_model/feature 8 #quanti
```
This command will convert the sparse parameter file SparseFeatFactors in the ctr_serving_model directory into a feature file (Sequence File format) in the cube_model directory. At present, the quantization tool only supports 8-bit quantization. In the future, it will support higher compression rates and more types of quantization methods.
## Launch Serving by quantized model
## Launch Serving by Quantized Model
In Serving, a quantized model is used when using general_dist_kv_quant_infer op to make predictions. See python/examples/criteo_ctr_with_cube/test_server_quant.py for details. No changes are required on the client side.
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册