Deepspeed quantization library v0.1 (#2450)
* Initial commit Deepspeed quantization library
* Match function signatures
* Add Quantization Kernel
* adding offset comparision and precommit changes
* format fixes
* FIle name changes
* pt_binding_changes
* test name change
* Integer quantization, minor refactors
* Add directed test_case
* format fixes
* Move param calculation to constructor of params class
* Use local function and add elemsPerBlock
* change function to be specalized
* sub block reduce
* add new schedule
* Add new schedule test case
* fix illegal writes in sch1
* Style fixes in comments
Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>
Showing
csrc/includes/quantization.h
0 → 100644
csrc/quantization/quantize.cu
0 → 100644
想要评论请 注册 或 登录