实战2-量化推理 [2]

Training的时候要用tmux

$ tmux new -s finetune
$ tmux attach -t finetune
$ ctcl +b , D

16bit量化推理慢, 要用4bit量化推理

量化框架

【大部分的推理框架都有量化的功能】