Why not use FP2 or IQ2 as kTransformers does?
#11
by
ghostplant - opened
I found kTransformers has supported IQ2, which can run with very lite GPUs. To make it faster, I think IQ2 must be a better choice.
ghostplant changed discussion status to
closed