This works with a quantsized version?
#4
by khronnuz - opened
I am running an FP8 version of Gemma 4, would this work or only works with the original version from google?
Seconding this, would like to know ii it works with quantization
Yes it works but you may get a lower token acceptance rate.