This works with a quantsized version?

#4
by khronnuz - opened

I am running an FP8 version of Gemma 4, would this work or only works with the original version from google?

Seconding this, would like to know ii it works with quantization

Yes it works but you may get a lower token acceptance rate.

Sign up or log in to comment