Sept

mcv@lemmy.zip · 28 days ago

But if that’s how you’re going to run it, why not also train it in that mode?

Xylight@lemdro.id · 28 days ago

That is a thing, and it’s called quantization aware training. Some open weight models like Gemma do it.

The problem is that you need to re-train the whole model for that, and if you also want a full-quality version you need to train a lot more.

It is still less precise, so it’ll still be worse quality than full precision, but it does reduce the effect.

mudkip@lemdro.id · 25 days ago

Your response reeks of AI slop

Xylight@lemdro.id · 25 days ago

4/10 bait

mudkip@lemdro.id · 25 days ago

Is it, or is it not, AI slop? Why are you using so heavily markdown formatting? That is a telltale sign of an LLM being involved

Xylight@lemdro.id · 25 days ago

I am not using an llm but holy bait

Hop off the reddit voice