How to compres f16/Q8 quant to q4 ? meinamix model sd1,5
A user on Reddit is seeking to reduce the file size of a Stable Diffusion model by quantizing it further. They have already converted a safetensor model to GGUF format but found the f16 GGUF file too large. The user is attempting to downsize it to a Q4 quantization level using command-line tools but is encountering issues, with the tool instead producing an image rather than the desired quantized model file. AI