Ggml-medium.bin -
The "medium" refers to the size of the original Whisper model by OpenAI. Whisper comes in five sizes:
Choosing "medium" is a trade-off. It is significantly more accurate than "small" or "base" for transcribing accents, background noise, or technical jargon, but it requires roughly 2-3 GB of RAM to run, whereas "large" requires 5+ GB. ggml-medium.bin
To understand the file, one must break down its name into three distinct components: The "medium" refers to the size of the
While variations exist depending on who quantized the model (e.g., community members on Hugging Face), a typical ggml-medium.bin file exhibits the following characteristics: Choosing "medium" is a trade-off