Add support for llama.cpp as an inference backend for Granite Switch models.
Add support for llama.cpp as an inference backend for Granite Switch models.