Provide model info in chat ui & allow multiple models

Rather than serving all models with generic `model.file` filename, ramalama should provide more information about the currently served or loaded models.

Also, could allow passing a model-config file to allow to switch easily between models with a single instance of a server. 
https://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support 

Here, I've renamed the file ramalama downloaded to `granite-code` so it shows at llamacpp:port/models.
![Image](https://github.com/user-attachments/assets/25ac6fed-f7e7-4fb7-9890-bca1210b25b1)

Currently, any model served by ramalama is listed as `model.file` 
![Image](https://github.com/user-attachments/assets/71c7316b-34d7-4233-9510-56ddd0175a9a)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provide model info in chat ui & allow multiple models #598

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Provide model info in chat ui & allow multiple models #598

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions