I’m using this one because before they ceased the Open LLM Leaderboard, it was the highest rated 14B model that can run on a single GPU with 10GB VRAM.
Newbie here. I’m not sure if the documentation tells me if it can run with ollama. If I understand correctly you have to build it «by hand»? I mainly use ollama/models on the official website and I’m too scared to plunge deeper into the mechanics haha.
I’m using this one because before they ceased the Open LLM Leaderboard, it was the highest rated 14B model that can run on a single GPU with 10GB VRAM.
Newbie here. I’m not sure if the documentation tells me if it can run with ollama. If I understand correctly you have to build it «by hand»? I mainly use ollama/models on the official website and I’m too scared to plunge deeper into the mechanics haha.
Not for GGUF comverted models.
Just run the following command in ollama
That’s awesome thank you.