I’m new to this but I’m curious why not Llama 3.1? Are Mistral-Nemo and Mistral-Small superior?
I’m running a GTX 4070 Super with Llama 3.1 8b and Llama 3.2 and like the results. But I’m open to a higher fidelity model that works with my GPU at a reasonable speed.