Logo Public

Software

Framework:

  • LLM: llama.cpp
  • Stable diffusion: the web UI?? How to do multi-user? Not sure if there's a program that can do multi-GPU. Might not be feasible.

LLM model:

  • Gemma 2 27B for speed
  • Llama 3.1 70B for quality