KoboldCpp Single-binary llama.cpp fork with a lightweight built-in UI; fast local inference for RP with long context options. 0380 Roleplay Frontends (Local/Self-Hosted UI)# adult# GGUF# inference