Gemma 2 Google’s open-weight 9B/27B models; practical to run locally and widely adopted for fine-tuning and apps. 0700 Open-source Models# 27B# 9B# Gemma 2
Falcon 2 11B TII’s newer open series (text+VLM variants) optimized for efficient inference; Apache-style release. 0630 Open-source Models# 11B# Falcon 2# local
Text Generation WebUI (oobabooga) All-in-one local web UI for LLMs with chat, memory, extensions and RP-friendly presets; runs GGUF/llama.cpp and more. 0570 Roleplay Frontends (Local/Self-Hosted UI)# adult# extensions# GGUF
MiniCPM-V 2.6 Edge-friendly multimodal 8B model (images/video) with quantized variants for low-VRAM local inference. 0530 Open-source Models# edge# int4# local
TinyLlama 1.1B Compact 1.1B Llama-compatible model; popular GGUF quantizations make it fast on CPUs/GPUs locally. 0510 Open-source Models# 1.1B# GGUF# Llama-compatible
LLaVA-OneVision 1.5 Fully open multimodal (images/video+text) models & training stack; strong results and reproducible recipes. 0470 Open-source Models# LLaVA# local# multimodal
Phi-4 Reasoning Microsoft’s 14B open-weight reasoning model yielding strong complex-task performance with modest hardware. 0470 Open-source Models# 14B# local# open weights
H2O GPT Private, self-hostable chat UI/server (Apache-2.0) supporting local models & Ollama—usable for RP and long-form chats. 0460 Roleplay Frontends (Local/Self-Hosted UI)# adult# local# NSFW
Ollama Library Pull open-weight LLMs locally with one command; browse popular chat and embedding models. 0440 Model Hubs# embeddings# GGUF# LLMs
OLMo 2 (AI2) Fully open training data, code, and checkpoints; transparent 7B/13B/32B family for reproducible research and apps. 0440 Open-source Models# AI2# fully open# local
KoboldAI Classic RP/writing client with story, adventure, and chat modes; supports local and remote backends and lorebooks. 0430 Roleplay Frontends (Local/Self-Hosted UI)# adult# local# lorebook
SillyTavern Power-user local frontend for roleplay chats with character cards, lorebooks, multi-model connectors, TTS/vision/image support. 0430 Roleplay Frontends (Local/Self-Hosted UI)# adult# characters# local
Ollama + Open WebUI (Bundle) Community bundle marrying Ollama models with Open WebUI for a quick-start local RP chat experience. 0420 Roleplay Frontends (Local/Self-Hosted UI)# adult# bundle# local
DBRX Instruct (HF) Hugging Face repo for DBRX Instruct checkpoints under an open license for local inference and finetuning. 0410 Open-source Models# DBRX Instruct# Hugging Face# local
Open Interpreter Local agent that runs code (Python/JS/Shell) from natural language—terminal-first workflow. 0410 Code Assistants# agent# JavaScript# local