Google ShieldGemma Open-weight safety classifiers for inputs/outputs; tune to your policies. 0460 Guardrails & Moderation# classifier# Gemma# google
DBRX Instruct (HF) Hugging Face repo for DBRX Instruct checkpoints under an open license for local inference and finetuning. 0400 Open-source Models# DBRX Instruct# Hugging Face# local
SmolLM2 Ultra-small open models (135M/360M/1.7B) tailored for on-device and constrained local deployments. 0490 Open-source Models# on-device# open weights# small LLM
InternVL 2.5 Open multimodal family (1B–78B); 78B surpasses 70% on MMMU; broad image/video understanding. 0480 Open-source Models# InternVL# MMMU# multimodal
QwQ-32B (Qwen Reasoning) Open-weight 32B RL-trained reasoning model reported to rival larger systems while remaining locally runnable. 0370 Open-source Models# open weights# QwQ-32B# reasoning
Yi-1.5 01.AI’s upgraded Yi family (6B/34B) with improved instruction following and coding; widely used in local stacks. 0480 Open-source Models# 01.AI# 34B# instruction
Phi-4 Reasoning Microsoft’s 14B open-weight reasoning model yielding strong complex-task performance with modest hardware. 0450 Open-source Models# 14B# local# open weights
Gemma 2 Google’s open-weight 9B/27B models; practical to run locally and widely adopted for fine-tuning and apps. 0640 Open-source Models# 27B# 9B# Gemma 2
Qwen2.5 (Alibaba Qwen) Latest Qwen series from 0.5B–72B; dense decoder models with strong general, coding and math variants for local use. 0350 Open-source Models# 32B# 72B# 7B
Llama 3.2 (Meta) Family of open-weight models (1B–90B, some with vision) designed for local and edge deployment and strong instruction following. 0380 Open-source Models# edge AI# Llama 3.2# local inference