Open-Weight LLM Catalog
Discover the best open-weight large language models for self-hosted deployment. Filter by capability, compare specifications, and deploy to your infrastructure in minutes.
Model Filters
1B 1026B
4K 10M
| Model | Organization | Parameters | Context | Capabilities | Quants |
|---|---|---|---|---|---|
| Kimi K2 Instruct | Moonshot AI | 1026B | 128K | Code Tool Calls | 18 |
| Kimi K2.5 | Moonshot AI | 1016B | 256K | Code Thinking Tool Calls Vision | 18 |
| DeepSeek V3.2 | DeepSeek | 685B | 160K | Multilingual Tool Calls | 18 |
| DeepSeek V3.1 | DeepSeek | 685B | 160K | Code Multilingual Tool Calls | 18 |
| Mistral Large 3 675B Instruct 2512 | Mistral AI | 675B | 288K | Code Multilingual Tool Calls | 20 |
| Llama 4 Maverick 17B 128E Instruct | Meta | 397B | 1M | Code Multilingual Tool Calls Vision | 19 |
| GLM 4.7 | Zai Org | 358B | 198K | Code Thinking Tool Calls | 18 |
| Qwen3 235B A22B | Qwen | 235B | 40K | Code Multilingual Tool Calls | 17 |
| MiniMax M2 | MiniMax | 229B | 192K | Code Thinking Tool Calls | 18 |
| Kimi K2 Thinking | Moonshot AI | 170B | 256K | Code Thinking Tool Calls | 18 |
| Devstral 2 123B Instruct 2512 | Mistral AI | 125B | 256K | Code Multilingual Tool Calls | 18 |
| Mistral Large Instruct 2411 | Mistral AI | 123B | 128K | Multilingual Tool Calls | 14 |
| GPT OSS 120B | OpenAI | 120B | 128K | Multilingual Thinking Tool Calls | 17 |
| Qwen3 Next 80B A3B Thinking | Qwen | 81B | 256K | Code Multilingual Thinking Tool Calls | 18 |
| Qwen3 Next 80B A3B Instruct | Qwen | 81B | 256K | Code Multilingual Tool Calls | 18 |
| Qwen3 Coder Next | Qwen | 80B | 256K | Code Multilingual Tool Calls | 18 |
| Qwen2.5 72B Instruct | Qwen | 73B | 32K | Code Multilingual Tool Calls | 9 |
| DeepSeek R1 Distill Llama 70B | DeepSeek | 71B | 128K | Code Multilingual Thinking | 19 |
| Llama 3.3 70B Instruct | Meta | 70B | 128K | Code Multilingual Tool Calls | 20 |
| Meta Llama 3.1 70B Instruct | Meta | 70B | 128K | Code Multilingual Tool Calls | 15 |
| DeepSeek R1 Distill Qwen 32B | DeepSeek | 33B | 128K | Code Multilingual Thinking | 8 |
| NVIDIA Nemotron 3 Nano 30B A3B | NVIDIA | 32B | 256K | Code Multilingual Thinking Tool Calls | 17 |
| GLM 4.7 Flash | Zai Org | 31B | 198K | Code Thinking Tool Calls | 16 |
| Mistral Small 24B Instruct 2501 | Mistral AI | 24B | 32K | Code Multilingual Tool Calls | 19 |
| GPT OSS 20B | OpenAI | 22B | 128K | Multilingual Thinking Tool Calls | 17 |
| Llama 4 Scout 17B 16E Instruct | Meta | 17B | 10M | Code Multilingual Tool Calls Vision | 19 |
| DeepSeek Coder V2 Lite Instruct | DeepSeek | 16B | 160K | Code Multilingual | 5 |
| DeepSeek R1 Distill Qwen 14B | DeepSeek | 15B | 128K | Code Multilingual Thinking | 8 |
| Qwen2.5 14B Instruct | Qwen | 15B | 32K | Code Multilingual Tool Calls | 9 |
| Phi 4 | Microsoft | 15B | 16K | Code | 14 |
| DeepSeek R1 0528 Qwen3 8B | DeepSeek | 8B | 128K | Code Multilingual Thinking Tool Calls | 18 |
| Meta Llama 3.1 8B Instruct | Meta | 8B | 128K | Code Multilingual Tool Calls | 19 |
| DeepSeek R1 Distill Qwen 7B | DeepSeek | 8B | 128K | Code Multilingual Thinking | 8 |
| Qwen2.5 7B Instruct | Qwen | 8B | 32K | Code Multilingual Tool Calls | 9 |
| Granite 4.0 Tiny Base Preview | IBM | 7B | 128K | Code Multilingual | 15 |
| Phi 3 mini 4k instruct | Microsoft | 4B | 4K | Code | 2 |
| LFM2.5 1.2B Thinking | Liquid AI | 1B | 125K | Multilingual Thinking Tool Calls | 6 |