Skip to content

Open-Weight LLM Catalog

Discover the best open-weight large language models for self-hosted deployment. Filter by capability, compare specifications, and deploy to your infrastructure in minutes.

Model Filters

1B
1026B
4K
10M
Model Organization Parameters Context Capabilities Quants
Kimi K2 Instruct Moonshot AI 1026B 128K
Code Tool Calls
18
Kimi K2.5 Moonshot AI 1016B 256K
Code Thinking Tool Calls Vision
18
DeepSeek V3.2 DeepSeek 685B 160K
Multilingual Tool Calls
18
DeepSeek V3.1 DeepSeek 685B 160K
Code Multilingual Tool Calls
18
Mistral Large 3 675B Instruct 2512 Mistral AI 675B 288K
Code Multilingual Tool Calls
20
Llama 4 Maverick 17B 128E Instruct Meta 397B 1M
Code Multilingual Tool Calls Vision
19
GLM 4.7 Zai Org 358B 198K
Code Thinking Tool Calls
18
Qwen3 235B A22B Qwen 235B 40K
Code Multilingual Tool Calls
17
MiniMax M2 MiniMax 229B 192K
Code Thinking Tool Calls
18
Kimi K2 Thinking Moonshot AI 170B 256K
Code Thinking Tool Calls
18
Devstral 2 123B Instruct 2512 Mistral AI 125B 256K
Code Multilingual Tool Calls
18
Mistral Large Instruct 2411 Mistral AI 123B 128K
Multilingual Tool Calls
14
GPT OSS 120B OpenAI 120B 128K
Multilingual Thinking Tool Calls
17
Qwen3 Next 80B A3B Thinking Qwen 81B 256K
Code Multilingual Thinking Tool Calls
18
Qwen3 Next 80B A3B Instruct Qwen 81B 256K
Code Multilingual Tool Calls
18
Qwen3 Coder Next Qwen 80B 256K
Code Multilingual Tool Calls
18
Qwen2.5 72B Instruct Qwen 73B 32K
Code Multilingual Tool Calls
9
DeepSeek R1 Distill Llama 70B DeepSeek 71B 128K
Code Multilingual Thinking
19
Llama 3.3 70B Instruct Meta 70B 128K
Code Multilingual Tool Calls
20
Meta Llama 3.1 70B Instruct Meta 70B 128K
Code Multilingual Tool Calls
15
DeepSeek R1 Distill Qwen 32B DeepSeek 33B 128K
Code Multilingual Thinking
8
NVIDIA Nemotron 3 Nano 30B A3B NVIDIA 32B 256K
Code Multilingual Thinking Tool Calls
17
GLM 4.7 Flash Zai Org 31B 198K
Code Thinking Tool Calls
16
Mistral Small 24B Instruct 2501 Mistral AI 24B 32K
Code Multilingual Tool Calls
19
GPT OSS 20B OpenAI 22B 128K
Multilingual Thinking Tool Calls
17
Llama 4 Scout 17B 16E Instruct Meta 17B 10M
Code Multilingual Tool Calls Vision
19
DeepSeek Coder V2 Lite Instruct DeepSeek 16B 160K
Code Multilingual
5
DeepSeek R1 Distill Qwen 14B DeepSeek 15B 128K
Code Multilingual Thinking
8
Qwen2.5 14B Instruct Qwen 15B 32K
Code Multilingual Tool Calls
9
Phi 4 Microsoft 15B 16K
Code
14
DeepSeek R1 0528 Qwen3 8B DeepSeek 8B 128K
Code Multilingual Thinking Tool Calls
18
Meta Llama 3.1 8B Instruct Meta 8B 128K
Code Multilingual Tool Calls
19
DeepSeek R1 Distill Qwen 7B DeepSeek 8B 128K
Code Multilingual Thinking
8
Qwen2.5 7B Instruct Qwen 8B 32K
Code Multilingual Tool Calls
9
Granite 4.0 Tiny Base Preview IBM 7B 128K
Code Multilingual
15
Phi 3 mini 4k instruct Microsoft 4B 4K
Code
2
LFM2.5 1.2B Thinking Liquid AI 1B 125K
Multilingual Thinking Tool Calls
6