LLaMA 2 7B Chat (Hugging Face)

Meta’s 7B-parameter instruction-tuned model optimized for chat, dialogue, and assistant-style applications.

1 min

LLaMA 4 Maverick 17B 128E (Original)

Meta’s experimental ultra-sparse MoE model with 128 experts, designed to explore efficient large-scale scaling and routing strategies for future LLaMA architectures.

1 min

LLaMA 4 Scout 17B 16E

Meta’s experimental LLaMA 4-series MoE model with 17 billion parameters and 16 experts, designed to explore sparse routing and scaling strategies.

1 min

Meta Llama 3 8B

A next-generation 8-billion-parameter open-weight language model from Meta, optimized for reasoning and general-purpose tasks.

1 min

Mixtral 8x7B Instruct v0.1

A powerful sparse Mixture-of-Experts (MoE) instruction-tuned language model by Mistral AI, combining efficiency and performance for chat and task-oriented generation.

1 min

MPT-30B

A 30-billion-parameter open-source language model from MosaicML — a strong, general-purpose LLM balancing scale, performance, and inference efficiency.

2 min

OpenAI o1

An advanced, text-only language model from OpenAI with GPT-4-level capabilities, optimized for performance, efficiency, and competitive benchmarks.

1 min

QWQ-32B

A 32-billion-parameter large language model developed by Qwen LM, designed to deliver high-quality instruction following and multilingual chat capabilities.

1 min