Provider: Meta AI
License: Meta Llama 3 Community License (for research & commercial use with conditions)
Access: Open weights (available via Hugging Face and other platforms)
Architecture: Decoder-only Transformer
Parameters: 8 billion
๐ Overview
Meta Llama 3 8B is part of Metaโs third-generation LLM series, designed to deliver state-of-the-art performance in reasoning, coding, and general-purpose text generation. Released alongside the 70B variant, Llama 3 8B provides a strong performance-to-size ratio, suitable for research and deployment in production environments.
Key highlights:
- Improved Reasoning Capabilities: Outperforms many earlier open models in logical and structured tasks
- Long Context Support: Optimized for context lengths up to 8,192 tokens
- Foundation for Chat Models: Used as a base for the Meta Llama 3 Chat variants
โ๏ธ Technical Details
- Architecture: Transformer decoder (GPT-style)
- Parameters: 8B
- Context Length: 8K tokens
- Tokenizer: New tokenizer supporting improved multilingual capabilities
- Training Data: Mixture of public web data, code, and synthetic reasoning data
๐ Deployment
- Hugging Face Repo: meta-llama/Meta-Llama-3-8B
- Compatibility: Compatible with ๐ค Transformers, text generation pipelines, and popular chat UIs
- Fine-Tuning: Supports parameter-efficient fine-tuning (LoRA, QLoRA, PEFT)