DINOv2 ViT-L/14

A powerful self-supervised vision foundation model from Meta AI, producing high-quality image embeddings for vision tasks without task-specific labels.

1 min

FLUX.1 [dev]

A 12-billion-parameter rectified flow transformer capable of generating images from text descriptions.

1 min

MPT-30B

A 30-billion-parameter open-source language model from MosaicML — a strong, general-purpose LLM balancing scale, performance, and inference efficiency.

2 min

Stable Diffusion v1.4

A high-quality text-to-image latent diffusion model trained on LAION-2B, enabling fast and flexible image generation.

1 min

Stable Diffusion XL Base 1.0

A flagship text-to-image model with improved realism, composition, and support for high-resolution 1024x1024 image generation.

1 min

Whisper Large v3

A multilingual speech recognition and translation model by OpenAI, supporting 100+ languages with improved robustness and low-latency transcription.

1 min