Laskamp4 -

: Designed for efficiency, this model has 17 billion active parameters. It fits on a single H100 GPU. It is optimized for high-speed performance (up to 460+ tokens per second) and long-document reasoning.

: Unlike previous versions that relied on "bolted-on" vision components, Llama 4 was trained from the start with text, images, and video frames. Laskamp4

: Previews suggest this is Meta's most powerful model yet. It serves as a "teacher" for smaller models through distillation processes. Reception and Performance : Designed for efficiency, this model has 17