Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL!

Published on January 29, 2025

We release Qwen2.5-VL, the new flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL. To try the latest model, feel free to visit Qwen Chat and choose Qwen2.5-VL-72B-Instruct. Also, we open both base and instruct models in 3 sizes, including 3B, 7B, and 72B, in both Hugging Face and ModelScope.

The key features include:

Understand things visually: Qwen2.5-VL is not only proficient in recognizing common objects such as flowers, birds, fish, and insects, but it is highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Being agentic: Qwen2.5-VL directly plays as a visual agent that can reason and dynamically direct tools, which is capable of computer use and phone use.

Read the full article here.

Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL!

Lockheed Martin Integrates AI-Controlled Drones with F-35 to Enhance Air Dominance

Exclusive: US Army 101st Airborne Division Develops 3D-Printed Small Drones for Air Assault Operations.