All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

Alibaba's Qwen3.7-Plus combines visual AI with autonomous agent capabilities for coding and app navigation

By

Jonathan Kemper

17d ago· 3 min readenNews

Summary

Alibaba's Qwen team has released Qwen3.7-Plus, a proprietary multimodal AI model that combines visual perception with agent capabilities like coding, tool use, and GUI navigation. Built on the text-only Qwen3.7, it functions as a "multimodal interactive hybrid agent" capable of recognizing real-world scenes, reading screens, operating interfaces, writing code from visual templates, and navigating mobile apps. In a demo, an agent built on the model autonomously developed a vocabulary learning app over eleven hours, producing over 10,000 lines of code across 1,000 agent calls. The model leads on-screen understanding in Qwen's benchmarks but shows mixed overall performance. It is priced well below Western frontier models and does not have open weights.

Source

bskyAlibaba's Qwen3.7-Plus combines visual AI with autonomous agent capabilities for coding and app navigationthe-decoder.com

Key quotes

· 3 pulled
Billed as a 'multimodal interactive hybrid agent,' the model is designed to recognize real-world scenes, read screen content, operate graphical interfaces, write code from visual templates, and navigate mobile apps end to end.
Using Qwen3.7-Plus, the team had a hybrid agent system build...
Qwen3.7-Plus is a proprietary offering with no open weights, priced well below Western frontier models.
Snippet from the RSS feed
Alibaba's Qwen team has released Qwen3.7-Plus, a multimodal agent model that combines visual perception, GUI operation, and coding in a single agent loop. In a demo, an agent built on the model autonomously developed a vocabulary learning app, producing o

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.