All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Alibaba Releases Qwen3.5 Medium AI Models with Open Source Licensing and Near Sonnet 4.5 Performance

By

lostmsu

3mo ago· 4 min readenNews

Summary

Alibaba's Qwen AI team has released the Qwen3.5 Medium Model series, consisting of four new large language models with agentic tool calling capabilities. Three models are available under the open source Apache 2.0 license for commercial use, while the fourth (Qwen3.5-Flash) is proprietary and available through Alibaba Cloud Model Studio API. The models offer near-lossless accuracy with 4-bit weight and KV cache quantization, enabling developers to process large datasets without requiring server-grade infrastructure. The performance is described as approaching that of Anthropic's Sonnet 4.5 model on local computers.

Key quotes

· 4 pulled
Alibaba's now famed Qwen AI development team has done it again: a little more than a day ago, they released the Qwen3.5 Medium Model series consisting of four new large language models (LLMs) with support for agentic tool calling
three of which are available for commercial usage by enterprises and indie developers under the standard open source Apache 2.0 license
A fourth model, Qwen3.5-Flash, appears to be proprietary and only available through the Alibaba Cloud Model Studio API, but still offers a strong advantage in cost comp
This leap is made possible by near-lossless accuracy under 4-bit weight and KV cache quantization, allowing developers to process massive datasets without server-grade infrastructure
Snippet from the RSS feed
This leap is made possible by near-lossless accuracy under 4-bit weight and KV cache quantization, allowing developers to process massive datasets without server-grade infrastructure.

You might also wanna read