StepFun Releases Step 3.5 Flash: 196B Sparse MoE Model for OpenClaw Agents

Step 3.5 Flash is StepFun’s 196B sparse MoE model that activates only 11B parameters per token. It delivers frontier reasoning and strong agentic performance with high efficiency. Seamless native…

Read the full article

Zac Zuo1mo ago4 min readenProduct

technology artificial intelligence programming open source software

You might also wanna read

GLM-4.7-Flash: Z.ai's 30B-A3B MoE Model for Lightweight AI Deployment

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co·5mo ago

LongCat-Flash: Meituan's 560B-Parameter MoE Language Model with Dynamic Computation and Open-Source Release

We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and a

arxiv.org·17d ago

Step 3.7 Flash: A High-Efficiency Multimodal AI Model for Real-World Applications

Understands images across the full range — product UIs, documents, charts, and natural scenes — then writes code or calls tools to act on wh

static.stepfun.com·1mo ago

Mach-Mind-4-Flash Technical Report

arXiv:2607.09375v1 Announce Type: new Abstract: We present Mach-Mind-4-Flash, a 35B-parameter Mixture-of-Experts (MoE) agentic model with 3B

machinebrief.com·4d ago

DeepSeek-V4 Series Preview: Million-Token Context MoE Models with 1.6T Parameters

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co·2mo ago

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

The surgence of Mixture of Experts (MoE) in Large Language Models promises a small price of execution cost for a much larger model parameter

arxiv.org·1y ago

Comments

No comments yet. Be the first.