All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

NVIDIA Releases Nemotron-TwoTower-30B-A3B: A Block-Wise Diffusion Language Model

8d ago· 7 min readen

Summary

NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, a block-wise autoregressive diffusion language model built on the Nemotron-3-Nano-30B-A3B backbone. Unlike traditional autoregressive models that generate tokens one at a time, this model generates text by iteratively denoising blocks of tokens in parallel. The model was developed between September 2025 and April 2026, with pre-training data cutoff of June 25, 2025. The page provides model architecture details, comparison with the autoregressive baseline, and links to the model on Hugging Face.

Source

Twitter / XNVIDIA Releases Nemotron-TwoTower-30B-A3B: A Block-Wise Diffusion Language Modelhuggingface.co

Key quotes

· 3 pulled
Nemotron-TwoTower-30B-A3B-Base-BF16 is a block-wise autoregressive diffusion language model built on the NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 backbone.
It generates text by iteratively denoising blocks of tokens in parallel rather than one token at a time.
We're on a journey to advance and democratize artificial intelligence through open source and open science.
Snippet from the RSS feed
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.