Non-Autoregressive Language Models for Fast and Flexible Text Generation

Submissions Due

Jun 30, 2026

via OpenReview · AoE

Notifications

Jul 24, 2026

accept / reject

Camera-ready

TBA

non-archival

Workshop

Oct 9, 2026

San Francisco

News

Announcements

Jun 2026Submissions are open: the OpenReview portal is live, and we are recruiting reviewers — sign up to volunteer.
Jun 2026First invited talk announced: Shansan Gong (University of Hong Kong) on flexible generation order in diffusion language models. See the program.
May 2026NonAR-LM is confirmed as an official workshop at COLM 2026 in San Francisco.

About

Language generation beyond next-token prediction

Autoregressive next-token prediction has long been the dominant paradigm for language modeling, thanks to its simplicity, scalability, and strong empirical performance. Yet the left-to-right factorization imposes constraints that limit efficiency, controllability, and global coherence.

Recent advances in non-autoregressive modeling offer a fundamentally different approach to discrete sequence generation. Instead of committing to a fixed left-to-right order, these models enable parallel decoding, generate tokens in any order, and can revise earlier decisions. They span masked and uniform-state diffusion, discrete flow matching, and any-order autoregression, and are now competitive at scale and increasingly deployed in industry systems. This workshop brings the community together around three core challenges:

Sequential decoding bottleneck

Tokens are generated one at a time, preventing parallelism across the sequence and leaving hardware underutilized.

Limited controllability

Conditioning on global constraints or future tokens is indirect, often needing complex prompting, rejection sampling, or constrained decoding.

Limited global consistency

Local, token-level decisions can drift into incoherence over long horizons, since the model cannot revise earlier outputs.

Topics

Call for contributed work

We invite contributions on training and/or inference of non-autoregressive language models — including diffusion, flow matching, and any-order autoregression.

01

Modeling & Training

New model classes and training objectives — discrete diffusion, uniform-state diffusion, flow-based, and any-order approaches.

02

Inference & Sampling

Inference-time algorithms: iterative refinement, parallel decoding, controllable and constrained generation, planning and correction.

03

Evaluation & Efficiency

Evaluation beyond left-to-right likelihood, plus parallel generation, latency-constrained inference, and systems for scaling.

04

Applications

Applications across language, code, and biological sequences — including comparative studies of when iterative models help.

Call for Papers

Submit your work

Submissions may present new results, works in progress, negative results, empirical evaluations, or forward-looking position papers relevant to the workshop themes.

Up to 9 pages, excluding references and an optional appendix; shorter submissions are equally welcome.
Use the COLM 2026 or NeurIPS 2026 template. Format submissions with the official COLM 2026 or NeurIPS 2026 LaTeX style files, submission mode.
Non-archival. Submitting does not preclude publishing elsewhere.
Double-blind review. Each submission receives at least three reviews.
Six spotlight (contributed) talks selected from submissions; all accepted work is presented as posters.
Submissions due June 30, 2026; notifications by July 24, 2026.

Call for Reviewers

Join the review committee

We are recruiting reviewers to help evaluate submissions and shape the program. Volunteering is a hands-on way to engage closely with the newest work in the field.

Light load. Reviewers handle only a small number of non-archival submissions.
Reviewing in July 2026. Submissions close June 30 and notifications go out July 24, so reviews fall in early through mid-July.
Who should sign up. Students and early-career researchers working on or curious about non-autoregressive, diffusion, flow-based, or any-order generation.
Why it matters. Reviews directly inform which submissions are accepted and spotlighted.

Program

Schedule

A full day of 6 invited talks, 6 contributed spotlight talks, 2 poster sessions, and a panel discussion (times in Pacific Time).