Subquadratic SubQ: An LLM That Reads 12 Million Tokens in One Prompt

Miami startup Subquadratic launched SubQ, an LLM that accepts a 12 million token context window using sparse attention that scales linearly, claiming almost 1,000x less compute than frontier models at full context.
artificial-intelligence
Author

Kabui, Charles

Published

2026-05-28

Keywords

long-context-llm, sparse-attention, subquadratic-attention, 12m-token-context-window, subq