Speech ReaLLM revolutionizes real-time speech recognition by combining “decoder-only” ASR with RNN-T, enabling continuous audio processing without explicit endpointing. The approach achieves impressive results in real-time streaming and showcases the potential of multimodal LLMs in speech recognition tasks.