Interfaze Ships diffusion-gemma-asr-small, an Open-Source Diffusion ASR Model Transcribing Six Languages via DiffusionGemma’s Parallel Denoising Decoder

Interfaze open-sourced diffusion-gemma-asr-small, a multilingual ASR model that transcribes via diffusion, not autoregression. It adds audio to Google's frozen DiffusionGemma using a ~42M-parameter adapter. One adapter covers six languages, with transcription cost set by denoising steps, not transcript length. The post Interfaze Ships diffusion-gemma-asr-small, an Open-Source Diffusion ASR Model…
This is a summary curated by AIFuture. Read the complete article at the original source:
Read the full story on MarkTechPost