REPOGEO REPORT · LITE
kyutai-labs/delayed-streams-modeling
Default branch main · commit 4c4f65e1 · scanned 5/27/2026, 1:18:12 AM
GitHub: 2,923 stars · 307 forks
Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface kyutai-labs/delayed-streams-modeling, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.
Action plan — copy-paste fixes
3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.
- hightopics#1Add relevant topics to the repository
Why:
COPY-PASTE FIXspeech-to-text, text-to-speech, stt, tts, real-time, streaming, low-latency, deep-learning, pytorch, kyutai
- highreadme#2Clarify the README's opening statement to emphasize STT/TTS models
Why:
CURRENT# Delayed Streams Modeling: Kyutai STT & TTS This repo contains instructions and examples of how to run [Kyutai Speech-To-Text](#kyutai-speech-to-text) and [Kyutai Text-To-Speech](#kyutai-text-to-speech) models. See also Unmute, a voice AI system built using Kyutai STT and Kyutai TTS. But wait, what is "Delayed Streams Modeling"? It is a technique for solving many streaming X-to-Y tasks (with X, Y in `{speech, text}`) that formalize the approach we had with Moshi and Hibiki. See our pre-print about DSM.COPY-PASTE FIX# Delayed Streams Modeling: Kyutai STT & TTS This repository showcases Kyutai's advanced Speech-To-Text (STT) and Text-To-Speech (TTS) models, which are built using our innovative Delayed Streams Modeling (DSM) framework. DSM is a powerful technique for solving various streaming X-to-Y tasks (with X, Y in `{speech, text}`), formalizing approaches seen in projects like Moshi and Hibiki. Here you will find comprehensive instructions and examples for deploying and running these models, including those powering the Unmute voice AI system. - mediumhomepage#3Add a homepage URL to the repository
Why:
COPY-PASTE FIXhttps://huggingface.co/collections/kyutai/speech-to-text-685403682cf8a23ab9466886
Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash
Category visibility — the real GEO test
Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?
Same questions for every model — switch tabs to compare answers and rankings.
- NVIDIA NeMo · recommended 1×
- Whisper · recommended 1×
- Kaldi · recommended 1×
- Mozilla DeepSpeech · recommended 1×
- Hugging Face Transformers · recommended 1×
- CATEGORY QUERYWhat are the best open-source models for real-time streaming speech-to-text with low latency?you: not recommendedAI recommended (in order):
- NVIDIA NeMo
- Whisper
- Kaldi
- Mozilla DeepSpeech
- Hugging Face Transformers
- SpeechBrain
AI recommended 6 alternatives but never named kyutai-labs/delayed-streams-modeling. This is the gap to close.
Show full AI answer
- CATEGORY QUERYHow to implement an efficient speech recognition system that provides word-level timestamps?you: not recommendedAI recommended (in order):
- OpenAI Whisper (openai/whisper)
- Google Cloud Speech-to-Text
- AssemblyAI
- AWS Transcribe
- Mozilla DeepSpeech (mozilla/DeepSpeech)
- Montreal Forced Aligner (MFA) (MontrealCorpusTools/Montreal-Forced-Aligner)
- Picovoice Rhino Speech-to-Text
- Kaldi (kaldi-asr/kaldi)
AI recommended 8 alternatives but never named kyutai-labs/delayed-streams-modeling. This is the gap to close.
Show full AI answer
Objective checks
Rule-based audits of metadata signals AI engines weight most.
- Metadata completenesswarn
Suggestion:
- README presencepass
Self-mention check
Does AI even know your repo exists when asked about it directly?
- Compared to common alternatives in this category, what is the core differentiator of kyutai-labs/delayed-streams-modeling?passAI did not name kyutai-labs/delayed-streams-modeling — likely talking about a different project
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- If a team adopts kyutai-labs/delayed-streams-modeling in production, what risks or prerequisites should they evaluate first?passAI named kyutai-labs/delayed-streams-modeling explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- In one sentence, what problem does the repo kyutai-labs/delayed-streams-modeling solve, and who is the primary audience?passAI did not name kyutai-labs/delayed-streams-modeling — likely talking about a different project
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
Embed your GEO score
Drop this badge into the README of kyutai-labs/delayed-streams-modeling. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.
[](https://repogeo.com/en/r/kyutai-labs/delayed-streams-modeling)<a href="https://repogeo.com/en/r/kyutai-labs/delayed-streams-modeling"><img src="https://repogeo.com/badge/kyutai-labs/delayed-streams-modeling.svg" alt="RepoGEO" /></a>Subscribe to Pro for deep diagnoses
kyutai-labs/delayed-streams-modeling — Lite scans stay free; this card itemizes Pro deep limits vs Lite.
- Deep reports10 / month
- Brand-free category queries5 vs 2 in Lite
- Prioritized action items8 vs 3 in Lite