alicloud-ai-audio-asr-realtime
v1.0.0Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.
Installation
Please help me install the skill `alicloud-ai-audio-asr-realtime` from SkillHub official store.
npx skills add cinience/alicloud-ai-audio-asr-realtime
Category: provider
Model Studio Qwen ASR Realtime
Validation
mkdir -p output/alicloud-ai-audio-asr-realtime
python -m py_compile skills/ai/audio/alicloud-ai-audio-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/alicloud-ai-audio-asr-realtime/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-asr-realtime/validate.txt is generated.
Output And Evidence
- Save session payloads and response samples under
output/alicloud-ai-audio-asr-realtime/.
Critical model names
Use one of these exact model strings:
- qwen3-asr-flash-realtime
- qwen3-asr-flash-realtime-2026-02-10
Use cases
- Realtime subtitles and captions
- Voice-agent duplex input
- Streaming speech-to-text in browser or terminal clients
Prerequisites
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials. - Realtime sessions generally require WebSocket or streaming session handling in the client.
Normalized interface (asr.realtime)
Request
model(string, optional): defaultqwen3-asr-flash-realtimelanguage_hints(array, optional) format(string, optional): e.g.pcm,wavsample_rate(int, optional): e.g.16000chunk_ms(int, optional): frame size in milliseconds
Response
text(string): recognized transcript fragmentis_final(bool): finalization markerusage(object, optional)
Quick start
Generate a request template:
python skills/ai/audio/alicloud-ai-audio-asr-realtime/scripts/prepare_realtime_asr_request.py
--output output/alicloud-ai-audio-asr-realtime/request.json
Operational guidance
- Prefer 16kHz mono PCM unless your client stack requires another format.
- Keep chunks small enough for responsive partial results.
- If you only have recorded files, use
skills/ai/audio/alicloud-ai-audio-asr/instead.
References
references/sources.md