paraformer-8k-v2 is the telephone scenario version, supports Chinese only, 8kHz sample rateAdd to request header:
Authorization: Bearer YOUR_API_KEY
Paraformer audio file recognition model
Options:
| Value | Description |
|---|---|
paraformer-v2 | General version, supports multiple languages, any sample rate |
paraformer-8k-v2 | Telephone scenario, Chinese only, 8kHz sample rate |
paraformer-v2, paraformer-8k-v2 "paraformer-v2"
Audio file URL list
Notes:
1 - 100 elements["https://example.com/audio/meeting.wav"]Language hints for recognition
Notes:
paraformer-v2zh (Chinese), en (English), ja (Japanese), yue (Cantonese), ko (Korean), de (German), fr (French), ru (Russian)["zh", "en"]Audio track index
Notes:
[0] means the first track[0] (only process the first track)[0]Recognition configuration
Notes:
Speaker diarization configuration
Notes:
Task created successfully
Task creation timestamp
1757165031
Task ID
"task-unified-1757165031-uyujaw3d"
Actual model name used
Specific task type
audio.generation.task Task progress percentage (0-100)
0 <= x <= 1000
Task status
pending, processing, completed, failed "pending"
Asynchronous task info
Task output type
audio "audio"