Skip to content

feat: add qwen3-tts text-to-speech example#23

Merged
deanq merged 2 commits intomainfrom
timpietrusky/add-qwen3-tts-example
Feb 13, 2026
Merged

feat: add qwen3-tts text-to-speech example#23
deanq merged 2 commits intomainfrom
timpietrusky/add-qwen3-tts-example

Conversation

@TimPietruskyRunPod
Copy link
Member

@TimPietruskyRunPod TimPietruskyRunPod commented Feb 6, 2026

Summary

  • Add new example 02_ml_inference/01_text_to_speech/ using Qwen3-TTS-12Hz-1.7B-CustomVoice
  • 9 voices (English, Chinese, Japanese, Korean), 11 languages, emotion/style control via natural language instructions
  • Runs on RTX 4090 (24GB VRAM) with auto-scaling 0-3 workers

Endpoints

  • POST /gpu/tts — JSON response with base64-encoded WAV audio
  • POST /gpu/tts/audio — Direct WAV file download
  • GET /gpu/voices — List available voices and languages

Test plan

  • Scaffolded with flash init (runpod-flash v1.0.0)
  • Tested with flash run locally — generated WAV file successfully
  • make consolidate-deps run — root pyproject.toml updated
  • Run flash run from repo root to verify unified app discovery
  • Test all three endpoints via /docs Swagger UI

Add text-to-speech example using Qwen3-TTS-12Hz-1.7B-CustomVoice model
running on RTX 4090 GPUs. Supports 9 voices, 11 languages, and
emotion/style control via natural language instructions.
@deanq deanq merged commit 7666ca8 into main Feb 13, 2026
6 checks passed
@deanq deanq deleted the timpietrusky/add-qwen3-tts-example branch February 13, 2026 01:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants