Compose dialogue, attach optional voice prompts, and generate audio (CUDA graphs enabled by default).
Timestamps