Text-to-speech narration and podcast generation via ElevenLabs voices with emotion control and SSML tags.