🖼️ → 🎬 Slideshow Maker

Per-image audio: upload audio files, one (or more) per image (matched by filename or order).
Per-image TTS (multiline): write blocks separated by blank lines; lines inside a block are spoken sequentially for that image.
TTS voices: pick from Coqui VCTK multi-speaker voices (male/female) or use gTTS as a lightweight fallback.

Result

Tips

Multiline per image: separate image blocks with a blank line. Within each block, lines are spoken in order.
Coqui per-line speaker: prefix a line with speaker| text, e.g., p225| Hello there.
Sync option: turn it on to make each image stay up for the full duration of its own audio.