I collected text to speech, image and text data through my zaps, but I cannot collect them all in the form of voiceover and subtitles in a single image. What would you recommend for this? I want it to be autonomous and I tried an application called creatomate. It passes the zap test but does not give any output. I would be happy if you help.
Question
How do I combine text-to-speech, image, and text into a single image with voiceover and subtitles?
This post has been closed for comments. Please create a new post if you need help or have a question about this topic.
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.


