Please read:
A face in the image is enough.
The more clearly the subject is visible in the original image, the higher the chance of a very good result.
Enter the language (one word) the video should be in.
For example: english, japanese, chinese, etc
There is an automatic prompt generator; it will do what it needs to do.
P.S.: The 5s version of this app may sometimes be dialogue-less.