Use Ltx-2.3 to generate videos with audio from images and prompts. The models used are as follows:
・(https://www.seaart.ai/ja/models/detail/d72vbste878c7395fr80)
・ltx-2.3-22b-distilled-1.1_lora-dynamic_fro09(https://www.seaart.ai/ja/models/detail/d7ffgnle878c73eeep8g)
Options
・There are 2-pass and 1-pass modes. In general, 2-pass is more efficient and suited for high resolutions, while 1-pass is suited for low resolutions. In 2-pass, the first pass generates a half-size video, and the second pass upscales it by 2×.
Other
・The input fields “shorter side length of the video,” “seconds,” and “FPS” affect processing time.