le meng: "Supports text, image, audio, and video references with precise motion, consistency, and immersive audio-visual output. Create cinematic AI videos with unified multimodal control. https://www.xmk.com/s"