SkyReels V4 takes the #1 spot in Text to Video With Audio in the Artificial Analysis Video Arena, surpassing Kling 3.0 and Veo 3.1! SkyReels V4 is the latest video generation model from @Skywork_ai, marking a major shift from their previous avatar-focused models to a full multimodal video generation system supporting Text, Image, Video, and Audio inputs. The model generates up to 15-second videos at 1080p resolution with native audio support. SkyReels V4 also performs strongly across other modalities, ranking #2 in Text to Video without Audio, #4 in Image to Video with Audio, and #7 in Image to Video without Audio. The model is priced at $7.20 per minute with audio and $8.40 per minute without audio, positioning it below Kling 3.0 1080p Pro (~$20/min with audio) and Veo 3.1 ($24/min with audio), though at a premium over Grok Imagine at $4.20/min with audio. SkyReels V4 is available via the @SkyReels website, with both a web app and API access. SkReels V4 Omni will be released soon. See below for example generations of SkyReels V4 in the Artificial Analysis Video Arena 🧵
SkyReels V4 ranks #2 in Text to Video without Audio, #4 in Image to Video with Audio, and #7 in Image to Video without Audio.
Example Generations from SkyReels V4
172