this week there were some major developments on the AI video Frontier Google's V2 image to video mode is now released on free pick V2 was showing next gen performance and working almost like a cinematic physics engine we will see if it can repeat the same performance with image to video Additionally the state-ofthe-art open source AI video model Alibaba 1 2.1 has a new version one pro one pro generates high quality 1080p videos at 30 frames per second with up to 6 seconds duration so in this epic prompt battle I will test Google V2 against clink
1.6 one pro Hario Minimax and Lumar Ray 2 let's get started in this first challenge initially we are starting with the blue butterfly this is going to be our first frame and after that we have a cinematic Focus change and we switch to a wolf walking into the scene here we are testing the prompt understanding of the models how well they can Implement cinematic camera movements I was very impressed with the V2s result how smoothly wolf enters to the scene butterflies natural flutter and just flying away it looked very cinematic and cool and in the
end of the shot you will realize that a second wolf enters to the scene if I want to go perfectionist on that I could say I was looking for a single wolf walking into the scene but this is not a major problem I was also very impressed that VI gave me the dynamic Focus Shift from butter fly to wolf's head clink 1.6 didn't give me the wolf entering to the scene just keept zooming in to my butterfly I was expecting much better from clink on this challenge it surprised me a little bit but it didn't
actually give me what I asked for one pro output is impressive I really liked how amazingly light and shadows and rendered in this shot butterfly looks really alive and natural and I like that both anwers to the scene exactly how I described I have even a Focus Shift which is really impressive one issue I realized with one it tend to desaturate the initial image always it always decreases the brightness a little bit pliio gave me T Wings the T Wings appeared so we lost the coherence right away at the beginning of the shot you will
realize that in the end of the shot it actually gave me the wol but if I would compare this to for example V2 output or one pro output it doesn't look as cinematic as the other output but from prompt understanding perspective halio did a good job A2 output unfortunately didn't give me the Focus Shift instead again a common mistake in this AI video tools they tend to give user a scene shift instead of a Focus Shift so in the second part of the prompt there was a wol and then scene changes to a distinct separate
wolf scenery so among all the results I think Lumar had the worst result from my perspective in the next challenge we have a Battlefield this is a combination of natural elements of water fire lightning we have metaphysical elements as well as soldiers and Banners we asked in the prompt that we want the highspeed POV shot that I wanted camera to FL through the battlefield show me the banners and flags and then we would jump into the Vortex and go through different universes at least this is what I dreamed of one major problem with V2 output
was initially soldiers looked Frozen this is a common mistake for image to video models sometimes you can realize that the initial image frame couple of seconds can be Frozen and after that model figures out what to do I am impressed with how bears are moving in the wind the flames you will realize that lightning is static and it's not moving and after that everything starts moving and then we got also a Vortex coming in it didn't do a horrible job except the initial frame freeze the clink output game the flly through of the battlefield it
gave me the vortex and I like how smooth the motion was you can see moving flags and Flames so overall I'm happy with the clink Result One Pro also gave me exactly what I asked for the PV camera flying through and I liked how Vortex looks really Dynamic and the whole shot looked really Dynamic same issue again that everything gets super dark in one output brightness drop is massive in this particular shot it actually worked to our advantage it made the vortex even more obvious and beautiful to look at in other times this can be
little distracting I think Lumar Ray was really committed to give me multiple universes I got a flly through from Medieval Age to Space Age with a particularly interesting helicopter looking toy car unfortunately in comparison to other outputs it is difficult to say that Luma tried here I would prefer a fly through in the battlefield I think it would look much more cinematic before entering to the vortex I'm happy with the minx's output it looks pretty cinematic it gave me renders of natural elements like fire and power surge and even lightning so I'm overall very happy
with the Minimax result in the next challenge we have a female runner at the Olympics in this particular challenge I'm trying to understand the physics understanding of these models and a lot of course and how well they can render human anotomy starting with V2 you will realize few coherence issues both in the leg in the upper body as well on the background of the initial frame quality dropped really major you can almost not see anything else there's a strong motion blur as we ask for overall it's a good result with some coherence issues on the
upper body in particular and one time in the legs as well you can realize the motion and movement in in the hair the clink output decided to give me a bit of a slow motion so in comparison to V2 this shot looks a little bit more static but the muscles and the movement of the runner and the coherence is simply fantastic it kept the coherence of the anatomy and the movement throughout the shot which was very impressive I like that one output rendered the background elements a little bit better but running doesn't really look as
natural as V2 and clink output this little bit disappointed me about one it is not horrible but there are some coherence issues the whole movement looks little bit jumpy to me in comparison to impressive anatomical coherence and natural running M of the first two models it's from my perspective little behind the halio minimix gave me a much better anatomy and the natural movement of running only problem when it comes to halio is the quality here the details of the body and the muscles are not as visible as clink output or view output but still impressive
result Lumar Ray rendered the whole scene really well it's just randomly decided to empty the whole stadium couldn't really render audience as good as vo and clink but overall I think it did a good job it kept the coherence of the anatomy and I'm not disappointed with the Lumar Ray result in the last challenge we have a warrior attacking with a blade and we have a circling orbiting camera after that attack I would like to see de breeze are lifting into the air with the force of the attack it's a complex prompt and there's a
lot going on here it's not a straightforward one but a good challenge for our models Google's V2 output again decided to freeze the first frame it didn't give me the attack with the blades until the very end of the shot and I didn't get the camera circling our character unfortunately the physics of the debris the explosion and moving objects look pretty awesome but overall I was underwhelmed with this shot clink gave me a much more Dynamic shot a circling camera and a nice swing of the blade in the middle of the shot we lost coherence
of our character got it back later in the last frame but overall for me this shot look much more Dynamic than Google's V2 yet of course it definitely requires a reroll or maybe the first part of the shot can be used until the decoherence part one pro output looked more like a dance for me or almost like a bit of like anime fight it can be also taught as he's burning and he's in pain or this intense fire causing him to invent a new dance we can make many comments for me it's not the best
result among the other outputs here I like the halio output because it gave me what I asked for there's an attack there is some camera movement a bit of circling not much but it looked cool and cinematic it's just the explosion on the background looks pretty much Frozen I like that debris are moving around overall I think it did a fantastic job here Lumar A2 lost the coherence at the beginning of the shot and instead of a single blade it gave me two swords by Magic if this would be in an online game I would
definitely accept this offer I mean replacing one blade with two sword is a good deal in this particular scenario lumaray output didn't really get gave us what we asked for so my final verdict and these are my personal opinions just as a reminder I'm not associated with any of these Brands Google V2 is delivering exceptional visual quality and motion diversity but there is initial frame freeze problem and this requires Improvement it definitely has potential to catch up with clink 1.6 if it can solve this initial frame problem and here and there I observed some minor
coherence issues with some optimization I think it would be a fantastic competition for clink one pro is a very good model but coherence seems far behind clink VI to and high Mini Max it also makes the footages little bit darker because of some reason this probably requires further optimization but overall it's a model with a massive Potential from my perspective the Lumar R 2 is far behind all models mentioned about hopefully this video was truly helpful for you don't forget to give a thumbs up and subscribe for more indepth tutorials and prompt battles if you
want to learn more about creative intelligence click here