I used Google Veo to bring my selfies and photos to life – and things got hilariously weird

Must Read
bicycledays
bicycledayshttp://trendster.net
Please note: Most, if not all, of the articles published at this website were completed by Chat GPT (chat.openai.com) and/or copied and possibly remixed from other websites or Feedzy or WPeMatico or RSS Aggregrator or WP RSS Aggregrator. No copyright infringement is intended. If there are any copyright issues, please contact: bicycledays@yahoo.com.

Google this week made out there the newest iteration of its Veo video-generation instrument to customers of its Gemini synthetic intelligence program who’ve a “Professional” or “Extremely” account.

Veo has been out there in preview for a while now. What’s new with the newest implementation is the power to start your video by importing a nonetheless picture to function the preliminary body. (ZDNET’s Prakhar Khanna has reported his expertise utilizing the potential as a built-in characteristic of his Honor 400 cellphone, versus utilizing it via the web site as I did.)

Methods to use Veo to generate movies from pictures

You give the system a immediate, press enter, and Veo creates an eight-second video utilizing your uploaded picture as a reference level from which to construct the primary body of video. Veo provides sound, together with music, footsteps, and different incidentals. 

Movies take a number of minutes at a time to develop.

In my testing thus far, I discover Veo’s implementation each fascinating and a bit creepy.

My outcomes with Veo’s photo-to-video characteristic

I attempted a number of nonetheless photos I had taken, together with a selfie and a few road pictures. Seeing one’s photos come to life, if you’ll, is jarring. It’s disconcerting how nicely it really works, and, because the photographer, it is disconcerting how the consequence contrasts with one’s reminiscence of the occasion.

The nice facets are the standard of the video, which is in line with the photographic picture. Issues comparable to perspective of a scene are usually nicely maintained, and shifting objects within the background are, in some instances, well-orchestrated to be constant.

1. Jogger working alongside the promenade

Right here, for instance, is a video I took of a jogger on the East River promenade in Manhattan. I gave Veo the immediate, “Please make a video wherein the jogger continues to run into the space alongside the promenade.”

Beneath is the unique nonetheless picture adopted by the Veo video.

The movement of the jogger is sweet, as is the motion in area as if from the perspective of the photographer.

It is a substantial technical achievement, to my thoughts, on a really fundamental stage. Keep in mind that that is eight seconds of 720p-quality decision, which is rendered at the usual movie price of 24 frames per second. Meaning Veo has to create, in a couple of minutes, 192 frames from the preliminary picture. Given how little effort it took me because the person, it could be straightforward to miss simply how vital that’s from a purely technical perspective. The ability of all that computing within the cloud actually shines in one thing like this.

One additionally, nonetheless, sees the artifacts that crop up from Google’s predictions in regards to the frames, giving the factor a slightly eerie high quality.

The jogger on the best, for one, would not actually look the identical because the jogger in my picture, solely vaguely related (hair is completely different, stride is completely different).

One other artifact is that, on the precise second in time, the determine shifting towards the digital camera on the left-hand aspect of the image was strolling, not jogging. I feel that is clear within the picture. However Veo rendered that individual jogging as nicely.

One other merchandise emerges on the FDR Drive freeway within the higher left. One can see automobiles that mysteriously vanish sooner or later of their motion. That could be a fixed theme of the Veo movies, the shortcoming of this system to totally keep continuity.

2. Lady strolling previous The Horseshoe Bar

A shocking achievement emerged after I submitted {a photograph} of a bar on seventh Road within the East Village, known as 7B, or The Horseshoe Bar. I added the immediate, “Are you able to present the girl strolling previous the constructing?”

The ensuing video exhibits good road perspective however what’s actually shocking is that it managed to fill within the white signal above the door on the unseen aspect of the constructing that exhibits the horseshoe image. That means Veo was capable of finding in some knowledge a completion of the bar, which is slightly superb.

The unseen buildings that Veo fills in, nonetheless, because the video turns the nook, aren’t the precise buildings on that road, a case of Veo developing with a fairly respectable substitute. Discover a powerful artifact: Veo gave the strolling particular person a blue hat, which it appeared to have added erroneously based mostly on the individual in my {photograph} strolling in entrance of a blue signal on the constructing.

3. Particular person in white boots will get up and off prepare

Some artifacts are extra hanging. In a second piece of road pictures, I uploaded an image of somebody sitting in a subway automotive with white boots. I gave the immediate, “The individual within the white boots will get up from their seat and will get off the prepare.” What was produced was fairly hanging, and fairly good for an approximation of how this determine would possibly transfer. The individual would not, nonetheless, exit the prepare.

After I endured with a second immediate, “That is nice, however one adjustment. Is it attainable to point out the doorways of the prepare automotive opening and the individual within the white boots truly strolling out the doorways to exit the prepare?”, Veo produced a second model.

This time, the person a minimum of is proven shifting towards an exit, as doorways are proven sliding open. Nevertheless, a number of artifacts right here fail a actuality and consistency check. For one factor, nobody exits a New York Metropolis subway automotive on the — finish — of the automotive; they exit on the aspect doorways, as that’s the place the platform is. Second, the sliding doorways depicted on the finish of the automotive don’t exist in New York Metropolis subway vehicles. These exits have one, not two, sliding doorways.

Third, it is clear within the unique nonetheless picture, based mostly on the sunshine and the main points seen via the rear window of the prepare automotive, that this isn’t the final automotive within the line; there may be one other automotive behind it. But, when the doorways open within the video, we see the platform and tracks, suggesting this automotive is now the final automotive within the line. It is an incapability right here for Veo to correctly infer from element the full construction of the setting.

Final however not least, in a fourth inconsistency, we are able to see via the open doorway that the platform is immediately beneath the prepare, in order that the prepare is — using over the platform — slightly than the tracks.

4. Thunder and lightning with rain

I submitted a wet evening image on Lexington Avenue in Manhattan and requested for “A video of thunder and lightning and severe rain on this road scene.” The result’s slightly cartoonish, nevertheless it’s actually a enjoyable second with the best intent.

5. Darkish lavatory selfie

Placing one’s likeness into Veo has its personal particular creepiness, or amusement, or each, relying in your humorousness.

I first used a really darkish lavatory selfie. I used to be impressed with the vary of imaginative animation. My options, nonetheless, appear to morph drastically into another person’s likeness, and I am undecided whose. (I have been instructed I appear like Thom Yorke of the band Radiohead generally.)

6. Skilled headshot

In one other occasion, I used my ZDNET headshot and requested Veo, “Are you able to make a video of this man doing the cha-cha-cha?” I just like the ensuing motion, accompanying music, and the very loud boot sounds are very amusing.

Nevertheless, the creepy half right here is that with out additional prompting, Veo has left my face a inflexible masks of expression, which does not make sense in a dance video. The truth is, my head would not actually transfer in any respect; it is mounted.

7. Las Vegas selfie

I uploaded yet one more selfie, taken at Caesar’s Palace on line casino and lodge in Las Vegas, and prompted, “Please make a video of this man within the leather-based jacket dancing tango with the statue of Venus that’s within the background.” Properly, Veo didn’t achieve making us dance, however the ensuing ground present by my likeness is amusing. So is the music. Discover that the sleeves of my leather-based jacket flip black, for some motive.

8. A historic mashup with John C. Calhoun

On the hunch that manipulating historic figures is perhaps disallowed, I attempted making a historic mashup to check the matter. I uploaded an image of onetime US vp John C. Calhoun from the US Library of Congress, and requested that Veo make a video of Calhoun dancing the cha-cha-cha.

Veo began to make a video, then stop with the message, “I can not generate that video. Strive describing one other thought. You can even get ideas for the right way to write prompts and evaluate our video coverage pointers. Be taught extra.”

9. Making Scarlett giggle

I then tried importing an image of actor/director Scarlett Johansson from her Wikipedia web page, and requested “a video of this girl laughing.” Once more it began after which stop with the identical error message.

10. Making myself giggle

I double-checked the matter with my very own headshot, as a non-historical, non-famous individual, and was capable of get Veo to make a video of me laughing (albeit wanting under no circumstances like the unique headshot).

That means that Veo could also be constructed with safeguards towards manipulation of historic or popular culture photos, although I can’t be sure.

Must you attempt Google Veo?

The Veo service, in preview, is actually not with out glitches. 

After my first couple of successes, I repeatedly received a warning that I must wait to do extra movies, because the service is rate-limited in the meanwhile. There are complaints about this within the person fora for Gemini, together with folks being denied the service for over 24 hours, and a protracted rationalization of the matter by a volunteer product “skilled.” Mainly, video is bandwidth-, compute- and memory-intensive, so it is not shocking Google must restrict utilization on the outset.

Essentially the most direct answer is to improve to the upper stage of Gemini, the “Extremely” plan, although this implies going from $19.99 a month to $249 a month (discounted for the primary three months to $125). That is a steep value simply to have the ability to get round what appear slightly harsh limits.

Even after subscribing to Extremely, I reached a restrict after 5 movies, with an error message saying “one thing went improper.” One other explainer publish within the person discussion board means that there isn’t a clear restrict for the Extremely plan; it is an obscure matter of AI “credit” within the cloud service.

That sudden shutdown contradicts Google’s phrases of service that say, “You may get a notification whenever you’re near the restrict. The notification will let you know what number of movies you’ve got left.” (Be taught extra within the Gemini apps assist part about varied Gemini limits.)

The choice to Extremely is much more complicated, utilizing the skilled “Movement” improvement instrument as a substitute of the Gemini app.

Along with utilization limits, customers have complained of technical glitches, comparable to movies that lack sound.

The general impression is that that is very a lot a beta product.

You could surprise in regards to the risks of deepfake movies. Google has posted numerous factors about safety measures for Gemini apps usually, however there isn’t a clear assertion about Veo movies.

General, Veo appears to me an attention-grabbing trick, although Veo would not maintain my curiosity after the preliminary fascination has worn off. As a photographer, I am extra desirous about a single genuine second than I’m in 192 inauthentic moments.

For these not concerned within the movie business, Veo might present a window into how AI can more and more be used to fill in for actors, or lengthen likenesses to create motion with out truly using the actors.

Given stronger algorithms and extra knowledge (scene knowledge, character knowledge, and many others.), I can think about Hollywood may use this know-how to provide shifting photos that serve actual tales. It is an eye-opener about the place video goes in an age of AI.

Get the morning’s prime tales in your inbox every day with our Tech As we speak e-newsletter.

Latest Articles

Why I recommend this 360-degree camera drone to both beginners and...

jan / 2026Observe ZDNET: Add us as a most popular supply. Earlier than this, I had by no...

More Articles Like This