Google's DeepMind generates video from single image

Earlier this week, the team behind Google’s advanced DeepMind neural network unveiled a new ability dubbed Transframer, which allows AI to generate 30-second videos from a single image input. It’s a nifty little trick at first glance, but the implications are much larger than an interesting .GIF file.

“Transframer is state-of-the-art on a variety of video generation benchmarks, and… can generate coherent 30 second videos from a single image without any explicit geometric information,” the DeepMind research team explains. Basically, all Transframer needs is a one photo, which it then analyzes and identities the picture’s framing, i.e. clues like a table, a hallway, or a street. After predicting a subject’s surroundings using these “context images,” it then envisions (and subsequently shows) what that target would look like from various angles. DeepMind’s team illustrates the procedure with targets like a chair, a laptop, a glass of water, and even a GRE textbook.

ShapeNet (1 context view, 128×128). *DeepMind.*

“Given a collection of context images with associated annotations (time-stamps, camera viewpoints, etc. ), and a query annotation, the task is to predict a probability distribution over the target image,” continues the team. “This framework supports a range of visual prediction tasks, including video modelling, novel view synthesis, and multi-task vision.”

As noted by Futurism, Transframer could one day offer an entirely new avenue within the video game industry by utilizing machine learning to build digital environments rather than relying on more time-consuming rendering methods. As the technology progresses, DeepMind’s Transframer training could open entirely new avenues for art, scientific analysis, and further AI development. Additionally, one Twitter user envisioned piggybacking their OpenAI’s DALL-E pictures on top of the Transframer program to create stacked AI creations—as if those images couldn’t get any more surreal.

The 15 best home gifts to upgrade any living space The 15 best home gifts to upgrade any living space

The best smartwatches for seniors in 2024 The best smartwatches for seniors in 2024

Porn is helping people cope with the pandemic Porn is helping people cope with the pandemic

13,000 fans moshed at home as a metal band streamed from an empty venue 13,000 fans moshed at home as a metal band streamed from an empty venue

Your digital life is a mess. Our 30-day deep-clean newsletter will fix that. Your digital life is a mess. Our 30-day deep-clean newsletter will fix that.

Coronavirus shows our health agencies are ill prepared for fake news Coronavirus shows our health agencies are ill prepared for fake news

Twelve ways to make your new browser tabs more exciting Twelve ways to make your new browser tabs more exciting

No, you can’t taste anything with your testicles No, you can’t taste anything with your testicles

Google results for probiotics are totally unreliable Google results for probiotics are totally unreliable

Ten GIFs that really got us through 2019 Ten GIFs that really got us through 2019

Can AI escape our control and destroy us? Can AI escape our control and destroy us?

All the cool new stuff from Google’s 2018 I/O developers conference All the cool new stuff from Google’s 2018 I/O developers conference

Google’s Clips camera uses AI to try to spot your important family moments Google’s Clips camera uses AI to try to spot your important family moments

Blizzard Opens Up Starcraft To Google’s DeepMind AI Blizzard Opens Up Starcraft To Google’s DeepMind AI

Let’s watch Google announce the Pixel 2, the Home Mini, and other stuff Let’s watch Google announce the Pixel 2, the Home Mini, and other stuff

AI and deep learning can now help you be more popular on Twitter AI and deep learning can now help you be more popular on Twitter

Google DeepMind’s AlphaGo Finishes Final Tournament Match With A Win Google DeepMind’s AlphaGo Finishes Final Tournament Match With A Win

Google DeepMind’s Algorithm Can Now Explore 3D Mazes Google DeepMind’s Algorithm Can Now Explore 3D Mazes

Share