Google Enhances Veo 3.1 AI Video Model with Improved Reference Image Processing and Vertical Video Support
By
Jess Weatherbed
Plain bagel done well. Pleasantly substantive.
Summary
Google is enhancing its Veo 3.1 AI video model with improved visual capabilities for the 'Ingredients to Video' tool, which allows users to generate videos based on up to three reference images. The update focuses on better attention to reference images, expanded native vertical video support, and resolution upscaling features, giving users more control over character subjects, backgrounds, and textures in generated videos.
Key quotes
· 3 pulledGoogle is making its Veo 3.1 AI video model pay closer attention to the reference images you want generated clips to be based on.
The company is releasing new visual improvements for the 'Ingredients to Video' tool that was introduced last year, alongside expanding native vertical video support and resolution upscaling features.
The Ingredients to Video tool allows Veo users to generate videos based on up to three reference images, pulling in materials like character subjects, backgrounds, and textures to have more control over how the results will look.
You might also wanna read
Google Launches Veo 3.1 AI Video Generation Model with Enhanced Creative Controls
Google has launched Veo 3.1, an updated AI video generation model that enables filmmakers, storytellers, and developers to create stunningly
Google is using YouTube videos to train its AI video generator
Google Launches Gemini 2.5 Flash Image: Advanced AI Image Generation and Editing Model
Google has launched Gemini 2.5 Flash Image, a new state-of-the-art image generation and editing model that enables blending multiple images,
Velvet: Integrated Platform for AI Video Generation and Editing
Velvet is an integrated platform that enables users to create AI-generated videos for companies in just one minute using the world's best vi
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
Ovi is a multimodal AI model developed by Character AI that simultaneously generates both video and audio content from text or text+image in
