Amazon unveils surprise new video and image AI models to compete with the
best on the market
Date:
Tue, 03 Dec 2024 19:26:58 +0000
Description:
Amazon Nova will offer new video and image generation AI models.
FULL STORY ======================================================================Amazon unveils new image and video creation AI tools Amazon Nova Canvas and Nova
Reel look to help ecommerce sellers Both new Nova models set to launch in
2025
Amazon has announced new image and video generation models as it steps up its fight to become an AI heavyweight.
The company unveiled Amazon Nova Canvas and Nova Reel at its AWS re:Invent 2024 event in Las Vegas, with CEO Andy Jassy revealing the launch as part of
a new Nova series of AI models.
Both new models will be available in mid 2025, with the launches set to take Amazon into direct competition with the likes of OpenAI and Grok when it
comes to image and video creation. Amazon Nova Canvas and Reel
The new models look to initially target sellers and other users on Amazon's ecommerce platform, allowing them to quickly and cheaply create media content to enrich their pages.
Amazon didn't reveal too much in the way of specifics when it came to the new offerings, but did reveal Nova Canvas will allow users to create and edit images using natural language text inputs, and Nova Reel can provide "studio-quality" video, with features such as camera motion control, 360-degree rotation, and zoom.
In a blog post announcing the news, the company noted that customers on its Amazon Ads platform using the new models advertised five times more products and twice as many images per advertised product, widening their reach to buyers across the globe.
Looking forward, Jassy also revealed Amazon will be launching a Speech-to-Speech generation model in early 2025, followed by an "Any-to-Any" model in mid-2025.
The former will be able to analyse and understand streaming speech input in natural language, with the ability to interpret verbal and nonverbal cues
such as tone and cadence, to reply in a natural, human-esque way.
The latter, which Jassy described as a true multimodal to multimodal model, will be able to take in text, images, audio, and video, before outputting in whichever mode is required. You may also like AI reckons it can do all jobs, even those thought previously 'safe' Weve listed the best AI writers around today Check out our roundup of the best productivity tools
======================================================================
Link to news story:
https://www.techradar.com/pro/amazon-unveils-surprise-new-video-and-image-ai-m odels-to-compete-with-the-best-on-the-market
--- Mystic BBS v1.12 A47 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)