VideoPoet
Appearance
"A dog eating popcorn at the cinema" "A teddy bear with a cap, sunglasses, and leather jacket playing drums" Example videos generated by the model from texts | |
Developer(s) | |
---|---|
Initial release | February 8, 2024 |
Type | Large language model |
VideoPoet is a large language model developed by Google Research in 2023 for video making.[1][2][3][4] It can be asked to animate still images.[5] The model accepts text, images, and videos as inputs, with a program to add feature for any input to any format generated content.[4] VideoPoet was publicly announced on December 19, 2023.[1] It uses an autoregressive language model.
References
- ^ a b Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". Analytics India Magazine. Retrieved April 29, 2024.
- arXiv:2312.14125 [cs.CV].
- ^ "Google has introduced VideoPOET breaking new ground in coherent video generation". Gizmochina. December 21, 2023.
- ^ a b "VideoPoet". Google Research. Retrieved April 29, 2024.
- ^ Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". VentureBeat.
External links
Media related to VideoPoet at Wikimedia Commons