Google DeepMind has shared details on how its visual language model, Flamingo, is now being used to generate descriptions for YouTube Shorts.
Flamingo is trained on a massive dataset of text and code, and is able to learn the relationship between images and text.
This allows it to generate descriptions in Shorts’ metadata, making them more searchable.