Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text
- Paper
- Apr 14, 2023
- #ComputerScience
In-context vision and language models like Flamingo support arbitrarily interleaved sequences of images and text as input. This format not only enables few-shot learning via interle...
Show More