The Blog



Share

Efficient Video-Text Learning with Iterative Co-tokenization