The Blog



Share

Vid2Seq: a pretrained visual language model for describing multi-event videos