Choose the caption that most accurately describes major objects and actions from the video. Common words are in green, unique words are in orange.