"text-video representation learning" Papers

1 papers found