"vision-language benchmarks" Papers

4 papers found