"cross-attention layer" Papers

3 papers found