α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Victor Veitch
Victor Veitch
12
papers
677
total citations
papers (12)
The Linear Representation Hypothesis and the Geometry of Large Language Models
ICML 2024
arXiv
363
citations
Sense and Sensitivity Analysis: Simple Post-Hoc Analysis of Bias Due to Unobserved Confounding
NEURIPS 2020
arXiv
61
citations
Concept Algebra for (Score-Based) Text-Controlled Generative Models
NEURIPS 2023
arXiv
60
citations
On the Origins of Linear Representations in Large Language Models
ICML 2024
arXiv
58
citations
Invariant and Transportable Representations for Anti-Causal Domain Shifts
NEURIPS 2022
arXiv
45
citations
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
arXiv
26
citations
Causal Context Connects Counterfactual Fairness to Robust Prediction and Group Fairness
NEURIPS 2023
arXiv
21
citations
Uncovering Meanings of Embeddings via Partial Orthogonality
NEURIPS 2023
arXiv
18
citations
Using Embeddings for Causal Estimation of Peer Influence in Social Networks
NEURIPS 2022
arXiv
15
citations
Does Editing Provide Evidence for Localization?
ICLR 2025
arXiv
9
citations
RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals
ICML 2025
arXiv
1
citations
Counterfactual Invariance to Spurious Correlations in Text Classification
NEURIPS 2021
0
citations