Instrumental Convergence and Power-Seeking

Date: September 12, 2025 (Friday)

Time: 15:30 – 17:00

Venue: Room 4.36, 4/F, Run Run Shaw Tower, Centennial Campus, HKU

Speaker:

Professor David Thorstad

Associate Professor of Philosophy

Vanderbilt University

Abstract:

Recent years have seen increasing concern that artificial intelligence may soon pose an existential risk to humanity. One leading ground for concern is that artificial agents may be power-seeking, aiming to acquire power and in the process disempowering humanity. I show how the argument from power-seeking rests on a strong version of a claim known as the instrumental convergence thesis. I explore leading defenses of the instrumental convergence thesis and argue that none establishes the thesis in a strong enough form to ground the argument from power-seeking. I discuss implications for longtermism, the governance of artificial intelligence, and the methodology of studying risks posed by artificial agents.