Date: 12 September 2025 (Friday)
Time: 1530 – 1700
Venue: Room 10.13, 10/F, Seminar Room, Run Run Shaw Tower, Centennial Campus, HKU
Registration: Here
Speakers:
Prof David Thorstad, Philosophy, Vanderbilt University
Abstract:
Recent years have seen increasing concern that artificial intelligence may soon pose an existential risk to humanity. One leading ground for concern is that artificial agents may be power-seeking, aiming to acquire power and in the process disempowering humanity. I show how the argument from power-seeking rests on a strong version of a claim known as the instrumental convergence thesis. I explore leading defenses of the instrumental convergence thesis and argue that none establishes the thesis in a strong enough form to ground the argument from power-seeking. I discuss implications for longtermism, the governance of artificial intelligence, and the methodology of studying risks posed by artificial agents.

