Date: September 12, 2025 (Friday)
Time: 15:30 – 17:00
Venue: Room 4.36, 4/F, Run Run Shaw Tower, Centennial Campus, HKU
Speaker:
Professor David Thorstad
Associate Professor of Philosophy
Vanderbilt University
Abstract:
Recent years have seen increasing concern that artificial intelligence may soon pose an existential risk to humanity. One leading ground for concern is that artificial agents may be power-seeking, aiming to acquire power and in the process disempowering humanity. I show how the argument from power-seeking rests on a strong version of a claim known as the instrumental convergence thesis. I explore leading defenses of the instrumental convergence thesis and argue that none establishes the thesis in a strong enough form to ground the argument from power-seeking. I discuss implications for longtermism, the governance of artificial intelligence, and the methodology of studying risks posed by artificial agents.

