Upcoming Event

A Matter of Principle? AI alignment as the Fair Treatment of Claims (co-authored with Geoffrey Keeling)

Abstract: The normative challenge of AI alignment centres upon what goals or values to encode in AI systems to govern their behaviour. A number of answers have been proposed, including the notion that AI must be aligned with human intentions or that it should aim to be helpful, honest and harmless. Nonetheless, both accounts suffer …

A Matter of Principle? AI alignment as the Fair Treatment of Claims (co-authored with Geoffrey Keeling) Read More »

Evaluating AI Agents for Dangerous (Cognitive) Capabilities

Abstract: AI agents based on Large Language Models (LLMs) demonstrate human-level performance at some theory of mind (ToM) tasks (Kosinski 2024; Street et al. 2024). Here ToM is roughly the ability to predict and explain behaviour by attributing mental states to oneself and others. ToM capabilities matter for AI safety because, at least in humans, …

Evaluating AI Agents for Dangerous (Cognitive) Capabilities Read More »

Philosophical Commitments to LLM Evaluations: The Problem of Moving Goalposts and Observational Relativity

Date: February 7, 2025 (Friday) Time: 13:00 – 15:00 Venue: Rm 10.13, Run Run Shaw Tower, Centennial Campus, The University of Hong Kong Registration: here Speaker: Ms Ninell Oldenburg, University of Copenhagen Chair: Dr Frank Hong, The University of Hong Kong Abstract: While most technical and philosophical research on LLMs tries to find a translation from human functions to computational …

Philosophical Commitments to LLM Evaluations: The Problem of Moving Goalposts and Observational Relativity Read More »

Future science and artificial consciousness

Date: February 21, 2025 (Friday) Time: 13:00 – 15:00 Venue: 3/F, MPZ Room 2, HKU Main Library Registration: here Speaker: Dr Leonard Dung, Ruhr-University Bochum Chair: Dr Frank Hong, The University of Hong Kong Abstract: Does consciousness require biology or can systems made out of other materials be conscious? I develop an argument for the view that it is (nomologically) …

Future science and artificial consciousness Read More »

The Ethics of Amplification

Date: February 28, 2025 (Friday) Time: 13:00 – 15:00 Venue: 3/F, MPZ Room 2, HKU Main Library Registration: here Speaker: Dr Jeffrey Howard, University College London Chair: Dr Frank Hong, The University of Hong Kong Abstract: Social media platforms’ AI systems learn from user data to predict what content will keep users engaged. This content is subsequently amplified, increasing …

The Ethics of Amplification Read More »

The Philosophy of AI: Themes from Iason Gabriel

Date: February 14, 2025 (Friday) Time: 13:00 – 18:00 Venue: 11/F, Cheng Yu Tung Tower, The University of Hong Kong   Registration: here Organizers: AIH Lab, Programme on Artificial Intelligence and the Law and Hong Kong Ethics Lab A Matter of Principle? AI alignment as the Fair Treatment of Claims (co-authored with Geoffrey Keeling) Iason Gabriel, Staff Research Scientist, Google Deepmind Evaluating AI Agents …

The Philosophy of AI: Themes from Iason Gabriel Read More »

Alignment

Abstract: The speaker will distinguish some different conceptions of alignment, exploring how each conception relates to safety and existential risk.

Scroll to Top