Measuring Capabilities and Safety

January 29, 2024 / Upcoming Event

Speaker: Dr Dan Hendrycks, Centre for AI Safety

Abstract

Dr Hendrycks will discuss principles for measuring capabilities of AI systems, and walk through popular general capabilities benchmarks.

He will then discuss how general capabilities can be separated from the measurement of their safety, then overview new ways to measure safety.