Speaker: Dr Dan Hendrycks, Centre for AI Safety
Abstract
Dr Hendrycks will discuss principles for measuring capabilities of AI systems, and walk through popular general capabilities benchmarks.
He will then discuss how general capabilities can be separated from the measurement of their safety, then overview new ways to measure safety.