Measuring Capabilities and Safety
Speaker: Dr Dan Hendrycks, Centre for AI Safety Abstract Dr Hendrycks will discuss principles for measuring capabilities of AI systems, and walk through popular general capabilities benchmarks. He will then discuss how general capabilities can be separated from the measurement of their safety, then overview new ways to measure safety.
Measuring Capabilities and Safety Read More »