Moocable is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Forecasting and Aligning AI - Jacob Steinhardt

Description

Modern ML systems sometimes undergo qualitative shifts in behavior simply by “scaling up” the number of parameters and training examples. Given this, how can we extrapolate the behavior of future ML systems and ensure that they behave safely and are aligned with humans? I’ll argue that we can often study (potential) capabilities of future ML systems through well-controlled experiments run on current systems, and use this as a laboratory for designing alignment techniques. I’ll also discuss some recent work on “medium-term” AI forecasting.

Online Courses

YouTube

Free

55 minutes

Forecasting and Aligning AI - Jacob Steinhardt

Affiliate notice

  • Type
    Online Courses
  • Provider
    YouTube
  • Pricing
    Free
  • Duration
    55 minutes

Modern ML systems sometimes undergo qualitative shifts in behavior simply by “scaling up” the number of parameters and training examples. Given this, how can we extrapolate the behavior of future ML systems and ensure that they behave safely and are aligned with humans? I’ll argue that we can often study (potential) capabilities of future ML systems through well-controlled experiments run on current systems, and use this as a laboratory for designing alignment techniques. I’ll also discuss some recent work on “medium-term” AI forecasting.