Instrumental Incentives | Arkose

AI RISK INTERVIEW PERSPECTIVES

Generally Capable AI

Within 50 years

More than 50 years

Why these systems might come soon

Biology is special

No true creativity

Understand the brain first

We wouldn’t want that

Can’t see it based on current progress

Need AI paradigm shift

People would stop this

We need embodiment

AI cannot be conscious

The Alignment Problem

Test before deploying

Be careful with reward function

Alignment is easy

Alignment will progress automatically

Need to know what type of AGI

Misuse is a bigger problem

Other global risks are more dangerous

Humans have alignment problems too

Instrumental Incentives

Consciousness required

Stop it physically

Current systems don’t do that

Wouldn’t design it that way

Human oversight

Pursuing Safety Work

Policymakers will resolve this

Work not useful currently

Work not urgent currently