Learn more context on large-scale risks from advanced AI

These opportunities focus on AI safety work aimed at preventing loss of human control of very capable AI systems. To maximize your eligibility for these opportunities, we recommend gaining context on the perspectives of this subfield, e.g. by skimming pertinent AI safety papers.

AI Safety Papers

Opportunities

Last updated: 05/27/25

Job Opportunities

Anthropic (roles)

Frontier Red Team
Responsible Scaling Policy Team
Alignment Stress Testing Team
Interpretability Team
Dangerous Capability Evaluations Team
Assurance Team
Security Team

Google DeepMind (roles)

AI Safety and Alignment Team (Bay Area)
Scalable Alignment Team
Frontier Model and Governance Team
Mechanistic Interpretability Team
Responsibility & Safety Team

FAR.AI (roles)
Redwood Research (roles)
UK AI Security Institute (AISI) (roles)
Model Evaluations and Threat Research (METR) (roles)
Apollo Research (roles)
RAND (roles)

Note that RAND's Technology and Security Policy Fellowship is not just for policy research; ML engineers, software engineers with either infrastructure or front-end experience, and technical program managers are also encouraged to apply via this Fellowship.

Postdoctoral Positions and PhDs

You can use the filtered view of our database to find professors with open positions of any seniority, or the unfiltered view to find potential collaborators.
We'd also like to highlight the Centre for Human-Compatible AI's Research Fellowship and Research Collaborator positions.

Funding Opportunities

Open Philanthropy

Request for proposals: AI governance (In the "technical governance" section, examples include: compute governance, model evaluations, technical safety and security standards for AI developers, cybersecurity for model weights, and privacy-preserving transparency mechanisms. See also the Governance and Policy section below)
Career development and transition funding
Course development grants
Funding for work that builds capacity to address risks from transformative AI

Centre for Security and Emerging Technology: Risks from internal deployment of frontier AI models (deadline: 06/30/25)
NSF Secure and Trustworthy Cyberspace Grants
Foresight Institute: Grants for Security, Cryptography & Multipolar Approaches to AI Safety (quarterly applications)
Long-Term Future Fund
Currently Closed Funding Opportunities

UK AI Security Institute (AISI): Challenge Fund
Schmidt Sciences: Safety in the Inference-Time Compute Paradigm (deadline: 04/30)
Anthropic Model Evaluation Initiative (accepting EOIs for their next round)
ARIA's Safeguarded AI Program (accepting EOIs for their next round)

Safeguarded AI aims to provide quantitative safety guarantees for AI. Their current funding round is for demonstrations that AI systems with such guarantees are useful and profitable in safety-critical contexts (e.g. optimising energy networks, clinical trials, or telecommunications).

AI Safety Fund: RFP for Bio- and Cyber- Security and AIs
Cooperative AI Foundation Concordia Contest 2024
Cooperative AI Foundation Research Grants
Future of Life Institute: PhD Fellowships
Future of Life Institute Postdoctoral Fellowships (deadline: 01/06/25)
Future of Life Institute: How to Mitigate AI-driven Power Concentration
Open Philanthropy

Request for proposals: benchmarking LLM agents on consequential real-world tasks
Request for proposals: technical AI safety research (deadline: 4/15)
Request for proposals: improving capability evaluations (deadline: 4/1)

Schmidt Sciences: Safety Assurance through Fundamental Science in Emerging AI
Survival and Flourishing Fund (SFF): Grant Round with additional Freedom and Fairness tracks

Note: SFF gives grants to universities. Alternatively, SFF requires that you have a 501c3 charity (i.e. your nonprofit has 501c3 status or you have a fiscal sponsor that has 501c3 status).

SafeBench Competition (deadline: 2/25/2025; $250k in prizes)
Call for Research Ideas: Expanding the Toolkit for Frontier Model Releases from CSET
OpenAI: Research into Agentic AI Systems, Superalignment Fast Grants, OpenAI Cybersecurity Grants (assumed closed)
NSF: Safe Learning-Enabled Systems and Responsible Design, Development, and Deployment of Technologies
Center for Security and Emerging Technology (CSET): Foundational Research Grants

Compute Opportunities

National Deep Inference Fabric (NDIF), can request early access to a research computing project for interpretability research
Cohere for AI, subsidized access to APIs

AI Safety Programs / Fellowships / Residencies / Collaborations

Constellation (extended visits and residencies at an AI safety organization in Berkeley)

Visiting Fellows: a 3-6 month (unpaid) visit at the Constellation office (Berkeley, CA) for researchers, engineers, entrepreneurs, and other professionals working on their focus areas. Applications open for the winter cohort (beginning January 6th).
Residencies: a year-long salaried position ($100K-$300K) for experienced researchers, engineers, entrepreneurs, and other professionals to pursue self-directed work on one of Constellation's focus areas in the Constellation office (Berkeley, CA).
Workshops: Constellation also expect to offer 1-2 day intensive workshops for experts working in or transitioning into their focus areas.

Workshops and Community

Expression of Interest form for FAR.AI's Alignment Workshop. Recordings from the previous workshop are also available on the website.
Expression of Interest form for Constellation Workshops. Constellation expects to offer 1–2 day intensive workshops for people working in or transitioning into their focus areas.
Expression of Interest form for events by the AI Security Forum
Past Workshops

NeurIPS 2024

ICLR 2024

ML Safety Social hosted by the Center for AI Safety
Secure and Trustworthy Large Language Models
How Far Are We From AGI?
Reliable and Responsible Foundation Models
ME-FoMo: Mathematical and Empirical Understanding of Foundation Models

Alignment Workshops (recordings available)

Researchers working in AI safety

Arkose's Database of AI Safety Professionals
AI Existential Safety Community from Future of Life Institute
See speakers from the Alignment Workshop series (SF 2023, NOLA 2023)

AISafety.com's List of AI Safety Communities
Interested in working in China?

Contact Concordia AI 安远AI
International Dialogues on AI Safety
AI Alignment: A Comprehensive Survey
Newsletters: AI Safety in China, ChinAI Newsletter

Open Source Projects

Job Board

Filtered from the 80,000 Hours Job Board

Alternative Technical Opportunities

Theoretical Research

Alignment Research Center (roles)

Information Security

Information Security roles
Overview from a security engineer at Google
Jason Clinton's recommended upskilling book

Forecasting (see especially Epoch)
Software Engineering
AI Governance and Policy

AI governance is focused on developing global norms, policies, and institutions to increase the chances that advanced AI is beneficial for humanity.

The Horizon Fellowship (Deadline 08/30/2024)

The Horizon Fellowship places experts in emerging technologies in federal agencies, congressional offices, and thinktanks in Washington DC for up to two years.

Future of Life Institute: AI Compute Security & Governance Technical Program Manager

The Future of Life Institute is looking to hire someone with experience in both hardware engineering and project management to lead a new initiative in technical AI governance.

Summer Webinar Series on Careers in Emerging Technology Policy (mid-July - end of August)

The series is designed to help individuals interested in federal AI and biosecurity policy decide if they should pursue careers in these fields. Each session features experienced policy practitioners who will discuss what it’s like to work in emerging technology policy and provide actionable advice on how to get involved. Some of the sessions will be useful for individuals from all fields and career stages, while others are focused on particular backgrounds and opportunities. You may choose to attend all or only some of the sessions.

AI Governance Curriculum by BlueDot Impact
AI Policy Resources by Emerging Technology Policy Careers
Several organizations working in the space:

Center for Long-Term Resilience (CLTR)
RAND's Technology and Security Policy work
Horizon Institute for Public Service
Institute for AI Policy and Strategy
Center for Security and Emerging Technology (CSET)
Frontier AI Task Force
Center for the Governance of AI (GovAI)
Industry AI Governance teams
Center for AI Policy