MindXO Insight | Article

What +1,200 AI incidents tell us about AI risks

The MIT AI Risk Repository brings together more than 1,200 reported incidents and over 2,000 distinct risks. It reads them through two lenses: how a risk emerges, and what harm it produces. Across both, one pattern holds. AI risk shows up mostly once systems are live, so enterprise governance has to work proactively, tell system risks apart from human ones, and stay active across the full system lifecycle.

By Myriam Ayada · MindXO · February 2026

Get new articles and frameworks by email

One email when we publish something new. Nothing else. To subscribe without JavaScript, write to contact@mind-xo.com.

Key takeaways

AI risk is overwhelmingly operational. About 98% of documented incidents occur after deployment. The risk shows up once systems are live.
One label hides many phenomena. “AI risk” collapses causation and harm into a single word. That is why it feels ungovernable.
A shared structure exists. The MIT AI Risk Repository organizes 1,200+ incidents and 2,000+ risks through two lenses: how a risk emerges, and what harm it produces.
Governance must be proactive and differentiated. Incidents split roughly 50/50 intentional vs. unintentional and 67/33 system vs. human. Monolithic, reactive controls cannot address that.

Four signals from 1,200+ incidents

How, 67 / 33. Incident cause splits between AI system behavior (67%) and human action (33%). AI risk is socio-technical.
Why, 50 / 50. Incidents split almost evenly between intentional misuse and unintentional failure. Governance has to work proactively.
When, ~98%. Of reported incidents occur after deployment. AI risk is overwhelmingly operational.
Where, 75%. Concentrate in four domains: misuse, system safety, discrimination, misinformation.

Source: MIT AI Risk Repository · 1,200+ reported incidents, 2,000+ distinct risks.

Why AI risk feels so confusing

Over the past two years, “AI risk” has become a ubiquitous phrase. It appears in board presentations, regulatory consultations, strategy decks, and media headlines. Depending on who is speaking, it can mean hallucinating chatbots, biased credit decisions, intellectual-property leakage, large-scale misinformation, or even existential threat. These concerns are all legitimate, and they differ in nature, origin, and how they manifest.

When a single label describes such different phenomena, it becomes hard to reason about risk in a disciplined way, let alone govern it. There is a reason for the confusion: years of parallel work across academia, industry, and policy, each approaching AI risk from a different angle. To cut through it, a large research effort led by MIT reviewed existing AI risk frameworks and real-world incidents.

One repository, more than one way to classify risk

The MIT AI Risk Repository brings together over 1,200 reported AI incidents and more than 2,000 distinct risks, drawn from hundreds of academic, industry, and policy sources. Its aim is to give organizations a common frame of reference for AI risk.

Its central finding is straightforward. AI risks need to be read through more than one lens, and confusion sets in when those lenses collapse into a single list. The repository separates two questions that often get merged:

How a risk emerges: the causal lens. Who or what caused it, was it intentional, and when did it arise?
What harm it produces: the impact lens. What gets harmed when the risk materializes?

Lens one: how the risk emerges

The first lens is about causality. It asks who or what caused the risk (a human, an AI system, or external conditions), whether the outcome was intentional or accidental, and whether it arose before deployment or only in operational use. This mirrors how other safety-critical domains reason about causation, where understanding how a failure comes into existence is essential for accountability and prevention.

The repository’s incident data shows that roughly 67% of incidents are attributed primarily to AI system behavior and about 33% to human action, with a small residual of ambiguous cases. That balance reflects how socio-technical AI risk is: system behavior encodes upstream human decisions in design, training, configuration, and deployment, which then interact with real-world conditions.

Intent is almost evenly divided. Roughly half of incidents stem from deliberate misuse, and half arise unintentionally through system behavior, misalignment, or unforeseen interactions. Controls aimed only at misuse leave the other half unaddressed. Timing sharpens the picture: about 97.5% of reported incidents occur after deployment, and fewer than 3% surface before systems go live. Most AI risk emerges once a system is running in the real world.

Lens two: what kind of harm results

The second lens classifies risks by impact domain, the type of harm produced: discrimination, misinformation, safety failures, privacy violations, or broader societal effects. It answers a different question. What gets harmed when the risk materializes? This view is intuitive and dominant in policy and ethics discussion.

Reported incidents are highly concentrated. About three-quarters fall into four domains: malicious use (around 34%), AI system safety failures and limitations (around 23%), discrimination and toxicity (around 19%), and misinformation (around 13%). A single incident can carry more than one domain label, so the individual shares add up to more than the 75% aggregate. The remaining domains, including privacy and security, human-computer interaction, and socioeconomic and environmental harms, account for a much smaller share of reported incidents.

Those four domains mark where AI systems are most exposed today: at scale, in open environments, and in direct contact with users. Whether they are the harms that matter most is a separate question the data leaves open. The split also makes MIT’s core point concrete. Causal explanations and impact domains do different jobs: calling something a “bias risk” says nothing about how it emerged, and labelling a failure “accidental” says nothing about who was harmed.

Why this matters for organizations

Treated as a single block, AI risk produces long, overlapping risk registers, unclear ownership, and controls that are either excessively heavy or dangerously thin. The repository’s empirical signals point to three practical implications.

First, governance has to work proactively. With incidents split almost evenly between intentional misuse and unintentional failure, post-incident response and misuse controls only ever cover half the ground. Organizations need to anticipate both deliberate abuse and ordinary operational drift.

Second, governance has to distinguish between risk sources. The roughly 67/33 split between system behavior and human action describes two different problems with two different owners. System-level risks call for technical controls, testing, monitoring, and lifecycle gates tied to specific AI systems. Organization-wide risks call for policies, training, access controls, and cultural safeguards that shape how people use AI. Treating them as one blurs accountability.

Third, governance has to run across the lifecycle. With more than 97% of incidents happening after deployment, controls that concentrate upstream and ease off once a system is live are aimed at the wrong phase. The weight belongs on continuous monitoring, feedback loops, and periodic reassessment after systems are deployed and scaled.

From confusion to governability

Together, these findings describe what AI governance has to be: an ongoing risk-management discipline that anticipates problems, separates system risks from human ones, and stays active while systems run. Compliance documentation and design reviews have their place inside that discipline, though they cannot stand in for it.

AI risk is more tractable than the noise around it suggests, and it does not call for a new discipline. It follows familiar logic: events, probabilities, and impacts. What has been missing is a shared structure. The MIT AI Risk Repository supplies that structure and gives organizations a foundation to move from anxiety to informed decisions. The work left to each organization is to turn that clarity into governance structures, controls, and decision processes that match how AI risks actually show up.

Sources

MIT AI Risk Repository, a common frame of reference synthesizing 1,200+ incidents and 2,000+ risks.
Slattery et al. (2024), “The AI Risk Repository”, systematic meta-review, database, and taxonomy. arXiv:2408.12622.
NIST AI Risk Management Framework (AI RMF 1.0, NIST AI 100-1).
ISO/IEC 42001:2023, Artificial intelligence management system requirements.

Frequently asked questions

What is the MIT AI Risk Repository?

It is a systematic synthesis of more than 1,200 reported AI incidents and over 2,000 distinct risks, drawn from hundreds of academic, industry, and policy sources. It gives organizations a common frame of reference for AI risk, organized around two taxonomies: how a risk emerges (causal) and what harm it produces (domain).

Why distinguish how a risk emerges from what harm it produces?

Because they answer different questions. Causation explains who or what caused a risk, whether it was intentional, and when it arose; impact explains what gets harmed when it materializes. Calling something a “bias risk” says nothing about how the risk emerged, and labelling a failure “accidental” says nothing about who was harmed. Merging the two into one list is a main source of confusion about AI risk.

What share of AI incidents happen after deployment?

About 97.5% of reported incidents occur after deployment, and fewer than 3% are identified before a system goes live. That makes AI risk overwhelmingly operational, so governance that concentrates controls upstream and eases oversight once systems are live is aimed at the wrong phase.

Are AI incidents caused by the AI or by people?

Both. Roughly 67% of reported incidents are attributed primarily to AI system behavior and about 33% to human action, with a small residual of ambiguous cases. That balance reflects how socio-technical AI risk is: system behavior encodes upstream human decisions in design, training, and deployment, which then interact with real-world conditions.

Which harms are most common in reported incidents?

About three-quarters of documented incidents fall into four domains: malicious use (around 34%), AI system safety failures (around 23%), discrimination and toxicity (around 19%), and misinformation (around 13%). A single incident can carry more than one label. The pattern shows where AI is most exposed today: at scale, in open environments, and in direct contact with users. Whether these are the harms that matter most is a separate question.

What does the evidence mean for enterprise AI governance?

Three things. Governance has to work proactively, because incidents split almost evenly between intentional misuse and unintentional failure. It has to separate system-level risks from organization-wide ones, because they have different owners and different controls. And it has to operate across the lifecycle, because most risk emerges after deployment and needs continuous monitoring.

About MindXO

MindXO is an independent AI governance and risk management practice. We research emerging AI risks and help organizations design governance frameworks, manage risk, and scale AI responsibly.

Explore more on the Insight Hub

Visit Insight Hub · Get in touch