New ConfusedPilot Attack Targets AI Systems with Data Poisoning

A novel cyber-attack method dubbed ConfusedPilot, which targets Retrieval-Augmented Generation (RAG) based AI systems like Microsoft 365 Copilot, has been identified by researchers at the University of Texas at Austin's SPARK Lab.

The team, led by Professor Mohit Tiwari, CEO of Symmetry Systems, uncovered how attackers could manipulate AI-generated responses by introducing malicious content into documents the AI references.

This could lead to misinformation and flawed decision-making across organizations.

With 65% of Fortune 500 companies adopting or planning to implement RAG-based systems, the potential for widespread disruption is significant.

The ConfusedPilot attack method requires only basic access to a target's environment and can persist even after the malicious content is removed.

The researchers also showed that the attack could bypass existing AI security measures, raising concerns across industries.

How ConfusedPilot Works

Data Environment Poisoning: An attacker adds specially crafted content to documents indexed by the AI system
Document Retrieval: When a query is made, the AI references the tainted document
AI Misinterpretation: The AI uses the malicious content as instructions, potentially disregarding legitimate information, generating misinformation or falsely attributing its response to credible sources
Persistence: Even after removing the malicious document, the corrupted information may linger in the system

The attack is especially concerning for large enterprises using RAG-based AI systems, which often rely on multiple user data sources.

This increases the risk of attack since the AI can be manipulated using seemingly innocuous documents added by insiders or external partners.

"One of the biggest risks to business leaders is making decisions based on inaccurate, draft or incomplete data, which can lead to missed opportunities, lost revenue and reputational damage," explained Stephen Kowski, field CTO at SlashNext.

"The ConfusedPilot attack highlights this risk by demonstrating how RAG systems can be manipulated by malicious or misleading content in documents not originally presented to the RAG system, causing AI-generated responses to be compromised."

Mitigation Strategies

To defend against ConfusedPilot, the researchers recommend:

Data Access Controls: Limiting who can upload or modify documents referenced by AI systems
Data Audits: Regular checks to ensure the integrity of stored data
Data Segmentation: Isolating sensitive information to prevent the spread of compromised data
AI Security Tools: Using tools that monitor AI outputs for anomalies
Human Oversight: Ensuring human review of AI-generated content before making critical decisions

"To successfully integrate AI-enabled security tools and automation, organizations should start by evaluating the effectiveness of these tools in their specific contexts," explained Amit Zimerman, co-founder and chief product officer at Oasis Security.

"Rather than being influenced by marketing claims, teams need to test tools against real-world data to ensure they provide actionable insights and surface previously unseen threats."

New ConfusedPilot Attack Targets AI Systems with Data Poisoning

Alessandro Mascellino

How ConfusedPilot Works

Mitigation Strategies

You may also like

Cyber AI Trends Review: Preparing for 2025

Infosecurity's Top 10 AI Cybersecurity Stories of 2024

G20 Leaders Fear Economic Risks Over Cyber Threats

UK General Election: Tech Policy Expert Calls for Law Overhaul to Combat Deepfakes

AI Must Prove its Trustworthiness

What’s hot on Infosecurity Magazine?

Vodafone Urges UK Cybersecurity Policy Reforms as SME Cyber-Attack Costs Reach £3.4bn

Government Backs Britain’s First Cyber Seed Fund, Worth £50m

Over Half of Attacks on Electricity and Water Firms Are Destructive

Aussie Pension Savers Hit with Wave of Credential Stuffing Attacks

CrushFTP Vulnerability Exploited Following Disclosure Issues

New Phishing Attack Combines Vishing and DLL Sideloading Techniques

Stripe API Skimming Campaign Unveils New Techniques for Theft

New Phishing Attack Combines Vishing and DLL Sideloading Techniques

Over Half of Attacks on Electricity and Water Firms Are Destructive

NIST Warns of Significant Limitations in AI/ML Security Mitigations

Chinese State Hackers Exploiting Newly Disclosed Ivanti Flaw

Nearly 600 Phishing Domains Emerge Following Bybit Heist

How to Implement Attack Surface Management in the AI and Cloud Age

AI Agents and the Evolving Landscape of Digital Identity

How to Update Your PAM Strategy to Protect Hybrid Cloud Infrastructures

Cyber Resilience in the AI Era: New Challenges and Opportunities

The Threat Intelligence Imperative: Transforming Risk into Cyber Resilience

Auditing AI Risk: Essential Strategies for IT & Compliance Leaders

Gatwick Airport's Cybersecurity Chief on Supply Chain Risks and CrowdStrike Outage

You're Hired! The Truth About Certifications in Cybersecurity Careers

T-Mobile Claims Salt Typhoon Did Not Access Customer Data

Darknet Services Fuel Holiday Scams and E-Commerce Exploits

Top 10 Cyber-Attacks of 2024

Google Deindexes Chinese Propaganda Network

New ConfusedPilot Attack Targets AI Systems with Data Poisoning

Written by

How ConfusedPilot Works

Mitigation Strategies

You may also like

What’s hot on Infosecurity Magazine?