Misinformation Propagation in Benign Multi-Agent Systems

2026-06-15Multiagent Systems

Multiagent SystemsComputation and Language
AI summary

The authors studied how misinformation affects systems where many AI language agents work together to solve problems. They found that false information can hurt performance in both single agents and groups, but group discussions help reduce mistakes if most agents have correct information. How well groups handle misinformation depends on how they share and decide on answers. The study shows that group makeup and decision rules play important roles in resisting misleading information.

multi-agent systemslarge language modelsmisinformationreasoning tasksknowledge tasksalignmentconsensusvotingpeer interactionsdecision aggregation
Authors
Jonas Becker, Jan Philip Wahle, Terry Ruas, Bela Gipp
Abstract
Multi-agent systems, in which multiple large language model agents solve problems through turn-based interaction, are increasingly deployed in high-stakes settings such as medical diagnosis, legal analysis, and forensic decision-making. Their reliability can be at risk when single agents reason from incorrect or misleading context, e.g., from tool calls, since errors may propagate through agent interactions. This work studies this risk by injecting intent-based misinformation into benign single-agent and multi-agent systems across reasoning, knowledge, and alignment tasks. We find that misinformation can degrade single-agent performance and persists across multi-agent debate, with agents often retaining answers introduced by misinformed peers. Nevertheless, multi-agent debate reduces the resulting performance degradation compared to single-agent prompting, especially when most agents are not exposed to misinformation. Robustness depends on group composition and decision protocol. Consensus can be more stable than voting under peer pressure, while majorities can often steer misinformed agents back toward correct answers. Our results show that misinformation robustness in multi-agent systems depends on the underlying model and also on how agents exchange information and aggregate decisions.