AI as a Partner in Learning about, Doing, and Engaging with Science: Vigilance as the Key to Productive Augmentation
2026-06-15 • Computers and Society
Computers and Society
AI summaryⓘ
The authors explain that when people use AI to learn or do science, success depends on how carefully they check the AI's answers rather than just trusting them. They call this careful checking 'epistemic vigilance' and say it's essential for safely using AI, especially in education. The authors discuss how this vigilance affects learning and highlight that AI's confident writing can trick users into trusting wrong information. They warn that students who are better at this checking may benefit more, potentially increasing learning gaps. Lastly, they suggest gradually reducing help as students become more independent in evaluating AI output.
AI partnershipepistemic vigilancescience educationgenerative AIaugmentationdeep processingconfidence in AIlearning with AIevaluation skillsstudent preparedness
Authors
Marcus Kubsch
Abstract
AI has become a partner in how people learn about, do, and engage with science, and the partnership takes three forms: a scientist works with a co-scientist whose output must be checked; a member of the public looks something up to decide whether a diet works or whether to fit solar panels; and a student takes up an inquiry with AI in a science class. Across all three, one thing decides whether the partnership helps or harms: whether the human evaluates what the AI returns or takes it on trust. I argue that this evaluation -- epistemic vigilance calibrated to how far a fallible source can be trusted -- is, given adequate prior knowledge, the binding constraint on productive augmentation. You can hand the AI a great deal precisely because you stay vigilant; vigilance makes generative partnership safe, so it licenses augmentation rather than restricting it. Vigilance is already invoked in science education but under-specified for the AI case; I specify its components, the mechanism tying it to learning, and a way to measure it without soliciting the evaluation it is meant to detect. What is distinctive is that the machine's fluent, confident prose reads as trustworthy whether or not it is, so its surface works against the human evaluating it. The argument bears hardest on education: the integrated conceptual knowledge instruction aims to foster forms only under deep processing, and vigilance sets how deeply a claim is processed, so it is the precondition for learning with AI. The design factors the field reports matter through whether they engage the learner's evaluation; none works around it. Untested is vigilance as a measured disposition, above all where the AI is confidently wrong. Because it is unevenly distributed, integrating AI uniformly is likely to widen the gap between better- and less-prepared students. I close on how it might be built by fading support as the learner takes over.