Study finds nearly 147,000 AI-hallucinated citations in scientific papers published in 2025

An audit of 2.5 million scientific papers found roughly 146,900 fake AI-generated citations in 2025 across arXiv, bioRxiv, SSRN, and PubMed Central. Most passed through peer review and moderation undetected.

Categorized in: AI News Science and Research
Published on: May 19, 2026
Study finds nearly 147,000 AI-hallucinated citations in scientific papers published in 2025

146,900 Fake Citations Found in Scientific Papers as AI Use Spreads

Researchers auditing millions of scientific papers found an estimated 146,900 hallucinated citations across four major repositories in 2025 alone. The fake references appeared in papers on arXiv, bioRxiv, SSRN, and PubMed Central-spreading across many papers rather than concentrating in a few outliers.

Large language models generate text that sounds plausible but can be entirely fabricated. These systems predict the next word based on patterns in training data, not on actual facts. When researchers use AI to write papers without fact-checking the output, those hallucinations slip into the scientific record.

The Scale of the Problem

The audit examined 111 million references from 2.5 million papers. Researchers used automated checks and manual verification to identify citations that didn't match any real publication. Over 95% of references were successfully matched to actual sources.

The fake citations surged starting in mid-2024, after ChatGPT and similar tools became widely available. To isolate AI's role, researchers compared unmatched citation rates before 2023-when these tools didn't exist-against current rates. The difference was stark.

Early-career scientists and small research teams were most likely to include fake citations. Some of these researchers saw their output increase roughly threefold since AI tools became available.

Gaps in Quality Control

Preprint servers, journal editors, and peer review caught only a fraction of the errors. On arXiv alone, 78.8% of non-existent citations passed through moderation and appeared on the platform.

The hallucinated references showed a pattern: they disproportionately credited already prominent and male scholars. This suggests AI errors may reinforce existing inequalities in how scientific credit gets distributed.

Implications for Research

Scientific progress depends on building on verified prior work. Fake citations undermine that foundation. If researchers cite papers that don't exist, others may waste time trying to find them or, worse, build arguments on false premises.

The problem extends beyond academia. Hallucinations appear in government reports, legal filings, and news articles. Without intervention, the contamination of scientific literature could spread to policy decisions and public understanding.

Scientists need to verify AI-generated content before publication. Journals and preprint servers need stronger detection systems. The responsibility falls on researchers to check their tools' work-something many are not doing now.

To understand how AI systems generate these errors and best practices for using them responsibly in research, consider exploring Generative AI and LLM Courses or AI Research Courses.


Get Daily AI News

Your membership also unlocks:

700+ AI Courses
700+ Certifications
Personalized AI Learning Plan
6500+ AI Tools (no Ads)
Daily AI News by job industry (no Ads)