ArXiv bans researchers for AI-generated errors in papers
The preprint platform arXiv announced researchers will face a year-long ban if their names appear on papers containing obvious errors from artificial intelligence. The ban applies to all authors listed, regardless of who included the problematic AI-generated content.
The move targets a surge in low-quality submissions flooded with AI text. A study released last month found that roughly half of new articles published online are now primarily AI-generated. In academic publishing, the problem has become acute enough that arXiv stopped accepting certain submission types last year.
The quality problem
AI-generated research writing frequently includes hallucinated citations-references to papers that don't exist. These errors slip through because peer reviewers are already stretched thin. The peer review system was struggling before AI arrived; now pressured researchers have even less time for unpaid review work.
A January preprint study estimated around 1 in 8 biomedical papers now contain AI-generated text. ArXiv's policy doesn't prohibit AI use outright. Instead, it targets cases where "authors did not check the results of LLM generation," saying such negligence means "we can't trust anything in the paper."
The punishment problem
A year-long ban for all authors on a paper may be disproportionate, particularly in modern research. Papers today typically list four or five authors-sometimes hundreds. Each researcher works on their specialty and relies on colleagues to check their own contributions.
Banning dozens or hundreds of scientists because one author included a hallucinated reference seems harsh compared to other publishing failures. There are no equivalent sanctions for papers promoting fringe theories, using poor evidence, or making logical errors.
Using AI to catch AI problems
Sanctions for reckless AI use make sense. But the solution may involve AI itself, not just punishment.
AI systems can quickly verify reference lists against published papers available online. Flagged citations can then be checked by humans. Modern AI can also run sense-checks on statistical analysis and other technical claims in minutes.
This approach would improve quality assurance without banning researchers for relatively minor infractions. It treats the underlying problem-poor verification-rather than the tool that exposed it.
Learn more about AI Research Courses and Generative AI and LLM Courses to understand how these tools are reshaping research practices.
Your membership also unlocks: