ArXiv bans authors for a year if they submit papers with unchecked AI-generated content

ArXiv now bans authors for one year if they submit papers with unverified LLM output, such as hallucinated references. Future submissions after the ban must clear peer review first.

Categorized in: AI News Science and Research
Published on: May 17, 2026
ArXiv bans authors for a year if they submit papers with unchecked AI-generated content

ArXiv Tightens Rules on Large Language Model Use in Scientific Papers

ArXiv, the preprint repository used by researchers across computer science and mathematics, is enforcing stricter standards for papers that rely on large language models. Authors who submit work containing evidence they did not verify LLM output face a one-year ban from the site, followed by a requirement that future submissions be accepted by a peer-reviewed journal first.

The policy takes effect immediately. Thomas Dietterich, chair of ArXiv's computer science section, said that "hallucinated references" and visible comments to or from an LLM constitute grounds for enforcement.

What Triggers the Ban

The rule targets carelessness, not LLM use itself. Researchers can use language models in their work - but they must take full responsibility for the output. If a paper contains inappropriate language, plagiarized content, biased material, errors, incorrect references, or misleading information copied directly from an LLM, the authors are liable.

Moderators flag potential violations. Section chairs must confirm the evidence before a ban is imposed. Authors can appeal the decision.

Broader Context

ArXiv has already moved to reduce low-quality AI-generated submissions. The site now requires first-time posters to get endorsements from established authors. Last year, the organization separated from Cornell University to become an independent nonprofit, giving it more resources to address problems like AI-generated papers.

Recent research documents a rise in fabricated citations in biomedical literature, likely driven by LLMs that generate plausible-sounding but false references. The problem extends beyond academia - journalists and other professionals have also been caught using AI-fabricated sources.

For researchers working with AI tools, understanding how to verify LLM output and maintain research integrity is now essential. Generative AI and LLM Courses and AI Research Courses cover responsible practices for integrating these tools into academic work.


Get Daily AI News

Your membership also unlocks:

700+ AI Courses
700+ Certifications
Personalized AI Learning Plan
6500+ AI Tools (no Ads)
Daily AI News by job industry (no Ads)