Lancet: biomedical research paper analysis finds significant increase in LLM hallucination undermining quality
I was looking into the 1-year bans for hallucinated data and citations on arXiv and stumbled across this..
Given healthcare advances are one of the poster boys for keeping the AI Dumpster Fire on the road and absolving it of its heinous sins... this is worrying... but not unexpected.
https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(26)00603-3/fulltext
quote:
Scientific literature depends on the integrity of its references. Each reference implicitly asserts that a verifiable source exists and supports the claims being made. When references point to non-existent studies, readers, reviewers, and policy makers are unable to evaluate the evidence.
Fabricated references (references whose claimed titles correspond to no existing publication) can arise from paper mill activity, intentional misconduct, or uncritical use of artificial intelligence (AI) writing tools.1 Large language models (LLMs) generate plausible sounding but fictitious references, a well documented failure mode; previous studies estimate that 30–69% of LLM-generated references in biomedical contexts are fabricated.
.....
Among 97·1 million verified references, we identified 4046 fabricated references across 2810 papers (illustrative examples are shown in the appendix p 5–6). In 2023, approximately one in 2828 papers contained at least one fabricated reference. By 2025, this had risen to one in 458 and in the first 7 weeks of 2026, one in 277 papers had at least one fabricated reference. The fabrication rate increased more than 12 times, from approximately four per 10 000 papers in 2023, to 51·3 per 10 000 papers in the fourth quarter of 2025, reaching 56·9 per 10 000 papers in early 2026 (figure).
-----
We recommend four actions. First, publishers should integrate automated reference verification into submission workflows before peer review begins; verification tools exist, and the barrier to adoption is institutional rather than technological. Second, indexing services should add integrity metadata to article records so that downstream users can assess the reliability of references. Third, publishers should retroactively screen existing publications and issue corrections or retractions when fabricated references compromise a paper's conclusions. Fourth, fabricated references do not currently exist as a discrete category in major research integrity databases; establishing this category would enable systematic tracking and accountability.
----