How a Research Lab Using GPT-4o-mini and Llama 3.6 Encountered Conflicting Factuality Scores in April 2025
https://smoothdecorator.com/how-hallucinations-break-production-a-7-point-checklist-for-ctos-engineering-leads-and-ml-engineers/
In March 2025 a university research lab set out to compare factuality across modern large language models