arXiv AI recent: Ensembles of Large Language Models for Identifying EQ-5D Studies in PubMed Based on Their Abstracts
The authors evaluated nine large language models, including Google's Gemini and Gemma, for automatically detecting EQ-5D reporting in PubMed abstracts.,A weighted ensemble of gemini-2.5-p...
Manual screening of systematic literature reviews is becoming increasingly resource‑intensive, inefficient, and inconsistent due to the rapid growth of scientific publications.,The authors created a dataset of PubMed studies manually labeled by two experts for EQ‑5D reporting and used it to asses...