Accessibility settings

Published on in Vol 5 (2026)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/83640, first published .
Performance of Large Language Models Under Input Variability in Health Care Applications: Dataset Development and Experimental Evaluation

Performance of Large Language Models Under Input Variability in Health Care Applications: Dataset Development and Experimental Evaluation

Performance of Large Language Models Under Input Variability in Health Care Applications: Dataset Development and Experimental Evaluation

Journals

  1. Harada Y. Safety Audit of a Large Language Model for Lay Self-Triage Using Japanese Symptom Vignettes: Persistent Red-Flag Under-Triage Despite Improved Reproducibility Under Near-Deterministic Decoding. Cureus 2026 View