Original Paper
Abstract
Background: The use of artificial intelligence (AI) can revolutionize health care, but this raises risk concerns. It is therefore crucial to understand how clinicians trust and accept AI technology. Gastroenterology, by its nature of being an image-based and intervention-heavy specialty, is an area where AI-assisted diagnosis and management can be applied extensively.
Objective: This study aimed to study how gastroenterologists or gastrointestinal surgeons accept and trust the use of AI in computer-aided detection (CADe), computer-aided characterization (CADx), and computer-aided intervention (CADi) of colorectal polyps in colonoscopy.
Methods: We conducted a web-based questionnaire from November 2022 to January 2023, involving 5 countries or areas in the Asia-Pacific region. The questionnaire included variables such as background and demography of users; intention to use AI, perceived risk; acceptance; and trust in AI-assisted detection, characterization, and intervention. We presented participants with 3 AI scenarios related to colonoscopy and the management of colorectal polyps. These scenarios reflect existing AI applications in colonoscopy, namely the detection of polyps (CADe), characterization of polyps (CADx), and AI-assisted polypectomy (CADi).
Results: In total, 165 gastroenterologists and gastrointestinal surgeons responded to a web-based survey using the structured questionnaire designed by experts in medical communications. Participants had a mean age of 44 (SD 9.65) years, were mostly male (n=116, 70.3%), and mostly worked in publicly funded hospitals (n=110, 66.67%). Participants reported relatively high exposure to AI, with 111 (67.27%) reporting having used AI for clinical diagnosis or treatment of digestive diseases. Gastroenterologists are highly interested to use AI in diagnosis but show different levels of reservations in risk prediction and acceptance of AI. Most participants (n=112, 72.72%) also expressed interest to use AI in their future practice. CADe was accepted by 83.03% (n=137) of respondents, CADx was accepted by 78.79% (n=130), and CADi was accepted by 72.12% (n=119). CADe and CADx were trusted by 85.45% (n=141) of respondents and CADi was trusted by 72.12% (n=119). There were no application-specific differences in risk perceptions, but more experienced clinicians gave lesser risk ratings.
Conclusions: Gastroenterologists reported overall high acceptance and trust levels of using AI-assisted colonoscopy in the management of colorectal polyps. However, this level of trust depends on the application scenario. Moreover, the relationship among risk perception, acceptance, and trust in using AI in gastroenterology practice is not straightforward.
doi:10.2196/50525
Keywords
Introduction
Artificial intelligence (AI) has made groundbreaking technological advancements in medical image interpretation [
]; diagnosis assistance; risk assessment for various conditions [ ]; outcome prognostication [ ]; and in certain areas, treatment suggestion [ ] and partaking in surgical intervention [ ].Studies of AI trust and acceptance among clinicians are becoming increasingly important. This is because trust and acceptance of AI technology are seen as preconditions for clinical workflow integration [
]. Currently, trust has already been demonstrated by several studies as one of the main determinants in driving the adoption of AI in health care [ , ]. One study showed that within a general home-based health care setting—where AI is applied on the internet of things–based devices to monitor patients’ health—risk perception, acceptance, and trust are related concepts that govern the ultimate use of the developed technology [ ]. A separate study [ ] conducted on the use of an AI-based system in the application of a Blood Utilization Calculator showed that its trust and use were determined by perceived risk and expectancy (in our context, acceptance). It was demonstrated that high perceived risk reduced trust and subsequent use.While the clinical evidence of accuracy in the diagnosis and prognosis of AI is accumulating, the level of trust and acceptance by clinicians requires more attention [
]. We identified that gastroenterology, by its very nature of having heavy usage of image-based diagnosis (eg, computed tomography, magnetic resonance imaging, endoscopy, and histology) and surgical or endoscopic intervention, will be one of the specialties that may readily use AI technologies in clinical management [ , ]. Yet, there is little research on AI risk perception, acceptance, and trust among gastroenterologists.To our knowledge, most published research surveys trust in a more general manner. One such recent example is the survey on gastrointestinal (GI) health care in 2022, which covered clinicians’ perspectives in a general way [
]. However, such surveys lack granularity. It is impossible to know under what circumstances do clinicians become less trusting or accepting or become more concerned about the deployments of AI.Moreover, there is a lack of explicit modeling from collected data to relate patterns of risk perception, acceptance, and trust among practitioners. There are existing models [
, ] that explore parts of the interactions among these 3 factors. However, because these explorations cover only partial relationships and interactions, we feel that these may be inadequate for modeling real-world dynamics. Therefore, having more comprehensive models would allow for a better understanding of the various factors underpinning how clinicians come to trust, accept, and eventually use AI. This knowledge would help in formulating successful implementation of AI tools in real-world environments.In this study, we aim to understand the trust and acceptance among gastroenterologists, with a specific focus on the Asia-3Pacific region. We hypothesize is that risk perception, acceptance, and trust will change according to the scenario (computer-aided detection [CADe], computer-aided characterization [CADx], or computer-aided intervention [CADi]), with different levels of invasiveness. A blueprint of a survey that examines contextual responses toward screening colonoscopy with polypectomy in clinical environments is provided. Using our collected data, we attempt to elucidate how risk perception, acceptance, and trust interactions can be modeled and studied. These contributions collectively enhance our understanding of complex factors influencing the integration of AI in medical practice.
Methods
Survey
We used a structured questionnaire (
) to conduct a survey in English by inviting gastroenterologists or GI surgeons from the Asia-Pacific region through open invitations to various medical associations. The questionnaire was based on the expectancy-value framework, major constructs of the Theory of Planned Behaviour research framework [ ], and the Technology Acceptance Model measures [ ]. Items in the questionnaire for testing risk perception, acceptance, and trust were adapted from various other studies [ , ], with some including items from validated constructs in questionnaires. These questions are then adapted into scenarios covering detection (CADe), characterization (CADx), or intervention (CADi), with different levels of invasiveness characterization and intervention for colonoscopic detection and polypectomy (see for items used to evaluate these aspects).Most items were rated on a 7-point Likert scale, where 7 denotes strong agreement. To assess risk perception, acceptance, and trust, we presented participants with 3 different AI applications related to colonoscopy and the management of colorectal polyps. These scenarios, reflecting existing AI applications in GI, involve the detection of polyps (CADe), characterization of the nature of polyps (CADx), and treatment procedures (CADi), respectively (see
and ). displays measurement items.In this study, the three key elements for assessment are (1) risk perception, (2) acceptance, and (3) trust. Risk perception refers to an individual’s subjective assessment or understanding of the potential hazards, threats, or uncertainties associated with a particular situation or activity. It involves the process of evaluating and interpreting information about risk, considering factors such as the severity of potential consequences [
, ]. Acceptance is the mental and emotional state of acknowledging and accommodating a new concept or innovation into one’s beliefs, behaviors, or practices. Trust is defined as belief or confidence in the reliability, credibility, and integrity of a person, system, or technology leading to usage or action [ , ]. Acceptance may precede trust in the adoption of new technologies, but trust plays a crucial role in establishing a strong foundation for sustained usage and effective integration of AI into medical practice. Risk perception, acceptance, and trust may interact with each other and other factors stemming from professional, technological, and personal sources. The conceptual framework presented in illustrates the intricate interplay among sociodemographic variables, AI acceptance, trust, perceived risk, and outcomes [ ]. Our study aims to contribute to this understanding not by testing individual relationships within this conceptual framework but by exploring how trust, risk, and acceptance are possibly interconnected in the context of AI-supported applications in gastroenterology.Computer-aided detection
- Imagine you are attending an informal meeting of colleagues. Your colleagues are not experts in artificial intelligence and have about the same amount of understanding as you do. The conversation turns to innovation in medicine, especially machine learning algorithms and their potential to assist in the interpretation of medical imagery in the early detection of colon cancer. One of the colleagues speaks about a patient who underwent a colonoscopy which was assisted by a machine learning algorithm. When the algorithm indicated that the patient had a colonic polyp, the colleague asked for an additional biopsy. It turned out that the result produced by the algorithm was correct (use the following scale: 1=have major doubts to 4=neutral to 7=fully believe).
Computer-aided characterization
- The second colleague reported that the machine learning algorithm is also capable of correctly classifying whether the colonic polyp was adenomatous or hyperplastic (use the following scale: 1=have major doubts to 4=neutral to 7=fully believe).
Computer-aided intervention
- Now suppose a third colleague told you that a machine learning algorithm can be applied to guide interventions. Endoscopists need a targeted biopsy from specific locations that harbor the lesion. The third colleague said that the algorithm can guide a biopsy needle more precisely than a human, using ultrasound imaging (use the following scale: 1=have major doubts to 4=neutral to 7=fully believe).
Scenario | Objective |
Computer-aided detection: use of AI to assist in identifying the presence of colorectal polyps and improving adenoma detection rate. | To evaluate the acceptability of AI to assist in the interpretation of medical imagery in detecting colorectal lesions under different bowel preparations and colonic configurations |
Computer-aided characterization: use of AI to classify whether a colonic polyp was adenomatous or hyperplastic. | To evaluate the acceptability of AI to differentiate (without histology) between adenoma (with variable degree of malignant potential) vs hyperplastic polyps (no malignant potent) |
Computer-aided intervention: use of AI in an endoscopy to guide colonoscopic polypectomy. | To evaluate the acceptability of AI to decide which tool to use in assessing the completeness of polypectomy and risk of bleeding, perforation, or both. |
aAI: artificial intelligence.
Measure | Question text |
Risk perception | I expect major risks involved with the artificial intelligence diagnosis. |
Acceptance | Do you believe that machine learning algorithm can, in some cases (as in the one described above), better perform (the task, computer-aided detection, computer-aided characterization, Computer-aided intervention) than human beings? |
Trust | I am ready to try the method myself |
Statistical Analysis
Statistically significant application pairs were identified by the Mann-Whitney U test (U test) or when there is dependence, the Wilcoxon signed-rank test (Wilcox test). Statistical significance is established at .05. Analyses were conducted in Python using the scipy.stats module (version 1.10.0; the SciPy community), statsmodels module (version 0.13.5), and the Pingouin statistical package (version 0.5.3) or SPSS (version 28; IBM Corp).
Correction for multiple testing was performed using Bonferroni correction, where the statistical threshold (α) was divided by the number of tests n, such that the adjusted P value threshold is given by α/n.
Power Analysis
Our hypothesis is that risk perception, acceptance, and trust will change according to the scenario (detection [CADe], characterization [CADx], or intervention [CADi]), with different levels of invasiveness. Based on an estimated effect size of 0.3 for trust, power, and risk perception with 0.95 power, we can calculate the minimum set of respondents needed to determine any significant differences of a given “size” in response to trust, risk perception, and acceptance measures across scenarios. Since every individual answers scenarios 1 to 3, the differences in the response of every individual can be estimated using a Wilcox test if we compare between pairs of scenarios. The required sample size to pick up a small-moderate effect size (based on Cohen d) of 0.3 with a power of 95% is 154. In this study, we have recruited 165 participants, and this should be enough to achieve sufficient statistical power.
Ethical Considerations
This study was approved by the Nanyang Technological University institutional review board (IRB-2022-756). Informed consent was obtained with ability to opt out. Data was anonymized, and no compensation was provided.
Results
Response and Nonresponse Bias
Tracking response rates can help determine the representativeness of a study, but due to the constraints of our institutional review board, we were not allowed to track individual respondents. During the initial phase of the study, we sent the survey to a distribution list of 151 participants with known dates. Applying an approximate 1-month window (October 21, 2022, to November 13, 2022), we obtained 128 responses. Thus, our estimated response rate is ~85% (n=128). While we were analyzing or cleaning up the data, we hoped to get more participants. In the subsequent weeks, we obtained 37 new responses. To compare early and late respondents, we aggregated the first 130 responses (collected between October 21, 2022, and December 29, 2022) as a single group to represent the early respondents and the remaining 35 (collected between January 10 to January 19, 2023) as the late responses. Comparing 130 early respondents against 35 late respondents using a Mann-Whitney U test with a Bonferroni-adjusted α=.0056, we found no significant differences for risk, trust, and acceptance across each of the 3 scenarios. This suggests no significant difference between the early and late responses. The lowest obtained P value was .022 (trust in CADx), and the remaining P values were at least .30. Together, we take these results as a proxy that nonresponder bias is not a strong concern. We also note the overall response rates are rather high; the survey was sent out to various gastroenterology associations as an open invitation, without individual follow-up. It is possible that AI is increasingly seen as transformative and important in the gastroenterology field, but there is not much work on understanding how perspectives on AI lead toward trust and adoption. Hence, invitees feel strongly about the matter and are more inclined to participate in this survey.
Unidimensionality and Reliability
Most items in our questionnaire were already used in other questionnaires and can be considered as validated. For the scenario-based questions used in this study, these are novel, as we needed to develop new instruments to explore new topics. Participants had to answer on three 7-point items (not at all to wholeheartedly) whether they accept, trust, and perceive risk on the method presented in each of the scenarios. Unidimensionality and reliability were verified and assured using confirmatory factor analysis and Omega Hierarchical, respectively (see
for details).Cohort Characteristics
In total, 165 clinicians participated in the study. The survey completion rate was ~99.40% (n=165). Participants averaged 44.49 (SD 9.65) years, were mostly male (n=116, 70%), and predominantly specialized in gastroenterology (n=153, 92.72%; see
).The sample comprised gastroenterologists and GI surgeons with varied clinical experience: 93 (56.36%) participants have over 10 years’ experience in practicing gastroenterology and 111 (66.81%) participants were consultants or senior consultants, mostly working in public hospitals (n=110, 66.67%). Most participants reported basic familiarity with AI (n=160, 96.97%; Q1: How familiar are you with AI?). Many were exposed at work, either directly (n=111, 67.27%; Q2: Have you ever used AI in your occupation?) or indirectly (n=112, 67.88%; Q6: Do you personally know other clinicians who use AI at work?).
Participants rated a mean score of 6.00 (SD 0.95) for intending to use AI when it becomes available in their workplace and a score of 5.50 (SD 1.24) for intending to use it to provide services to their patients. Participants rated a mean score of 5.83 (SD 1.37) for intention to use AI routinely in patient care. These figures suggest generally favorable attitudes toward adopting AI.
Participant | Values (N=165), n (%) | |
Age (years), mean (SD)a | 44.49 (9.65) | |
Gendera | ||
Male | 116 (75.32) | |
Female | 38 (24.68) | |
Country or areab | ||
Australia | 3 (1.83) | |
Brunei Darussalam | 7 (4.27) | |
Hong Kong | 18 (10.98) | |
India | 6 (3.66) | |
Indonesia | 6 (3.66) | |
Japan | 9 (5.49) | |
New Zealand | 1 (0.61) | |
People’s Republic of China | 50 (30.49) | |
Philippines | 1 (0.61) | |
Republic of Korea | 2 (1.22) | |
Singapore | 24 (14.63) | |
Taiwan | 33 (20.12) | |
Main work settingc | ||
Public hospital | 110 (67.9) | |
Private hospital | 28 (17.28) | |
Institute of higher learning | 18 (11.11) | |
Community health center | 1 (0.62) | |
Other | 5 (3.09) | |
Current role at workd | ||
Resident | 19 (11.8) | |
Fellow | 19 (11.8) | |
Consultant | 57 (35.4) | |
Senior consultant | 54 (33.54) | |
Other | 12 (7.45) | |
Specialtyc | ||
Gastroenterology | 153 (94.44) | |
Colorectal surgery | 4 (2.47) | |
General surgery | 2 (1.23) | |
Other | 3 (1.85) | |
Practicing in specialty (years)c | ||
Less than 5 | 39 (24.07) | |
5-10 | 30 (18.52) | |
11-20 | 48 (29.63) | |
Over 20 | 45 (27.78) |
a11 participants did not report their ages or gender.
b1 participant did not report their country or area.
c3 participants did not report their main work setting, specialty, and years practicing in a specialty.
d4 participants did not report their current role at work.
Scenario-Based Differentiation
When participants were exposed to three scenarios in medical practice that extend from (1) diagnosing and detecting colorectal polyps (CADe), (2) assessing the nature of pathology of polyps and predict risk of malignancy (CADx), and (3) adopting endoscopic or surgical intervention or removal of the polyps (CADi), clinicians expressed similar risk perceptions across all applications (
A: MedianCADe=MedianCADx=MedianCADi=4.0; WilcoxCADe-CADx: P=.09; WilcoxCADe-CADi: P=.44; WilcoxCADx-CADi: P=.66). However, there were clear application-specific differences in intention to accept AI in practice, with CADe and CADx rated higher than that of CADi ( B: MedianCADe=6.0, MedianCADx=6.0, MedianCADi=5.0; WilcoxCADe-CADx: P=.031; WilcoxCADe-CADi: P=1.6×10–4; WilcoxCADx-CADi: P=.02). Similarly for trust, CADe and CADx were rated higher than CADi ( C: MedianCADe=6.0, MedianCADx=6.0, MedianCADi=5.0; WilcoxCADe-CADx: P=.29; WilcoxCADe-CADi: P=3.7×10–08; WilcoxCADx-CADi: P=4.5×10–08).Subgroup Analysis for Identification of Confounding Effects and Other Intrinsic Factors
We performed a subgroup analysis to investigate if factors such as gender, years of experience, and practice environment will affect risk perception, acceptance, and trust in AI for gastroenterology practice (
).Male and female practitioners held similar risk perceptions. There was good concordance in their risk perception, acceptance, and trust toward using AI in gastroenterology practice (
A1, 3B1, and 3C1). Male participants tended to be less accepting and trusting, especially in CADi, although this difference is not statistically significant.Next, we compared practitioners with 10 or less years of clinical experience (n=69) versus experienced practitioners with more than 10 years of clinical experience (n=93). While the overall trends of high acceptance and trust showed no difference between the 2 groups, experienced clinicians exhibited consistently lower risk perception than less experienced ones (
A2). This observation was statistically significant for all 3 scenarios (CADe: P=9.7×10–6; CADx: P=1.7×10–06; CADi: P=3.3×10–04). We also compared practitioners of the rank senior consultant and consultant (n=111) against residents and fellows (n=38; A3, 3B3, and 3C3). The acceptance and trust remained high, and the trend showed a good concordance between the 2 groups. A lower risk perception was found among senior consultants and consultants compared to residents and fellows (CADe: P=.12, CADx: P=.10, and CADi: P=.27). However, the difference is statistically insignificant. The years of experience in clinical practice appeared to have a stronger impact on risk perception than the rank held.Finally, we compared practitioners from public hospitals with those from private hospitals (
A4, 3B4, and 3C4). There was no statistically significant difference between private hospital practitioners against their public counterparts, although there was a noticeable difference in CADx on acceptance ( B4). There was also a lower rate of acceptance and trust in using AI for intervention (CADi) compared to CADe and CADx. Despite not reaching statistical significance, we observed that the spread among private hospital respondents tended to exhibit greater variations. In some instances, the spread appeared to be bimodal for CADi, suggesting that the private respondents could be a combination of 2 distinct subgroups.The correlation among risk perception, acceptance, and trust was further analyzed by incorporating the years of experience of the participants by their years of practice in gastroenterology. In all 3 scenarios, there is a moderate correlation between acceptance and trust of AI in detecting polyps (CADe) and characterizing polyps (CADx). The influence of risk perception on acceptance and trust appears to be more diffused: noticeably, when trust and acceptance are both high, and it does not always coincide with low-risk perception.
We first used contingency tables combined with the Fisher exact test to evaluate the impact on the original relationships between trust and acceptance and after introducing risk perception (risk) as an interaction term. This was repeated for each scenario (CADx, CADi, and CADe;
). Using this approach, we find that after introducing risk perception, the distribution of values still largely follows that of the original data, suggesting that risk does not interact strongly with trust and acceptance. However, this does not mean that risk does not influence these 2 factors. To further investigate, we performed a 2-way ANOVA to further study the influence of risk perception on acceptance and trust. The 2-way ANOVA revealed a statistically significant interaction in CADe (F25=3.37; P=1.6×10–05) but not in CADx (F25=1.40; P=.12) and CADi (F36=1.35; P=.16). Finally, we performed two sets of regression analyses with (1) acceptance and risk perception as independent variables and (2) acceptance, risk perception, and an interaction term that is the product of acceptance and risk perception ( ). Acceptance had a statistically significant positive influence on trust for all 3 scenarios. Risk perception only has a statistically significant negative impact on trust for the first 2 scenarios (CADe and CADx). When we considered an interaction term, only CADe had a statistically significant impact on trust on all 3 terms. For CADx and CADi, this effect disappeared and only acceptance retained a statistically significant influence on trust. Thus, we believe risk perception has a weak association with trust and acceptance. Taken together, the relationship between trust, acceptance, and risk perception appears complex and is not straightforward.Discussion
Principal Findings
The findings from our study demonstrate that gastroenterologists are generally familiar with AI and were frequently exposed to AI tools in medical settings. This may be because of the introduction of AI-assisted colonoscopy by various industries. In recent years, there are also numerous publications and seminars in the field of gastroenterology mentioning the success of using AI tools in diagnosis, risk prediction, and the treatment of GI conditions [
]. This suggests that they have a keen awareness of AI’s future potential in clinical applications. However, our findings showed that acceptance is not an all-or-nothing choice, but the application or intention to use AI tools varied between different clinical scenarios as well as the nature and impact of AI participation.When looking at scenario-specific acceptance and trust in AI, the responses vary. Our survey on AI use in detection (CADe), characterization (CADx), and intervention (CADi) of colonic polyps revealed wide acceptance disparity among practitioners (
). While CADe was more widely accepted, CADi was met with much greater resistance. The 3 AI scenarios that were presented to clinicians in this study varied in the degree of involvement a clinician has in certain procedures. Participants preferred CADi the least. These results agree with our hypothesis that trust, acceptance, and risk perception will change according to the scenario (detection [CADe], characterization [CADx], or intervention [CADi]), with different levels of invasiveness.In this study, acceptance appeared to have little correlation with the perceived risk level of the procedures. Although certain case scenarios were considered by some as high risk, they do not necessarily warrant low acceptance or trust in using AI. Hence, the findings highlight the intricate relationship between the complexity of AI technologies and their acceptance. One intriguing finding is that participants with more (years of) experience appear to accept the risk and would trust the use of AI more than those who are less experienced. This probably indicates that they see the use of AI as an option or recommendation, instead as an obligation or necessity. Therefore, having more clinical experience may give clinicians greater confidence in their medical expertise and practice, thereby generating more confidence in risk mitigation when new technologies are introduced. Indeed, a study by Lawton et al [
] revealed more experienced doctors were much more at ease with uncertainty.On the other hand, a general lack of AI familiarization and training in medical education may be one of the reasons that less experienced doctors perceive AI as more risky than regular or traditional practice. Chen et al [
] found that while most physicians and medical students were receptive to the use of AI, most also had concerns about the potential for unpredictable or incorrect results. The same study also stated that respondents were aware of AI’s potential but lacked practical experience and related knowledge. Thus, introducing AI literacy and familiarization training early in medical careers may help mitigate risk aversion and promote responsible AI use in clinical practice. Young doctors are also aware of their education gaps. In a study by Civaner et al [ ], medical student respondents acknowledged a gap in “knowledge and skills related to AI applications” (96.2%), “applications for reducing medical errors” (95.8%), and “training to prevent and solve ethical problems that might arise as a result of using AI applications” (93.8%).Our results suggest that although there is a moderate correlation between trust and acceptance, risk perception appeared invariant suggesting the relationship between trust and acceptance with risk perception is not straightforward and may implicate other factors and interactions than the relationships shown in
. Indeed, the invariance of risk perception across scenarios against acceptance suggests that there are other factors that influence the acceptance of AI ( ). Among the tested factors, we find that risk acceptance is confounded with years of experience ( ). Future studies should be conducted to better understand other drivers and barriers that influence acceptance, such as the perceived usefulness of using AI and whether AI tools may replace the jobs of clinicians in future practices. Qualitative studies, such as the use of focus group discussions, would also be useful to better understand clinicians’ specific concerns in using AI and the impact of their concerns on the use of AI. Quantitatively, more complex data analysis methods may also be used in the future to understand the causal relationship between various factors and the acceptance of AI. As we proceed into deeper and larger cohort studies investigating trust and acceptance of AI, the development of powerful network methodologies can yield more insight. Indeed, simple statistical learning and even deep learning methods may soon become limited in their ability to explain complex and directed relationships among factors. We believe that causal analysis methods, such as Bayesian Belief Networks will soon become necessary and indispensable for explaining and modeling trust, acceptance, and risk perceptions on medical AI [ ].Limitations
There are limitations in this study. While this study provides invaluable insight into the Asia-Pacific region, we have only captured clinicians’ perspectives despite there being other stakeholders whose voices and opinions matter. This includes nurses, endoscopy assistants, and patients. Future studies should aim to capture their perspectives and understand better how their opinions align or conflict with each other. This will help us navigate complex trust and acceptance issues more realistically and create valuable propositions and effective policies by adopting a multistakeholder perspective into consideration [
]. Participants in this study come from 5 countries with only 165 respondents. The generalizability of the findings can be strengthened by including more clinicians from different backgrounds and regions of practice. In future implementation studies, it may also be worthwhile to examine additional case scenarios such as the management of complicated inflammatory bowel diseases; choice of therapy for GI cancers and GI bleeding; and their corresponding trust, acceptance, and risk perceptions. This additional information will help us better contextualize how risk acceptance, acceptance, and trust change depending on practice.Conclusions
This study is one of the first to examine risk perception, acceptance, and trust across different scenarios. It is one of the earliest reports of AI risk perception, acceptance, and trust among gastroenterologists, with a unique focus on the Asia- Pacific region. We found that gastroenterologists have, in general, a high acceptance and trust level of using AI-assisted colonoscopy in the management of colorectal polyps. However, this level of trust depends on the application scenario. Moreover, the relationship among risk perception, acceptance, and trust in using AI in gastroenterology practice is not a straightforward correlation. Future studies are required to identify factors that influence the acceptance and trust of using AI in clinical practices.
Acknowledgments
This research or project is supported by the National Research Foundation, Singapore under its AI Singapore Programme (AISG3-GV-2021-009).
Conflicts of Interest
None declared.
Survey questions and supplementary results.
DOCX File , 145 KBReferences
- Elmore JG, Lee CI. Artificial intelligence in medical imaging-learning from past mistakes in mammography. JAMA Health Forum. 2022;3(2):e215207. [FREE Full text] [CrossRef] [Medline]
- Ren Y, Loftus TJ, Datta S, Ruppert MM, Guan Z, Miao S, et al. Performance of a machine learning algorithm using electronic health record data to predict postoperative complications and report on a mobile platform. JAMA Netw Open. 2022;5(5):e2211973. [FREE Full text] [CrossRef] [Medline]
- Hoiland RL, Rikhraj KJK, Thiara S, Fordyce C, Kramer AH, Skrifvars MB, et al. Neurologic prognostication after cardiac arrest using brain biomarkers: a systematic review and meta-analysis. JAMA Neurol. 2022;79(4):390-398. [FREE Full text] [CrossRef] [Medline]
- Piette JD, Newman S, Krein SL, Marinec N, Chen J, Williams DA, et al. Patient-centered pain care using artificial intelligence and mobile health tools: a randomized comparative effectiveness trial. JAMA Intern Med. 2022;182(9):975-983. [FREE Full text] [CrossRef] [Medline]
- Loftus TJ, Tighe PJ, Filiberto AC, Efron PA, Brakenridge SC, Mohr AM, et al. Artificial intelligence and surgical decision-making. JAMA Surg. 2020;155(2):148-158. [FREE Full text] [CrossRef] [Medline]
- Asan O, Bayrak AE, Choudhury A. Artificial intelligence and human trust in healthcare: focus on clinicians. J Med Internet Res. 2020;22(6):e15154. [FREE Full text] [CrossRef] [Medline]
- Schepart A, Burton A, Durkin L, Fuller A, Charap E, Bhambri R, et al. Artificial intelligence-enabled tools in cardiovascular medicine: a survey of current use, perceptions, and challenges. Cardiovasc Digit Health J. 2023;4(3):101-110. [FREE Full text] [CrossRef] [Medline]
- Frank DA, Elbæk CT, Børsting CK, Mitkidis P, Otterbring T, Borau S. Drivers and social implications of artificial intelligence adoption in healthcare during the COVID-19 pandemic. PLoS One. 2021;16(11):e0259928. [FREE Full text] [CrossRef] [Medline]
- Alraja MN, Farooque MMJ, Khashab B. The effect of security, privacy, familiarity, and trust on users' attitudes toward the use of the IoT-based healthcare: the mediation role of risk perception. IEEE Access. 2019;7:111341-111354. [FREE Full text] [CrossRef]
- Choudhury A, Asan O, Medow JE. Effect of risk, expectancy, and trust on clinicians' intent to use an artificial intelligence system—blood utilization calculator. Appl Ergon. 2022;101:103708. [CrossRef] [Medline]
- Mori Y, Neumann H, Misawa M, Kudo SE, Bretthauer M. Artificial intelligence in colonoscopy—now on the market. What's next? J Gastroenterol Hepatol. 2021;36(1):7-11. [CrossRef] [Medline]
- Walradt T, Berzin TM. Artificial intelligence in gastroenterology. In: Greenhill AT, Chahal D, Byrne MF, Parsa N, Ahmad O, Bagci U, editors. AI in Clinical Medicine. Hoboken, NJ. Wiley; 2023;176-183.
- van der Zander QEW, van der Ende-van Loon MCM, Janssen JMM, Winkens B, van der Sommen F, Masclee AAM, et al. Artificial intelligence in (gastrointestinal) healthcare: patients' and physicians' perspectives. Sci Rep. 2022;12(1):16779. [FREE Full text] [CrossRef] [Medline]
- Ye T, Xue J, He M, Gu J, Lin H, Xu B, et al. Psychosocial factors affecting artificial intelligence adoption in health care in China: cross-sectional study. J Med Internet Res. 2019;21(10):e14316. [FREE Full text] [CrossRef] [Medline]
- Gupta S, Kamboj S, Bag S. Role of risks in the development of responsible artificial intelligence in the digital healthcare domain. Inf Syst Front. 2021;25(6):2257-2274. [CrossRef]
- Ajzen I. The theory of planned behavior. Organ Behav Hum Decis Process. 1991;50(2):179-211. [CrossRef]
- Davis FD. A technology acceptance model for empirically testing new end-user information systems: theory and results. Massachusetts Institute of Technology. 1985. URL: https://dspace.mit.edu/handle/1721.1/15192 [accessed 2024-01-09]
- Kader R, Baggaley RF, Hussein M, Ahmad OF, Patel N, Corbett G, et al. Survey on the perceptions of UK gastroenterologists and endoscopists to artificial intelligence. Frontline Gastroenterol. 2022;13(5):423-429. [FREE Full text] [CrossRef] [Medline]
- Hah H, Goldin DS. How clinicians perceive artificial intelligence-assisted technologies in diagnostic decision making: mixed methods approach. J Med Internet Res. 2021;23(12):e33540. [FREE Full text] [CrossRef] [Medline]
- Mayer RC, Davis JH, Schoorman FD. An integrative model of organizational trust. Acad Manage Rev. 1995;20(3):709-734. [FREE Full text] [CrossRef]
- Schoorman FD, Mayer RC, Davis JH. An integrative model of organizational trust: past, present, and future. Acad Manage Rev. 2007;32(2):344-354. [CrossRef]
- Solberg E, Kaarstad M, Eitrheim MHR, Bisio R, Reegård K, Bloch M. A conceptual model of trust, perceived risk, and reliance on AI decision aids. Group Organ Manage. 2022;47(2):187-222. [CrossRef]
- Kröner PT, Engels MM, Glicksberg BS, Johnson KW, Mzaik O, van Hooft JE, et al. Artificial intelligence in gastroenterology: a state-of-the-art review. World J Gastroenterol. 2021;27(40):6794-6824. [FREE Full text] [CrossRef] [Medline]
- Lawton R, Robinson O, Harrison R, Mason S, Conner M, Wilson B. Are more experienced clinicians better able to tolerate uncertainty and manage risks? a vignette study of doctors in three NHS emergency departments in England. BMJ Qual Saf. 2019;28(5):382-388. [FREE Full text] [CrossRef] [Medline]
- Chen M, Zhang B, Cai Z, Seery S, Gonzalez MJ, Ali NM, et al. Acceptance of clinical artificial intelligence among physicians and medical students: a systematic review with cross-sectional survey. Front Med (Lausanne). 2022;9:990604. [FREE Full text] [CrossRef] [Medline]
- Civaner MM, Uncu Y, Bulut F, Chalil EG, Tatli A. Artificial intelligence in medical education: a cross-sectional needs assessment. BMC Med Educ. 2022;22(1):772. [FREE Full text] [CrossRef] [Medline]
- Vasudevan RK, Ziatdinov M, Vlcek L, Kalinin SV. Off-the-shelf deep learning is not enough, and requires parsimony, Bayesianity, and causality. npj Comput Mater. 2021;7(1):16. [FREE Full text] [CrossRef]
- Güngör H. Creating value with artificial intelligence: a multi-stakeholder perspective. J Creat Value. 2020;6(1):72-85. [FREE Full text] [CrossRef]
Abbreviations
AI: artificial intelligence |
CADe: computer-aided detection |
CADi: computer-aided intervention |
CADx: computer-aided characterization |
GI: gastrointestinal |
Edited by K El Emam, B Malin; submitted 05.07.23; peer-reviewed by D Lungu, Z Li; comments to author 01.08.23; revised version received 28.08.23; accepted 23.11.23; published 07.03.24.
Copyright©Wilson WB Goh, Kendrick YA Chia, Max FK Cheung, Kalya M Kee, May O Lwin, Peter J Schulz, Minhu Chen, Kaichun Wu, Simon SM Ng, Rashid Lui, Tiing Leong Ang, Khay Guan Yeoh, Han-mo Chiu, Deng-chyang Wu, Joseph JY Sung. Originally published in JMIR AI (https://ai.jmir.org), 07.03.2024.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR AI, is properly cited. The complete bibliographic information, a link to the original publication on https://www.ai.jmir.org/, as well as this copyright and license information must be included.