Acessibilidade / Reportar erro

Innovations in surgical training: exploring the role of artificial intelligence and large language models (LLM)

ABSTRACT

The landscape of surgical training is rapidly evolving with the advent of artificial intelligence (AI) and its integration into education and simulation. This manuscript aims to explore the potential applications and benefits of AI-assisted surgical training, particularly the use of large language models (LLMs), in enhancing communication, personalizing feedback, and promoting skill development. We discuss the advancements in simulation-based training, AI-driven assessment tools, video-based assessment systems, virtual reality (VR) and augmented reality (AR) platforms, and the potential role of LLMs in the transcription, translation, and summarization of feedback. Despite the promising opportunities presented by AI integration, several challenges must be addressed, including accuracy and reliability, ethical and privacy concerns, bias in AI models, integration with existing training systems, and training and adoption of AI-assisted tools. By proactively addressing these challenges and harnessing the potential of AI, the future of surgical training may be reshaped to provide a more comprehensive, safe, and effective learning experience for trainees, ultimately leading to better patient outcomes. .

Keywords:
Artificial Intelligence; Education, Medical; Learning Curve

RESUMO

O cenário do treinamento cirúrgico está evoluindo rapidamente com o surgimento da inteligência artificial (IA) e sua integração na educação e simulação. Este artigo explora as aplicações e benefícios potenciais do treinamento cirúrgico assistido por IA, em particular o uso de modelos de linguagem avançados (MLAs), para aprimorar a comunicação, personalizar o feedback e promover o desenvolvimento de habilidades. Discutimos os avanços no treinamento baseado em simulação, ferramentas de avaliação impulsionadas por IA, sistemas de avaliação baseados em vídeo, plataformas de realidade virtual (RV) e realidade aumentada (RA), e o papel potencial dos MLAs na transcrição, tradução e resumo do feedback. Apesar das oportunidades promissoras apresentadas pela integração da IA, vários desafios devem ser abordados, incluindo precisão e confiabilidade, preocupações éticas e de privacidade, viés nos modelos de IA, integração com os sistemas de treinamento existentes, e treinamento e adoção de ferramentas assistidas por IA. Ao abordar proativamente esses desafios e aproveitar o potencial da IA, o futuro do treinamento cirúrgico pode ser remodelado para proporcionar uma experiência de aprendizado mais abrangente, segura e eficaz para os aprendizes, resultando em melhores resultados para os pacientes.

Palavras-chave:
Inteligência Artificial; Educação Médica; Curva de Aprendizado

INTRODUCTION

Surgical training has traditionally followed the apprenticeship model, in which trainees learn from mentors by observing and assisting experienced surgeons in the operating room11 Halsted. The training of the surgeon. JAMA. 1904;XLIII(21):1553-4. doi:10.1001/jama.1904.02500210043006.
https://doi.org/10.1001/jama.1904.025002...
. This approach, though valuable, is not without limitations. As surgical procedures have become increasingly complex and specialized, the need for more structured and standardized training methods has grown22 Ortiz C, Vela J, Contreras C, Belmar F, Paul I, Zinco A, et al. A new approach for the acquisition of trauma surgical skills: an OSCE type of simulation training program. Surg Endosc. 2022;36(11):8441-50. doi: 10.1007/s00464-022-09098-w.
https://doi.org/10.1007/s00464-022-09098...
,33 Tejos R, Avila R, Inzunza M, Achurra P, Castillo R, Rosberg A, et al. Impact of a simulated laparoscopic training program in a three-year General Surgery Residency. Arq Bras Cir Dig. 2019;32(2):e1436. doi: 10.1590/0102-672020190001e1436.
https://doi.org/10.1590/0102-67202019000...
. Concerns about patient safety and the need to objectively evaluate trainees’ performance have prompted the search for innovative and reliable training modalities44 Reznick RK, MacRae H. Teaching surgical skills--changes in the wind. N Engl J Med. 2006;355(25):2664-9. doi: 10.1056/NEJMra054785.
https://doi.org/10.1056/NEJMra054785...
,55 Aggarwal R, Mytton OT, Derbrew M, Hananel D, Heydenburg M, Issenberg B, et al. Training and simulation for patient safety. Qual Saf Health Care. 2010;19 Suppl 2(Suppl 2):i34-43. doi: 10.1136/qshc.2009.038562.
https://doi.org/10.1136/qshc.2009.038562...
. The rise of Artificial Intelligence assisted education and simulation has become a relevant component in surgical training, enabling a more comprehensive, safe, and efficient learning experience for trainees66 Guerrero DT, Asaad M, Rajesh A, Hassan A, Butler CE. Advancing Surgical Education: The Use of Artificial Intelligence in Surgical Training. Am Surg. 2023;89(1):49-54. doi: 10.1177/00031348221101503.
https://doi.org/10.1177/0003134822110150...

7 Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
https://doi.org/10.1002/jso.26496...

8 Mirchi N, Bissonnette V, Yilmaz R, Ledwos N, Winkler-Schwartz A, Maestro D. The Virtual Operative Assistant: An explainable artificial intelligence tool for simulation-based training in surgery and medicine. PloS one. 2020;15(2):e0229596. doi: 10.1371/journal.pone.0229596.
https://doi.org/10.1371/journal.pone.022...

9 Bilgic E, Gorgy A, Young M, Abbasgholizadeh-Rahimi S, Harley JM. Artificial intelligence in surgical education: Considerations for interdisciplinary collaborations. Surg Innov. 2022;29(2):137-8. doi: 10.1177/15533506211059269.
https://doi.org/10.1177/1553350621105926...
-1010 Park JJ, Tiefenbach J, Demetriades AK. The role of artificial intelligence in surgical simulation. Front Med Technol. 2022;4:1076755. doi: 10.3389/fmedt.2022.1076755.
https://doi.org/10.3389/fmedt.2022.10767...
.

Artificial intelligence (AI) is a multidisciplinary field of computer science focused on creating intelligent agents capable of performing tasks that typically require human-like cognition1111 Goodfellow I, Bengio Y, Courville A. Deep learning. MIT Press; 2016.. AI systems employ techniques such as machine learning, deep learning, natural language processing, computer vision, and expert systems to perceive, reason, learn, and adapt to new information1111 Goodfellow I, Bengio Y, Courville A. Deep learning. MIT Press; 2016.. Machine learning enables computers to learn from data and make predictions or decisions without explicit programming, while deep learning uses artificial neural networks for sophisticated pattern recognition1212 Murphy KP. Machine Learning: A Probabilistic Perspective. MIT Press; 2012.. Natural language processing allows AI systems to understand and generate human-like text or speech, and computer vision deals with the analysis and interpretation of visual information1313 Hirschberg J, Manning CD. Advances in natural language processing. Science. 2015;349(6245):261-6. doi: 10.1126/science.aaa8685.
https://doi.org/10.1126/science.aaa8685...
. Expert systems involve rule-based decision-making based on predefined knowledge bases1414 Jackson P. Introduction to Expert Systems. Singapore, Singapore: Addison Wesley Longman; 1990.. By integrating these diverse AI methodologies, researchers can develop intelligent agents to tackle complex problems and transform various aspects of human life, including surgical training and education.

While there have been significant advancements in AI-assisted surgical training, the specific use of LLMs in facilitating communication and information processing within surgical training environments has been relatively understudied. Effective communication is a crucial aspect of surgical training, and leveraging LLMs to transcribe and translate feedback inputs from trainers to trainees can greatly improve the learning experience and outcomes.

The paper aims to highlight the potential of LLMs in addressing this communication gap and explores their applications in transcription, translation, summarization of feedback data, and providing real-time corrections and recommendations to instructors. By addressing this gap, the paper contributes to the understanding of how LLMs can enhance surgical education and bridge the communication divide between trainers and trainees.

AI-driven assessment and feedback

Simulation-based training, in particular, has proven to be a powerful adjunct to a traditional apprenticeship. It offers a controlled and risk-free environment where trainees can practice their skills before applying them in the operating room1515 Andersen DK. Fundamentals of surgical simulation: Principles and practices. JAMA. 2012;307(9):974. doi: 10.1001/jama.2012.246.
https://doi.org/10.1001/jama.2012.246...
,1616 Boza C, León F, Buckel E, Riquelme A, Crovari F, Martínez J, et al. Simulation-trained junior residents perform better than general surgeons on advanced laparoscopic cases. Surg Endosc. 2017;31(1):135-41. doi: 10.1007/s00464-016-4942-6.
https://doi.org/10.1007/s00464-016-4942-...
. AI has significantly expanded the potential of simulation-based training in various ways1717 Belmar F, Gaete MI, Escalona G, Carnier M, Durán V, Villagrán I, et al. Artificial intelligence in laparoscopic simulation: a promising future for large-scale automated evaluations. Surg Endosc. 2023;37(6):4942-6. doi: 10.1007/s00464-022-09576-1.
https://doi.org/10.1007/s00464-022-09576...
. One of the key advancements is the ability to provide personalized feedback to surgical trainees66 Guerrero DT, Asaad M, Rajesh A, Hassan A, Butler CE. Advancing Surgical Education: The Use of Artificial Intelligence in Surgical Training. Am Surg. 2023;89(1):49-54. doi: 10.1177/00031348221101503.
https://doi.org/10.1177/0003134822110150...
. By analyzing performance metrics and identifying areas of improvement, AI-powered systems can tailor feedback to individual subjects, addressing their unique strengths and weaknesses1818 Hashimoto DA, Rosman G, Rus D, Meireles OR. Artificial intelligence in surgery: Promises and perils. Ann Surg. 2018;268(1):70-6. doi: 10.1097/SLA.0000000000002693.
https://doi.org/10.1097/SLA.000000000000...
. This customized approach to feedback not only enhances the learning process but also enables trainees to focus on specific areas that require further development.

In addition to personalized feedback, AI is revolutionizing the assessment processes in surgical training. Traditional methods often rely on subjective evaluations by human observers, which can be prone to bias and inconsistency1515 Andersen DK. Fundamentals of surgical simulation: Principles and practices. JAMA. 2012;307(9):974. doi: 10.1001/jama.2012.246.
https://doi.org/10.1001/jama.2012.246...
,1919 Gumbs AA, Frigerio I, Spolverato G, Croner R, Illanes A, Chouillard E, et al. Artificial intelligence surgery: How do we get to autonomous actions in surgery? Sensors (Basel). 2021;21(16):5526. doi: 10.3390/s21165526.
https://doi.org/10.3390/s21165526...
. AI-driven assessment tools, on the other hand, can objectively measure surgical performance by analyzing various data sources, such as motion tracking, force measurements, historic feedback inputs, and video recordings2020 Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
https://doi.org/10.1002/jso.26496...
,2121 Maier-Hein L, Eisenmann M, Sarikaya D, März K, Collins T, Malpani A, et al. Surgical data science - from concepts toward clinical translation. Med Image Anal. 2022;76(102306):102306. doi: 10.1016/j.media.2021.102306.
https://doi.org/10.1016/j.media.2021.102...
. Some of these tools can provide real-time feedback and generate standardized performance scores, thus ensuring a more consistent and reliable evaluation of trainee progress77 Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
https://doi.org/10.1002/jso.26496...
,1818 Hashimoto DA, Rosman G, Rus D, Meireles OR. Artificial intelligence in surgery: Promises and perils. Ann Surg. 2018;268(1):70-6. doi: 10.1097/SLA.0000000000002693.
https://doi.org/10.1097/SLA.000000000000...
.

Moreover, AI has facilitated the rise of video-based surgical assessment systems that leverage computer vision and machine learning techniques to analyze surgical videos and extract valuable insights77 Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
https://doi.org/10.1002/jso.26496...
. These systems offer data-driven feedback and objective evaluations, enabling trainees and experienced surgeons to identify areas for improvement and learn from best practices2222 Twinanda AP, Shehata S, Mutter D, Marescaux J, de Mathelin M, Padoy N. EndoNet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging. 2017;36(1):86-97. doi: 10.1109/tmi.2016.2593957.
https://doi.org/10.1109/tmi.2016.2593957...
,2323 Gumbs A, Grasso V, Bourdel N, Croner R, Spolverato G, Frigerio I, et al. The advances in computer vision that are enabling more Autonomous Actions in surgery: A systematic review of the literature. Sensors (Basel). 2022;22(13):4918. doi: 10.3390/s22134918.
https://doi.org/10.3390/s22134918...
(Table 1).

Table 1
AI-enhanced Video-Based Surgical Assessment and Training Systems.

AI-assisted predicting of performance and skill development

Another promising application of AI in surgical training is the prediction of trainees’ surgical performance and skill development. By utilizing machine learning algorithms, AI systems can analyze historical performance data and identify patterns that can be used to predict future performance and technique progression1818 Hashimoto DA, Rosman G, Rus D, Meireles OR. Artificial intelligence in surgery: Promises and perils. Ann Surg. 2018;268(1):70-6. doi: 10.1097/SLA.0000000000002693.
https://doi.org/10.1097/SLA.000000000000...
. This predictive capability can help trainers identify trainees in need of additional support, as well as optimize training curricula based on individual learning trajectories. Now that learning is moving from fixed training curriculums to competency-based curriculums, AI provides a more objective and less time-consuming assessment of trainees’ learning curves.

AI and immersive technologies

AI is playing a major role in the advancement of virtual reality (VR) and augmented reality (AR) systems, especially in the field of surgical training. These immersive technologies offer realistic and interactive training scenarios that can closely resemble real-life surgical procedures2424 Khan R, Plahouras J, Johnston BC, Scaffidi MA, Grover SC, Walsh CM. Virtual reality simulation training in endoscopy: a Cochrane review and meta-analysis. Endoscopy. 2019;51(7):653-64. doi: 10.1055/a-0894-4400.
https://doi.org/10.1055/a-0894-4400...

25 von Ende E, Ryan S, Crain MA, Makary MS. Artificial intelligence, augmented reality, and virtual reality advances and applications in interventional radiology. Diagnostics (Basel). 2023;13(5). doi: 10.3390/diagnostics13050892.
https://doi.org/10.3390/diagnostics13050...
-2626 Schmidt MW, Köppinger KF, Fan C, Kowalewski K-F, Schmidt LP, Vey J, et al. Virtual reality simulation in robot-assisted surgery: meta-analysis of skill transfer and predictability of skill. BJS Open. 2021;5(2). doi: 10.1093/bjsopen/zraa066.
https://doi.org/10.1093/bjsopen/zraa066...
. AI algorithms can be employed to generate complex, dynamic environments and unique patient-based anatomical models, allowing trainees to practice various surgical techniques and approaches in a controlled and safe setting. Additionally, AI can facilitate real-time adaptation of the VR and AR scenarios based on the trainee’s performance, ensuring a more engaging and effective learning experience2525 von Ende E, Ryan S, Crain MA, Makary MS. Artificial intelligence, augmented reality, and virtual reality advances and applications in interventional radiology. Diagnostics (Basel). 2023;13(5). doi: 10.3390/diagnostics13050892.
https://doi.org/10.3390/diagnostics13050...
. Video-based feedback and evaluation are scaling fast with the help of AI. Video-recorded practices are now automatically assessed by AI algorithms, and “copilot-like” recommendations for instructors when providing feedback to trainees are now possible1717 Belmar F, Gaete MI, Escalona G, Carnier M, Durán V, Villagrán I, et al. Artificial intelligence in laparoscopic simulation: a promising future for large-scale automated evaluations. Surg Endosc. 2023;37(6):4942-6. doi: 10.1007/s00464-022-09576-1.
https://doi.org/10.1007/s00464-022-09576...
,2525 von Ende E, Ryan S, Crain MA, Makary MS. Artificial intelligence, augmented reality, and virtual reality advances and applications in interventional radiology. Diagnostics (Basel). 2023;13(5). doi: 10.3390/diagnostics13050892.
https://doi.org/10.3390/diagnostics13050...
(Table 2).

Table 2
Comparison of some AI-Driven Surgical Simulation Training Systems.

Large Language Models in surgical education

Large language models (LLMs), a subset of generative AI models, have gained significant attention in recent years due to their ability to generate human-like text based on input data. These models are trained on vast amounts of text from diverse sources, enabling them to understand and generate contextually appropriate responses in various languages and domains2727 Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language Models are Few-Shot Learners. arXiv:2005.14165v4. 2020:1-75. doi: 10.48550/arXiv.2005.14165.
https://doi.org/10.48550/arXiv.2005.1416...
. Examples of LLMs include OpenAI’s GPT-3 and the recently launched GPT-4. Despite the numerous advances in AI-assisted surgical training, there is still a notable gap in the literature regarding applying large language models (LLMs) to surgical education, for instance, in facilitating the transcription and translation of feedback inputs from trainers to trainees in simulated and video-based assessment environments. This gap is significant because effective communication is a critical aspect of surgical training, and addressing it could lead to considerable improvements in the learning experience and outcomes for trainees2828 Ötles E, Kendrick DE, Solano QP, Schuller M, Ahle SL, Eskender MH, et al. Using natural language processing to automatically assess feedback quality: Findings from 3 surgical residencies. Acad Med. 2021;96(10):1457-60. doi: 10.1097/acm.0000000000004153.
https://doi.org/10.1097/acm.000000000000...
,2929 Tustumi F, Andreollo NA, Aguilar-Nascimento JE de. Future of the language models in healthcare: The role of ChatGPT. ABCD Arq Bras Cir Dig. 2023;36:e1727. doi: 10.1590/0102-672020230002e1727.
https://doi.org/10.1590/0102-67202023000...
.

LLMs have demonstrated remarkable capabilities in understanding and generating human-like text, making them well-suited for tasks related to communication and information processing2727 Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language Models are Few-Shot Learners. arXiv:2005.14165v4. 2020:1-75. doi: 10.48550/arXiv.2005.14165.
https://doi.org/10.48550/arXiv.2005.1416...
. In surgical training, LLMs could be employed to automatically transcribe spoken feedback provided by trainers during simulations or video-based assessments. This transcription could be used to create a written record of the feedback, allowing trainees to review and reflect on the suggestions and guidance received.

LLMs could also be utilized to translate feedback into different languages even in real time, thus overcoming potential language barriers between trainers and trainees. With surgical training programs increasingly attracting students from different countries, language gaps can be a significant challenge to effective communication. By leveraging LLMs to provide real-time or asynchronous translation, trainers and trainees could communicate more effectively, ensuring that valuable feedback is accurately conveyed and understood3030 Belmar F, Gaete MI, Durán V, Chelebifski S, Jarry C, Ortiz C, et al. Taking advantage of asynchronous digital feedback: development of an at-home basic suture skills training program for undergraduate medical students that facilitates skills retention. Global Surg Educ. 2023;2(1):32. doi: 10.1007/s44186-023-00112-w.
https://doi.org/10.1007/s44186-023-00112...
.

Another potential application of LLMs in surgical training is the extraction and summarization of key insights from large volumes of feedback data3131 Solano QP, Hayward L, Chopra Z, Quanstrom K, Kendrick D, Abbott KL, et al. Natural language processing and assessment of resident feedback quality. J Surg Educ. 2021;78(6):e72-7. doi: 10.1016/j.jsurg.2021.05.012.
https://doi.org/10.1016/j.jsurg.2021.05....
. By analyzing feedback from multiple trainers or across numerous training instances, LLMs could identify recurring themes or patterns and provide students with concise, actionable summaries. This approach could help trainees to prioritize their learning objectives and focus on the most critical areas for improvement.

Similarly, LLM systems have the potential to improve instructors’ performance by providing real-time corrections and feedback, helping them meet minimum parameters and gradually improving their communication skills with trainees. The best clinician expert is not always the best at teaching and communicating learning experiences. However, semantic feedback from an automatic artificial intelligence system may help the sender notice areas for improvement in their teaching abilities.

LLMs challenges in surgical training

Integrating large language models (LLMs) into surgical training comes with various challenges, including ensuring the accuracy and reliability of LLM-generated transcriptions and translations, addressing ethical and privacy concerns, and handling potential biases in the AI models. Below, we explore these challenges in greater detail:

  • Accuracy and reliability: LLM-generated transcriptions and translations must be accurate and reliable to avoid misunderstandings or misinterpretations that could negatively impact the learning process. Inaccurate transcriptions might cause trainees to miss crucial feedback or make incorrect decisions based on flawed information. Ensuring the quality of LLM-generated content requires continuous monitoring, validation, and improvement of the models to minimize errors and maximize the value of the AI-generated outputs.

  • Ethical and privacy concerns: The use of AI-generated feedback in surgical training may raise ethical and privacy concerns, particularly when sensitive patient information is involved. Ensuring compliance with data protection regulations, such as the Health Insurance Portability and Accountability Act (HIPAA) in the United States or the General Data Protection Regulation (GDPR) in the European Union, is essential. Implementing data anonymization techniques, secure storage and transmission protocols, and access controls can help protect sensitive information and maintain patient confidentiality.

  • Bias in AI models: AI algorithms, including LLMs, can inadvertently propagate biases present in the training data. These biases may manifest in the form of gender, racial, or cultural stereotypes, potentially leading to unfair or discriminatory feedback. To address this issue, it is essential to develop transparent and fair AI models by using diverse and representative training data, applying bias-mitigation techniques, and rigorously testing the algorithms to identify and correct potential biases.

  • Integration with existing training systems: Integrating LLMs into surgical training may require substantial modifications to existing training platforms and workflows. This process can be complex and time-consuming, requiring close collaboration between developers, trainers, and trainees to ensure seamless integration and minimize disruptions to the training process.

  • Training and adoption: The successful implementation of LLMs in surgical training requires trainers, trainees, and their institutions to adapt to new technology and its workflows. This may involve providing additional training to familiarize users with the AI tools and addressing any resistance to change. Ensuring user buy-in is crucial for the effective adoption and use of LLMs in surgical training environments.

  • Cost and Talent: Implementing artificial intelligence in surgical training necessitates substantial financial investment and specialized human expertise. To embark on an AI project may entail considerable expenses, making it challenging to conduct a proper cost-benefit assessment initially. Nonetheless, major companies have significantly invested in developing algorithms that can be tailored to specific domains, thereby enhancing their potential. Although the initial costs of these services may be high, as is typically the case with technological evolution, these are expected to decrease over time. At present, the cost and infrastructure needed to incorporate AI into surgical training continue to pose significant challenges.

Addressing these challenges is essential to harness the full potential of LLMs in surgical training and ensure their successful integration into training programs and educational institutions. By proactively identifying and addressing these challenges, it is possible to develop effective AI-assisted surgical training solutions that may improve communication, enhance the learning experience, and ultimately contribute to better patient outcomes.

DISCUSSION AND CONCLUSION

The integration of AI into our society and daily activities has begun progressively. As with other technologies, the surgical field has quickly adopted some of its benefits, and the complexity of surgical training has made AI an interesting venture. We have presented two main ways in which AI-related technologies are becoming part of the teaching experience.

Firstly, AI-enhanced assessment methods are being used. Skill acquisition involves a trainer, a trainee, assessment tools, feedback delivery, and a training instance. We have described some of the available tools or apps that can optimize the integration of these five aspects of surgical training. Video-based platforms are showing promising results, allowing not only AI-driven assessment (using various measurements) but also providing relevant feedback based on previous and contemporary data. On the other hand, LLM-based technologies address communication issues that are critical for the learning process. Even when precise metrics and formidable progress tracking are assured, feedback and knowledge must be delivered correctly to make significant improvements in the trainee. LLMs can help ensure that concepts are conveyed effectively, suggest feedback to trainers, and help overcome language barriers by translating in real-time.

Despite the numerous benefits, implementing AI-based technologies and especially LLMs in surgical training poses significant challenges that can be approached from two perspectives. Firstly, from a technological standpoint, integrating AI-based tools and platforms into daily surgical education requires new “know-how,” and the integration of novel and “old” systems may be disruptive, difficult, and expensive. Careful and gradual integration may reduce resistance to these innovative technologies, especially in more experienced and traditional institutions. Secondly, from an educational perspective, concerns may arise regarding the reliability of “automatic” assessments, feedback, and inputs. Regarding this topic, it is important to understand that AI is a probabilistic model used to analyze data, employing high-powered computer technology to organize information from many sources. Currently, AI does not create new information in the strict sense of the word, so many of its limitations and mistakes are attributable due to the low quality or insufficiency of its sources. Future research and development should aim to identify the best sources and provide filters to prevent inaccurate information from affecting the algorithm’s outcomes. It is our responsibility to develop and apply safety measures to track interactions and identify any issues that may lead to misunderstandings or negatively impact the learning experience. Ensuring data protection, reducing bias, and testing the real educational impact of AI-driven tools are other significant challenges that we will face in adopting these technologies.

In addition to the technical considerations, it is essential to highlight the ethical and legal challenges associated with the use of AI and LLMs in surgical practice.

From an ethical perspective, AI and LLMs in surgery raise several concerns. The first one pertains to patient autonomy and informed consent. It’s crucial to ensure that patients thoroughly understand the nature and extent of AI and LLM involvement in their treatment. Given the complexity of these technologies, conveying this information in a way that a layperson can understand is challenging.

The second ethical concern relates to accountability. In instances where AI or an LLM might make a mistake or provide suboptimal advice, it’s not immediately clear who would be held responsible - the developers, the medical practitioners relying on the technology, or the institutions adopting it. This ambiguity could lead to ethical dilemmas in practice.

Thirdly, there is the question of bias and fairness. AI and LLMs are trained on vast datasets and can inadvertently perpetuate existing biases in these data, leading to unequal treatment outcomes. For instance, if the training data are skewed towards a particular demographic, the AI might be less effective when applied to patients from underrepresented demographics.

Turning to the legal challenges, the application of AI and LLMs in surgical practice sits at the intersection of several legal domains, including data protection, intellectual property rights, and medical malpractice. AI’s ability to process large amounts of patient data raises questions about data privacy and security. Furthermore, who owns the intellectual property rights to AI and LLM-assisted surgical techniques or inventions? This is an area of law that is still evolving and has yet to catch up with the rapid advancements in AI technology.

In the event of an adverse outcome, the legal liability is also a grey area. If an error occurs due to the advice or assistance of an AI or LLM, traditional medical malpractice doctrines may not apply. The use of AI could blur the lines of responsibility, making it difficult to determine who is at fault.

Although the potential benefits of AI and LLMs in surgical training are significant, it is important to recognize and discuss inherent limitations of these technologies. First, AI and LLMs are dependent on the data they are trained on. If the data is incomplete, biased, or not representative of the range of cases a surgeon might encounter, the training and advice given may be suboptimal. Additionally, these technologies may lack the nuanced understanding of a human trainer, who can interpret complex visual cues and adapt to the trainee’s individual learning style in a way that AI and LLMs currently cannot.

Another limitation is the risk of over-reliance on technology. Surgical trainees might rely too heavily on AI and LLMs, which could impair the development of their independent decision-making skills. Furthermore, the high costs associated with the development and implementation of AI and LLMs may make these technologies inaccessible to some institutions, exacerbating existing inequities in surgical training.

Looking to the future, several exciting research directions are emerging. We need rigorous studies evaluating the impact of AI and LLMs on long-term surgical performance and patient outcomes, which can help validate the effectiveness of these technologies. Novel approaches could also be explored, such as integrating AI and LLMs with virtual or augmented reality to create immersive surgical training experiences. Furthermore, research should address the aforementioned limitations. For example, studies could investigate how to effectively combine AI and human mentorship in surgical training, or how to make these technologies more accessible to a broader range of institutions.

As a conclusion, as technology advances and AI becomes more integrated into the medical field, it is crucial to address the various challenges and limitations associated with AI technology while exploring new applications and opportunities in surgical education. By embracing the transformative power of AI, the future of surgical training can be reshaped to provide a more comprehensive, safe, and effective learning experience for trainees.

REFERENCES

  • 1
    Halsted. The training of the surgeon. JAMA. 1904;XLIII(21):1553-4. doi:10.1001/jama.1904.02500210043006.
    » https://doi.org/10.1001/jama.1904.02500210043006
  • 2
    Ortiz C, Vela J, Contreras C, Belmar F, Paul I, Zinco A, et al. A new approach for the acquisition of trauma surgical skills: an OSCE type of simulation training program. Surg Endosc. 2022;36(11):8441-50. doi: 10.1007/s00464-022-09098-w.
    » https://doi.org/10.1007/s00464-022-09098-w
  • 3
    Tejos R, Avila R, Inzunza M, Achurra P, Castillo R, Rosberg A, et al. Impact of a simulated laparoscopic training program in a three-year General Surgery Residency. Arq Bras Cir Dig. 2019;32(2):e1436. doi: 10.1590/0102-672020190001e1436.
    » https://doi.org/10.1590/0102-672020190001e1436
  • 4
    Reznick RK, MacRae H. Teaching surgical skills--changes in the wind. N Engl J Med. 2006;355(25):2664-9. doi: 10.1056/NEJMra054785.
    » https://doi.org/10.1056/NEJMra054785
  • 5
    Aggarwal R, Mytton OT, Derbrew M, Hananel D, Heydenburg M, Issenberg B, et al. Training and simulation for patient safety. Qual Saf Health Care. 2010;19 Suppl 2(Suppl 2):i34-43. doi: 10.1136/qshc.2009.038562.
    » https://doi.org/10.1136/qshc.2009.038562.
  • 6
    Guerrero DT, Asaad M, Rajesh A, Hassan A, Butler CE. Advancing Surgical Education: The Use of Artificial Intelligence in Surgical Training. Am Surg. 2023;89(1):49-54. doi: 10.1177/00031348221101503.
    » https://doi.org/10.1177/00031348221101503
  • 7
    Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
    » https://doi.org/10.1002/jso.26496
  • 8
    Mirchi N, Bissonnette V, Yilmaz R, Ledwos N, Winkler-Schwartz A, Maestro D. The Virtual Operative Assistant: An explainable artificial intelligence tool for simulation-based training in surgery and medicine. PloS one. 2020;15(2):e0229596. doi: 10.1371/journal.pone.0229596.
    » https://doi.org/10.1371/journal.pone.0229596
  • 9
    Bilgic E, Gorgy A, Young M, Abbasgholizadeh-Rahimi S, Harley JM. Artificial intelligence in surgical education: Considerations for interdisciplinary collaborations. Surg Innov. 2022;29(2):137-8. doi: 10.1177/15533506211059269.
    » https://doi.org/10.1177/15533506211059269
  • 10
    Park JJ, Tiefenbach J, Demetriades AK. The role of artificial intelligence in surgical simulation. Front Med Technol. 2022;4:1076755. doi: 10.3389/fmedt.2022.1076755.
    » https://doi.org/10.3389/fmedt.2022.1076755
  • 11
    Goodfellow I, Bengio Y, Courville A. Deep learning. MIT Press; 2016.
  • 12
    Murphy KP. Machine Learning: A Probabilistic Perspective. MIT Press; 2012.
  • 13
    Hirschberg J, Manning CD. Advances in natural language processing. Science. 2015;349(6245):261-6. doi: 10.1126/science.aaa8685.
    » https://doi.org/10.1126/science.aaa8685
  • 14
    Jackson P. Introduction to Expert Systems. Singapore, Singapore: Addison Wesley Longman; 1990.
  • 15
    Andersen DK. Fundamentals of surgical simulation: Principles and practices. JAMA. 2012;307(9):974. doi: 10.1001/jama.2012.246.
    » https://doi.org/10.1001/jama.2012.246
  • 16
    Boza C, León F, Buckel E, Riquelme A, Crovari F, Martínez J, et al. Simulation-trained junior residents perform better than general surgeons on advanced laparoscopic cases. Surg Endosc. 2017;31(1):135-41. doi: 10.1007/s00464-016-4942-6.
    » https://doi.org/10.1007/s00464-016-4942-6
  • 17
    Belmar F, Gaete MI, Escalona G, Carnier M, Durán V, Villagrán I, et al. Artificial intelligence in laparoscopic simulation: a promising future for large-scale automated evaluations. Surg Endosc. 2023;37(6):4942-6. doi: 10.1007/s00464-022-09576-1.
    » https://doi.org/10.1007/s00464-022-09576-1
  • 18
    Hashimoto DA, Rosman G, Rus D, Meireles OR. Artificial intelligence in surgery: Promises and perils. Ann Surg. 2018;268(1):70-6. doi: 10.1097/SLA.0000000000002693.
    » https://doi.org/10.1097/SLA.0000000000002693
  • 19
    Gumbs AA, Frigerio I, Spolverato G, Croner R, Illanes A, Chouillard E, et al. Artificial intelligence surgery: How do we get to autonomous actions in surgery? Sensors (Basel). 2021;21(16):5526. doi: 10.3390/s21165526.
    » https://doi.org/10.3390/s21165526
  • 20
    Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol. 2021;124(2):221-30. doi: 10.1002/jso.26496.
    » https://doi.org/10.1002/jso.26496
  • 21
    Maier-Hein L, Eisenmann M, Sarikaya D, März K, Collins T, Malpani A, et al. Surgical data science - from concepts toward clinical translation. Med Image Anal. 2022;76(102306):102306. doi: 10.1016/j.media.2021.102306.
    » https://doi.org/10.1016/j.media.2021.102306
  • 22
    Twinanda AP, Shehata S, Mutter D, Marescaux J, de Mathelin M, Padoy N. EndoNet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging. 2017;36(1):86-97. doi: 10.1109/tmi.2016.2593957.
    » https://doi.org/10.1109/tmi.2016.2593957
  • 23
    Gumbs A, Grasso V, Bourdel N, Croner R, Spolverato G, Frigerio I, et al. The advances in computer vision that are enabling more Autonomous Actions in surgery: A systematic review of the literature. Sensors (Basel). 2022;22(13):4918. doi: 10.3390/s22134918.
    » https://doi.org/10.3390/s22134918
  • 24
    Khan R, Plahouras J, Johnston BC, Scaffidi MA, Grover SC, Walsh CM. Virtual reality simulation training in endoscopy: a Cochrane review and meta-analysis. Endoscopy. 2019;51(7):653-64. doi: 10.1055/a-0894-4400.
    » https://doi.org/10.1055/a-0894-4400
  • 25
    von Ende E, Ryan S, Crain MA, Makary MS. Artificial intelligence, augmented reality, and virtual reality advances and applications in interventional radiology. Diagnostics (Basel). 2023;13(5). doi: 10.3390/diagnostics13050892.
    » https://doi.org/10.3390/diagnostics13050892
  • 26
    Schmidt MW, Köppinger KF, Fan C, Kowalewski K-F, Schmidt LP, Vey J, et al. Virtual reality simulation in robot-assisted surgery: meta-analysis of skill transfer and predictability of skill. BJS Open. 2021;5(2). doi: 10.1093/bjsopen/zraa066.
    » https://doi.org/10.1093/bjsopen/zraa066
  • 27
    Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language Models are Few-Shot Learners. arXiv:2005.14165v4. 2020:1-75. doi: 10.48550/arXiv.2005.14165.
    » https://doi.org/10.48550/arXiv.2005.14165
  • 28
    Ötles E, Kendrick DE, Solano QP, Schuller M, Ahle SL, Eskender MH, et al. Using natural language processing to automatically assess feedback quality: Findings from 3 surgical residencies. Acad Med. 2021;96(10):1457-60. doi: 10.1097/acm.0000000000004153.
    » https://doi.org/10.1097/acm.0000000000004153
  • 29
    Tustumi F, Andreollo NA, Aguilar-Nascimento JE de. Future of the language models in healthcare: The role of ChatGPT. ABCD Arq Bras Cir Dig. 2023;36:e1727. doi: 10.1590/0102-672020230002e1727.
    » https://doi.org/10.1590/0102-672020230002e1727
  • 30
    Belmar F, Gaete MI, Durán V, Chelebifski S, Jarry C, Ortiz C, et al. Taking advantage of asynchronous digital feedback: development of an at-home basic suture skills training program for undergraduate medical students that facilitates skills retention. Global Surg Educ. 2023;2(1):32. doi: 10.1007/s44186-023-00112-w.
    » https://doi.org/10.1007/s44186-023-00112-w
  • 31
    Solano QP, Hayward L, Chopra Z, Quanstrom K, Kendrick D, Abbott KL, et al. Natural language processing and assessment of resident feedback quality. J Surg Educ. 2021;78(6):e72-7. doi: 10.1016/j.jsurg.2021.05.012.
    » https://doi.org/10.1016/j.jsurg.2021.05.012
  • Funding source:

    The authors would like to acknowledge the National Agency for Research and Development (ANID) under the Scholarship Program DOCTORADO BECAS CHILE/2020 - 21202032, the National Center for Artificial Intelligence (CENIA), part of the Basal ANID funding FB210017 and Chilean Research Grants FONDECYT No 1221490, 190616011 and 11201338.

Publication Dates

  • Publication in this collection
    25 Aug 2023
  • Date of issue
    2023

History

  • Received
    07 June 2023
  • Accepted
    14 July 2023
Colégio Brasileiro de Cirurgiões Rua Visconde de Silva, 52 - 3º andar, 22271- 090 Rio de Janeiro - RJ, Tel.: +55 21 2138-0659, Fax: (55 21) 2286-2595 - Rio de Janeiro - RJ - Brazil
E-mail: revista@cbc.org.br