Fairfield, Connecticut
April 19, 2024
April 19, 2024
April 20, 2024
Diversity
16
10.18260/1-2--45771
https://peer.asee.org/45771
230
Abdullah Aldwean is a Ph.D. candidate in Technology Management and innovation at the University of Bridgeport with 15+ years of professional experience in healthcare industry. His ongoing research explore the application of Generative Artificial Intelligence in healthcare, with particular interest in Generative large Language Models evaluation analysis. Abdullah holds a Master of Business Administration from Saudi Electronic University in association with Colorado State University global campus.
Dr. Tenney is an Assistant Professor at the University of Bridgeport in the Technology Management Department as part of the Engineering School. Dan Tenney worked in various Quality, Technical, and Operational positions in manufacturing divisions of HJ Heinz Company, 3M Company and Nile Spice Foods (acquired by Quaker Oats). For more than 25 years Dan was a member of the executive teams that directed and managed these divisions. Dan’s current focus is strategic technical and business management, application and research. Dan is a Board member on a Child’s Mental Health nonprofit agency where he has facilitated strategic planning and operational management training and guidance. He has published numerous publications on strategic, technology, and business management topics.
Utilizing large language models (LLMs), such as the Bidirectional Encoder Representations Transformer (BERT), presents an opportunity to revolutionize the healthcare experience by enhancing patient engagement, facilitating medical education, and improving the overall healthcare service outcomes. However, integrating large language model solutions in a highly regulated industry such as healthcare poses many challenges to healthcare decision-makers due to the high level of uncertainty, the complexity, and the potential social and ethical implications. Therefore, conducting thorough evaluations of LLM-based systems to ensure their ability to achieve intended goals securely, ethically, and safely is critical for healthcare organizations. In this paper, we reviewed the recent advancements in LLM evaluation fronts, mainly focusing on the performance evaluation of medical LLMs in the healthcare domain. We highlighted the potential opportunities and limitations of utilizing these advanced technologies in the context of clinical services. Additionally, we propose a comprehensive framework that integrates various evaluation aspects to better meet the unique requirements of LLMs adoption in healthcare. This framework aims to facilitate the adoption decision-making process by ensuring the utilization of the LLM's potential while holding high standards of safety, security, and ethical practice. This paper contributes to the knowledge by providing researchers, decision-makers, and healthcare practitioners with valuable insights into important aspects that should be considered in LLMs adoption decisions in the healthcare domain.
Aldwean, A., & Tenney, D. (2024, April), Large Language Models in Healthcare: Bridging the Gap between Performance Evaluation and Socio-Ethical Implications Paper presented at 2024 ASEE North East Section, Fairfield, Connecticut. 10.18260/1-2--45771
ASEE holds the copyright on this document. It may be read by the public free of charge. Authors may archive their work on personal websites or in institutional repositories with the following citation: © 2024 American Society for Engineering Education. Other scholars may excerpt or quote from these materials with the same citation. When excerpting or quoting from Conference Proceedings, authors should, in addition to noting the ASEE copyright, list all the original authors and their institutions and name the host city of the conference. - Last updated April 1, 2015