Explainable hierarchical fine-grained ICD code assignment using large language models: a dissertation in Engineering and Applied Science

Joshua Carberry

doi:10.62791/1994

Back

Dissertation

Open access

Explainable hierarchical fine-grained ICD code assignment using large language models: a dissertation in Engineering and Applied Science

Joshua Carberry

Doctor of Philosophy (PHD), University of Massachusetts Dartmouth

2024

DOI:

https://doi.org/10.62791/1994

Abstract

In healthcare, structured and standardized records are essential to ensure that crucial medical information is well-documented and accessible. Doctor’s notes, an example of unstructured records, encode important information about patient diagnoses and treatments but are presented in unstructured natural language. An important step in the healthcare process is the use of standards such as the International Classification of Diseases (ICD) to annotate doctor’s notes with precise medical codes, providing a common language for accurate communication between healthcare organizations and other sectors such as insurance. Due to the complexity of healthcare, which involves countless unique and often indistinguishable diagnoses, assigning ICD codes is an enormous amount of work that requires skilled experts. In this thesis, we introduce several novel ICD auto-coding techniques that leverage knowledge representations and large language models (LLMs) to achieve high performance and explainable results. The presented auto-coding system is based on a fine-grained approach that reduces the complexity of classification and improves human comprehensibility by locating and labeling one diagnosis at a time rather than processing all notes at once. For each selected diagnosis, ICD code predictions are based not only on the diagnosis itself but also consider semantically related text elsewhere in the notes. This additional evidence simplifies classification and provides a basis for understanding the results of automated coding. We explored related sentence extraction using two approaches: an ontology-based approach (a formal knowledge representation) and an approach using the LLM GPT-4. After extracting semantically related text, LLM-based classifiers are able to predict the correct ICD codes. To improve scalability, we introduce a hierarchical classifier forming a tree of fine-tuned LLMs to handle the large label space and complex classification inherent in the ICD coding task. This hierarchical approach decomposes the ICD coding task into smaller, more manageable subclassification tasks, thereby improving tractability and addressing the challenges posed by the high number of unique labels associated with ICD coding.

Files and links (1)

pdf

Carberry J. COE PhD Dissertation 20241.65 MBDownload View

CC BY-NC-ND V4.0, Open Access

Metrics

23 File views/ downloads

42 Record Views

Details

Title: Explainable hierarchical fine-grained ICD code assignment using large language models
Creators: Joshua Carberry
ORCID: 0000-0002-1957-291X
Contributors: Haiping Xu (Advisor) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Yuchou Chang (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Donghui Yan (Committee Member) - University of Massachusetts Dartmouth, Department of Mathematics
Number of pages: x, 92 pages
Illustrations: illustrations (some color)
Table of contents: Abstract -- Acknowledgements -- Table of contents -- List of figures -- List of tables -- Abbreviations -- Chapter 1. Introduction -- Background and motivations -- Related work -- Contributions -- Chapter 2. Fine-grained ICD coding -- Introduction -- Fine-grained approach -- Ontology-based sentence extraction -- ICD code prediction -- Case studies and discussion -- Summary -- Chapter 3. Large language models for classification -- Introduction -- Pretrained transformer-based LLMs for classification -- Case studies and discussion -- Summary -- Chapter 4. Hierarchical ICD code prediction -- Introduction -- Hierarchical ICD coding with BERT -- Case studies and discussion -- Summary -- Chapter 5. Large language models for sentence extraction -- Introduction -- Generative LLMs -- LLMs for sentence extraction -- Case studies and discussion -- Chapter 6. Conclusions and future work -- References -- Publications of the author.
References: Includes bibliographical references (pages 86-91).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Doctor of Philosophy (PHD)
Degree in: Engineering and Applied Science
Academic Unit: Department of Computer and Information Science
Language: English
Resource Type: Dissertation
DOI: https://doi.org/10.62791/1994
Record Identifier: 9914424794301301