Cynthia Zhao - Student Research and Creativity Forum - Hofstra University

‭ sing Generative Large Language Models for Assertion Detection in Medical Notes‬ U ‭MS Computer Science Thesis Project‬ ‭By: Cynthia Zhao‬ ‭Advisor: Dr. Simona Doboli‬ ‭ he‬ ‭unstructured‬ ‭nature‬ ‭of‬ ‭electronic‬ ‭medical‬ ‭records‬ ‭and‬ ‭clinical‬ ‭notes‬ ‭make‬ ‭it‬ ‭difficult‬ ‭for‬ ‭Natural‬ T ‭Language‬‭Processing‬‭techniques‬‭to‬‭automatically‬‭analyze,‬‭yet‬‭it‬‭is‬‭important‬‭for‬‭healthcare‬‭professionals‬ ‭to‬‭accurately‬‭and‬‭efficiently‬‭understand‬‭clinical‬‭notes‬‭and‬‭determine‬‭the‬‭presence,‬‭absence,‬‭or‬‭possibility‬ ‭of‬ ‭certain‬ ‭medical‬ ‭problems‬ ‭that‬ ‭may‬ ‭exist‬ ‭in‬ ‭their‬ ‭patients.‬ ‭To‬ ‭address‬ ‭this‬ ‭issue,‬ ‭the‬ ‭task‬ ‭of‬ ‭categorizing‬‭different‬‭assertions‬‭(i.e.‬‭statements‬‭expressing‬‭facts)‬‭can‬‭be‬‭employed.‬‭This‬‭is‬‭also‬‭known‬ ‭as‬‭clinical‬‭assertion‬‭detection‬‭and‬‭it‬‭has‬‭been‬‭explored‬‭using‬‭non-generative‬‭models‬‭to‬‭identify‬‭assertion‬ ‭classes‬ ‭of‬ ‭medical‬ ‭entities‬ ‭in‬ ‭unstructured‬ ‭clinical‬‭notes.‬‭Non-generative‬‭models‬‭are‬‭models‬‭that‬‭don’t‬ ‭generate‬‭text‬‭but‬‭are‬‭good‬‭at‬‭sorting‬‭data‬‭and‬‭making‬‭predictions‬‭based‬‭on‬‭patterns.‬‭This‬‭project‬‭differs‬ ‭in‬ ‭that‬ ‭we‬ ‭employ‬ ‭a‬ ‭generative‬ ‭LLM‬ ‭for‬ ‭assertion‬ ‭detection.‬ ‭Generative‬ ‭models‬ ‭have‬ ‭the‬ ‭ability‬ ‭to‬ ‭generate‬ ‭text,‬ ‭enabling‬ ‭the‬‭generation‬‭of‬‭explanations‬‭alongside‬‭the‬‭assertion‬‭classification,‬‭potentially‬ ‭offering‬ ‭more‬ ‭support‬‭for‬‭medical‬‭professionals.‬‭There‬‭are‬‭many‬‭applications‬‭for‬‭this‬‭in‬‭the‬‭healthcare‬ ‭realm.‬ ‭Primarily,‬ ‭it‬ ‭is‬ ‭crucial‬ ‭for‬ ‭healthcare‬ ‭professionals‬ ‭to‬ ‭efficiently‬ ‭comprehend‬ ‭the‬ ‭information‬ ‭contained‬ ‭in‬ ‭clinical‬ ‭notes.‬‭It‬‭can‬‭also‬‭be‬‭used‬‭for‬‭the‬‭automation‬‭of‬‭clinical‬‭decision‬‭support,‬‭medical‬ ‭risk evaluation, and much more.‬ ‭ ur‬‭primary‬‭aim‬‭is‬‭to‬‭utilize‬‭a‬‭clinical‬‭note‬‭along‬‭with‬‭a‬‭medical‬‭entity‬‭and‬‭its‬‭context,‬‭as‬‭provided‬‭by‬ O ‭the‬ ‭2010‬ ‭i2b2‬ ‭Assertion‬ ‭Task‬ ‭dataset,‬ ‭to‬ ‭enable‬ ‭the‬ ‭model‬ ‭to‬ ‭determine‬ ‭whether‬ ‭the‬‭entity‬‭is‬‭present,‬ ‭absent,‬ ‭or‬ ‭possible‬ ‭in‬‭a‬‭patient.‬‭The‬‭generative‬‭LLM‬‭that‬‭we‬‭used‬‭is‬‭called‬‭Mistral‬‭and‬‭it‬‭has‬‭7‬‭billion‬ ‭parameters.‬‭Through‬‭a‬‭task‬‭called‬‭finetuning,‬‭we‬‭can‬‭change‬‭or‬‭“finetune”‬‭these‬‭parameters‬‭so‬‭that‬‭the‬ ‭model‬‭is‬‭better‬‭suited‬‭to‬‭our‬‭specific‬‭task‬‭of‬‭clinical‬‭assertion‬‭detection.‬‭Using‬‭the‬‭i2b2‬‭dataset‬‭and‬‭the‬ ‭comprehensive‬‭training‬‭dataset‬‭that‬‭we‬‭generated‬‭through‬‭an‬‭iterative‬‭process,‬‭we‬‭observed‬‭tremendous‬ ‭progress‬‭in‬‭the‬‭finetune‬‭generated‬‭predictions.‬‭Improving‬‭from‬‭a‬‭pre-finetuned‬‭micro‬‭F1-score‬‭of‬‭0.50‬‭to‬ ‭a‬ ‭post-fine‬ ‭tuned‬ ‭micro‬ ‭F1-score‬ ‭of‬ ‭0.88,‬ ‭underscoring‬‭the‬‭capability‬‭of‬‭our‬‭fine-tuned‬‭model‬‭to‬‭learn‬ ‭essential patterns and excel in assertion detction.‬ ‭ his‬ ‭project‬ ‭also‬ ‭demonstrates‬ ‭the‬ ‭importance‬ ‭of‬ ‭prompting‬ ‭formats‬ ‭and‬ ‭their‬ ‭impact‬ ‭on‬ ‭the‬ ‭model’s‬ T ‭generated‬ ‭responses.‬ ‭We‬ ‭observed‬ ‭that‬ ‭utilizing‬ ‭one-shot‬ ‭prompting,‬ ‭or‬ ‭prompting‬ ‭that‬ ‭provides‬ ‭an‬ ‭example‬‭of‬‭the‬‭desired‬‭response,‬‭improved‬‭the‬‭model’s‬‭predictions‬‭a‬‭considerable‬‭amount‬‭over‬‭zero-shot‬ ‭prompting,‬ ‭or‬ ‭prompting‬ ‭without‬ ‭any‬ ‭examples‬ ‭in‬ ‭the‬ ‭prompt,‬ ‭without‬ ‭any‬ ‭additional‬ ‭finetuning.‬ ‭Combining‬‭one-shot‬‭prompting‬‭with‬‭other‬‭methods‬‭like‬‭multiple‬‭choice‬‭prompting‬‭and‬‭chain‬‭of‬‭thought‬ ‭prompting‬‭enabled‬‭us‬‭to‬‭develop‬‭a‬‭prompting‬‭format‬‭that‬‭enhanced‬‭response‬‭generation,‬‭surpassing‬‭the‬ ‭performance of a basic zero-shot prompt.‬ ‭ hile‬‭our‬‭results‬‭do‬‭not‬‭outperform‬‭those‬‭of‬‭the‬‭state-of-the-art‬‭BERT‬‭model,‬‭our‬‭results‬‭are‬‭promising‬ W ‭and‬ ‭show‬ ‭the‬ ‭potential‬ ‭of‬ ‭the‬ ‭application‬ ‭of‬ ‭generative‬ ‭models‬ ‭in‬ ‭the‬ ‭healthcare‬ ‭field.‬ ‭Some‬ ‭areas‬‭of‬ ‭further‬ ‭research‬ ‭include‬ ‭exploration‬ ‭into‬ ‭medical‬ ‭question‬ ‭and‬ ‭answer‬‭prompt‬‭engineering‬‭to‬‭generate‬ ‭optimal‬ ‭prompts‬ ‭as‬ ‭well‬ ‭as‬ ‭finetuning‬ ‭Mistral‬ ‭with‬ ‭other‬ ‭clinical‬ ‭datasets‬ ‭and‬ ‭fine‬ ‭tuning‬ ‭other‬ ‭generative LLMs on medical clinical notes.‬

Turn static files into dynamic content formats.

Create a flipbook