Automated Clinical Coding: What, Why, and Where We Are?

AI-generated keywords: Automated Clinical Coding Artificial Intelligence Natural Language Processing Transformer-based pre-trained language models Note Bloat

AI-generated Key Points

  • Automated clinical coding is a promising task for AI and NLP to improve the efficiency and accuracy of transforming medical information into structured codes.
  • Challenges need to be addressed to develop an AI-based automated system that is human-centered, explainable, intelligent, and robust to complex real-world scenarios.
  • One of the main challenges is handling long documents with over 10 thousand tokens.
  • Text redundancy or "Note Bloat" problem may impede the performance of deep learning models for code prediction.
  • Achieving better performance than traditional methods such as CNN-based approaches for multi-label classification applied to clinical coding is another challenge due to inefficiency in modelling concept-level information and long documents.
  • Future deep learning-based systems need to integrate knowledge reasoning with rules and ontologies for improved and more explainable results.
  • Organizational challenges also need addressing before deploying an AI-based coding tool into the clinical coding environment.
  • Coders need to be involved in model development and deployment stages despite being occupied with their coding work.
  • Data availability should also be made available within papers related to this topic.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hang Dong, Matúš Falis, William Whiteley, Beatrice Alex, Joshua Matterson, Shaoxiong Ji, Jiaoyan Chen, Honghan Wu

accepted for npj Digital Medicine
License: CC BY 4.0

Abstract: Clinical coding is the task of transforming medical information in a patient's health records into structured codes so that they can be used for statistical analysis. This is a cognitive and time-consuming task that follows a standard process in order to achieve a high level of consistency. Clinical coding could potentially be supported by an automated system to improve the efficiency and accuracy of the process. We introduce the idea of automated clinical coding and summarise its challenges from the perspective of Artificial Intelligence (AI) and Natural Language Processing (NLP), based on the literature, our project experience over the past two and half years (late 2019 - early 2022), and discussions with clinical coding experts in Scotland and the UK. Our research reveals the gaps between the current deep learning-based approach applied to clinical coding and the need for explainability and consistency in real-world practice. Knowledge-based methods that represent and reason the standard, explainable process of a task may need to be incorporated into deep learning-based methods for clinical coding. Automated clinical coding is a promising task for AI, despite the technical and organisational challenges. Coders are needed to be involved in the development process. There is much to achieve to develop and deploy an AI-based automated system to support coding in the next five years and beyond.

Submitted to arXiv on 21 Mar. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2203.11092v3

Automated clinical coding is a promising task for Artificial Intelligence (AI) and Natural Language Processing (NLP), as it has the potential to improve the efficiency and accuracy of transforming medical information in a patient's health records into structured codes for statistical analysis. However, there are several challenges that need to be addressed to develop an AI-based automated system that is human-centered, explainable, intelligent, and robust to complex real-world scenarios. One of the main challenges is handling long documents. The recent Transformer-based pre-trained language models have limited input length due to their memory-demanding self-attention mechanism, while clinical notes can have up to over 10 thousand tokens. Additionally, text redundancy or "Note Bloat" problem may impede the performance of deep learning models for code prediction. Another challenge is achieving better performance than traditional methods such as CNN-based approaches for multi-label classification applied to clinical coding. This limitation may be due to inefficiency in modelling concept-level information and long documents. Moreover, manual coding is largely based on a standard process with rules applied to the healthcare system. Future deep learning-based systems need to integrate knowledge reasoning with rules and ontologies to achieve improved and more explainable results. Organizational challenges also need addressing before deploying an AI-based coding tool into the clinical coding environment. Coders need to be involved in model development and deployment stages despite being occupied with their coding work. Data availability should also be made available within papers related to this topic. In conclusion, while there are technical and organizational challenges that need addressing before deploying an AI-based automated system for clinical coding, there is a clearer path forward thanks to growing numbers of studies and projects in academia and industry. With further research support on projects in medical informatics and computer science, we look forward to seeing more advances in AI-assisted clinical coding in the next five years and beyond.
Created on 27 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.