TY - JOUR
T1 - Text summarization in the biomedical domain
T2 - A systematic review of recent research
AU - Mishra, Rashmi
AU - Bian, Jiantao
AU - Fiszman, Marcelo
AU - Weir, Charlene R.
AU - Jonnalagadda, Siddhartha
AU - Mostafa, Javed
AU - Del Fiol, Guilherme
N1 - Funding Information:
The authors would like to acknowledge Alice Weber for providing insights on the search strategy of this systematic review. This project was supported by Grant Number 1R01LM011416-01 from the National Library of Medicine .
Publisher Copyright:
© 2014 Elsevier Inc.
PY - 2014/12/1
Y1 - 2014/12/1
N2 - Objective: The amount of information for clinicians and clinical researchers is growing exponentially. Text summarization reduces information as an attempt to enable users to find and understand relevant source texts more quickly and effortlessly. In recent years, substantial research has been conducted to develop and evaluate various summarization techniques in the biomedical domain. The goal of this study was to systematically review recent published research on summarization of textual documents in the biomedical domain. Materials and methods: MEDLINE (2000 to October 2013), IEEE Digital Library, and the ACM digital library were searched. Investigators independently screened and abstracted studies that examined text summarization techniques in the biomedical domain. Information is derived from selected articles on five dimensions: input, purpose, output, method and evaluation. Results: Of 10,786 studies retrieved, 34 (0.3%) met the inclusion criteria. Natural language processing (17; 50%) and a hybrid technique comprising of statistical, Natural language processing and machine learning (15; 44%) were the most common summarization approaches. Most studies (28; 82%) conducted an intrinsic evaluation. Discussion: This is the first systematic review of text summarization in the biomedical domain. The study identified research gaps and provides recommendations for guiding future research on biomedical text summarization. Conclusion: Recent research has focused on a hybrid technique comprising statistical, language processing and machine learning techniques. Further research is needed on the application and evaluation of text summarization in real research or patient care settings.
AB - Objective: The amount of information for clinicians and clinical researchers is growing exponentially. Text summarization reduces information as an attempt to enable users to find and understand relevant source texts more quickly and effortlessly. In recent years, substantial research has been conducted to develop and evaluate various summarization techniques in the biomedical domain. The goal of this study was to systematically review recent published research on summarization of textual documents in the biomedical domain. Materials and methods: MEDLINE (2000 to October 2013), IEEE Digital Library, and the ACM digital library were searched. Investigators independently screened and abstracted studies that examined text summarization techniques in the biomedical domain. Information is derived from selected articles on five dimensions: input, purpose, output, method and evaluation. Results: Of 10,786 studies retrieved, 34 (0.3%) met the inclusion criteria. Natural language processing (17; 50%) and a hybrid technique comprising of statistical, Natural language processing and machine learning (15; 44%) were the most common summarization approaches. Most studies (28; 82%) conducted an intrinsic evaluation. Discussion: This is the first systematic review of text summarization in the biomedical domain. The study identified research gaps and provides recommendations for guiding future research on biomedical text summarization. Conclusion: Recent research has focused on a hybrid technique comprising statistical, language processing and machine learning techniques. Further research is needed on the application and evaluation of text summarization in real research or patient care settings.
KW - Biomedical domain
KW - Intrinsic evaluation
KW - Language processing
KW - Machine learning
KW - Text summarization
UR - http://www.scopus.com/inward/record.url?scp=84919846379&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84919846379&partnerID=8YFLogxK
U2 - 10.1016/j.jbi.2014.06.009
DO - 10.1016/j.jbi.2014.06.009
M3 - Review article
C2 - 25016293
AN - SCOPUS:84919846379
SN - 1532-0464
VL - 52
SP - 457
EP - 467
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
ER -