Medical informatics methods for the clinical evidence extraction

Article information

J Korean Med Assoc. 2012;55(8):741-747
Publication date (electronic) : 2012 August 16
doi : https://doi.org/10.5124/jkma.2012.55.8.741
1U-Healthcare Institute, Gachon University, Incheon, Korea.
2U-Healthcare Center, Gachon University Gil Hospital, Incheon, Korea.
3IT Department, Gachon University, Incheon, Korea.
Corresponding author: Young Ho Lee, lyh@gachon.ac.kr
Received 2012 June 29; Accepted 2012 July 12.

Abstract

Clinical professionals gain new information to assist in patient care when they read the medical literature. Similarly, in clinical preventive medicine, medical science documents that have previously published can be searched and evaluated in order to confirm the scientific support for the clinical preventive medical service offered in order to prevent chronic disease. This paper introduces the medical informatics techniques for knowledge extraction that can become the basis for clinical practice. Particularly, it discusses the clinical document retrieval and knowledge discovery tools that can search for extracting the knowledge which the medical expert desires with data mining techniques. For example, Clinical medical personnel and medical researchers can locate the information from the latest literature rapidly or find and evaluate the scientific basis for the treatment and prevention of infection. This study can be used when they analyze the correlation between accumulated and different type of data and contributes to the detection of new knowledge. Recently, the concern about the visualization of massive data and information is high as the importance of big data has received greater attention. Contributions to this technique and decision support tools will increase gradually due to the way support for decision-making through scientific evidence for the pattern changing disease is evaluated or as one of the clinical practice guidelines is accepted.

Acknowledgement

This research was supported by grant no. 10037283 from the Industrial Strategic Technology Development Program funded by the Ministry of Knowledge Economy.

References

1. Woolf SH, Grol R, Hutchinson A, Eccles M, Grimshaw J. Clinical guidelines: potential benefits, limitations, and harms of clinical guidelines. BMJ 1999. 318527–530.
2. Buchanan BG, Shortliffe EH. Rule based expert systems: the MYCIN experiments of the Stanford Heuristic Programming Project 1984. Boston: Addison-Wesley Longman Publishing;
3. Vapnik VN. Statistical learning theory 1998. New York: Wiley-Interscience;
4. Tan PN, Steinbach M, Kumar V. Introduction to data mining 2007. Boston: Addison Wesley Longman;
5. Heckerman D. Bayesian networks for data mining. Data Min Knowl Discov 1997. 179–119.
6. Lam W, Low KF, Ho CY. In : Pollack M, ed. Using a Bayesian network induction approach for text categorization. Proceedings of the 15th International Joint Conference on Artificial Intelligence 1997. 1997 Aug 23-29; Nagoya, Japan. San Francisco: Morgan Kauffman; 745–750.
7. Cho SB, Won HH. In : Chen YP, ed. Machine learning in DNA microarray analysis for cancer classification. Proceedings of First Asia-Pacific Bioinformatics Conference 2003 Feb 4-7; Adelaide, Australia. Darlinghurst: Australian Computer Society; 189–198.
8. Orr MJ. Introduction to radial basis function networks 1996. Edinburgh: Centre for Cognitive Science;
9. Domingos P, Pazzani M. On the optimality of the simple Bayesian classifier under zero-one loss. Mach Learn 1997. 29103–130.
10. Cristianini N, Shawe-Taylor J. An introduction to support vector machines and other kernel-based learning methods 2000. Cambridge: Cambridge University Press;
11. Grishman R, Sundheim B. In : Tsujii J, ed. Message understanding conference-6: a brief history. Proceedings of the 16th Conference on Computational Linguistics 1996. 1996 Aug 5-7; Copenhagen, Denmark. Stroudsburg: Association for Computational Linguistics; 466–471.
12. Kim SN, Martinez D, Cavedon L, Yencken L. Automatic classification of sentences to support evidence based medicine. BMC Bioinformatics 2011. 12Suppl 2. S5.
13. Nawaz R, Thompson P, McNaught J, Ananiadou S. In : Nicoletta C, Khalid C, Bente M, Joseph M, Jan O, Stelios P, Mike R, Daniel T, eds. Meta-knowledge annotation of bio-events. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC-2010) 2010. 2010 May 17-23; Valletta, Malta. [place unknown]: European Language Resources Association; 2498–2505.
14. Pan F. Multi-dimensional fragment classification in biomedical text [dissertation] 2006. Ontario: Queen's University;
15. Song MH, Kim SH, Park DK, Lee YH. A multi-classifier based guideline sentence classification system. Healthc Inform Res 2011. 17224–231.
16. Duda RO, Hart PE, Stork DG. Pattern classification 2000. 2nd edth ed. New York: Wiley-Interscience;
17. Liu H, Yu L. Toward integrating feature selection algorithms for classification and clustering. IEEE Trans Knowl Data Eng 2005. 17491–502.
18. Baharudin B, Lee LH, Khan K. A review of machine learning algorithms for text-documents classification. J Adv Inf Technol 2010. 14–20.
19. Yang Y, Pedersen JO. In : Fisher DH, ed. A comparative study on feature selection in text categorization. ICML 97: proceedings of the 14th International Conference on Machine Learning 1997. 1997 Jul 8-12; Nashville, USA. San Francisco: Morgan Kaufmann; 412–420.
20. Silla CN, Pappa GL, Freitas AA, Kaestner CA. Automatic text summarization with genetic algorithm-based attribute selection. Adv Artif Intell 2004. 3315305–314.
21. Anbarasi M, Anupriya E, Iyengar NC. Enhanced prediction of heart disease with feature subset selection using genetic algorithm. Int J Eng Sci Technol 2010. 25370–5376.
22. Nikitin A, Egorov S, Daraselia N, Mazo I. Pathway studio: the analysis and navigation of molecular networks. Bioinformatics 2003. 192155–2157.
23. Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp 2001. 17–21.
24. Pub anatomy: integrated exploration of biomedical literature and data [Internet] cited 2012 Jul 23. Ann Arbor: Microarray Lab; Available from: http://brainarray.mbni.med.umich.edu/Brainarray/prototype/PubAnatomy.
25. Plake C, Schiemann T, Pankalla M, Hakenberg J, Leser U. AliBaba: PubMed as a graph. Bioinformatics 2006. 222444–2445.
26. Stoyanovich J, Mee W, Ross KA. In : Li F, Moro MM, Ghandeharizadeh S, Haritsa JR, Weikum G, Carey MJ, Casati F, Chang EY, Manolescu I, Mehrotra S, Dayal U, Tsotras VJ, eds. Semantic ranking and result visualization for life sciences publications. Proceedings of the 26th International Conference on Data Engineering, ICDE 2010 2010. 2010 Mar 1-6; Long Beach, USA. [place unknown]: IEEE; 860–871.
27. Yu H, Kim T, Oh J, Ko I, Kim S, Han WS. Enabling multi-level relevance feedback on PubMed by integrating rank learning into DBMS. BMC Bioinformatics 2010. 11Suppl 2. S6.

Article information Continued

Figure 1

PathwayAssist graphical interface. Pathway assist enables researchers to create their own pathways and produce publication quality pathway diagrams. Pathway-Assist uses a proprietary graph visualization engine to allow for the visual features (From Nikitin A, et al. Bioinformatics 2003;19:2155-2157, with permission from Oxford University Press) [22].

Figure 2

AliBaba Graphical Interface. (Parts of the) graph resulting from five PubMed ab-stracts for the query 'FADD'. Information on the selected protein 'caspase-8' is given in the right panel, for instance, association partners and evidence texts (From Plake C, et al. Bioinformatics 2006;22:2444-2445, with permission from Oxford University Press) [25].