Ⅰ. INTRODUCTION
Corresponding to the fourth industrial revolution, accuracy of image analysis using Artificial Intelligence (AI), Deep Learning, is increasing fast in medical image analysis and diagnosis.[1-3] Deep Learning is a technology that combines Artificial Neural Network (ANN), which is an algorithm based on a mechanism of neural networking, and Big data. ANN can interpret the input data using perceptron, classification and clustering and can recognize specific pattern of data using this interpretation. Deep Learning uses Convolution Neural Network (CNN), one kind of ANN.[4,5] CNN has advantages that inputting two-dimensional data and training is easy and there is less parameter. It shows good performance in every part of image and audio.[6] Chest X-ray was basic examination for pneumonia and follow up for improvement, and was reported a sensitivity of more than 50% when diagnosed with pneumonia.[7,8] However, the diagnosis of pneumonia may be misdiagnosed by the doctor, and it takes a long time to Chest X-ray read.[9] Increasing accuracy of pneumonia diagnosis and reducing reading time would help imaging diagnosis a lot. Therefore, this study aims to evaluate pneumonia classification and accuracy of deep learning using chest X-ray data.
Ⅱ. MATERIAL AND METHODS
1. Chest X-ray image data
This research used 5,872 chest X-ray images provided by Labeled Optical Coherence Tomography (OCT) and Chest X-ray Images for Classification from Mendeley LTD. 1,583 images were Normal and 4,289 images were Pneumonia as shown in Fig. 1.
Fig. 1. Image of Chest PA.
2. Data preprocessing and modeling
2.1. Data classification
Collected chest X-ray image was classified into Training Data about 88.8% (Normal 25.7%, Pneumonia 74.3%), Test Data about 11% (Normal 37.5%, Pneumonia 62.5%) and Validation Data about 0.2% (Normal 50%, Pneumonia 50%). Size of all image was reduced to 150×150.
2.2. Data modeling
The research used Anaconda (Anaconda Inc, USA, Texas, Ver. 4.7.10), Python (Python Software Foundation, Ver. 3.7.3), Jupyter Notebook (Project Jupyter, Ver. 6.0.0), Keras (Ver. 2.2.4.)
2.2.1 Convolution layer
Among Convolution layer, Conv2D layer was used. At three layers, filter number was set as 64-128-128 and filter size was set as 2×2 and 3×3. At four layers, filter number was set as 64-128-128-128 and filter size as 3×3.
2.2.2 Max Pooling layer
For Max Pooling layer, pool size was set as 2×2 in order to obtain feature through extracting maximum value from Pixel of output Image through Convolution and resizing.
2.2.3 Flatten layer
To transform two - dimensional data into one - dimensional image in order to convey the characteristics, which is extracted into Convolution layer and Max Pooling layer to Fully-connected layer, Flatten layer was used.
2.2.4 Image Data Generator
It was done to increase performance by reflecting character of Test Dataset fully through overstating Training Dataset which is not good enough with Image Data Generator as shown in Fig. 2.
Fig. 2. Data modeling diagram.
2.3. Data learning
For data learning, an experiment was done under the setting that parameters are Filter number, Filter size, drop out, epoch, batch size and Loss Function when Convolution layer is 3. Secondly, an experiment was done under the setting that parameters are Filter number, Filter size, drop out, epoch, batch size and Loss Function when Convolution layer is 4.
Ⅲ. RESULT
When Convolution layer was 3 and Filter number was 64-128-128 and set as Filter size 2×2, Drop out 0.25, Epoch 3, Batch size 16, the accuracy was 83.75%. When set as Filter size 3×3, Drop out 0.25, Epoch 3, Batch size 16, the accuracy was 87.5%. When set as Filter size 3×3, Drop out 0.5, Epoch 3, Batch size 16, the accuracy was 66.25%. When set as Filter size 3×3, Drop out 0.25, Epoch 5, Batch size 16, the accuracy was 90%, as shown in Table 1. When Convolution layer was 4, filter number was 64-128-128-128 and filter size is 3×3, the accuracy was 82.5% under the setting that Drop out was 0.25, Epoch was 3, Batch size was 16 and loss function was RMSprop. When set as Drop out 0.25, Epoch 5, Batch size 15, loss function as adam, the accuracy was 90.67%, as shown in Table 2. The accuracy was highest as 94.67% when set as Drop out 0.25, Epoch 5, Batch size 15, Loss function rmsprop, as shown in Fig. 3. The progress of learning rate of this model, as shown in Fig. 4.
Table 1. Set value of Convolution layer 3
Table 2. Set value of Convolution layer 4
Fig. 3. Diagram of convolution layer.
Fig. 4. Learning progress of the highest model.
Ⅳ. DISCUSSION
This research classified into Normal and Pneumonia through Deep Learning. Among the results, minimum was 66.25% and maximum was 94.67%. For the development of program supporting diagnosis of occupational lung disease (pneumoconiosis)[10], the research classifying Normal and Pneumoconiosis using CNN showed accuracy up to 95%. This research also showed high results with accuracy up to 94.67%. A research about detection of crack in granite X-ray CT image using Deep Learning applied a method increasing image learning data but the accuracy was only 67.91%.[11] A research classifying lung pattern among interstitial lung disease showed 85.5% accuracy.[12] This research showed satisfactory prediction accuracy but there was a few restraints when classifying Chest image using Deep Learning. Deep Learning is a machine-learning that can obtain more accurate results as data number increases. For medical image, however, it was difficult to obtain enough data for learning due to personal information protection.[13] Cooperation with database about clinical disorders along with a supporting system for web service or Electronic Medical Record (EMR) system would solve this and make possible to share large volumes of data. Learning with more various and large amount of lung image would show higher accuracy than this research. Also, a phenomenon that perception rate failed to meet expectation would occur if classify high-resolution medical image with existing algorithm.[14] Despite these problems, this research developed a method to enable learning at network by dividing large size of image into small size of lattice.[15] This method would show high accuracy if algorithm for high-resolution image like medical image is developed.
Ⅴ. CONCLUSION
Chest X-ray classification and pneumonia diagnosis using deep learning were high accuracy and it is considered to be a useful basic data for pneumonia diagnosis.
References
- M. Rastgarpour, J. Shanbehzadeh, "Application of AI Techniques in Medical Image Segmentation and Novel Categorization of Available Methods and Tools," Lecture Notes in Engineering and Computer Science, Vol. 2188, No. 1, pp. 519-523, 2011.
- Sarraf, Saman, Ghassem, Tofighi, "Deep learning-based pipeline to recognize Alzheimer's disease using fMRI data," Future Technologies Conference (FTC), IEEE, pp. 816-820, 2016.
- U. Niyaz, Abhishek Singh Sambyal, Devanand, "Advances in Deep Learning Techniques for Medical Image Analysis," 2018 Fifth International Conference on Parallel, Distributed and Grid Computing (PDGC), IEEE, pp. 271-277, 2018.
- R. Meng, S. G, Rice, J. Wang, X. Sun, "A Fusion Steganographic Algorithm Based on Faster R-CNN," Computers, Materials & Continua, Vol. 55, No. 1, pp. 1-16, 2018.
- Schmidhuber, Jurgen. "Deep learning in neural networks: An overview," Neural networks, Vol. 61, pp. 85-117, 2015. https://doi.org/10.1016/j.neunet.2014.09.003
- L. Amelio, A. Amelio. "Classification Methods in Image Analysis with a Special Focus on Medical Analytics," Machine Learning Paradigms: Advances in Data Analytics, Vol. 149, pp. 31-69, 2018. https://doi.org/10.1007/978-3-319-94030-4_3
- E. J. Park, J. H. Park, M. J. Hong, W. D. Kim, K. Y. Lee, S. J. Kim, H. J. Kim, K. W. Ha, G. R. Chon, H. A. Kim, K. H. Yoo. "Usefulness of Vibration Response Imaging(VRI) for Pneumonia Patients," Tuberculosis and Respiratory Diseases, Vol. 71, No. 1, pp. 30-36, 2011. https://doi.org/10.4046/trd.2011.71.1.30
- J. P. Matlay, M. J. Fine, "Testing strategies in the initial management of patients with community-acquired pneumonia," Annals of Internal Medicine, Vol. 138, No. 2, pp. 109-118, 2003. https://doi.org/10.7326/0003-4819-138-2-200301210-00012
- M. Tsoumakidou, G. Chrysofakis, I. Tsiligianni, G. Maltezakis, N. M. Siafakas, N. Tzanakis, "A Prospective Analysis of 184 Hemoptysis Cases - Daignositc Impact of Chest X-Ray, Computed Tomography, Bronchoscopy," Respiration, Vol. 79, No. 6, pp. 808-814, 2006.
- J. P. Myung, J. S. Song, S. H. Kang, S. W. Han, "An artificial intelligence deep learning based diagnostic assistance program for pneumoconiosis," Journal of the Korean Academy of Tuberculosis and Respiratory Diseases in the Fall Conference of The Korean Society of Visualization, Vol. 126, pp. 258-258, 2018.
- S. H. Hyun, J. S. Lee, S. H. Jeon, Y. J. Kim, K. Y. Kim, T. S. Yoon, "Pixel-level Crack Detection in X-ray Computed Tomography Image of Granite using Deep Learning," Korean Society for Rock Mechanics and Rock Enginnering in the tunnels and underground spaces, Vol. 29, No. 3, pp. 184-196, 2019.
- M. Anthimopoulos, S. Christodoulidis, L. Ebner, A. Christe, S. Mougiakakou, "Lung pattern classification for interstitial lung diseases using a deep convolutional neural network," IEEE transactions on medical imaging, Vol. 35, No. 5, pp. 1207-1216, 2016. https://doi.org/10.1109/TMI.2016.2535865
- Shen, Dinggang, Guorong Wu, Heung-Il Suk, "Deep learning in medical image analysis," Annual review of biomedical engineering, Vol. 19 No. 1, pp. 221-248, 2017.
- Y. M. Seo, J. K. Han. "Deep Learning Algorithm to Identify Cancer Pictures," Journal of Broadcast Engineering, Vol. 23, No. 5, pp. 669-681, 2018. https://doi.org/10.5909/JBE.2018.23.5.669
- Erickson, Bradley J., et al. "Deep learning in radiology: does one size fit all?," Journal of the American College of Radiology, Vol. 15, No. 3, pp. 521-526, 2018. https://doi.org/10.1016/j.jacr.2017.12.027
Cited by
- 심장비대증 환자의 흉부 X선 영상에 대한 Inception V3 알고리즘의 분류 성능평가 vol.15, pp.4, 2020, https://doi.org/10.7742/jksr.2021.15.4.455