Search | Korea Science

DP-LinkNet: A convolutional network for historical document image binarization

Xiong, Wei;Jia, Xiuhong;Yang, Dichun;Ai, Meihui;Li, Lirong;Wang, Song
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.15 no.5
- /
- pp.1778-1797
- /
- 2021
Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.
https://doi.org/10.3837/tiis.2021.05.011 인용 PDF KSCI HTML

Evaluation of Deep-Learning Feature Based COVID-19 Classifier in Various Neural Network (코로나바이러스 감염증19 데이터베이스에 기반을 둔 인공신경망 모델의 특성 평가)

Hong, Jun-Yong;Jung, Young-Jin
- Journal of radiological science and technology
- /
- v.43 no.5
- /
- pp.397-404
- /
- 2020
Coronavirus disease(COVID-19) is highly infectious disease that directly affects the lungs. To observe the clinical findings from these lungs, the Chest Radiography(CXR) can be used in a fast manner. However, the diagnostic performance via CXR needs to be improved, since the identifying these findings are highly time-consuming and prone to human error. Therefore, Artificial Intelligence(AI) based tool may be useful to aid the diagnosis of COVID-19 via CXR. In this study, we explored various Deep learning(DL) approach to classify COVID-19, other viral pneumonia and normal. For the original dataset and lung-segmented dataset, the pre-trained AlexNet, SqueezeNet, ResNet18, DenseNet201 were transfer-trained and validated for 3 class - COVID-19, viral pneumonia, normal. In the results, AlexNet showed the highest mean accuracy of 99.15±2.69% and fastest training time of 1.61±0.56 min among 4 pre-trained neural networks. In this study, we demonstrated the performance of 4 pre-trained neural networks in COVID-19 diagnosis with CXR images. Further, we plotted the class activation map(CAM) of each network and demonstrated that the lung-segmentation pre-processing improve the performance of COVID-19 classifier with CXR images by excluding background features.
https://doi.org/10.17946/JRST.2020.43.5.397 인용 PDF KSCI

Band Selection Algorithm based on Expected Value for Pixel Classification (픽셀 분류를 위한 기댓값 기반 밴드 선택 알고리즘)

Chang, Duhyeuk;Jung, Byeonghyeon;Heo, Junyoung
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.22 no.6
- /
- pp.107-112
- /
- 2022
In an embedded system such as a drone, it is difficult to store, transfer and analyze the entire hyper-spectral image to a server in real time because it takes a lot of power and time. Therefore, the hyper-spectral image data is transmitted to the server through dimension reduction or compression pre-processing. Feature selection method are used to send only the bands for analysis purpose, and these algorithms usually take a lot of processing time depending on the size of the image, even though the efficiency is high. In this paper, by improving the temporal disadvantage of the band selection algorithm, the time taken 24 hours was reduced to around 60-180 seconds based on the 40000*682 image resolution of 8GB data, and the use of 7.6GB RAM was significantly reduced to 2.3GB using 45 out of 150 bands. However, in terms of pixel classification performance, more than 98% of analysis results were derived similarly to the previous one.
https://doi.org/10.7236/JIIBC.2022.22.6.107 인용 PDF KSCI HTML

Analysis of Feature Variables for Breast Cancer Diagnosis

Jung, Yong Gyu;Kim, Jang Il;Sihn, Sung Chul;Heo, Jun
- International journal of advanced smart convergence
- /
- v.2 no.2
- /
- pp.36-39
- /
- 2013
It is becoming more important as the growing of health information and increasing in cancer patients diagnose over the time gradually. Among the various types of cancer, we focuses on breast cancer diagnosis. The accuracy of breast cancer diagnosis is increasing when the diagnosis is based on evidence and statistics. To do this we use the weka data mining tools and analysis algorithms significantly associated with the decision tree uses rules. In addition, the data pre-processing and cross-validation are used to increase the reliability of the results. The number and cause of the disease becomes important to increase evidence-based medical doctors. As the evidence-based medical, the data obtained from patients in the past through the disease by calculating the probability for future patients to diagnose and predict disease and treatment plan. It can be found by improving the survival rate plays an important role.
https://doi.org/10.7236/IJASC2013.2.2.8 인용 PDF KSCI

Beamforming Optimization Using Filterbank-based Frost Algorithm (필터뱅크 기반 프로스트 알고리즘을 이용한 빔포밍 최적화)

Park, Ji-Hoon;Lee, Sung-Joo;Hong, Jeong-Pyo;Jeong, Sang-Bae;Hahn, Min-Soo
- MALSORI
- /
- no.66
- /
- pp.73-86
- /
- 2008
Beamforming is one of the spatial filtering techniques which extract only desired signals from noisy environments using microphone arrays. Fixed beamforming is a simple concept and easy to implement. However, it does not show good performance in real noisy conditions. As an adaptive beamforming, Frost algorithm can be a good candidate. It uses the concept of the linearly constrained minimum variance (LCMV) algorithm. The difference between the Frost and the LCMV algorithm is the error correction scheme which is very effective feature in the aspect of performance. In this paper, as quadrature mirror filtering (QMF)-based filterbank is utilized as the pre-processing of the Frost beamformning, the filter length and the learning rate of each band is optimized to improve the performance. The performance is measured by the signal-to-noise ratio (SNR) and the Bark's scale spectral distortion (BSD).
PDF

Object Recognition Using the Edge Orientation Histogram and Improved Multi-Layer Neural Network

Kang, Myung-A
- International Journal of Advanced Culture Technology
- /
- v.6 no.3
- /
- pp.142-150
- /
- 2018
This paper describes the algorithm that lowers the dimension, maintains the object recognition and significantly reduces the eigenspace configuration time by combining the edge orientation histogram and principle component analysis. By using the detected object region as a recognition input image, in this paper the object recognition method combined with principle component analysis and the multi-layer network which is one of the intelligent classification was suggested and its performance was evaluated. As a pre-processing algorithm of input object image, this method computes the eigenspace through principle component analysis and expresses the training images with it as a fundamental vector. Each image takes the set of weights for the fundamental vector as a feature vector and it reduces the dimension of image at the same time, and then the object recognition is performed by inputting the multi-layer neural network.
https://doi.org/10.17703//IJACT2018.6.3.142 인용 PDF KSCI

Estimation of gender and age using CNN-based face recognition algorithm

Lim, Sooyeon
- International journal of advanced smart convergence
- /
- v.9 no.2
- /
- pp.203-211
- /
- 2020
This study proposes a method for estimating gender and age that is robust to various external environment changes by applying deep learning-based learning. To improve the accuracy of the proposed algorithm, an improved CNN network structure and learning method are described, and the performance of the algorithm is also evaluated. In this study, in order to improve the learning method based on CNN composed of 6 layers of hidden layers, a network using GoogLeNet's inception module was constructed. As a result of the experiment, the age estimation accuracy of 5,328 images for the performance test of the age estimation method is about 85%, and the gender estimation accuracy is about 98%. It is expected that real-time age recognition will be possible beyond feature extraction of face images if studies on the construction of a larger data set, pre-processing methods, and various network structures and activation functions have been made to classify the age classes that are further subdivided according to age.
https://doi.org/10.7236/IJASC.2020.9.2.203 인용 PDF KSCI

Flexible Jet Point Setting In Gabor Filter Based Face Recognition (가보필터기반 얼굴인식에서의 유동적 Jet Point Setting)

신하송;김병우;이정안;김민기
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2032-2035
- /
- 2003
This paper focused on the possibility of face recognition using Flexible let Point Setting method in Gabor Filter Based Face Recognition. Gabor Filter is very sensible to the Texture variation. Therefore, any little change in the face expression or rotation of posture make recognition rate down significantly. A suggested solution for this problem is the Flexible Jet Point Setting. A significant effect of this method is that the number of Jet Point has been reduced from over 150 to under 30 even though the change of recognition rate between two methods is neglectable, Furthermore a set of feature values which results from a set of Gabor filtering became insensible to face variation such as expression, rotation, and light effect. Retinex Algorithm which has been developed by NASA are used as pre-processing.
PDF

Multi-Scale Modelling of a Phase Mixture Model and the Finite Element Method for Nanocrystalline Materials (나노결정 재료의 상혼합모델과 유한요소법을 결합한 멀티스케일 모델링)

윤승채;서민홍;김형섭
- Transactions of Materials Processing
- /
- v.13 no.2
- /
- pp.174-179
- /
- 2004
The effect of grain refinement on the plastic deformation behaviour of nanocrystalline metallic materials is investigated. A phase mixture model in which a single phase material is considered as an effectively two-phase one is discussed. A distinctive feature of the model is that grain boundaries are treated as a separate phase deforming by a diffusion mechanism. For the grain interior phase two concurrent mechanisms are considered: dislocation glide and mass transfer by diffusion. The proposed constitutive model was implemented into a finite element code (DEFORM) using a semicoupled approach. The finite element method was applied to simulating room temperature tensile deformation of Cu down to the nanoscale grain size in order to investigate the pre- and post-necking behaviour.
https://doi.org/10.5228/KSPP.2004.13.2.174 인용 PDF KSCI

Facial Expression Classification Using Deep Convolutional Neural Network

Choi, In-kyu;Ahn, Ha-eun;Yoo, Jisang
- Journal of Electrical Engineering and Technology
- /
- v.13 no.1
- /
- pp.485-492
- /
- 2018
In this paper, we propose facial expression recognition using CNN (Convolutional Neural Network), one of the deep learning technologies. The proposed structure has general classification performance for any environment or subject. For this purpose, we collect a variety of databases and organize the database into six expression classes such as 'expressionless', 'happy', 'sad', 'angry', 'surprised' and 'disgusted'. Pre-processing and data augmentation techniques are applied to improve training efficiency and classification performance. In the existing CNN structure, the optimal structure that best expresses the features of six facial expressions is found by adjusting the number of feature maps of the convolutional layer and the number of nodes of fully-connected layer. The experimental results show good classification performance compared to the state-of-the-arts in experiments of the cross validation and the cross database. Also, compared to other conventional models, it is confirmed that the proposed structure is superior in classification performance with less execution time.
https://doi.org/10.5370/JEET.2018.13.1.485 인용 PDF KSCI HTML

Search Result 188, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)