• Title/Summary/Keyword: convolution model

Search Result 400, Processing Time 0.026 seconds

SATURATION-VALUE TOTAL VARIATION BASED COLOR IMAGE DENOISING UNDER MIXED MULTIPLICATIVE AND GAUSSIAN NOISE

  • JUNG, MIYOUN
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.26 no.3
    • /
    • pp.156-184
    • /
    • 2022
  • In this article, we propose a novel variational model for restoring color images corrupted by mixed multiplicative Gamma noise and additive Gaussian noise. The model involves a data-fidelity term that characterizes the mixed noise as an infimal convolution of two noise distributions and the saturation-value total variation (SVTV) regularization. The data-fidelity term facilitates suitable separation of the multiplicative Gamma and Gaussian noise components, promoting simultaneous elimination of the mixed noise. Furthermore, the SVTV regularization enables adequate denoising of homogeneous regions, while maintaining edges and details and diminishing the color artifacts induced by noise. To solve the proposed nonconvex model, we exploit an alternating minimization approach, and then the alternating direction method of multipliers is adopted for solving subproblems. This contributes to an efficient iterative algorithm. The experimental results demonstrate the superior performance of the proposed model compared to other existing or related models, with regard to visual inspection and image quality measurements.

Speakers' Intention Analysis Based on Partial Learning of a Shared Layer in a Convolutional Neural Network (Convolutional Neural Network에서 공유 계층의 부분 학습에 기반 한 화자 의도 분석)

  • Kim, Minkyoung;Kim, Harksoo
    • Journal of KIISE
    • /
    • v.44 no.12
    • /
    • pp.1252-1257
    • /
    • 2017
  • In dialogues, speakers' intentions can be represented by sets of an emotion, a speech act, and a predicator. Therefore, dialogue systems should capture and process these implied characteristics of utterances. Many previous studies have considered such determination as independent classification problems, but others have showed them to be associated with each other. In this paper, we propose an integrated model that simultaneously determines emotions, speech acts, and predicators using a convolution neural network. The proposed model consists of a particular abstraction layer, mutually independent informations of these characteristics are abstracted. In the shared abstraction layer, combinations of the independent information is abstracted. During training, errors of emotions, errors of speech acts, and errors of predicators are partially back-propagated through the layers. In the experiments, the proposed integrated model showed better performances (2%p in emotion determination, 11%p in speech act determination, and 3%p in predicator determination) than independent determination models.

Impulsive Noise Mitigation Scheme Based on Deep Learning (딥 러닝 기반의 임펄스 잡음 완화 기법)

  • Sun, Young Ghyu;Hwang, Yu Min;Sim, Issac;Kim, Jin Young
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.17 no.4
    • /
    • pp.138-149
    • /
    • 2018
  • In this paper, we propose a system model which effectively mitigates impulsive noise that degrades the performance of power line communication. Recently, deep learning have shown effective performance improvement in various fields. In order to mitigate effective impulsive noise, we applied a convolution neural network which is one of deep learning algorithm to conventional system. Also, we used a successive interference cancellation scheme to mitigate impulsive noise generated from multi-users. We simulate the proposed model which can be applied to the power line communication in the Section V. The performance of the proposed system model is verified through bit error probability versus SNR graph. In addition, we compare ZF and MMSE successive interference cancellation scheme, successive interference cancellation with optimal ordering, and successive interference cancellation without optimal ordering. Then we confirm which schemes have better performance.

Tongue Segmentation Using the Receptive Field Diversification of U-net

  • Li, Yu-Jie;Jung, Sung-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.9
    • /
    • pp.37-47
    • /
    • 2021
  • In this paper, we propose a new deep learning model for tongue segmentation with improved accuracy compared to the existing model by diversifying the receptive field in the U-net. Methods such as parallel convolution, dilated convolution, and constant channel increase were used to diversify the receptive field. For the proposed deep learning model, a tongue region segmentation experiment was performed on two test datasets. The training image and the test image are similar in TestSet1 and they are not in TestSet2. Experimental results show that segmentation performance improved as the receptive field was diversified. The mIoU value of the proposed method was 98.14% for TestSet1 and 91.90% for TestSet2 which was higher than the result of existing models such as U-net, DeepTongue, and TongueNet.

Resource-Efficient Object Detector for Low-Power Devices (저전력 장치를 위한 자원 효율적 객체 검출기)

  • Akshay Kumar Sharma;Kyung Ki Kim
    • Transactions on Semiconductor Engineering
    • /
    • v.2 no.1
    • /
    • pp.17-20
    • /
    • 2024
  • This paper presents a novel lightweight object detection model tailored for low-powered edge devices, addressing the limitations of traditional resource-intensive computer vision models. Our proposed detector, inspired by the Single Shot Detector (SSD), employs a compact yet robust network design. Crucially, it integrates an 'enhancer block' that significantly boosts its efficiency in detecting smaller objects. The model comprises two primary components: the Light_Block for efficient feature extraction using Depth-wise and Pointwise Convolution layers, and the Enhancer_Block for enhanced detection of tiny objects. Trained from scratch on the Udacity Annotated Dataset with image dimensions of 300x480, our model eschews the need for pre-trained classification weights. Weighing only 5.5MB with approximately 0.43M parameters, our detector achieved a mean average precision (mAP) of 27.7% and processed at 140 FPS, outperforming conventional models in both precision and efficiency. This research underscores the potential of lightweight designs in advancing object detection for edge devices without compromising accuracy.

A Study on the Improvement of Digital Periapical Images using Image Interpolation Methods (영상보간법을 이용한 디지털 치근단 방사선영상의 개선에 관한 연구)

  • Song Nam-Kyu;Koh Kawng-Joon
    • Journal of Korean Academy of Oral and Maxillofacial Radiology
    • /
    • v.28 no.2
    • /
    • pp.387-413
    • /
    • 1998
  • Image resampling is of particular interest in digital radiology. When resampling an image to a new set of coordinate, there appears blocking artifacts and image changes. To enhance image quality, interpolation algorithms have been used. Resampling is used to increase the number of points in an image to improve its appearance for display. The process of interpolation is fitting a continuous function to the discrete points in the digital image. The purpose of this study was to determine the effects of the seven interpolation functions when image resampling in digital periapical images. The images were obtained by Digora, CDR and scanning of Ektaspeed plus periapical radiograms on the dry skull and human subject. The subjects were exposed to intraoral X-ray machine at 60kVp and 70 kVp with exposure time varying between 0.01 and 0.50 second. To determine which interpolation method would provide the better image, seven functions were compared; (1) nearest neighbor (2) linear (3) non-linear (4) facet model (5) cubic convolution (6) cubic spline (7) gray segment expansion. And resampled images were compared in terms of SNR(Signal to Noise Ratio) and MTF(Modulation Transfer Function) coefficient value. The obtained results were as follows ; 1. The highest SNR value(75.96dB) was obtained with cubic convolution method and the lowest SNR value(72.44dB) was obtained with facet model method among seven interpolation methods. 2. There were significant differences of SNR values among CDR, Digora and film scan(P<0.05). 3. There were significant differences of SNR values between 60kVp and 70kVp in seven interpolation methods. There were significant differences of SNR values between facet model method and those of the other methods at 60kVp(P<0.05), but there were not significant differences of SNR values among seven interpolation methods at 70kVp(P>0.05). 4. There were significant differences of MTF coefficient values between linear interpolation method and the other six interpolation methods (P< 0.05). 5. The speed of computation time was the fastest with nearest -neighbor method and the slowest with non-linear method. 6. The better image was obtained with cubic convolution, cubic spline and gray segment method in ROC analysis. 7. The better sharpness of edge was obtained with gray segment expansion method among seven interpolation methods.

  • PDF

Determination of Nitrogen in Fresh and Dry Leaf of Apple by Near Infrared Technology (근적외 분석법을 응용한 사과의 생잎과 건조잎의 질소분석)

  • Zhang, Guang-Cai;Seo, Sang-Hyun;Kang, Yeon-Bok;Han, Xiao-Ri;Park, Woo-Churl
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.37 no.4
    • /
    • pp.259-265
    • /
    • 2004
  • A quicker method was developed for foliar analysis in diagnosis of nitrogen in apple trees based on multivariate calibration procedure using partial least squares regression (PLSR) and principal component regression (PCR) to establish the relationship between reflectance spectra in the near infrared region and nitrogen content of fresh- and dry-leaf. Several spectral pre-processing methods such as smoothing, mean normalization, multiplicative scatter correction (MSC) and derivatives were used to improve the robustness and performance of the calibration models. Norris first derivative with a seven point segment and a gap of six points on MSC gave the best result of partial least squares-1 PLS-1) model for dry-leaf samples with root mean square error of prediction (RMSEP) equal to $0.699g\;kg^{-1}$, and that the Savitzky-Golay first derivate with a seven point convolution and a quadratic polynomial on MSC gave the best results of PLS-1 model for fresh-samples with RMSEP of $1.202g\;kg^{-1}$. The best PCR model was obtained with Savitzky-Golay first derivative using a seven point convolution and a quadratic polynomial on mean normalization for dry leaf samples with RMSEP of $0.553g\;kg^{-1}$, and obtained with the Savitzky-Golay first derivate using a seven point convolution and a quadratic polynomial for fresh samples with RMSEP of $1.047g\;kg^{-1}$. The results indicate that nitrogen can be determined by the near infrared reflectance (NIR) technology for fresh- and dry-leaf of apple.

A Research on Network Intrusion Detection based on Discrete Preprocessing Method and Convolution Neural Network (이산화 전처리 방식 및 컨볼루션 신경망을 활용한 네트워크 침입 탐지에 대한 연구)

  • Yoo, JiHoon;Min, Byeongjun;Kim, Sangsoo;Shin, Dongil;Shin, Dongkyoo
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.29-39
    • /
    • 2021
  • As damages to individuals, private sectors, and businesses increase due to newly occurring cyber attacks, the underlying network security problem has emerged as a major problem in computer systems. Therefore, NIDS using machine learning and deep learning is being studied to improve the limitations that occur in the existing Network Intrusion Detection System. In this study, a deep learning-based NIDS model study is conducted using the Convolution Neural Network (CNN) algorithm. For the image classification-based CNN algorithm learning, a discrete algorithm for continuity variables was added in the preprocessing stage used previously, and the predicted variables were expressed in a linear relationship and converted into easy-to-interpret data. Finally, the network packet processed through the above process is mapped to a square matrix structure and converted into a pixel image. For the performance evaluation of the proposed model, NSL-KDD, a representative network packet data, was used, and accuracy, precision, recall, and f1-score were used as performance indicators. As a result of the experiment, the proposed model showed the highest performance with an accuracy of 85%, and the harmonic mean (F1-Score) of the R2L class with a small number of training samples was 71%, showing very good performance compared to other models.

Intrusion Detection Method Using Unsupervised Learning-Based Embedding and Autoencoder (비지도 학습 기반의 임베딩과 오토인코더를 사용한 침입 탐지 방법)

  • Junwoo Lee;Kangseok Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.8
    • /
    • pp.355-364
    • /
    • 2023
  • As advanced cyber threats continue to increase in recent years, it is difficult to detect new types of cyber attacks with existing pattern or signature-based intrusion detection method. Therefore, research on anomaly detection methods using data learning-based artificial intelligence technology is increasing. In addition, supervised learning-based anomaly detection methods are difficult to use in real environments because they require sufficient labeled data for learning. Research on an unsupervised learning-based method that learns from normal data and detects an anomaly by finding a pattern in the data itself has been actively conducted. Therefore, this study aims to extract a latent vector that preserves useful sequence information from sequence log data and develop an anomaly detection learning model using the extracted latent vector. Word2Vec was used to create a dense vector representation corresponding to the characteristics of each sequence, and an unsupervised autoencoder was developed to extract latent vectors from sequence data expressed as dense vectors. The developed autoencoder model is a recurrent neural network GRU (Gated Recurrent Unit) based denoising autoencoder suitable for sequence data, a one-dimensional convolutional neural network-based autoencoder to solve the limited short-term memory problem that GRU can have, and an autoencoder combining GRU and one-dimensional convolution was used. The data used in the experiment is time-series-based NGIDS (Next Generation IDS Dataset) data, and as a result of the experiment, an autoencoder that combines GRU and one-dimensional convolution is better than a model using a GRU-based autoencoder or a one-dimensional convolution-based autoencoder. It was efficient in terms of learning time for extracting useful latent patterns from training data, and showed stable performance with smaller fluctuations in anomaly detection performance.

A Car Plate Area Detection System Using Deep Convolution Neural Network (딥 컨볼루션 신경망을 이용한 자동차 번호판 영역 검출 시스템)

  • Jeong, Yunju;Ansari, Israfil;Shim, Jaechang;Lee, Jeonghwan
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.8
    • /
    • pp.1166-1174
    • /
    • 2017
  • In general, the detection of the vehicle license plate is a previous step of license plate recognition and has been actively studied for several decades. In this paper, we propose an algorithm to detect a license plate area of a moving vehicle from a video captured by a fixed camera installed on the road using the Convolution Neural Network (CNN) technology. First, license plate images and non-license plate images are applied to a previously learned CNN model (AlexNet) to extract and classify features. Then, after detecting the moving vehicle in the video, CNN detects the license plate area by comparing the features of the license plate region with the features of the license plate area. Experimental result shows relatively good performance in various environments such as incomplete lighting, noise due to rain, and low resolution. In addition, to protect personal information this proposed system can also be used independently to detect the license plate area and hide that area to secure the public's personal information.