• Title/Summary/Keyword: Deep Features

Search Result 1,096, Processing Time 0.027 seconds

Single Image Super Resolution Based on Residual Dense Channel Attention Block-RecursiveSRNet (잔여 밀집 및 채널 집중 기법을 갖는 재귀적 경량 네트워크 기반의 단일 이미지 초해상도 기법)

  • Woo, Hee-Jo;Sim, Ji-Woo;Kim, Eung-Tae
    • Journal of Broadcast Engineering
    • /
    • v.26 no.4
    • /
    • pp.429-440
    • /
    • 2021
  • With the recent development of deep convolutional neural network learning, deep learning techniques applied to single image super-resolution are showing good results. One of the existing deep learning-based super-resolution techniques is RDN(Residual Dense Network), in which the initial feature information is transmitted to the last layer using residual dense blocks, and subsequent layers are restored using input information of previous layers. However, if all hierarchical features are connected and learned and a large number of residual dense blocks are stacked, despite good performance, a large number of parameters and huge computational load are needed, so it takes a lot of time to learn a network and a slow processing speed, and it is not applicable to a mobile system. In this paper, we use the residual dense structure, which is a continuous memory structure that reuses previous information, and the residual dense channel attention block using the channel attention method that determines the importance according to the feature map of the image. We propose a method that can increase the depth to obtain a large receptive field and maintain a concise model at the same time. As a result of the experiment, the proposed network obtained PSNR as low as 0.205dB on average at 4× magnification compared to RDN, but about 1.8 times faster processing speed, about 10 times less number of parameters and about 1.74 times less computation.

Deep Learning Approach for Automatic Discontinuity Mapping on 3D Model of Tunnel Face (터널 막장 3차원 지형모델 상에서의 불연속면 자동 매핑을 위한 딥러닝 기법 적용 방안)

  • Chuyen Pham;Hyu-Soung Shin
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.508-518
    • /
    • 2023
  • This paper presents a new approach for the automatic mapping of discontinuities in a tunnel face based on its 3D digital model reconstructed by LiDAR scan or photogrammetry techniques. The main idea revolves around the identification of discontinuity areas in the 3D digital model of a tunnel face by segmenting its 2D projected images using a deep-learning semantic segmentation model called U-Net. The proposed deep learning model integrates various features including the projected RGB image, depth map image, and local surface properties-based images i.e., normal vector and curvature images to effectively segment areas of discontinuity in the images. Subsequently, the segmentation results are projected back onto the 3D model using depth maps and projection matrices to obtain an accurate representation of the location and extent of discontinuities within the 3D space. The performance of the segmentation model is evaluated by comparing the segmented results with their corresponding ground truths, which demonstrates the high accuracy of segmentation results with the intersection-over-union metric of approximately 0.8. Despite still being limited in training data, this method exhibits promising potential to address the limitations of conventional approaches, which only rely on normal vectors and unsupervised machine learning algorithms for grouping points in the 3D model into distinct sets of discontinuities.

Quantitative Evaluation of Super-resolution Drone Images Generated Using Deep Learning (딥러닝을 이용하여 생성한 초해상화 드론 영상의 정량적 평가)

  • Seo, Hong-Deok;So, Hyeong-Yoon;Kim, Eui-Myoung
    • Journal of Cadastre & Land InformatiX
    • /
    • v.53 no.2
    • /
    • pp.5-18
    • /
    • 2023
  • As the development of drones and sensors accelerates, new services and values are created by fusing data acquired from various sensors mounted on drone. However, the construction of spatial information through data fusion is mainly constructed depending on the image, and the quality of data is determined according to the specification and performance of the hardware. In addition, it is difficult to utilize it in the actual field because expensive equipment is required to construct spatial information of high-quality. In this study, super-resolution was performed by applying deep learning to low-resolution images acquired through RGB and THM cameras mounted on a drone, and quantitative evaluation and feature point extraction were performed on the generated high-resolution images. As a result of the experiment, the high-resolution image generated by super-resolution was maintained the characteristics of the original image, and as the resolution was improved, more features could be extracted compared to the original image. Therefore, when generating a high-resolution image by applying a low-resolution image to an super-resolution deep learning model, it is judged to be a new method to construct spatial information of high-quality without being restricted by hardware.

Dynamic Characteristics of Water Column Properties based on the Behavior of Water Mass and Inorganic Nutrients in the Western Pacific Seamount Area (서태평양 해저산 해역에서 수괴와 무기영양염 거동에 기초한 동적 수층환경 특성)

  • Son, Juwon;Shin, Hong-Ryeol;Mo, Ahra;Son, Seung-Kyu;Moon, Jai-Woon;Kim, Kyeong-Hong
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.18 no.3
    • /
    • pp.143-156
    • /
    • 2015
  • In order to understand the dynamic characteristics of water column environments in the Western Pacific seamount area (approximately $150.2^{\circ}E$, $20^{\circ}N$), we investigated the water mass and the behavior of water column parameters such as dissolved oxygen, inorganic nutrients (N, P), and chlorophyll-a. Physico-chemical properties of water column were obtained by CTD system at the nine stations which were selected along the east-west and south-north direction around the seamount (OSM14-2) in October 2014. From the temperature-salinity diagram, the main water masses were separated into North Pacific Tropical Water and Thermocline Water in the surface layer, North Pacific Intermediate Water in the intermediate layer, and North Pacific Deep Water in the bottom layer, respectively. Oxygen minimum zone (OMZ, mean $O_2$ $73.26{\mu}M$), known as dysoxic condition ($O_2<90{\mu}M$), was distributed in the depth range of 700~1,200 m throughout the study area. Inorganic nutrients typified by nitrite + nitrate and phosphate showed the lowest concentration in the surface mixed layer and then gradually increased downward with representing the maximum concentration in the OMZ, with lower N:P ratio (13.7), indicating that the nitrogen is regarded as limiting factor for primary production. Vertical distribution of water column parameters along the east-west and south-north station line around the seamount showed the effect of bottom water inflowing at around 500 m deep in the western and southern region, and concentrations of water column parameters in the bottom layer (below 2,500 m deep) of the western and southern region were differently distributed comparing to those of the other side regions (eastern and northern). The value of Excess N calculated from Redfield ratio (N:P=16:1) represented the negative value throughout the study area, which indicated the nitrogen sink dominant environments, and relative higher value of Excess N observed in the bottom layer of western and southern region. These observations suggest that the topographic features of a seamount influence the circulation of bottom current and its effects play a significant role in determining the behavior of water column environmental parameters.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

Bi-directional LSTM-CNN-CRF for Korean Named Entity Recognition System with Feature Augmentation (자질 보강과 양방향 LSTM-CNN-CRF 기반의 한국어 개체명 인식 모델)

  • Lee, DongYub;Yu, Wonhee;Lim, HeuiSeok
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.12
    • /
    • pp.55-62
    • /
    • 2017
  • The Named Entity Recognition system is a system that recognizes words or phrases with object names such as personal name (PS), place name (LC), and group name (OG) in the document as corresponding object names. Traditional approaches to named entity recognition include statistical-based models that learn models based on hand-crafted features. Recently, it has been proposed to construct the qualities expressing the sentence using models such as deep-learning based Recurrent Neural Networks (RNN) and long-short term memory (LSTM) to solve the problem of sequence labeling. In this research, to improve the performance of the Korean named entity recognition system, we used a hand-crafted feature, part-of-speech tagging information, and pre-built lexicon information to augment features for representing sentence. Experimental results show that the proposed method improves the performance of Korean named entity recognition system. The results of this study are presented through github for future collaborative research with researchers studying Korean Natural Language Processing (NLP) and named entity recognition system.

Diagnosis and Endovascular Treatment of May-Thurner Syndrome (May-Thurner 증후군의 진단과 혈관내 치료)

  • 허균;이재욱;신화균;원용순
    • Journal of Chest Surgery
    • /
    • v.37 no.11
    • /
    • pp.911-917
    • /
    • 2004
  • Background: There are limited number of reports on May-Thurner syndrome (Iliac vein compression syndrome) in Korea, We analysed the clinical features, diagnostic modalities and endovascular treatment of May-Thurner syndrome. Material and Method: We reviewed 12 cases of May-Thurner syndrome between March 2001 and June 2003. Mean age was $57.6\pm2$ years. We were used in venography, color doppler and computed tomographic angiography as diagnostic modalities and in thrombolysis, thrombectomy, angioplasty and stent insertion as endovascular treatment. Result: Clinical features showed edema of lower extremities in 4 patients, pain of lower extremities in 1 patient, edema with pain in 5 patients, and all in 1 patient. In one patient, he did not have any pain and any edema of lower extremities but was diagnosed as May-Thurner syndrome using venography due to varicose veins on lower extremities. Diagnostic modalities included venography, computed tomographic angiography in all patients with clinical presentation except in one patient and color doppler was only performed only in 4 patients. Four kinds of endovascular treatment were performed for May-Thurner syndrome, angioplasty in 11 patients, stent insertion in 10 patients, thrombectomy in 9 patients and thrombolysis for 7 patients. Nine patients were followed up and we can show good blood flow in Left iliac vein for 7 of 9 patients. Conclusion: it is necessary to recognize the possibility of May-Thurner syndrome in Deep vein thrombosis patients and we should use a variety of modalities to diagnose May-Thurner syndrome. Finally, endovascular treatment is a safe and effective therapy for May-Thurner syndrome.

Fabrication of UV imprint stamp using diamond-like carbon coating technology (Diamond-like carbon 코팅기술을 사용한 UV-임프린트 스탬프 제작)

  • JEONG JUN-HO;KIM KI-DON;SIM YOUNG-SUK;CHOI DAE-GEUN;CHOI JUNHYUK;LEE EUNG-SUG;LIM TAE-WOO;PARK SANG-HU;YANG DONG-YOL;CHA NAM-GOO;PARK JIN-GOO
    • Proceedings of the Korean Society for Technology of Plasticity Conference
    • /
    • 2005.10a
    • /
    • pp.167-170
    • /
    • 2005
  • The two-dimensional (2D) and three-dimensional (3D) diamond-like carbon (DLC) stamps for ultraviolet nanoimprint lithography (UV-NIL) were fabricated using two kinds of methods, which were a DLC coating process followed by the focused ion beam (FIB) lithography and the two-photon polymerization (TPP) patterning followed by nano-scale thick DLC coating. We fabricated 70 nm deep lines with a width of 100 nm and 70 nm deep lines with a width of 150 nm on 100 nm thick DLC layers coated on quartz substrates using the FIB lithography. 200 nm wide lines, 3D rings with a diameter of $1.35\;{\mu}m$ and a height of $1.97\;{\mu}m$, and a 3D cone with a bottom diameter of $2.88\;{\mu}m$ and a height of $1.97\;{\mu}m$ were successfully fabricated using the TPP patterning and DLC coating process. The wafers were successfully printed on an UV-NIL using the DLC stamp. We could see the excellent correlation between the dimensions of features of stamp and the corresponding imprinted features.

  • PDF

Adversarial Learning-Based Image Correction Methodology for Deep Learning Analysis of Heterogeneous Images (이질적 이미지의 딥러닝 분석을 위한 적대적 학습기반 이미지 보정 방법론)

  • Kim, Junwoo;Kim, Namgyu
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.11
    • /
    • pp.457-464
    • /
    • 2021
  • The advent of the big data era has enabled the rapid development of deep learning that learns rules by itself from data. In particular, the performance of CNN algorithms has reached the level of self-adjusting the source data itself. However, the existing image processing method only deals with the image data itself, and does not sufficiently consider the heterogeneous environment in which the image is generated. Images generated in a heterogeneous environment may have the same information, but their features may be expressed differently depending on the photographing environment. This means that not only the different environmental information of each image but also the same information are represented by different features, which may degrade the performance of the image analysis model. Therefore, in this paper, we propose a method to improve the performance of the image color constancy model based on Adversarial Learning that uses image data generated in a heterogeneous environment simultaneously. Specifically, the proposed methodology operates with the interaction of the 'Domain Discriminator' that predicts the environment in which the image was taken and the 'Illumination Estimator' that predicts the lighting value. As a result of conducting an experiment on 7,022 images taken in heterogeneous environments to evaluate the performance of the proposed methodology, the proposed methodology showed superior performance in terms of Angular Error compared to the existing methods.

Effects of Spatio-temporal Features of Dynamic Hand Gestures on Learning Accuracy in 3D-CNN (3D-CNN에서 동적 손 제스처의 시공간적 특징이 학습 정확성에 미치는 영향)

  • Yeongjee Chung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.3
    • /
    • pp.145-151
    • /
    • 2023
  • 3D-CNN is one of the deep learning techniques for learning time series data. Such three-dimensional learning can generate many parameters, so that high-performance machine learning is required or can have a large impact on the learning rate. When learning dynamic hand-gestures in spatiotemporal domain, it is necessary for the improvement of the efficiency of dynamic hand-gesture learning with 3D-CNN to find the optimal conditions of input video data by analyzing the learning accuracy according to the spatiotemporal change of input video data without structural change of the 3D-CNN model. First, the time ratio between dynamic hand-gesture actions is adjusted by setting the learning interval of image frames in the dynamic hand-gesture video data. Second, through 2D cross-correlation analysis between classes, similarity between image frames of input video data is measured and normalized to obtain an average value between frames and analyze learning accuracy. Based on this analysis, this work proposed two methods to effectively select input video data for 3D-CNN deep learning of dynamic hand-gestures. Experimental results showed that the learning interval of image data frames and the similarity of image frames between classes can affect the accuracy of the learning model.