• Title/Summary/Keyword: hidden image

Search Result 340, Processing Time 0.027 seconds

PathGAN: Local path planning with attentive generative adversarial networks

  • Dooseop Choi;Seung-Jun Han;Kyoung-Wook Min;Jeongdan Choi
    • ETRI Journal
    • /
    • v.44 no.6
    • /
    • pp.1004-1019
    • /
    • 2022
  • For autonomous driving without high-definition maps, we present a model capable of generating multiple plausible paths from egocentric images for autonomous vehicles. Our generative model comprises two neural networks: feature extraction network (FEN) and path generation network (PGN). The FEN extracts meaningful features from an egocentric image, whereas the PGN generates multiple paths from the features, given a driving intention and speed. To ensure that the paths generated are plausible and consistent with the intention, we introduce an attentive discriminator and train it with the PGN under a generative adversarial network framework. Furthermore, we devise an interaction model between the positions in the paths and the intentions hidden in the positions and design a novel PGN architecture that reflects the interaction model for improving the accuracy and diversity of the generated paths. Finally, we introduce ETRIDriving, a dataset for autonomous driving, in which the recorded sensor data are labeled with discrete high-level driving actions, and demonstrate the state-of-the-art performance of the proposed model on ETRIDriving in terms of accuracy and diversity.

Scanning acoustic microscopy for material evaluation

  • Hyunung Yu
    • Applied Microscopy
    • /
    • v.50
    • /
    • pp.25.1-25.11
    • /
    • 2020
  • Scanning acoustic microscopy (SAM) or Acoustic Micro Imaging (AMI) is a powerful, non-destructive technique that can detect hidden defects in elastic and biological samples as well as non-transparent hard materials. By monitoring the internal features of a sample in three-dimensional integration, this technique can efficiently find physical defects such as cracks, voids, and delamination with high sensitivity. In recent years, advanced techniques such as ultrasound impedance microscopy, ultrasound speed microscopy, and scanning acoustic gigahertz microscopy have been developed for applications in industries and in the medical field to provide additional information on the internal stress, viscoelastic, and anisotropic, or nonlinear properties. X-ray, magnetic resonance, and infrared techniques are the other competitive and widely used methods. However, they have their own advantages and limitations owing to their inherent properties such as different light sources and sensors. This paper provides an overview of the principle of SAM and presents a few results to demonstrate the applications of modern acoustic imaging technology. A variety of inspection modes, such as vertical, horizontal, and diagonal cross-sections have been presented by employing the focus pathway and image reconstruction algorithm. Images have been reconstructed from the reflected echoes resulting from the change in the acoustic impedance at the interface of the material layers or defects. The results described in this paper indicate that the novel acoustic technology can expand the scope of SAM as a versatile diagnostic tool requiring less time and having a high efficiency.

A preliminary study for development of an automatic incident detection system on CCTV in tunnels based on a machine learning algorithm (기계학습(machine learning) 기반 터널 영상유고 자동 감지 시스템 개발을 위한 사전검토 연구)

  • Shin, Hyu-Soung;Kim, Dong-Gyou;Yim, Min-Jin;Lee, Kyu-Beom;Oh, Young-Sup
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.19 no.1
    • /
    • pp.95-107
    • /
    • 2017
  • In this study, a preliminary study was undertaken for development of a tunnel incident automatic detection system based on a machine learning algorithm which is to detect a number of incidents taking place in tunnel in real time and also to be able to identify the type of incident. Two road sites where CCTVs are operating have been selected and a part of CCTV images are treated to produce sets of training data. The data sets are composed of position and time information of moving objects on CCTV screen which are extracted by initially detecting and tracking of incoming objects into CCTV screen by using a conventional image processing technique available in this study. And the data sets are matched with 6 categories of events such as lane change, stoping, etc which are also involved in the training data sets. The training data are learnt by a resilience neural network where two hidden layers are applied and 9 architectural models are set up for parametric studies, from which the architectural model, 300(first hidden layer)-150(second hidden layer) is found to be optimum in highest accuracy with respect to training data as well as testing data not used for training. From this study, it was shown that the highly variable and complex traffic and incident features could be well identified without any definition of feature regulation by using a concept of machine learning. In addition, detection capability and accuracy of the machine learning based system will be automatically enhanced as much as big data of CCTV images in tunnel becomes rich.

Research on Classification of Sitting Posture with a IMU (하나의 IMU를 이용한 앉은 자세 분류 연구)

  • Kim, Yeon-Wook;Cho, Woo-Hyeong;Jeon, Yu-Yong;Lee, Sangmin
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.11 no.3
    • /
    • pp.261-270
    • /
    • 2017
  • Bad sitting postures are known to cause for a variety of diseases or physical deformation. However, it is not easy to fit right sitting posture for long periods of time. Therefore, methods of distinguishing and inducing good sitting posture have been constantly proposed. Proposed methods were image processing, using pressure sensor attached to the chair, and using the IMU (Internal Measurement Unit). The method of using IMU has advantages of simple hardware configuration and free of various constraints in measurement. In this paper, we researched on distinguishing sitting postures with a small amount of data using just one IMU. Feature extraction method was used to find data which contribution is the least for classification. Machine learning algorithms were used to find the best position to classify and we found best machine learning algorithm. Used feature extraction method was PCA(Principal Component Analysis). Used Machine learning models were five : SVM(Support Vector Machine), KNN(K Nearest Neighbor), K-means (K-means Algorithm) GMM (Gaussian Mixture Model), and HMM (Hidden Marcov Model). As a result of research, back neck is suitable position for classification because classification rate of it was highest in every model. It was confirmed that Yaw data which is one of the IMU data has the smallest contribution to classification rate using PCA and there was no changes in classification rate after removal it. SVM, KNN are suitable for classification because their classification rate are higher than the others.

Visual Classification of Wood Knots Using k-Nearest Neighbor and Convolutional Neural Network (k-Nearest Neighbor와 Convolutional Neural Network에 의한 제재목 표면 옹이 종류의 화상 분류)

  • Kim, Hyunbin;Kim, Mingyu;Park, Yonggun;Yang, Sang-Yun;Chung, Hyunwoo;Kwon, Ohkyung;Yeo, Hwanmyeong
    • Journal of the Korean Wood Science and Technology
    • /
    • v.47 no.2
    • /
    • pp.229-238
    • /
    • 2019
  • Various wood defects occur during tree growing or wood processing. Thus, to use wood practically, it is necessary to objectively assess their quality based on the usage requirement by accurately classifying their defects. However, manual visual grading and species classification may result in differences due to subjective decisions; therefore, computer-vision-based image analysis is required for the objective evaluation of wood quality and the speeding up of wood production. In this study, the SIFT+k-NN and CNN models were used to implement a model that automatically classifies knots and analyze its accuracy. Toward this end, a total of 1,172 knot images in various shapes from five domestic conifers were used for learning and validation. For the SIFT+k-NN model, SIFT technology was used to extract properties from the knot images and k-NN was used for the classification, resulting in the classification with an accuracy of up to 60.53% when k-index was 17. The CNN model comprised 8 convolution layers and 3 hidden layers, and its maximum accuracy was 88.09% after 1205 epoch, which was higher than that of the SIFT+k-NN model. Moreover, if there is a large difference in the number of images by knot types, the SIFT+k-NN tended to show a learning biased toward the knot type with a higher number of images, whereas the CNN model did not show a drastic bias regardless of the difference in the number of images. Therefore, the CNN model showed better performance in knot classification. It is determined that the wood knot classification by the CNN model will show a sufficient accuracy in its practical applicability.

A Pilot Study on Environmental Understanding and Estimation of the Nak-Dong River Basin Using Fuyo-1 OPS Data (Fuyo-1 OPS 자료를 이용한 낙동강 하류지역의 환경계측 시고)

  • Kim, Cheon
    • Korean Journal of Remote Sensing
    • /
    • v.12 no.2
    • /
    • pp.169-198
    • /
    • 1996
  • The objectives of this investigation are : 1. To analyze spectral signature and the associated vegetation index for geometric illumination conditions inf1uenced by low solar elevation and high slope orientations in mountainous forest. 2. To assess the accuracy of the spectral angle mapper classification for the a winter land cover in comparison with the maximum likelihood classification. 3. To produce the image of water quality and water properties that could be used to estimate the water pollution sources and the tide-included by turbid water in estuarine and coastal areas. These objectives are to characterize environmental and ecological monitoring applications of the Nak-Dong River Basin by using Fuyo-1 OPS VNIR data acquired on December 26, 1992. The results of this paper are as follows : 1. The spectral digital numbers and vegetation indexes (NDVI and TVI) of mountainous forest are higher on the slope facing the sun than on the slope hidden the sun under low sun elevation condition. 2. The spectral angle mapper algorithm produces a more accurate land cover classification of areas with steep slope, various aspects and low solar elevation than the maximum likelihood classifier. 3. The maximum likelihood classification images can be used for identifying the location and movement of both freshwater and salt water, regardless of geometric illumination conditions. 4. The color-coded density sliced image of selected water bodies by using the near-infrared band 3 can provide distribution of the water quality of the Lower Nak-Dong River. 5. The color-coded normalized difference vegetation index image of the selected mountain forest is suitable to classify winter vegetation cover types, i.e., forest canopy densities for slope orientations.

Trends in the Use of Artificial Intelligence in Medical Image Analysis (의료영상 분석에서 인공지능 이용 동향)

  • Lee, Gil-Jae;Lee, Tae-Soo
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.4
    • /
    • pp.453-462
    • /
    • 2022
  • In this paper, the artificial intelligence (AI) technology used in the medical image analysis field was analyzed through a literature review. Literature searches were conducted on PubMed, ResearchGate, Google and Cochrane Review using the key word. Through literature search, 114 abstracts were searched, and 98 abstracts were reviewed, excluding 16 duplicates. In the reviewed literature, AI is applied in classification, localization, disease detection, disease segmentation, and fit degree of registration images. In machine learning (ML), prior feature extraction and inputting the extracted feature values into the neural network have disappeared. Instead, it appears that the neural network is changing to a deep learning (DL) method with multiple hidden layers. The reason is thought to be that feature extraction is processed in the DL process due to the increase in the amount of memory of the computer, the improvement of the calculation speed, and the construction of big data. In order to apply the analysis of medical images using AI to medical care, the role of physicians is important. Physicians must be able to interpret and analyze the predictions of AI algorithms. Additional medical education and professional development for existing physicians is needed to understand AI. Also, it seems that a revised curriculum for learners in medical school is needed.

Anomaly Detection for User Action with Generative Adversarial Networks (적대적 생성 모델을 활용한 사용자 행위 이상 탐지 방법)

  • Choi, Nam woong;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.43-62
    • /
    • 2019
  • At one time, the anomaly detection sector dominated the method of determining whether there was an abnormality based on the statistics derived from specific data. This methodology was possible because the dimension of the data was simple in the past, so the classical statistical method could work effectively. However, as the characteristics of data have changed complexly in the era of big data, it has become more difficult to accurately analyze and predict the data that occurs throughout the industry in the conventional way. Therefore, SVM and Decision Tree based supervised learning algorithms were used. However, there is peculiarity that supervised learning based model can only accurately predict the test data, when the number of classes is equal to the number of normal classes and most of the data generated in the industry has unbalanced data class. Therefore, the predicted results are not always valid when supervised learning model is applied. In order to overcome these drawbacks, many studies now use the unsupervised learning-based model that is not influenced by class distribution, such as autoencoder or generative adversarial networks. In this paper, we propose a method to detect anomalies using generative adversarial networks. AnoGAN, introduced in the study of Thomas et al (2017), is a classification model that performs abnormal detection of medical images. It was composed of a Convolution Neural Net and was used in the field of detection. On the other hand, sequencing data abnormality detection using generative adversarial network is a lack of research papers compared to image data. Of course, in Li et al (2018), a study by Li et al (LSTM), a type of recurrent neural network, has proposed a model to classify the abnormities of numerical sequence data, but it has not been used for categorical sequence data, as well as feature matching method applied by salans et al.(2016). So it suggests that there are a number of studies to be tried on in the ideal classification of sequence data through a generative adversarial Network. In order to learn the sequence data, the structure of the generative adversarial networks is composed of LSTM, and the 2 stacked-LSTM of the generator is composed of 32-dim hidden unit layers and 64-dim hidden unit layers. The LSTM of the discriminator consists of 64-dim hidden unit layer were used. In the process of deriving abnormal scores from existing paper of Anomaly Detection for Sequence data, entropy values of probability of actual data are used in the process of deriving abnormal scores. but in this paper, as mentioned earlier, abnormal scores have been derived by using feature matching techniques. In addition, the process of optimizing latent variables was designed with LSTM to improve model performance. The modified form of generative adversarial model was more accurate in all experiments than the autoencoder in terms of precision and was approximately 7% higher in accuracy. In terms of Robustness, Generative adversarial networks also performed better than autoencoder. Because generative adversarial networks can learn data distribution from real categorical sequence data, Unaffected by a single normal data. But autoencoder is not. Result of Robustness test showed that he accuracy of the autocoder was 92%, the accuracy of the hostile neural network was 96%, and in terms of sensitivity, the autocoder was 40% and the hostile neural network was 51%. In this paper, experiments have also been conducted to show how much performance changes due to differences in the optimization structure of potential variables. As a result, the level of 1% was improved in terms of sensitivity. These results suggest that it presented a new perspective on optimizing latent variable that were relatively insignificant.

Analysis of Mosaic Image of Animation (애니메이션 <플랫 라이프>의 모자이크 이미지 분석)

  • Lee, Ji-Hyun
    • Cartoon and Animation Studies
    • /
    • s.49
    • /
    • pp.465-491
    • /
    • 2017
  • This paper analyzes the short animation film which faithfully follows the external dimension of cartoons, and studies the hidden subject and the way of narrative in the back of the figure. In this process, we analyze the theme of using the analysis method of mosaic image. The narrative of cartoons is usually the majority in discussing the differences in the aspectual part. However, this animation uses cartoons to reach the linear narrative of ordinary narrative movies. As Janet Murray explains, if a 'mosaic image' approaches a theme through a mosaic approach, we can limit the 'mosaic film' that are introduced in the film format among them. First of all, conceptually uses the characteristics of 'mosaic image', at the same time, it utilizes a work utilizing the narrative features of 'mosaic film'. By analyzing this animation film by bifurcation, the first half reveals the characteristics of open-minded mosaic video platform, and the second half introduces the linear narrative method of film narrative. This paper divides the narrative method of 'multi plot Film' into three types: mosaic narrative film, network narrative film, and multi-draft film. Thus, we can analyze the ending of as a narrative method of 'network narrative film' which is composed of parallel or juxtaposed stories. In other words, if the early part of the animation follows the 'mosaic narrative' as an 'extension of ensemble film', the latter part faithfully follows 'network narrative'. Even in the way of talking about the subject, this animated film uses the way of speaking the mosaic image. Considering the aspectual tendency of cartoons, it can be said that this film derives the meaning of 'humor' or 'satire' in an open way. If the first half refers to 'the ambiguous routine of modern man', the latter half draws a profound theme called 'the reality of human selfishness in modern society'. is a film for a wide range of social criticism designed for adults who can interpret meaning.

A Secure Digital Watermarking Scheme based on RSA Function (RSA 함수에 기반한 안전한 워터마킹 기법)

  • Lee, Jean-Ho;Kim, Tai-Yun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.3
    • /
    • pp.220-228
    • /
    • 2001
  • Digital watermarking is a technique for the purpose of protecting the ownership of the image by embedding invisible watermarks in a digital imnge. To guarantee the security of the digital watermarking scheme for copyright protection, it is required to satisfy some requirements robustness and perceptual invisibility which provided by the location of embedded bits, the public watermarking algorithm, and the hidden use of the key, which can protect unauthorized accesses from illegal users. For this, in this paper we propose a new copyright watermarking scheme, which is based on one-way hash functions using RSA functions and modular operations. RSA functions are widely used in cryptographic systems. Our watermarking scheme is robust against LSB(Jeast significant bit) attacks and gamma corresction attack, and is also perceptually invisible. We demonstrate the characteristics of our proposed watermarking scheme through experiments.

  • PDF