• Title/Summary/Keyword: extraction of feature

Search Result 2,579, Processing Time 0.027 seconds

Face Recognition using Modified Local Directional Pattern Image (Modified Local Directional Pattern 영상을 이용한 얼굴인식)

  • Kim, Dong-Ju;Lee, Sang-Heon;Sohn, Myoung-Kyu
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.3
    • /
    • pp.205-208
    • /
    • 2013
  • Generally, binary pattern transforms have been used in the field of the face recognition and facial expression, since they are robust to illumination. Thus, this paper proposes an illumination-robust face recognition system combining an MLDP, which improves the texture component of the LDP, and a 2D-PCA algorithm. Unlike that binary pattern transforms such as LBP and LDP were used to extract histogram features, the proposed method directly uses the MLDP image for feature extraction by 2D-PCA. The performance evaluation of proposed method was carried out using various algorithms such as PCA, 2D-PCA and Gabor wavelets-based LBP on Yale B and CMU-PIE databases which were constructed under varying lighting condition. From the experimental results, we confirmed that the proposed method showed the best recognition accuracy.

Hierarchical Gabor Feature and Bayesian Network for Handwritten Digit Recognition (계층적인 가버 특징들과 베이지안 망을 이용한 필기체 숫자인식)

  • 성재모;방승양
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.1
    • /
    • pp.1-7
    • /
    • 2004
  • For the handwritten digit recognition, this paper Proposes a hierarchical Gator features extraction method and a Bayesian network for them. Proposed Gator features are able to represent hierarchically different level information and Bayesian network is constructed to represent hierarchically structured dependencies among these Gator features. In order to extract such features, we define Gabor filters level by level and choose optimal Gabor filters by using Fisher's Linear Discriminant measure. Hierarchical Gator features are extracted by optimal Gabor filters and represent more localized information in the lower level. Proposed methods were successfully applied to handwritten digit recognition with well-known naive Bayesian classifier, k-nearest neighbor classifier. and backpropagation neural network and showed good performance.

The Analysis of Spectral characteristics of Water Quality Factors Uisng Airborne MSS Data (Airborne MSS 자료를 이용한 수질인자의 분광특성 분석)

  • Dong-Ho Jang;Gi-Ho Jo;Kwang-Hoon Chi
    • Korean Journal of Remote Sensing
    • /
    • v.14 no.3
    • /
    • pp.296-306
    • /
    • 1998
  • Airborne MSS data is regarded as a potentially effective data source for the measurement of water quality and for the environmental change of water bodies. In this study, we measured the radiance reflectance by using multi-spectral image of low resolution camera(LRC) which will be reached in the multi-purpose satellite(KOMPSAT) to use the data in analyzing water pollution. We also investigated the possibility of extraction of water quality factors in water bodies by using high resolution remote sensing data such as Airborne MSS. Especially, we tried to extract environmental factors related with eutrophication such as chlorophyll-a, suspended sediments and turbidity, and also tried to develop the process technique and the radiance feature of reflectance related with eutrophication. Although it was difficult to explicitly correlate Airborne MSS data with water quality factors due to the insufficient number of ground truth data. The results were summarized as follows: First, the spectrum of sun's rays which reaches the surface of the earth was consistent with visible bands of 0.4${\mu}{\textrm}{m}$~0.7${\mu}{\textrm}{m}$ and about 50% of total quantity of radiation could be found. The spectrum was reached highest at around 0.5${\mu}{\textrm}{m}$ of green spectral band in visible bands. Second, as a result of the radiance reflectance Chlorophyll-a represented high mainly around 0.52${\mu}{\textrm}{m}$ of green spectral band, and suspended sediments and turbidity represented high at 0.8${\mu}{\textrm}{m}$ and at 0.57${\mu}{\textrm}{m}$, respectively. Finally, as a result of the water quality analysis by using Airborne MSS, Chlorophyll-a could have a distribution image after carrying out ratio of B3 and B5 to B7. Band 7 was useful for making the distribution image of suspended sediments. When we carried out PCA, suspended sediments and turbidity had distributions at PC 1 and PC 4 which are similar to the ground data. Above results can be changed according to the change of season and time. Therefore, in order to analyze the environmental factors of water quality by using LRC data more exactly, we need to investigate the ground data and the radiance feature of reflectance of water bodies constantly. For further studies, we will constantly analyze the radiance feature of the surface of water in wafter bodies by measuring the on-the-spot radiance reflectance and using low resolution satellite image(SeaWiFS). We will also gather the data of water quality analysis in water bodies and analyze the pattern of water pollution.

Development of Multimedia Annotation and Retrieval System using MPEG-7 based Semantic Metadata Model (MPEG-7 기반 의미적 메타데이터 모델을 이용한 멀티미디어 주석 및 검색 시스템의 개발)

  • An, Hyoung-Geun;Koh, Jae-Jin
    • The KIPS Transactions:PartD
    • /
    • v.14D no.6
    • /
    • pp.573-584
    • /
    • 2007
  • As multimedia information recently increases fast, various types of retrieval of multimedia data are becoming issues of great importance. For the efficient multimedia data processing, semantics based retrieval techniques are required that can extract the meaning contents of multimedia data. Existing retrieval methods of multimedia data are annotation-based retrieval, feature-based retrieval and annotation and feature integration based retrieval. These systems take annotator a lot of efforts and time and we should perform complicated calculation for feature extraction. In addition. created data have shortcomings that we should go through static search that do not change. Also, user-friendly and semantic searching techniques are not supported. This paper proposes to develop S-MARS(Semantic Metadata-based Multimedia Annotation and Retrieval System) which can represent and extract multimedia data efficiently using MPEG-7. The system provides a graphical user interface for annotating, searching, and browsing multimedia data. It is implemented on the basis of the semantic metadata model to represent multimedia information. The semantic metadata about multimedia data is organized on the basis of multimedia description schema using XML schema that basically comply with the MPEG-7 standard. In conclusion. the proposed scheme can be easily implemented on any multimedia platforms supporting XML technology. It can be utilized to enable efficient semantic metadata sharing between systems, and it will contribute to improving the retrieval correctness and the user's satisfaction on embedding based multimedia retrieval algorithm method.

Scanning Determination & Observation Features by Sex shown in the Process of Acquiring Visual Information - With the Object of Subway Station Hall Space - (시각정보획득과정에 나타난 주사판정과 성별 주시특성 - 지하철 홀 공간을 대상으로 -)

  • Kim, Jong-Ha;Choi, Gae-Young
    • Korean Institute of Interior Design Journal
    • /
    • v.23 no.6
    • /
    • pp.115-124
    • /
    • 2014
  • This study has carried out scanning tests in order to figure out the features of scanning search by sex of space users, with the result of which the validity of data has been estimated. In this research, the scanning patterns were set up for verifying the typology of scanning paths and then the reason for determining scanning paths and the validity of estimation method were reviewed. Since the observation features depends on sex, the analysis of visual activities for acquiring any information in a space will reveal the intention and purpose of space users. The findings by analyzing the features of scanning pattern by sex which were found at the determination of scanning patterns can be defined as the followings. First, for estimating the process of space-information search, the movement distance at each point of continuative-observation data from the angle of eye-movement has been extracted, on the ground of which the fixation and movement of eye have been defined for the establishment of scanning-cut characteristics. Second, the scanning times were estimated for the extraction of effective observation data that would be used for comparative analysis, which showed that men had more data (3,398.2/64.4%) than women (2,998.2/55.6%). This enables the acknowledgment that the scanning cut of men was relatively less, which indicates that men will acquire more information on space than women in the process of observing any space. Third, men's scanning times (58.0 times/2.02 seconds) were less than those of women (71.9 times/1.39 seconds) while the scanning time of the former was longer than that of the latter, which shows the feature that it takes longer for men than women in scanning while the scanning times of the former is less than those of the latter. Fourth, the observation features can be determined that the combination of this result with the predominance character by sex for a general viewpoint to be employed indicates that while men employ mixed-scanning for observation activities to acquire space-information spending for longer time, women, by concentrated-scanning, focus on a single point for shorter time or stay at one location for a considerably long time for space-information acquirement.

Design of ASM-based Face Recognition System Using (2D)2 Hybird Preprocessing Algorithm (ASM기반 (2D)2 하이브리드 전처리 알고리즘을 이용한 얼굴인식 시스템 설계)

  • Kim, Hyun-Ki;Jin, Yong-Tak;Oh, Sung-Kwun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.2
    • /
    • pp.173-178
    • /
    • 2014
  • In this study, we introduce ASM-based face recognition classifier and its design methodology with the aid of 2-dimensional 2-directional hybird preprocessing algorithm. Since the image of face recognition is easily affected by external environments, ASM(active shape model) as image preprocessing algorithm is used to resolve such problem. In particular, ASM is used widely for the purpose of feature extraction for human face. After extracting face image area by using ASM, the dimensionality of the extracted face image data is reduced by using $(2D)^2$hybrid preprocessing algorithm based on LDA and PCA. Face image data through preprocessing algorithm is used as input data for the design of the proposed polynomials based radial basis function neural network. Unlike as the case in existing neural networks, the proposed pattern classifier has the characteristics of a robust neural network and it is also superior from the view point of predictive ability as well as ability to resolve the problem of multi-dimensionality. The essential design parameters (the number of row eigenvectors, column eigenvectors, and clusters, and fuzzification coefficient) of the classifier are optimized by means of ABC(artificial bee colony) algorithm. The performance of the proposed classifier is quantified through yale and AT&T dataset widely used in the face recognition.

Development of On-line Quality Sorting System for Dried Oak Mushroom - 3rd Prototype-

  • 김철수;김기동;조기현;이정택;김진현
    • Agricultural and Biosystems Engineering
    • /
    • v.4 no.1
    • /
    • pp.8-15
    • /
    • 2003
  • In Korea, quality evaluation of dried oak mushrooms are done first by classifying them into more than 10 different categories based on the state of opening of the cap, surface pattern, and colors. And mushrooms of each category are further classified into 3 or 4 groups based on its shape and size, resulting into total 30 to 40 different grades. Quality evaluation and sorting based on the external visual features are usually done manually. Since visual features of mushroom affecting quality grades are distributed over the entire surface of the mushroom, both front (cap) and back (stem and gill) surfaces should be inspected thoroughly. In fact, it is almost impossible for human to inspect every mushroom, especially when they are fed continuously via conveyor. In this paper, considering real time on-line system implementation, image processing algorithms utilizing artificial neural network have been developed for the quality grading of a mushroom. The neural network based image processing utilized the raw gray value image of fed mushrooms captured by the camera without any complex image processing such as feature enhancement and extraction to identify the feeding state and to grade the quality of a mushroom. Developed algorithms were implemented to the prototype on-line grading and sorting system. The prototype was developed to simplify the system requirement and the overall mechanism. The system was composed of automatic devices for mushroom feeding and handling, a set of computer vision system with lighting chamber, one chip microprocessor based controller, and pneumatic actuators. The proposed grading scheme was tested using the prototype. Network training for the feeding state recognition and grading was done using static images. 200 samples (20 grade levels and 10 per each grade) were used for training. 300 samples (20 grade levels and 15 per each grade) were used to validate the trained network. By changing orientation of each sample, 600 data sets were made for the test and the trained network showed around 91 % of the grading accuracy. Though image processing itself required approximately less than 0.3 second depending on a mushroom, because of the actuating device and control response, average 0.6 to 0.7 second was required for grading and sorting of a mushroom resulting into the processing capability of 5,000/hr to 6,000/hr.

  • PDF

Evaluation of Classification Performance of Inception V3 Algorithm for Chest X-ray Images of Patients with Cardiomegaly (심장비대증 환자의 흉부 X선 영상에 대한 Inception V3 알고리즘의 분류 성능평가)

  • Jeong, Woo-Yeon;Kim, Jung-Hun;Park, Ji-Eun;Kim, Min-Jeong;Lee, Jong-Min
    • Journal of the Korean Society of Radiology
    • /
    • v.15 no.4
    • /
    • pp.455-461
    • /
    • 2021
  • Cardiomegaly is one of the most common diseases seen on chest X-rays, but if it is not detected early, it can cause serious complications. In view of this, in recent years, many researches on image analysis in which deep learning algorithms using artificial intelligence are applied to medical care have been conducted with the development of various science and technology fields. In this paper, we would like to evaluate whether the Inception V3 deep learning model is a useful model for the classification of Cardiomegaly using chest X-ray images. For the images used, a total of 1026 chest X-ray images of patients diagnosed with normal heart and those diagnosed with Cardiomegaly in Kyungpook National University Hospital were used. As a result of the experiment, the classification accuracy and loss of the Inception V3 deep learning model according to the presence or absence of Cardiomegaly were 96.0% and 0.22%, respectively. From the research results, it was found that the Inception V3 deep learning model is an excellent deep learning model for feature extraction and classification of chest image data. The Inception V3 deep learning model is considered to be a useful deep learning model for classification of chest diseases, and if such excellent research results are obtained by conducting research using a little more variety of medical image data, I think it will be great help for doctor's diagnosis in future.

The Development of Software Teaching-Learning Model based on Machine Learning Platform (머신러닝 플랫폼을 활용한 소프트웨어 교수-학습 모형 개발)

  • Park, Daeryoon;Ahn, Joongmin;Jang, Junhyeok;Yu, Wonjin;Kim, Wooyeol;Bae, Youngkwon;Yoo, Inhwan
    • Journal of The Korean Association of Information Education
    • /
    • v.24 no.1
    • /
    • pp.49-57
    • /
    • 2020
  • The society we are living in has being changed to the age of the intelligent information society after passing through the knowledge-based information society in the early 21st century. In this study, we have developed the instructional model for software education based on the machine learning which is a field of artificial intelligence(AI) to enhance the core competencies of learners required in the intelligent information society. This model is focusing on enhancing the core competencies through the process of problem-solving as well as reducing the burden of learning about AI itself. The specific stages of the developed model are consisted of seven levels which are 'Problem Recognition and Analysis', 'Data Collection', 'Data Processing and Feature Extraction', 'ML Model Training and Evaluation', 'ML Programming', 'Application and Problem Solving', and 'Share and Feedback'. As a result of applying the developed model in this study, we were able to observe the positive response about learning from the students and parents. We hope that this research could suggest the future direction of not only the instructional design but also operation of software education program based on machine learning.

Accelerated Loarning of Latent Topic Models by Incremental EM Algorithm (점진적 EM 알고리즘에 의한 잠재토픽모델의 학습 속도 향상)

  • Chang, Jeong-Ho;Lee, Jong-Woo;Eom, Jae-Hong
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1045-1055
    • /
    • 2007
  • Latent topic models are statistical models which automatically captures salient patterns or correlation among features underlying a data collection in a probabilistic way. They are gaining an increased popularity as an effective tool in the application of automatic semantic feature extraction from text corpus, multimedia data analysis including image data, and bioinformatics. Among the important issues for the effectiveness in the application of latent topic models to the massive data set is the efficient learning of the model. The paper proposes an accelerated learning technique for PLSA model, one of the popular latent topic models, by an incremental EM algorithm instead of conventional EM algorithm. The incremental EM algorithm can be characterized by the employment of a series of partial E-steps that are performed on the corresponding subsets of the entire data collection, unlike in the conventional EM algorithm where one batch E-step is done for the whole data set. By the replacement of a single batch E-M step with a series of partial E-steps and M-steps, the inference result for the previous data subset can be directly reflected to the next inference process, which can enhance the learning speed for the entire data set. The algorithm is advantageous also in that it is guaranteed to converge to a local maximum solution and can be easily implemented just with slight modification of the existing algorithm based on the conventional EM. We present the basic application of the incremental EM algorithm to the learning of PLSA and empirically evaluate the acceleration performance with several possible data partitioning methods for the practical application. The experimental results on a real-world news data set show that the proposed approach can accomplish a meaningful enhancement of the convergence rate in the learning of latent topic model. Additionally, we present an interesting result which supports a possible synergistic effect of the combination of incremental EM algorithm with parallel computing.