• Title/Summary/Keyword: Multimodal Information

Search Result 255, Processing Time 0.031 seconds

Design and Implementation of MIML using XML (XML을 이용한 MIML(Multimodal Information Markup Language)의 설계 및 구현)

  • 김주리;이지근;김희숙;정석태;정성태
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10b
    • /
    • pp.289-291
    • /
    • 2001
  • www의 등장으로 디지털 정보의 표현이 단순한 텍스트 위주의 프리젠테이션에서 이제는 멀티미디어 내용의 증가와 함께 멀티모달 정보 프리젠테이션을 요구하는 변화를 가져오고 있다. 그러나 대다수의 사람들이 멀티모달 정보를 표현하기란 쉽지 않다. 본 논문에서는 이러한 사람들이 보다 쉽고 재미있는 멀티모달 정보 프리젠테이션을 쉽게 사용할 수 있도록 구두 대화 능력에 상호 작용하는 캐릭터 에이전트를 응용하여 MIML을 개발하였다. MIML은 XML 규격에 준거한 Markup Language로써 구두 발표 및 캐릭터 에이전트 행동을 통제하기 위한 기능을 지원한다. 본 논문에서는 다양한 캐릭터 에이전트의 감정 표현 기능과 멀티모달 정보 프리젠테이션을 구성하는 DTD에 대하여 기술하였다.

  • PDF

A Multimodal Emotion Recognition Using the Facial Image and Speech Signal

  • Go, Hyoun-Joo;Kim, Yong-Tae;Chun, Myung-Geun
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.1
    • /
    • pp.1-6
    • /
    • 2005
  • In this paper, we propose an emotion recognition method using the facial images and speech signals. Six basic emotions including happiness, sadness, anger, surprise, fear and dislike are investigated. Facia] expression recognition is performed by using the multi-resolution analysis based on the discrete wavelet. Here, we obtain the feature vectors through the ICA(Independent Component Analysis). On the other hand, the emotion recognition from the speech signal method has a structure of performing the recognition algorithm independently for each wavelet subband and the final recognition is obtained from the multi-decision making scheme. After merging the facial and speech emotion recognition results, we obtained better performance than previous ones.

Fuzzy Bayesian Network for Fusion of Multimodal Context Information (다양한 형태의 상황 정보 합성을 위한 퍼지 베이지안 네트워크)

  • Yoo Ji-Oh;Cho Sung-Bae
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.631-633
    • /
    • 2005
  • 다양한 형태의 상황 정보를 결합하여 추론하기 위해 베이지안 네트워크를 많이 사용한다. 그러나 일반 베이지안 네트워크는 각 노드의 상태가 이산적이기 때문에, 연속적이거나 여러 상태가 동시에 존재할 수 있는 현실의 상황 정보를 처리하기 어렵다. 본 논문에서는 이와 같은 베이지안 네트워크의 단점을 보완하기 위해 다양한 형태의 상황 정보를 퍼지를 통해 전처리하여 베이지안 네트워크를 통해 추론하는 퍼지 베이지안 네트워크를 제안한다. 유용성을 보이기 위해 음악 추천 에이전트를 설계하여 일반 베이지안 네트워크와 비교 실험한 결과, 제안한 방법으로 다양한 상황 정보에 대해 유연한 처리가 가능함을 확인하였다.

  • PDF

Enhancement of Mobile Authentication System Performance based on Multimodal Biometrics (다중 생체인식 기반의 모바일 인증 시스템 성능 개선)

  • Jeong, Kanghun;Kim, Sanghoon;Moon, Hyeonjoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.05a
    • /
    • pp.342-345
    • /
    • 2013
  • 본 논문은 모바일 환경에서의 다중생체인식을 통한 개인인증 시스템을 제안한다. 다중생체인식을 위하여 얼굴인식과 화자인식을 선택하였으며, 시스템의 인식 시나리오는 다음을 따른다. 얼굴인식을 위하여 Modified census transform (MCT) 기반의 얼굴검출과 k-means 클러스터 분석 (cluster analysis) 알고리즘 기반의 눈 검출을 통해 얼굴영역 전처리를 수행하고, principal component analysis (PCA) 기반의 얼굴인증 시스템을 구현한다. 화자인식을 위하여 음성의 끝점 추출과 Mel frequency cepstral coefficient(MFCC) 특징을 추출하고, dynamic time warping (DTW) 기반의 화자 인증 시스템을 구현한다. 그리고 각각의 생체인식을 본 논문에서 제안된 방법을 기반으로 융합하여 인식률을 향상시킨다.

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

The Effects of Multimodal Cognitive Intervention Focused on Instrumental Activities of Daily Living(IADL) for the elderly with High-risk of Dementia : a Pilot Study (도구적 일상생활에 초점을 둔 복합인지중재 프로그램이 치매고위험군 노인에게 미치는 영향 : 예비연구)

  • Park, Kyoung-Young;Shin, Su-Jung
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.5
    • /
    • pp.210-216
    • /
    • 2019
  • The purpose of this study is to investigate the effect of the multimodal cognitive intervention focusing on instrumental daily life on the cognitive function, depression and quality of life of the elderly with high-risk of dementia. This study was conducted on 24 elderly people with high-risk of dementia who participated in cognitive rehabilitation program from March to June, 2018 in Chungbuk A region. The intervention was applied to cognitive training and creative activities related to instrumental daily life. MMSE-DS, Subjective Memory Complaints Questionnaire, Short Geriatric Depression Scale-Korean version and Geriatric quality of life - Dementia were performed before and after the intervention. We confirmed that the subjects showed significant improvement in Subjective Memory Complaints and Quality of Life, but showed no significant changes in cognitive function and depression after the intervention program. Through this study, it was confirmed that this program which can affect the real life of the elderly can be usefully applied in the community. In the future, it will be necessary to develop a program that utilizes more diverse instrumental activities of daily living.

Secure Hiding of Multimodal Biometric Information Using Watermarking Method (워터마킹 기법을 이용한 다중생체정보의 안전한 은닉)

  • Lee, Uk-Jae;Lee, Dae-Jong;Jeon, Myeong-Geun
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.04a
    • /
    • pp.103-106
    • /
    • 2007
  • 본 논문에서는 얼굴, 홍채 등의 생채정보를 안전하게 은닉하고 효과적으로 은닉정보를 추출할 수 있는 웨이블렛 기반 워터마킹 기법을 제안한다. 얼굴과 홍채의 특징데이터는 Fuzzy-LDA(Fuzzy-Based Linear Discriminant Analysis)를 이용하여 추출하였다. 워터마킹알고리즘은 Wavelet을 이용하여 생체이미지에 생체특징 삽입 이전의 생체 인식율과 워터마킹알고리즘을 거쳐 생체특징을 추출한 후의 인식률 비교를 통해 성능을 평가하였다. 또한 단일생체특징 삽입과 다중생체특징삽입을 통해 단일생체보안과 다중생체보안의 실험을 수행, 평가하였다.

  • PDF

Implementation of Pen-Gesture Recognition System for Multimodal User Interface (멀티모달 사용자 인터페이스를 위한 펜 제스처인식기의 구현)

  • 오준택;이우범;김욱현
    • Proceedings of the IEEK Conference
    • /
    • 2000.11c
    • /
    • pp.121-124
    • /
    • 2000
  • In this paper, we propose a pen gesture recognition system for user interface in multimedia terminal which requires fast processing time and high recognition rate. It is realtime and interaction system between graphic and text module. Text editing in recognition system is performed by pen gesture in graphic module or direct editing in text module, and has all 14 editing functions. The pen gesture recognition is performed by searching classification features that extracted from input strokes at pen gesture model. The pen gesture model has been constructed by classification features, ie, cross number, direction change, direction code number, position relation, distance ratio information about defined 15 types. The proposed recognition system has obtained 98% correct recognition rate and 30msec average processing time in a recognition experiment.

  • PDF

PSA: A Photon Search Algorithm

  • Liu, Yongli;Li, Renjie
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.478-493
    • /
    • 2020
  • We designed a new meta-heuristic algorithm named Photon Search Algorithm (PSA) in this paper, which is motivated by photon properties in the field of physics. The physical knowledge involved in this paper includes three main concepts: Principle of Constancy of Light Velocity, Uncertainty Principle and Pauli Exclusion Principle. Based on these physical knowledges, we developed mathematical formulations and models of the proposed algorithm. Moreover, in order to confirm the convergence capability of the algorithm proposed, we compared it with 7 unimodal benchmark functions and 23 multimodal benchmark functions. Experimental results indicate that PSA has better global convergence and higher searching efficiency. Although the performance of the algorithm in solving the optimal solution of certain functions is slightly inferior to that of the existing heuristic algorithm, it is better than the existing algorithm in solving most functions. On balance, PSA has relatively better convergence performance than the existing metaheuristic algorithms.

Global Search Strategy using Enhanced Bacteria Chemotaxis algorithm (개선된 Bacteria Chemotaxis 알고리즘을 이용한 전역적 탐색 기법)

  • Park Jong Won;Park J.E.;Oh K.W.
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.790-792
    • /
    • 2005
  • 함수 최적화는 주어진 자원의 한도 내에서 최대의 이익 흑은 최소의 손실을 내는 최선의 결정을 내리는 것을목표로 한다. 본 논문은 $M{\ddot{u}}ller$의 연구를 바탕으로 박테리아의 주화성을 형상화한 'Chemical Sensing Bacteria Chemotaxis'라는 알고리즘을 제안한다. 이 알고리즘은 multimodal 환경에서의 전역 탐색을 목표로 한다. 또한 실험을 통해, 제안 알고리즘의 타당성을 분석하고, 결과적으로 제안 알고리즘이 기존의 자연계 기반의 알고리즘에 비해 경쟁력이 있음을 입증하였다.

  • PDF