• Title/Summary/Keyword: Subjective clustering

Search Result 45, Processing Time 0.021 seconds

Unit Generation Based on Phrase Break Strength and Pruning for Corpus-Based Text-to-Speech

  • Kim, Sang-Hun;Lee, Young-Jik;Hirose, Keikichi
    • ETRI Journal
    • /
    • v.23 no.4
    • /
    • pp.168-176
    • /
    • 2001
  • This paper discusses two important issues of corpus-based synthesis: synthesis unit generation based on phrase break strength information and pruning redundant synthesis unit instances. First, the new sentence set for recording was designed to make an efficient synthesis database, reflecting the characteristics of the Korean language. To obtain prosodic context sensitive units, we graded major prosodic phrases into 5 distinctive levels according to pause length and then discriminated intra-word triphones using the levels. Using the synthesis unit with phrase break strength information, synthetic speech was generated and evaluated subjectively. Second, a new pruning method based on weighted vector quantization (WVQ) was proposed to eliminate redundant synthesis unit instances from the synthesis database. WVQ takes the relative importance of each instance into account when clustering similar instances using vector quantization (VQ) technique. The proposed method was compared with two conventional pruning methods through objective and subjective evaluations of synthetic speech quality: one to simply limit the maximum number of instances, and the other based on normal VQ-based clustering. For the same reduction rate of instance number, the proposed method showed the best performance. The synthetic speech with reduction rate 45% had almost no perceptible degradation as compared to the synthetic speech without instance reduction.

  • PDF

A methodology for evaluating human operator's fitness for duty in nuclear power plants

  • Choi, Moon Kyoung;Seong, Poong Hyun
    • Nuclear Engineering and Technology
    • /
    • v.52 no.5
    • /
    • pp.984-994
    • /
    • 2020
  • It is reported that about 20% of accidents at nuclear power plants in Korea and abroad are caused by human error. One of the main factors contributing to human error is fatigue, so it is necessary to prevent human errors that may occur when the task is performed in an improper state by grasping the status of the operator in advance. In this study, we propose a method of evaluating operator's fitness-for-duty (FFD) using various parameters including eye movement data, subjective fatigue ratings, and operator's performance. Parameters for evaluating FFD were selected through a literature survey. We performed experiments that test subjects who felt various levels of fatigue monitor information of indicators and diagnose a system malfunction. In order to find meaningful characteristics in measured data consisting of various parameters, hierarchical clustering analysis, an unsupervised machine-learning technique, is used. The characteristics of each cluster were analyzed; fitness-for-duty of each cluster was evaluated. The appropriateness of the number of clusters obtained through clustering analysis was evaluated using both the Elbow and Silhouette methods. Finally, it was statistically shown that the suggested methodology for evaluating FFD does not generate additional fatigue in subjects. Relevance to industry: The methodology for evaluating an operator's fitness for duty in advance is proposed, and it can prevent human errors that might be caused by inappropriate condition in nuclear industries.

Colored Object Extraction using Fuzzy Neural Network (퍼지 신경회로망을 이용한 칼라 물체 추출)

  • Kim, Yong-Soo;Chung, Seung-Won
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.2
    • /
    • pp.226-231
    • /
    • 2007
  • This paper presents a method of colored object extraction from an image using the fuzzy neural network. Fuzzy neural network divides an image into two clusters. It extracts the prototypes of Cb and Cr of object and background by controlling the vigilance parameter. The proposed method extracted object regardless of the position, the size, and the intensity of object. We compared the performance of the proposed method with that of the method of using subjective threshold value. And, we compared the performance of the proposed method with that of the method of using subjective threshold value by using several images with added noises.

A Study of the Effective Method for Collecting and Analyzing Human Sensibility Applied Fuzzy Set Theory (퍼지이론을 응용한 효율적 감성 수집과 분석에 관한 연구)

  • Baek, Seung-Ryeol;Park, Beom
    • Journal of the Ergonomics Society of Korea
    • /
    • v.17 no.1
    • /
    • pp.47-54
    • /
    • 1998
  • Product design and development is very important process in enterprise activities. Reducing development time and reflecting consumer's needs is required to product design and development for increasing benefit and decreasing cost. Human sensibility ergonomics is one of the important technology of R&D in product development. However, the subjective method of human sensibility ergonomics has several problems to analyze and to Quantify experimental data and objective method of human sensibility ergonomics is still in process on study. In this research, new analyzing method is proposed for the subjective human sensibility ergonomics applied with fuzzy set theory. What is the useful theory for controlling uncertain type of information like human mind? This approach is more effective method for analyzing consumer's needs for product design and development process. At collecting needs, certainty scale is added for adapting hedge of fuzzy function. Using a kind of union operator, synthesize each item to analyze identification of each item with fuzzy hamming distance. Identification of analysis is classified with the relational weight using Relationship Chart Method, and is drawn the relationship diagram for clustering each item. A case study with sample test is conducted and demonstrated with this suggested method for more effective way.

  • PDF

Determination of Optimal Cluster Size Using Bootstrap and Genetic Algorithm (붓스트랩 기법과 유전자 알고리즘을 이용한 최적 군집 수 결정)

  • Park, Min-Jae;Jun, Sung-Hae;Oh, Kyung-Whan
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.12-17
    • /
    • 2003
  • Optimal determination of cluster size has an effect on the result of clustering. In K-means algorithm, the difference of clustering performance is large by initial K. But the initial cluster size is determined by prior knowledge or subjectivity in most clustering process. This subjective determination may not be optimal. In this Paper, the genetic algorithm based optimal determination approach of cluster size is proposed for automatic determination of cluster size and performance upgrading of its result. The initial population based on attribution is generated for searching optimal cluster size. The fitness value is defined the inverse of dissimilarity summation. So this is converged to upgraded total performance. The mutation operation is used for local minima problem. Finally, the re-sampling of bootstrapping is used for computational time cost.

Recovery of Missing Motion Vectors Using Modified ALA Clustering Algorithm (수정된 ALA 클러스터링 알고리즘을 이용한 손실된 움직임 벡터 복원 방법)

  • Son, Nam-Rye;Lee, Guee-Sang
    • The KIPS Transactions:PartB
    • /
    • v.12B no.7 s.103
    • /
    • pp.755-760
    • /
    • 2005
  • To transmit a video bit stream over low bandwith, such as mobile, channels, encoding algorithms for high bit rate like H.263+ are used. In transmitting video bit-streams, packet losses cause severe degradation in image quality. This paper proposes a new algorithm for the recovery of missing or erroneous motion vectors when H.263+ bit-stream is transmitted. Considering that the missing or erroneous motion vectors are closely related with those of neighboring blocks, this paper proposes a temporal-spatial error concealment algorithm. The proposed approach is that missing or erroneous Motion Vectors(MVs) are recovered by clustering the movements of neighboring blocks by their homogeneity. MVs of neighboring blocks we clustered according to ALA(Average Linkage Algorithm) clustering and a representative value for each cluster is determined to obtain the candidate MV set. By computing the distortion of the candidates, a MV with the minimum distortion is selected. Experimental results show that the proposed algorithm exhibits better performance in subjective and objective evaluation than existing methods.

Optimal Arrangement of Patrol Ships based on k-Means Clustering for Quick Response of Marine Accidents (해양사고 신속대응을 위한 k-평균 군집화 기반 경비함정 최적배치)

  • Yoo, Sang-Lok;Jung, Cho-Young
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.23 no.7
    • /
    • pp.775-782
    • /
    • 2017
  • The position of existing patrol ships has been decided according to subjective judgments, not purely by any reasonable or scientific criteria, because of a lack of access to marine accident positions. In this study, the optimal location of patrol ships is quantitatively determined based on historical marine accident data. The study area used included the coastal sea of Pohang in South Korea. In this study, a k-means clustering algorithm was used to derive the location of patrol ships, and then a Voronoi diagram was used to divide the region around each patrol ship. As a result, the average navigation distance for patrol ships was improved by 4.4 nautical miles, and the average arrival time was improved by 13.2 minutes per marine accident. Moreover, if the locations of patrol ships need to be changed flexibly, it will be possible to optimally arrange limited resources using the technique developed in this study to ensure a fast rescue.

Wideband Speech Reconstruction Using Modular Neural Networks (모듈화한 신경 회로망을 이용한 광대역 음성 복원)

  • Woo Dong Hun;Ko Charm Han;Kang Hyun Min;Jeong Jin Hee;Kim Yoo Shin;Kim Hyung Soon
    • MALSORI
    • /
    • no.48
    • /
    • pp.93-105
    • /
    • 2003
  • Since telephone channel has bandlimited frequency characteristics, speech signal over the telephone channel shows degraded speech quality. In this paper, we propose an algorithm using neural network to reconstruct wideband speech from its narrowband version. Although single neural network is a good tool for direct mapping, it has difficulty in training for vast and complicated data. To alleviate this problem, we modularize the neural networks based on appropriate clustering of the acoustic space. We also introduce fuzzy computing to compensate for probable misclassification at the cluster boundaries. According to our simulation, the proposed algorithm showed improved performance over the single neural network and conventional codebook mapping method in both objective and subjective evaluations.

  • PDF

COMMUNITY-GENERATED ONLINE IMAGE DICTORNARY

  • Li, Guangda;Li, Haojie;Tang, Jinhui;Chua, Tat-Seng
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.178-183
    • /
    • 2009
  • Online image dictionary has become more and more popular in concepts cognition. However, for existing online systems, only very few images are manually picked to demonstrate the concepts. Currently, there is very little research found on automatically choosing large scale online images with the help of semantic analysis. In this paper, we propose a novel framework to utilize community-generated online multimedia content to visually illustrate certain concepts. Our proposed framework adapts various techniques, including the correlation analysis, semantic and visual clustering to produce sets of high quality, precise, diverse and representative images to visually translate a given concept. To make the best use of our results, a user interface is deployed, which displays the representative images according the latent semantic coherence. The objective and subjective evaluations show the feasibility and effectiveness of our approach.

  • PDF

Voice conversion using low dimensional vector mapping (낮은 차원의 벡터 변환을 통한 음성 변환)

  • Lee, Kee-Seung;Doh, Won;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.4
    • /
    • pp.118-127
    • /
    • 1998
  • In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

  • PDF