Search | Korea Science

Speech Query Recognition for Tamil Language Using Wavelet and Wavelet Packets

Iswarya, P.;Radha, V.
- Journal of Information Processing Systems
- /
- v.13 no.5
- /
- pp.1135-1148
- /
- 2017
Speech recognition is one of the fascinating fields in the area of Computer science. Accuracy of speech recognition system may reduce due to the presence of noise present in speech signal. Therefore noise removal is an essential step in Automatic Speech Recognition (ASR) system and this paper proposes a new technique called combined thresholding for noise removal. Feature extraction is process of converting acoustic signal into most valuable set of parameters. This paper also concentrates on improving Mel Frequency Cepstral Coefficients (MFCC) features by introducing Discrete Wavelet Packet Transform (DWPT) in the place of Discrete Fourier Transformation (DFT) block to provide an efficient signal analysis. The feature vector is varied in size, for choosing the correct length of feature vector Self Organizing Map (SOM) is used. As a single classifier does not provide enough accuracy, so this research proposes an Ensemble Support Vector Machine (ESVM) classifier where the fixed length feature vector from SOM is given as input, termed as ESVM_SOM. The experimental results showed that the proposed methods provide better results than the existing methods.
https://doi.org/10.3745/JIPS.02.0033 인용 PDF KSCI

A Statistical Study of CMP Process in Various Scales (CMP 프로세스의 통계적인 다규모 모델링 연구)

석종원
- Transactions of the Korean Society of Mechanical Engineers A
- /
- v.27 no.12
- /
- pp.2110-2117
- /
- 2003
A physics-based material removal model in various scales is described and a feature scale simulation for a chemical mechanical polishing (CMP) process is performed in this work. Three different scales are considered in this model, i.e., abrasive particle scale, asperity scale and wafer scale. The abrasive particle and the asperity scales are combined together and then homogenized to result in force balance conditions to be satisfied in the wafer scale using an extended Greenwood-Williamson and Whitehouse-Archard statistical model that takes into consideration the joint distribution of asperity heights and asperity tip radii. The final computation is made to evaluate the material removal rate in wafer scale and a computer simulation is performed for detailed surface profile variations on a representative feature. The results show the dependence of the material removal rate on the joint distribution, applied external pressure, relative velocity, and other operating conditions and design parameters.
https://doi.org/10.3795/KSME-A.2003.27.12.2110 인용 PDF KSCI

Karyotype Classification of The Chromosome Image using Hierarchical Neural Network (계층형 신경회로망을 이용한 염색체 영상의 핵형 분류)

장용훈
- Journal of the Korea Computer Industry Society
- /
- v.2 no.8
- /
- pp.1045-1054
- /
- 2001
To improve classification accuracy in this paper, we proposed an algorithm for the chromosome image reconstruction in the image preprocessing part and also proposed the pattern classification method using the hierarchical multilayer neural network(HMNN) to classify the chromosome karyotype. It reconstructed chromosome images for twenty normal human chromosome by the image reconstruction algorithm. The four morphological and ten density feature parameters were extracted from the 920 reconstructed chromosome images. The each combined feature parameters of ten human chromosome images were used to learn HMNN and the rest of them were used to classify the chromosome images. The experimental results in this paper were composed to optimized HMNN and also obtained about 98.26％ to recognition ratio.
PDF

Non-linear incidental dynamics of frame structures

Radoicic, Goran N.;Jovanovic, Miomir Lj.;Marinkovic, Dragan Z.
- Structural Engineering and Mechanics
- /
- v.52 no.6
- /
- pp.1193-1208
- /
- 2014
A simulation of failures on responsible elements is only one form of the extreme structural behavior analysis. By understanding the dynamic behavior in incidental situations, it is possible to make a special structural design from the point of the largest axial force, stress and redundancy. The numerical realization of one such simulation analysis was performed using FEM in this paper. The boundary parameters of transient analysis, such as overall structural damping coefficient, load accelerations, time of load fall and internal forces in the responsible structural elements, were determined on the basis of the dynamic experimental parameters. The structure eigenfrequencies were determined in modal analysis. In the study, the basic incidental models were set. The models were identified by many years of monitoring incidental situations and the most frequent human errors in work with heavy structures. The combined load models of structure are defined in the paper since the incidents simply arise as consequences of cumulative errors and failures. A feature of a combined model is that the single incident causes the next incident (consecutive timing) as well as that other simple dynamic actions are simultaneous. The structure was observed in three typical load positions taken from the crane passport (range-load). The obtained dynamic responses indicate the degree of structural sensitivity depending on the character of incident. The dynamic coefficient KD was adopted as a parameter for the evaluation of structural sensitivity.
https://doi.org/10.12989/sem.2014.52.6.1193 인용 KSCI

Space Charge Behavior of Oil-Impregnated Paper Insulation Aging at AC-DC Combined Voltages

Li, Jian;Wang, Yan;Bao, Lianwei
- Journal of Electrical Engineering and Technology
- /
- v.9 no.2
- /
- pp.635-642
- /
- 2014
The space charge behaviors of oil-paper insulation affect the stability and security of oil-filled converter transformers of traditional and new energies. This paper presents the results of the electrical aging of oil-impregnated paper under AC-DC combined voltages by the pulsed electro-acoustic technique. Data mining and feature extractions were performed on the influence of electrical aging on charge dynamics based on the experiment results in the first stage. Characteristic parameters such as total charge injection and apparent charge mobility were calculated. The influences of electrical aging on the trap energy distribution of an oil-paper insulation system were analyzed and discussed. Longer electrical aging time would increase the depth and energy density of charge trap, which decelerates the apparent charge mobility and increases the probability of hot electron formation. This mechanism would accelerate damage to the cellulose and the formation of discharge channels, enhance the acceleration of the electric field distortion, and shorten insulation lifetime under AC-DC combined voltages.
https://doi.org/10.5370/JEET.2014.9.2.635 인용 PDF KSCI KPUBS HTML

Ensemble Deep Learning Model using Random Forest for Patient Shock Detection

Minsu Jeong;Namhwa Lee;Byuk Sung Ko;Inwhee Joe
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.4
- /
- pp.1080-1099
- /
- 2023
Digital healthcare combined with telemedicine services in the form of convergence with digital technology and AI is developing rapidly. Digital healthcare research is being conducted on many conditions including shock. However, the causes of shock are diverse, and the treatment is very complicated, requiring a high level of medical knowledge. In this paper, we propose a shock detection method based on the correlation between shock and data extracted from hemodynamic monitoring equipment. From the various parameters expressed by this equipment, four parameters closely related to patient shock were used as the input data for a machine learning model in order to detect the shock. Using the four parameters as input data, that is, feature values, a random forest-based ensemble machine learning model was constructed. The value of the mean arterial pressure was used as the correct answer value, the so called label value, to detect the patient's shock state. The performance was then compared with the decision tree and logistic regression model using a confusion matrix. The average accuracy of the random forest model was 92.80%, which shows superior performance compared to other models. We look forward to our work playing a role in helping medical staff by making recommendations for the diagnosis and treatment of complex and difficult cases of shock.
https://doi.org/10.3837/tiis.2023.04.003 인용 PDF HTML

Development of Facial Expression Recognition System based on Bayesian Network using FACS and AAM (FACS와 AAM을 이용한 Bayesian Network 기반 얼굴 표정 인식 시스템 개발)

Ko, Kwang-Eun;Sim, Kwee-Bo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.19 no.4
- /
- pp.562-567
- /
- 2009
As a key mechanism of the human emotion interaction, Facial Expression is a powerful tools in HRI(Human Robot Interface) such as Human Computer Interface. By using a facial expression, we can bring out various reaction correspond to emotional state of user in HCI(Human Computer Interaction). Also it can infer that suitable services to supply user from service agents such as intelligent robot. In this article, We addresses the issue of expressive face modeling using an advanced active appearance model for facial emotion recognition. We consider the six universal emotional categories that are defined by Ekman. In human face, emotions are most widely represented with eyes and mouth expression. If we want to recognize the human's emotion from this facial image, we need to extract feature points such as Action Unit(AU) of Ekman. Active Appearance Model (AAM) is one of the commonly used methods for facial feature extraction and it can be applied to construct AU. Regarding the traditional AAM depends on the setting of the initial parameters of the model and this paper introduces a facial emotion recognizing method based on which is combined Advanced AAM with Bayesian Network. Firstly, we obtain the reconstructive parameters of the new gray-scale image by sample-based learning and use them to reconstruct the shape and texture of the new image and calculate the initial parameters of the AAM by the reconstructed facial model. Then reduce the distance error between the model and the target contour by adjusting the parameters of the model. Finally get the model which is matched with the facial feature outline after several iterations and use them to recognize the facial emotion by using Bayesian Network.
https://doi.org/10.5391/JKIIS.2009.19.4.562 인용 PDF KSCI

Emotion Recognition using Robust Speech Recognition System (강인한 음성 인식 시스템을 사용한 감정 인식)

Kim, Weon-Goo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.5
- /
- pp.586-591
- /
- 2008
This paper studied the emotion recognition system combined with robust speech recognition system in order to improve the performance of emotion recognition system. For this purpose, the effect of emotional variation on the speech recognition system and robust feature parameters of speech recognition system were studied using speech database containing various emotions. Final emotion recognition is processed using the input utterance and its emotional model according to the result of speech recognition. In the experiment, robust speech recognition system is HMM based speaker independent word recognizer using RASTA mel-cepstral coefficient and its derivatives and cepstral mean subtraction(CMS) as a signal bias removal. Experimental results showed that emotion recognizer combined with speech recognition system showed better performance than emotion recognizer alone.
https://doi.org/10.5391/JKIIS.2008.18.5.586 인용 PDF KSCI

Visual servoing of robot manipulators using the neural network with optimal structure (최적화된 신경회로망을 이용한 동적물체의 비주얼 서보잉)

김대준;전효병;심귀보
- 제어로봇시스템학회:학술대회논문집
- /
- 1996.10b
- /
- pp.302-305
- /
- 1996
This paper presents a visual servoing combined by Neural Network with optimal structure and predictive control for robotic manipulators to tracking or grasping of the moving object. Using the four feature image information from CCD camera attached to end-effector of RV-M2 robot manipulator having 5 dof, we want to predict the updated position of the object. The Kalman filter is used to estimate the motion parameters, namely the state vector of the moving object in successive image frames, and using the multi layer feedforward neural network that permits the connection of other layers, evolutionary programming(EP) that search the structure and weight of the neural network, and evolution strategies(ES) which training the weight of neuron, we optimized the net structure of control scheme. The validity and effectiveness of the proposed control scheme and predictive control of moving object will be verified by computer simulation.
PDF

Speech/Music Signal Classification Based on Spectrum Flux and MFCC For Audio Coder (오디오 부호화기를 위한 스펙트럼 변화 및 MFCC 기반 음성/음악 신호 분류)

Sangkil Lee;In-Sung Lee
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.16 no.5
- /
- pp.239-246
- /
- 2023
In this paper, we propose an open-loop algorithm to classify speech and music signals using the spectral flux parameters and Mel Frequency Cepstral Coefficients(MFCC) parameters for the audio coder. To increase responsiveness, the MFCC was used as a short-term feature parameter and spectral fluxes were used as a long-term feature parameters to improve accuracy. The overall voice/music signal classification decision is made by combining the short-term classification method and the long-term classification method. The Gaussian Mixed Model (GMM) was used for pattern recognition and the optimal GMM parameters were extracted using the Expectation Maximization (EM) algorithm. The proposed long-term and short-term combined speech/music signal classification method showed an average classification error rate of 1.5% on various audio sound sources, and improved the classification error rate by 0.9% compared to the short-term single classification method and 0.6% compared to the long-term single classification method. The proposed speech/music signal classification method was able to improve the classification error rate performance by 9.1% in percussion music signals with attacks and 5.8% in voice signals compared to the Unified Speech Audio Coding (USAC) audio classification method.
https://doi.org/10.17661/jkiiect.2023.16.5.239 인용 PDF HTML

Search Result 27, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)