• Title/Summary/Keyword: Prominence detection

Search Result 15, Processing Time 0.02 seconds

Prominence Detection Using Feature Differences of Neighboring Syllables for English Speech Clinics (영어 강세 교정을 위한 주변 음 특징 차를 고려한 강조점 검출)

  • Shim, Sung-Geon;You, Ki-Sun;Sung, Won-Yong
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.15-22
    • /
    • 2009
  • Prominence of speech, which is often called 'accent,' affects the fluency of speaking American English greatly. In this paper, we present an accurate prominence detection method that can be utilized in computer-aided language learning (CALL) systems. We employed pitch movement, overall syllable energy, 300-2200 Hz band energy, syllable duration, and spectral and temporal correlation as features to model the prominence of speech. After the features for vowel syllables of speech were extracted, prominent syllables were classified by SVM (Support Vector Machine). To further improve accuracy, the differences in characteristics of neighboring syllables were added as additional features. We also applied a speech recognizer to extract more precise syllable boundaries. The performance of our prominence detector was measured based on the Intonational Variation in English (IViE) speech corpus. We obtained 84.9% accuracy which is about 10% higher than previous research.

  • PDF

Lane Recognition Using Lane Prominence Algorithm for Unmanned Vehicles (무인차량 적용을 위한 차선강조기법 기반의 차선 인식)

  • Baek, Jun-Young;Lee, Min-Cheol
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.16 no.7
    • /
    • pp.625-631
    • /
    • 2010
  • This paper proposes lane recognition algorithm using lane prominence technique to extract lane candidate. The lane prominence technique is combined with embossing effect, lane thickness check, and lane extraction using mask. The proposed lane recognition algorithm consists of preprocessing, lane candidate extraction and lane recognition. First, preprocessing is executed, which includes gray image acquisition, inverse perspective transform and gaussian blur. Second, lane candidate is extracted by using lane prominence technique. Finally, lane is recognized by using hough transform and least square method. To evaluate the proposed lane recognition algorithm, this algorithm was applied to the detection of lanes in the rainy and night day. The experiment results showed that the proposed algorithm can recognize lane in various environment. It means that the algorithm can be applied to lane recognition to drive unmanned vehicles.

The Effect of Focus Representation and Intonational Manipulation in Phoneme Detecting (초점 실현과 운율 조작에 대한 음소지각)

  • Kim, Hee-Seung;Shin, Ji-Young;Kim, Kee-Ho
    • MALSORI
    • /
    • no.60
    • /
    • pp.97-108
    • /
    • 2006
  • The purpose of this study is to observe how Korean listeners detect a target phoneme with 'Focus' represented by prosodic prominence and question-induced semantic emphasis, and with intonational manipulation. According to the automated phoneme detection task using E-Prime, the Korean listeners detected phoneme targets more rapidly when the target-bearing words were in prominence position and in question-induced position. However, the presence of question-induced semantic emphasis reduced the prominence effect, so two effects interacted: when question-induced emphasis were primarily given as a cue, prominence which was given as secondary cue affected less to fine the new information. Besides, the intonation with manipulation was responded to faster than without manipulation.

  • PDF

The Perceptual effect of 'Prosodic vs. Semantic' Focus Representation in Phoneme Detecting (음소 지각에 대한 초점의 운율적 실현과 의미적 실현의 효과(I))

  • Kim Hee-Sung;Jo Min-Ha;Kim Kee-Ho
    • Proceedings of the KSPS conference
    • /
    • 2006.05a
    • /
    • pp.71-74
    • /
    • 2006
  • The purpose of this study is to observe how Korean listeners detect a target phoneme with 'Focus' represented by prosodic prominence and question-induced semantic emphasis. According to the automated phoneme detection task using E-Prime, Korean listeners detected phoneme targets more rapidly when the target-bearing words were in prominence position and in question-induced position. However, when phoneme targets were in prominence position, response time was much faster than in question-induced position. The results suggest that the prosodic prominence which is explicit method of focus representation be more effective than question-inducing, implicit method of it, in phoneme detecting.

  • PDF

Anomaly Intrusion Detection Based on Hyper-ellipsoid in the Kernel Feature Space

  • Lee, Hansung;Moon, Daesung;Kim, Ikkyun;Jung, Hoseok;Park, Daihee
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.3
    • /
    • pp.1173-1192
    • /
    • 2015
  • The Support Vector Data Description (SVDD) has achieved great success in anomaly detection, directly finding the optimal ball with a minimal radius and center, which contains most of the target data. The SVDD has some limited classification capability, because the hyper-sphere, even in feature space, can express only a limited region of the target class. This paper presents an anomaly detection algorithm for mitigating the limitations of the conventional SVDD by finding the minimum volume enclosing ellipsoid in the feature space. To evaluate the performance of the proposed approach, we tested it with intrusion detection applications. Experimental results show the prominence of the proposed approach for anomaly detection compared with the standard SVDD.

Development of machine learning model for automatic ELM-burst detection without hyperparameter adjustment in KSTAR tokamak

  • Jiheon Song;Semin Joung;Young-Chul Ghim;Sang-hee Hahn;Juhyeok Jang;Jungpyo Lee
    • Nuclear Engineering and Technology
    • /
    • v.55 no.1
    • /
    • pp.100-108
    • /
    • 2023
  • In this study, a neural network model inspired by a one-dimensional convolution U-net is developed to automatically accelerate edge localized mode (ELM) detection from big diagnostic data of fusion devices and increase the detection accuracy regardless of the hyperparameter setting. This model recognizes the input signal patterns and overcomes the problems of existing detection algorithms, such as the prominence algorithm and those of differential methods with high sensitivity for the threshold and signal intensity. To train the model, 10 sets of discharge radiation data from the KSTAR are used and sliced into 11091 inputs of length 12 ms, of which 20% are used for validation. According to the receiver operating characteristic curves, our model shows a positive prediction rate and a true prediction rate of approximately 90% each, which is comparable to the best detection performance afforded by other algorithms using their optimized hyperparameters. The accurate and automatic ELM-burst detection methodology used in our model can be beneficial for determining plasma properties, such as the ELM frequency from big data measured in multiple experiments using machines from the KSTAR device and ITER. Additionally, it is applicable to feature detection in the time-series data of other engineering fields.

Hot plasmas in coronal mass ejection observed by Hinode/XRT

  • Lee, Jin-Yi;Raymond, John C.;Reeves, Katharine K.
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.37 no.1
    • /
    • pp.97-97
    • /
    • 2012
  • Hinode/XRT has observed coronal mass ejections (CMEs) since it launched on Sep. 2006. Observing programs of Hinode/XRT, called 'CME watch', perform several binned observations to obtain large FOV observations with long exposure time that allows the detection of faint CME plasmas in high temperatures. Using those observations, we determine the upper limit to the mass of hot CME plasma using emission measure by assuming the observed plasma structure. In some events, an associated prominence eruption and CME plasma were observed in EUV observations as absorption or emission features. The absorption feature provides the lower limit to the cold mass while the emission feature provides the upper limit to the mass of observed CME plasma in X-ray and EUV passbands. In addition, some events were observed by coronagraph observations (SOHO/LASCO, STEREO/COR1) that allow the determination of total CME mass. However, some events were not observed by the coronagraphs possibly because of low density of the CME plasma. We present the mass constraints of CME plasma and associated prominence as determined by emission and absorption in EUV and X-ray passbands, then compare this mass to the total CME mass as derived from coronagraphs.

  • PDF

A Lightweight Deep Learning Model for Text Detection in Fashion Design Sketch Images for Digital Transformation

  • Ju-Seok Shin;Hyun-Woo Kang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.17-25
    • /
    • 2023
  • In this paper, we propose a lightweight deep learning architecture tailored for efficient text detection in fashion design sketch images. Given the increasing prominence of Digital Transformation in the fashion industry, there is a growing emphasis on harnessing digital tools for creating fashion design sketches. As digitization becomes more pervasive in the fashion design process, the initial stages of text detection and recognition take on pivotal roles. In this study, a lightweight network was designed by building upon existing text detection deep learning models, taking into consideration the unique characteristics of apparel design drawings. Additionally, a separately collected dataset of apparel design drawings was added to train the deep learning model. Experimental results underscore the superior performance of our proposed deep learning model, outperforming existing text detection models by approximately 20% when applied to fashion design sketch images. As a result, this paper is expected to contribute to the Digital Transformation in the field of clothing design by means of research on optimizing deep learning models and detecting specialized text information.

The advantage of topographic prominence-adopted filter for the detection of short-latency spikes of retinal ganglion cells

  • Ahn, Jungryul;Choi, Myoung-Hwan;Kim, Kwangsoo;Senok, Solomon S.;Cho, Dong-il Dan;Koo, Kyo-in;Goo, Yongsook
    • The Korean Journal of Physiology and Pharmacology
    • /
    • v.21 no.5
    • /
    • pp.555-563
    • /
    • 2017
  • Electrical stimulation through retinal prosthesis elicits both short and long-latency retinal ganglion cell (RGC) spikes. Because the short-latency RGC spike is usually obscured by electrical stimulus artifact, it is very important to isolate spike from stimulus artifact. Previously, we showed that topographic prominence (TP) discriminator based algorithm is valid and useful for artifact subtraction. In this study, we compared the performance of forward backward (FB) filter only vs. TP-adopted FB filter for artifact subtraction. From the extracted retinae of rd1 mice, we recorded RGC spikes with $8{\times}8$ multielectrode array (MEA). The recorded signals were classified into four groups by distances between the stimulation and recording electrodes on MEA (200-400, 400-600, 600-800, $800-1000{\mu}m$). Fifty cathodic phase-$1^{st}$ biphasic current pulses (duration $500{\mu}s$, intensity 5, 10, 20, 30, 40, 50, $60{\mu}A$) were applied at every 1 sec. We compared false positive error and false negative error in FB filter and TP-adopted FB filter. By implementing TP-adopted FB filter, short-latency spike can be detected better regarding sensitivity and specificity for detecting spikes regardless of the strength of stimulus and the distance between stimulus and recording electrodes.

Hybrid Tensor Flow DNN and Modified Residual Network Approach for Cyber Security Threats Detection in Internet of Things

  • Alshehri, Abdulrahman Mohammed;Fenais, Mohammed Saeed
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.10
    • /
    • pp.237-245
    • /
    • 2022
  • The prominence of IoTs (Internet of Things) and exponential advancement of computer networks has resulted in massive essential applications. Recognizing various cyber-attacks or anomalies in networks and establishing effective intrusion recognition systems are becoming increasingly vital to current security. MLTs (Machine Learning Techniques) can be developed for such data-driven intelligent recognition systems. Researchers have employed a TFDNNs (Tensor Flow Deep Neural Networks) and DCNNs (Deep Convolution Neural Networks) to recognize pirated software and malwares efficiently. However, tuning the amount of neurons in multiple layers with activation functions leads to learning error rates, degrading classifier's reliability. HTFDNNs ( Hybrid tensor flow DNNs) and MRNs (Modified Residual Networks) or Resnet CNNs were presented to recognize software piracy and malwares. This study proposes HTFDNNs to identify stolen software starting with plagiarized source codes. This work uses Tokens and weights for filtering noises while focusing on token's for identifying source code thefts. DLTs (Deep learning techniques) are then used to detect plagiarized sources. Data from Google Code Jam is used for finding software piracy. MRNs visualize colour images for identifying harms in networks using IoTs. Malware samples of Maling dataset is used for tests in this work.