• Title/Summary/Keyword: speech analysis

Search Result 1,592, Processing Time 0.034 seconds

Contextual In-Video Advertising Using Situation Information (상황 정보를 활용한 동영상 문맥 광고)

  • Yi, Bong-Jun;Woo, Hyun-Wook;Lee, Jung-Tae;Rim, Hae-Chang
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.8
    • /
    • pp.3036-3044
    • /
    • 2010
  • With the rapid growth of video data service, demand to provide advertisements or additional information with regard to a particular video scene is increasing. However, the direct use of automated visual analysis or speech recognition on videos virtually has limitations with current level of technology; the metadata of video such as title, category information, or summary does not reflect the content of continuously changing scenes. This work presents a new video contextual advertising system that serves relevant advertisements on a given scene by leveraging the scene's situation information inferred from video scripts. Experimental results show that the use of situation information extracted from scripts leads to better performance and display of more relevant advertisements to the user.

VOC Summarization and Classification based on Sentence Understanding (구문 의미 이해 기반의 VOC 요약 및 분류)

  • Kim, Moonjong;Lee, Jaean;Han, Kyouyeol;Ahn, Youngmin
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.1
    • /
    • pp.50-55
    • /
    • 2016
  • To attain an understanding of customers' opinions or demands regarding a companies' products or service, it is important to consider VOC (Voice of Customer) data; however, it is difficult to understand contexts from VOC because segmented and duplicate sentences and a variety of dialog contexts. In this article, POS (part of speech) and morphemes were selected as language resources due to their semantic importance regarding documents, and based on these, we defined an LSP (Lexico-Semantic-Pattern) to understand the structure and semantics of the sentences and extracted summary by key sentences; furthermore the LSP was introduced to connect the segmented sentences and remove any contextual repetition. We also defined the LSP by categories and classified the documents based on those categories that comprise the main sentences matched by LSP. In the experiment, we classified the VOC-data documents for the creation of a summarization before comparing the result with the previous methodologies.

A Study on Performance Improvement of FIR Digital Filter using Modified Window Function (변형된 창함수를 이용한 FIR 디지털 필터의 성능 향상에 관한 연구)

  • Kim, Nam-Ho;Ku, Bon-Seok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.06a
    • /
    • pp.758-761
    • /
    • 2007
  • Digital signal processing technique is applied in wide fields such as speech processing, image processing and spectrum analysis. Therefore, in order to do frequency selective operation digital filter is used in stead of analog filter and sharp filter characteristics can be implemented. Since finite impulse response (FIR) digital filter as nonrecursive type represents linear phase response characteristics and is always stable and is used in fields regarding wave information importantly such as data transmission. And due to frequency characteristics, in order to remove the Gibbs phenomenon generating around a discontinuous point, filter is designed through window function method. Therefore, in this paper to improve performance of FIR digital filter, a modified window function was applied. And the proposed method was compared with conventional methods using peak side-lobe and transition properties in simulations.

  • PDF

Exploring Influence Factors for Peer Attachment in Korean Youth Based on Multi-Layer Perceptron Artificial Neural Networks (인공신경망을 이용한 청소년의 또래 애착 영향 요인 탐색)

  • Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.10
    • /
    • pp.209-214
    • /
    • 2017
  • The aim of the present study was to analyze the factors that affects the peer attachment in Korean youth. Subjects were 419 middle school students (210 male, 209 female). Dependent variable was defined as peer attachment. Explanatory variables were included as gender, academic achievement satisfaction, subjective household economy level, parent - child dialogue frequency, subjective health status, depression symptom, self - esteem, subjective life satisfaction, and mobile phone dependency. In the multi-layer perceptron artificial neural network algorithm analysis, depression symptoms, gender, parent-child dialogue level for school life, subjective household economy level, subjective health status were significantly associated with peer attachment in Korean youth. Based on this result, systematic programs are required in order to prevention of peer attachment in Korean youth.

Radial Basis Function Neural Network Modeling of Depression Experience in Elementary School Students of Multi-cultural Families (방사기저함수 인공 신경망을 이용한 다문화가정 초등학생의 우울증상 경험 예측 모델링)

  • Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.11
    • /
    • pp.293-298
    • /
    • 2017
  • The purpose of this study was to analyze the risk factors of depression in elementary school students in Korea. The subjects of the study were 23,291 elementary school students (12,016 male, 11,275 female) aged 9 to 12 years. Dependent variable was defined as experience of depression. Explanatory variables were included as sex, residential areas, social discrimination experience, experience of school violence for the past year, experience of Korean language education, experience of using multicultural family support center, reading to Korean, speaking to Korean, and writing to Korean, listening to Korean. In the RBF neural network analysis, experience of Korean education, experience of school violence, experience of Korean social discrimination, level of Korean reading were significantly associated with depression in elementary school students. In order to prevent depression in multicultural children, priority attention and counseling are needed for the group whose level of Korean reading is low.

Design of Programmable SC Filter (프로그램 가능한 SC Filter의 설계)

  • 이병수;이종악
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.11 no.3
    • /
    • pp.172-178
    • /
    • 1986
  • The recent interest in the design of filters is motivatied by the fact that such filter can be fully integrated using standard metal-oxide-semiconductor processing technology. This is due to replacing all the resistors in the active RC filter network by the switched capacitors. The voltage gain of a SC filter depends only on the rations of capacitance and these ratios can be obtained and maintained to high accuracy. Therefore, it is known that a switched capacitor is much better than a resistor in temperature and linearity characteristics. This paper proposed a programmable SC filter and proved the fact that ${omega}_0$ Q and G of this circuit can be controlled by digital signal. Experiments show that SC filter remains the low sensitivities but it can't avoid little influence of parasitic capacitance. As the transfer characteristic of the SC filter is varied with sampling frequency and resistor array, SC filtering technigue can be applied for digital processing, speech analysis and synthesis and so on.

  • PDF

Analysis of Phonatory Aerodynamic & E.G.G. during Passaggio of the Trained Male Singers (남성성악가의 Vocal Register Transition(Passaggio)시 공기역학적 변화와 EGG의 변화 연구)

  • Nam, Do-Hyun;Choi, Seong-Hee;Choi, Jae-Nam;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.21-26
    • /
    • 2004
  • Vocal Register Transition(Passaggio) is one of the most important vocal technique for classically trined male singers(tenor). Passaggio is that it bridges the chest register to head register without a noticeable voice break. Vocalist gest the feeling that voice is not locked a particular register. The purpose of this study was to clarify the difference between easy($B_3$) tone and non passaggio(F#_4$) & passaggio(F#_4$). We selected 6 trained singers(tenor), who had more than 12.6 years of experience and were well trained in passaggio technique. Simulataneous measurement was performed frequency(F0), mean flow rate(MFR), intensity(I), and subglottal pressure(Psub) using a phonatory function analyzer(Nagashima) and Closed Quotient(CQ), Jitter, Shimmer, NHR a Electro-glottography(EGG) of Lx. Speech Studio(Laryngogrph Lt, London, UK) and vocal efficiency was calculated by Carroll's method. For the tenor, target tone/a/was measured in three conditions : 1) easy phonation : $B_3$, 2) high tone without passaggio : F#_4$, 3) high tone with passaggio : F#_4$). The results revealed that F0 of the target tones between non-passaggio group and passaggio group were not significantly different though higher is F0, higher is subglottal pressure. And also CQ, MFR, Psub were increased in passagio than nonpssagio but these values were not statistically different. This study concluded that passaggio is the vocal technique to make the same quality of tone between chest register and head register in tenor.

  • PDF

Development of Neck-Type Electrolarynx Blueton and Acoustic Characteristic Analysis (경부형 전기인공후두 Blueton의 개발과 음향학적 성능 분석)

  • Choi, Seong-Hee;Park, Young-Jae;Park, Young-Kwan;Kim, Tae-Jung;Nam, Do-Hyun;Lim, Sung-Eun;Lee, Sung-Eun;Kim, Han-Soo;Choi, Hong-Shik;Kim, Kwang-Moon
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.37-42
    • /
    • 2004
  • Electrolarynx(EL), battery operated vibrators which are held against the neck by on-off button, has been widely used as a verbal communication method among post-laryngectomized patients. EL speech can produce easily without need of any additional surgery or special training and be used with any other methods. This institute developed a neck-typed EL named "Blueton" in commperation with EL Company Linkus, which consists of 3 parts : Vibrator part, Control part, Battery part. In this study we evaluated the acoustic characteristics of the produced voices by Blueton compared with Servox-inton using MDVP. Three EL users (2 full time users, 1 part time user) were participated. The results revelaed that NHR higher in Servox than Blueton and intensity is higher in Blueton than Servox. The spectra for vowels produced by EL speakers are mixed signals combined with talkers' vocal output and electrolarynx noise. The spectra pattern is similar with two ELs. High, SPI index and vowel spectra from MDVP demonstrated characteristics of both electrolarynxes related to noise signal. This finding suggests that Blueton helps to provide one of useful rehabilitation options in the post laryngectomy patients.

  • PDF

Coarticulation and vowel reduction in the neutral tone of Beijing Mandarin

  • Lin Maocan
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.207-207
    • /
    • 1996
  • The neutral tone is one of the most important distinguishing features in Beijing Mandarin, but there are two completely different views on its linguistic function: a special tone(Xu, 1980) versus weak stress(Chao, 1968). In this paper, the acoustic manifestation of the neutral tone will be explored to show that it is closely related to weak stress. 122 disyllabic words in which the second syllable carries the neutral tone, including 22 stress pairs, were uttered by a native male speaker of Beijing dialect and analysed by Kay Digital Sonagraph 5500-1. The results of the acoustic analysis are presented as follows: 1) The first two formants of the medial and the syllabic vowel moves towards that of central vowel with a greater magnitude in the syllable with the neutral tone than in the syllable with any of the four normal tones. Also the vowel ending, and nasal coda /n/ and / / in the syllable with the neutral tone tends to be deleted. 2) In the syllables with the neutral tone, there are strong carryover coarticulations between the medial and syllabic vowel and the preceding unvoiced consonant. In general, the vowel is affected to move towards the position of the central vowel with more greater magnitude by coronal consonant than by labial or velar consonant. 3) In the syllable with the neutral tone, when and only when it precedes a syllable with tone-4, the high vowel following [f], [ts'], [s], [ts'], [s], [tc'] or [c] tends to be voiceless. 4) It can be seen from the acoustical results of 22 stress pairs that the duration of the syllable with the neutral tone is on the average reduced to 55% of that of the syllable with the four normal tones, and the duration of the final in the syllable with neutral tone is on the average reduced to 45% of that of the final in the syllable with the four normal tones(Lin & Yan 1980). 5) The FO contour of the neutral tone is highly dependent on the preceding normal tone(Lin & Yan 1993). For a number of languages it has been found that the vowel space is reduced as the level of stress placed upon the vowel is reduced(Nord 1986). Therefore we reach the conclusion that the syllable with neutral tone is related to weak stress(Lin & Yan 1990). The neutral tone is not a special tone because the preceding normal tone.

  • PDF

Matching Pursuit Sinusoidal Modeling with Damping Factor (Damping 요소를 첨가한 매칭 퍼슈잇 정현파 모델링)

  • Jeong, Gyu-Hyeok;Kim, Jong-Hark;Lim, Joung-Woo;Joo, Gi-Ho;Lee, In-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.1
    • /
    • pp.105-113
    • /
    • 2007
  • In this paper, we propose the matching pursuit with damping factors, a new sinusoidal model improving the matching pursuit, for the codecs based on sinusoidal model. The proposed model defines damping factors by using a correlativity of parameters between the current and adjacent frame, and estimates sinusoidal parameters more accurately in analysis frame by using the matching pursuit according to damping factor, and synthesizes the final signal. Then it is possible to model efficiently without interpolation schemes. The proposed sinusoidal model shows a better speech quality without an additional delay than the conventional sinusoidal model with interpolation methods. Through the SNR(signal to noise ratio), the MOS(Mean Opinion Score), LR(Itakura-Saito likelihood ratio), and CD(cepstral distance), we compare the performance of our model with that of matching pursuit using interpolation methods.