• Title/Summary/Keyword: Voice function

Search Result 434, Processing Time 0.036 seconds

A transmit function implementation of wireless LAN MAC with QoS using single transmit FIFO (단일 송신 피포를 이용한 QoS 기능의 무선랜 MAC의 송신 기능 구현)

  • Park, Chan-Won;Kim, Jung-Sik;Kim, Bo-Kwan
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.237-239
    • /
    • 2004
  • Wireless LAN Voice over IP(VoIP) equipment needs Quality-of-Service(QoS) with priority for processing real-time traffic. This paper shows transmit function implementation of wireless LAN(WLANs) media access control(MAC) support VoIP, and it has an advantage of guarantee of QoS and is adaptable to VoIP or mobile wireless equipment. The IEEE 802.11e standard in progress has four queues according to four access categories(AC) for transmit and the MAC transmits the data based on EDCA. The value of AC is from AC0 to AC3 and AC3 has the highest priority. The transmit method implemented at this paper ensure QoS using one transmit FIFO in hardware since real-time traffic data and non real-time traffic data has the different priority. The device driver classifies real-time data and non real-time data and transmit data to hardware with information about data type. The hardware conducts shorter backoff and selects faster AIFS slot for real-time data than it for non real-time data. Therefor It make give the real-time traffic data faster channel access chance than non real-time data and enhances QoS.

  • PDF

Development of Half-Mirror Interface System and Its Application for Ubiquitous Environment (유비쿼터스 환경을 위한 하프미러형 인터페이스 시스템 개발과 응용)

  • Kwon Young-Joon;Kim Dae-Jin;Lee Sang-Wan;Bien Zeungnam
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.12
    • /
    • pp.1020-1026
    • /
    • 2005
  • In the era of ubiquitous computing, human-friendly man-machine interface is getting more attention due to its possibility to offer convenient services. For this, in this paper, we introduce a 'Half-Mirror Interface System (HMIS)' as a novel type of human-friendly man-machine interfaces. Basically, HMIS consists of half-mirror, USB-Webcam, microphone, 2ch-speaker, and high-speed processing unit. In our HMIS, two principal operation modes are selected by the existence of the user in front of it. The first one, 'mirror-mode', is activated when the user's face is detected via USB-Webcam. In this mode, HMIS provides three basic functions such as 1) make-up assistance by magnifying an interested facial component and TTS (Text-To-Speech) guide for appropriate make-up, 2) Daily weather information provider via WWW service, 3) Health monitoring/diagnosis service using Chinese medicine knowledge. The second one, 'display-mode' is designed to show decorative pictures, family photos, art paintings and so on. This mode is activated when the user's face is not detected for a time being. In display-mode, we also added a 'healing-window' function and 'healing-music player' function for user's psychological comfort and/or relaxation. All these functions are accessible by commercially available voice synthesis/recognition package.

Voice Activity Detection Algorithm using Wavelet Band Entropy Ensemble Analysis in Car Noisy Environments (프로세싱에서 삼각함수 공식을 응용한 장식적 타입페이스 제안)

  • Chun, Christine Hyeyeon
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.12
    • /
    • pp.1992-1999
    • /
    • 2017
  • This study proposes a decorative typeface which is produced through the concept of trigonometric functions in an open-source programming language known as Processing. First, the theoretical background of Processing and trigonometric functions as well as previous research in this area are analyzed. Second, basic modules of 'V', 'I', 'O', and 'M' were created for use as the final alphabet typeface with the concept of a trigonometric function. Third, a decorative parabolic curve that encircles the base module was created. Finally, the modules created on Processing were edited in Adobe Illustrator to create a typeface set with characters from A to Z. Various artworks using Programming can produce an infinite number of different versions by modifying only some of the variables and codes, and this method can include multimedia features such as text, images, videos, interactive art and various forms of content and media. Therefore, with regard to expression, the possibilities are endless. In this study, I attempt to expand the field of visual culture using programming and computational methodologies. In contrast to the digital typeface production method, which relies on existing graphic tools, this study is meaningful because it expands the range of use of decorative typefaces.

Audio Guidance Application For Commodity Prices Using Public Data And AI Chatbot (공공데이터와 AI챗봇을 이용한 물가 음성안내 앱 서비스)

  • Lee, Jae-Seon;Kang, Kyeong-Don;Park, Tae-Yok;Jung, Deok-Gil
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.251-253
    • /
    • 2018
  • As the prices of agricultural, fishery, and dairy products have been fluctuating due to recent instability on commodity prices, so consumers have been more inclined to make purchase without specific criteria by relying on marketing or their personal experiences and senses of market. The core function of this application is precisely and conveniently telling the consumption index to consumers who are waved by unstable commodity prices by helping users to easily understand the price index of agricultural, fishery, and dairy products in real time using public data. And, it also includes the AI Chatbot and voice recognition function, and meets the convenience of natural language processing and hands-free etc..

  • PDF

Efficient Iris Recognition using Deep-Learning Convolution Neural Network (딥러닝 합성곱 신경망을 이용한 효율적인 홍채인식)

  • Choi, Gwang-Mi;Jeong, Yu-Jeong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.15 no.3
    • /
    • pp.521-526
    • /
    • 2020
  • This paper presents an improved HOLP neural network that adds 25 average values to a typical HOLP neural network using 25 feature vector values as input values by applying high-order local autocorrelation function, which is excellent for extracting immutable feature values of iris images. Compared with deep learning structures with different types, we compared the recognition rate of iris recognition using Back-Propagation neural network, which shows excellent performance in voice and image field, and synthetic product neural network that integrates feature extractor and classifier.

Development of Customer-Oriented Quality Design Elements of Shoes based on QFD (QFD 기반에 의한 제화류의 감성지향적 품질설계 요소도출에 관한 실증적 연구)

  • 김진호;황인극
    • Journal of Korean Society for Quality Management
    • /
    • v.32 no.1
    • /
    • pp.130-143
    • /
    • 2004
  • Although consumer needs for better products force manufactures to put emphasis on design, often development of a product has been done without the formal process to consider consumer needs. In order to identify the implicit needs of customers and the areas of potential demand on a product, several analysis scheme such as QFD (Quality Function Deployment) has been developed. QFD, also known as the House of Quality, is the efficient tool ever created to tie product and service design decisions directly to customer wants and needs, i.e. VoC (Voice of Customer) To utilize this tool on a product design, first of all, the consumers attributes and the engineering characteristics must be exactly investigated. However there were only few studies about them on shoe design. Hence in this paper we developed an innovative framework for shoes design based on QFD. As a result, we uncovered 29 dominant human satisfaction dimensions as the consumers attributes for customer-oriented quality evaluation of a comfortable shoes. Here, 29 human satisfaction dimensions for a shoe design were identified as the dimensions that represent the human sensitivity and psychological feeling on comfortable shoes. Also, we proposed 60 human interface elements as the engineering characteristics. The relationships between human satisfaction dimensions and human interface elements were investigated. This study will help the designers and manufacturers clarify the conceptual and abstract aspect of the design evaluation by proposing a more systematic and process-oriented method.

Aerodynamic Characteristics of Korean Bilabial Stop Consonant as a Function of Phonemic Position in a Syllable (음절내 음소 출현 위치에 따른 한국어 양순 파열음의 공기역학적인 특징)

  • Park, Sang-Hee;Jeong, Haeng-Im;Jeong, Ok-Ran;Seok, Dong-Il
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.59-75
    • /
    • 2002
  • Aerodynamic analysis study was performed on 14 normal subjects (2 males, 12 females) with nonsense syllables composed of Korean bilabial stops (/p, p', $p^{h}$) and their preceding and/or following vowels, /i, a, u/. That is, [pi, p'i, $p^{h}i$, pa, p'a, $p^{h}a$, pu, p'u, $p^{h}u$, ipi, apa, upu, $ip^{h}i$, $ap^{h}a$, $up^{h}u$, ip'i, ap'a, up'u]. All measures were taken and analysed using Aerophone II voice function analyzer and included peak air pressure, mean air pressure, maximum flow rate, volume, mean SPL and phonatory SPL. A t-test and one-way ANOVA were employed for analysis. A post-hoc analysis was performed with Scheffe and Bonferroni. The results were as follows: First, MSPL. and MAP of /p, p', $p^{h}$/ were significantly different in different positions (initial and medial position). In addition, different vowel environment also produced significantly different aerodynamic characteristics those consonants. Especially the lax consonant /p/ was significantly different /i, a, u/ vowel environments. The tense consonant /p'/ was significantly different only /i/ vowel environment.

  • PDF

Subglottic Air Pressure in Different Phonetic Context (음성학적 문맥에 따른 성문하압의 차이에 관한 연구)

  • 박상희;정옥란;석동일
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.1
    • /
    • pp.23-27
    • /
    • 2002
  • The purpose of the study is to examine differences in subglottic air pressure as a function of phonetic context. The phonetic contexts consisted of $/i:{p^h}i:{p^h}i:/,/{p^h}i:{p^h}i:/, and /{p^h}{p^h}/$. The aerodynamic and phonatory parameters are investigated in 20 female normal adults. All measurements are taken and analysed using Aerophone II voice function analyzer. The aerodynamic parameters are Peak Air Pressure(PAP) and Mean Air Pressure(MAP), and the phonatory parameters are Phonatory Flow Rate(PFR) Maximum SPL(MSPL), Phonatory SPL(PSPL), Phonatory Power (PP), Phonatory Efficiency(PE), and Phonatory $Resistance^*$ 10-5(PR). A one-way ANOVA revealed the following results. First, the aerodynamic parameters are not significantly different. Second, Peak Air Pressure(PAP) and Mean Air Pressure(MAP), as well as the phonatory parameters such as Phonatory Flow Rate(PFR) Maximum SPL(MSPL), Phonatory SPL(PSPL), and Phonatory Efficiency(PE) were significantly different. Therefore, it is advised that clinicians use only aerodynamic parameters but phonatory parameters when using Aerophone II.

  • PDF

Extraction of CTQ for the Improvement of the Education Quality Using QFD in College (QFD를 이용한 전문대학 공학부 교육내실화 품질요소 도출)

  • Park, Byoung-Tae;Kim, Bok-Key;Kwak, Moon-Su;Lee, Eun-Soo
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.1
    • /
    • pp.231-239
    • /
    • 2013
  • This intensity is now on a global scale with countless universities across the globe competing for better services, programs and diplomas. For to counteract such a considerable change, in this paper CTQ(Critical to Quality) is extracted for the improvement of the education quality using QFD(Quality Function Development) in college. QFD is a structured approach to seek out voice of customers, understanding their needs, and ensure that their needs are met. First of all, the requirements of the customer are surveyed and analyzed, and then with these results the strategic alternatives are decided. In sequence, the importance and assessment ratings on the requirement of customers are surveyed. Finally, from the relation between the requirement of customers and the strategic alternatives the CTQ is extracted. The derived CTQ is reviewed and analyzed in detail. It'll have major positive effects on the competitiveness of college as well as the education quality of departments.

Speaker Verification Model Using Short-Time Fourier Transform and Recurrent Neural Network (STFT와 RNN을 활용한 화자 인증 모델)

  • Kim, Min-seo;Moon, Jong-sub
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.29 no.6
    • /
    • pp.1393-1401
    • /
    • 2019
  • Recently as voice authentication function is installed in the system, it is becoming more important to accurately authenticate speakers. Accordingly, a model for verifying speakers in various ways has been suggested. In this paper, we propose a new method for verifying speaker verification using a Short-time Fourier Transform(STFT). Unlike the existing Mel-Frequency Cepstrum Coefficients(MFCC) extraction method, we used window function with overlap parameter of around 66.1%. In this case, the speech characteristics of the speaker with the temporal characteristics are studied using a deep running model called RNN (Recurrent Neural Network) with LSTM cell. The accuracy of proposed model is around 92.8% and approximately 5.5% higher than that of the existing speaker certification model.