• Title/Summary/Keyword: Voice function

Search Result 436, Processing Time 0.028 seconds

State Visualization Design of AI Speakers using Color Field Painting (색면추상 기법을 통한 AI 스피커의 상태 시각화 디자인 연구)

  • Hong, Seung Yoon;Choe, Jong-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.2
    • /
    • pp.572-580
    • /
    • 2020
  • Recently released AI speakers show a pattern of interacting with the user by mainly with voice and simultaneously displaying simple and formal visual feedback through status LED light. This is due to the limitations of the product characteristics of the speaker, which makes it difficult to interact variously, and even such visual feedback is not standardized for each product, and thus does not give a consistent user experience. By maximizing the visual elements that can be expressed through color and abstract movement to assist voice feedback, the product can provide the user with an extended experience that includes not only functional satisfaction but also emotional satisfaction. In this study, after analyzing the interaction methods of the existing AI speakers, we examined the theory of color communication in order to expand the visual feedback effect, and examined the meaning and expression technique of Color Field Painting, an art genre that maximizes the emotional experience by using only color. Through this, the AI speaker's visual communication function was expanded by designing a way to feedback communication status using LED light.

Analysis of Voice Parameters Variation during Passaggio of the Trained Male Singers (남성 성악가의 Passaggio시 음성변화연구)

  • Nam, Do-Hyun;Ahn, Chul-Min;Choi, Sung-Hee;Hong, Jin-Hee;Lee, Sung-Eun;Choi, Hong-Shik
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.15-25
    • /
    • 2002
  • It's not easy to produce very high tones during singing for not only untrained ordinary people but also even trained singers. To get high singing tones from the low tones, some trained singers used to use a distinguished singing technique, Passaggio (vocal register transition). The purpose of this study is to compare several voice parameters variation between when to sing with using the passaggio technique and to sing without using it. We selected 18 male singers (tenor 8, baritone 10), who had more than 7 years of experience and were well trained in passaggio technique. Simultaneous measurements of fundamental frequency (F0), mean flow rate (MFR), intensity (I), and subglottal pressure (Psub) were performed using the phonatory function analyzer (Nagashima). For the tenor, target tones /a/ were presented: 1) easy phonation: $B_{2}$, 2) high tone without passaggio: F$#_{3}$ 3) high tone with passaggio: F$#_{3}$. For the baritone, target tones /a/ were presented: 1) easy phonation: G$#_{3}$, 2) high tone without passaggio: D$#_{3}$, 3) high tone with passaggio: D$#_{3}$. F0 of the target tones between non-passaggio group and passaggio group was almost the same in both tenor and baritone groups. Intensity of the non-passaggio and passaggio vocalization was much louder than that of easy phonation and pasaggio was louder than non-passaggio vocalization (especially statistically significant in baritone singers). MFR of the passaggio vocalization was greater than non-passaggio vocalization in both tenor and baritone group, but statistically significant only in baritone. Psub of the passaggio vocalization was greater than that of the non-passaggio vocalization in both tenor and baritone group, but statistically not significant in tenor.

  • PDF

Acoustic Analysis of Classically Trained Western Singers (서양 음악을 전공으로 하는 성악인의 음향학적 분석)

  • 정성민
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.10 no.2
    • /
    • pp.124-129
    • /
    • 1999
  • Background and Objectives : Classical singers are capable of masking abnormalities due to their high level of training and may present with apparent technical deficits rather than with obvious dysfunction. Therefore, some variations from expected normal laryngeal behavior may be present in trained classical singers. Consequently it is important for otolaryngologist to obtain a baseline assessment of their laryngeal function. Materials and Methods : Acoustic measurement including strobovideolaryngoscopy from 50 classically trained singers was done for this study, which was compared with the data from 20 untrained adults. Results and Conclusion : This study showed that 50-healthy asymptomatic classical singers revealed an incidence of 50% abnormal strobovideolaryngoscopic findings, but their acoustic data was within normal limit despite the abnormal laryngeal findings. Therefore the author recommends that the classical singers need objective voice analysis and their baseline data should be used for the accurate diagnosis of the cause of voice dysfunction In classical singer whose baseline laryngeal behavior may be unusual.

  • PDF

Comparison of Pre and Post-operational Phonatory Aerodynamic Parameters in Vocal Polyp and Vocal Cord Palsy Patients (성대마비 및 성대용종 환자의 수술 전과 후의 공기역학적 변수 비교)

  • Lee, Dahye;Kim, Jaeock;Oh, JaeKoon;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.112-116
    • /
    • 2015
  • Background and Objectives : Aerodynamic analysis is an examination which provides information regarding various vocalization measures indicating laryngeal efficiency. Voice evaluation using such examination must be capable of distinguishing between normal to abnormal voice. It also observes variables on aerodynamic characteristics by gender in regards to patients of vocal disorders, especially of vocal cord paralysis and vocal polyp, and compares the conditions before and after surgery. This paper therefore, seeks to build a framework for establishing standard levels of aerodynamical characteristic on vocal disorders. Subjects and Methods : The study was intended for a total number of 20 patients with vocal polyp or unilateral vocal cord paralysis. Those with the vocal polyp underwent laryngomycroscopy surgery and the vocal cord paralysis, vocal fold injection using Restylane. Aerodynamic analysis fulfilled the Maximum sustained Phonation (MXPH) and Voicing Efficiency (VOEF) by using PAS Model 6600 (KayPENTAX, USA). Results : In MXPH, increase in PHOT were evident with vocal polyp after surgery. As for patients with vocal cord paralysis, MAXDB, MEADB, DHODB, PHOT all have increased and MEAP, PEF, MEAF decreased after surgery. In VOEF, patients with vocal cord paralysis who underwent surgery showed increase in MAXDB, MEADB, DHODB, FET100, ARES, but decreases in PEF, TARF. Conclusion : Overall, it can be concluded that patients with the vocal polyp and vocal cord paralysis seemed to get closer to the normal values after than before surgery in majority of measures. This confirms that the function of their vocal cord has improved nearly to normality through operations.

  • PDF

A "GAP-Model" based Framework for Online VVoIP QoE Measurement

  • Calyam, Prasad;Ekici, Eylem;Lee, Chang-Gun;Haffner, Mark;Howes, Nathan
    • Journal of Communications and Networks
    • /
    • v.9 no.4
    • /
    • pp.446-456
    • /
    • 2007
  • Increased access to broadband networks has led to a fast-growing demand for voice and video over IP(VVoIP) applications such as Internet telephony(VoIP), videoconferencing, and IP television(IPTV). For pro-active troubleshooting of VVoIP performance bottlenecks that manifest to end-users as performance impairments such as video frame freezing and voice dropouts, network operators cannot rely on actual end-users to report their subjective quality of experience(QoE). Hence, automated and objective techniques that provide real-time or online VVoIP QoE estimates are vital. Objective techniques developed to-date estimate VVoIP QoE by performing frame-to-frame peak-signal-to-noise ratio(PSNR) comparisons of the original video sequence and the reconstructed video sequence obtained from the sender-side and receiver-side, respectively. Since processing such video sequences is time consuming and computationally intensive, existing objective techniques cannot provide online VVoIP QoE. In this paper, we present a novel framework that can provide online estimates of VVoIP QoE on network paths without end-user involvement and without requiring any video sequences. The framework features the "GAP-model", which is an offline model of QoE expressed as a function of measurable network factors such as bandwidth, delay, jitter, and loss. Using the GAP-model, our online framework can produce VVoIP QoE estimates in terms of "Good", "Acceptable", or "Poor"(GAP) grades of perceptual quality solely from the online measured network conditions.

A Study on the Public Safety Long Term Evolution Disaster Communication Network (PS-LTE 재난안전통신망에 관한 정책 연구)

  • Kang, Heau-Jo
    • Journal of Digital Contents Society
    • /
    • v.16 no.1
    • /
    • pp.43-51
    • /
    • 2015
  • South Korea decided to use the public safety long term evolution (PS-LTE) method to build a wireless communications network for national disaster. The move will allow all disaster management agencies, including police, firefighters and the military to use multimedia information, including videos, in addition to voice to carry out rescue operations from 2017. The Ministry of Science, ICT and Future Planning said Thursday that it adopted PS-LTE as the technology method for national disaster management system following a national policy coordination meeting last month. PS-LTE is a method integrating group communication, direction communication between terminals and independent base station function with existing LTE technology. Until now, disaster management agencies used technology that only allowed voice communication. However, transmission of multimedia information is possible using PS-LTE.

Educational Voice Storybook Application using Mobile NFC (모바일 NFC를 이용한 교육용 음성동화책 어플리케이션)

  • Jo, Seung-Geun;Kim, Tae Yong;Jang, Won-Tae
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.541-544
    • /
    • 2015
  • Through the data communication services in the mobile field in the past of a voice call center (SMS, E-Mail, web surfing, etc.), it has evolved as a "service of smartphone-based" life convenience. According to the latest world book publishing market statistics, the while increasing the proportion of e-books, the proportion of this paper are commercially being reduced. Paper this many e-books in the development of, but active communication in this paper By fusing an NFC-based mobile environment that provides a function to complement the disadvantage of electronic books has, it can give to meet the matters required in more diverse applications. Therefore, in this paper, in order to design and implement an NFC-based mobile phonic fairy book application of, and the efficiency and maintenance of the development through a way to save on rather than storing the information of the management target device server We have implemented a NFC-based mobile education phonic fairy book application that takes into account.

  • PDF

Case Study of a Dog Vocalizing Human's Words (사람의 말을 발성하는 개의 사례 연구)

  • Kyon, Doo-Heon;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.4
    • /
    • pp.235-243
    • /
    • 2012
  • This paper studies characteristics and causes of sound, and many others by distinguishing passivity and activity of the cases of a dog vocalizing human's words. As a result of the previous cases of vocalization of human's words, the dog was able to understand characteristics of a host's voice and imitate the sound using his own vocal organs. This is the case of passive vocalization accompanied by temporary voice imitation without a function of communication. On the contrary, as a consequence of the recently reported case in which a dog vocalizes such words as "Um-ma" and "Nu-na-ya," it shows the vocalization pattern clearly distinguished from the prior cases. The given dog repeatedly vocalizes pertaining words in an active manner according to circumstances and plays a role of fundamental communication and interaction with its host. The reason why the dog can vocalize the man's words actively is determined to be that the dog has a high level of intelligence and intimacy with its host, that people react actively to its pertaining pronunciation, and so forth. The following results can be used for the study that investigates animals' sound with vocalization possibility and language learning feasibility.

Transmission System of TDM signal based on E-PON (E-PON 기반 TDM 신호 전송 시스템)

  • Kwon, Jeong-Gook;Jin, Geol;Park, Chun-Kwan;Song, Han-Young;Jeon, Byung-Chun;Lee, Sang-Ho
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.44 no.12
    • /
    • pp.63-72
    • /
    • 2007
  • This paper addresses the system development that can transfer data and TDM signals simultaneously by adding TDM pseudowires functions to E-PON system. E-PON technology is one of many technologies which can realize FTTH, has raised as next generation access network solution having both low-cost Ethernet technology and optical infrastructure. TDM pseudowires service is the new voice/data conversion technology which can transfer the existing TDM circuit switched voice and data over packet switching network. In this paper, this system can provide both data and TDM service without deteriorating QoS by adding TDM pseudowires service module to E-PON, and then implementing QoS control function. Therefore, the competitive system which can transfer both data and TDM signal, can be installed.

Electromyographic evidence for a gestural-overlap analysis of vowel devoicing in Korean

  • Jun, Sun-A;Beckman, M.;Niimi, Seiji;Tiede, Mark
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.153-200
    • /
    • 1997
  • In languages such as Japanese, it is very common to observe that short peripheral vowel are completely voiceless when surrounded by voiceless consonants. This phenomenon has been known as Montreal French, Shanghai Chinese, Greek, and Korean. Traditionally this phenomenon has been described as a phonological rule that either categorically deletes the vowel or changes the [+voice] feature of the vowel to [-voice]. This analysis was supported by Sawashima (1971) and Hirose (1971)'s observation that there are two distinct EMG patterns for voiced and devoiced vowel in Japanese. Close examination of the phonetic evidence based on acoustic data, however, shows that these phonological characterizations are not tenable (Jun & Beckman 1993, 1994). In this paper, we examined the vowel devoicing phenomenon in Korean using data from ENG fiberscopic and acoustic recorders of 100 sentences produced by one Korean speaker. The results show that there is variability in the 'degree of devoicing' in both acoustic and EMG signals, and in the patterns of glottal closing and opening across different devoiced tokens. There seems to be no categorical difference between devoiced and voiced tokens, for either EMG activity events or glottal patterns. All of these observations support the notion that vowel devoicing in Korean can not be described as the result of the application of a phonological rule. Rather, devoicing seems to be a highly variable 'phonetic' process, a more or less subtle variation in the specification of such phonetic metrics as degree and timing of glottal opening, or of associated subglottal pressure or intra-oral airflow associated with concurrent tone and stricture specifications. Some of token-pair comparisons are amenable to an explanation in terms of gestural overlap and undershoot. However, the effect of gestural timing on vocal fold state seems to be a highly nonlinear function of the interaction among specifications for the relative timing of glottal adduction and abduction gestures, of the amplitudes of the overlapped gestures, of aerodynamic conditions created by concurrent oral tonal gestures, and so on. In summary, to understand devoicing, it will be necessary to examine its effect on phonetic representation of events in many parts of the vocal tracts, and at many stages of the speech chain between the motor intent and the acoustic signal that reaches the hearer's ear.

  • PDF