• Title/Summary/Keyword: 음성 훈련

Search Result 280, Processing Time 0.026 seconds

An Intelligent Electronic Performance Support System for Semiconductor Testing Equipment (반도체 검사 장비를 위한 지능형 전자 성능 지원 시스템)

  • 이상용
    • Korean Journal of Cognitive Science
    • /
    • v.9 no.1
    • /
    • pp.31-39
    • /
    • 1998
  • This paper describes an electronic performance support system called HELPS(Handler Electronic Learning Performence Support) for semiconductor testing e equipment. The purpose of this system is to improve productivity of operators by providing just-in-time, on-the-job, mutimedia-based system information for operational support, training, and knowledge-based trouble shooting and repair. HELPS is composed of a operation module and a trouble shooting module. The operation module uses multimedia and hypermedia to provide the detailed and easily accessible information about equipment to users. Multimedia incorporate multiple. media forms including still and video images. animations 'texts' graphics. and audio. Hypermedia a are provided through a hierarchical information structure which offers not only specific information which is needed to perform a task to experienced operators. but detailed system guidance and information to novice operators. The trouble shooting module is composed of an integrated mutimedia-supported expert system which assists operators in trouble shooting and equipment repair. After diagnosis through the use of the expert system. multimedia advice is presented to the user in either still images with text or motion sequences with sound HELPS is evaluated in term of training time and trouble shooting and repair time. It improved productivity by saving more than 30% of the total time used without the system. This s system has the potential to improve productivity when it is used with ICAIOntellignet Computer Aided Instruction) and virtual reality.

  • PDF

Artificial Intelligence for Assistance of Facial Expression Practice Using Emotion Classification (감정 분류를 이용한 표정 연습 보조 인공지능)

  • Dong-Kyu, Kim;So Hwa, Lee;Jae Hwan, Bong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.6
    • /
    • pp.1137-1144
    • /
    • 2022
  • In this study, an artificial intelligence(AI) was developed to help with facial expression practice in order to express emotions. The developed AI used multimodal inputs consisting of sentences and facial images for deep neural networks (DNNs). The DNNs calculated similarities between the emotions predicted by the sentences and the emotions predicted by facial images. The user practiced facial expressions based on the situation given by sentences, and the AI provided the user with numerical feedback based on the similarity between the emotion predicted by sentence and the emotion predicted by facial expression. ResNet34 structure was trained on FER2013 public data to predict emotions from facial images. To predict emotions in sentences, KoBERT model was trained in transfer learning manner using the conversational speech dataset for emotion classification opened to the public by AIHub. The DNN that predicts emotions from the facial images demonstrated 65% accuracy, which is comparable to human emotional classification ability. The DNN that predicts emotions from the sentences achieved 90% accuracy. The performance of the developed AI was evaluated through experiments with changing facial expressions in which an ordinary person was participated.

Changes in Visual Function After Viewing an Anaglyph 3D Image (Anaglyph 3D입체 영상 시청 후의 시기능 변화)

  • Lee, Wook-Jin;Kwak, Ho-Won;Son, Jeong-Sik;Kim, In-Su;Yu, Dong-Sik
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.16 no.2
    • /
    • pp.179-186
    • /
    • 2011
  • Purpose: This study aimed to compare and assess changes of visual functions in viewing an anaglyph 3D image. Methods: Visual functions were examined before and after viewing a 2D image and an anaglyph 3D image with red-green glasses on seventy college students (mean age = 22.29${\pm}$2.19 years). Visual function tests were carried out for von Graefe phoria test, accommodative amplitude test by (-) lens addition, negative relative accommodation (NRA) and positive relative accommodation (PRA) test, negative relative convergence (NRC) and positive relative convergence (PRC) test, accommodative facility, and vergence facility test. Results: Assessment of the visual functions indicated that near exophoria and accommodative amplitude were reduced after viewing a 3D image, and although there were small changes in relation to these findings, NRC and PRC showed tendencies to increase and decrease at near, respectively. There were no significant changes with NRA and PRA, and accommodative and vergence facility were shown to have improved. Conclusions: Changes of visual functions were more in the 3D image than the 2D image, especially at near than distance. Particularly, the improvement of accommodative and vergence facility could be related to an effect of subsequent accommodation and vergence shift to have stereopsis in the 3D image. These results indicate that an anaglyph 3D image may, to some extent, be the effect of vision training such as anaglyphs.

Characteristics of Phonatory and Respiratory Control on Pitch, Loudness, Register Change in Untrained and Trained Singers (성악가와 훈련 받지 않은 일반인의 음도, 강도, 성구 변화 시 발성 및 호흡조절 특성)

  • Choi, Seong-Hee;Nam, Do-Hyun;Kim, Deak-Won;Kim, Young-Ho;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.17 no.2
    • /
    • pp.115-126
    • /
    • 2006
  • Background and Objectives : Training of breath support and laryngeal muscles control are important components in the development of the singing voice. The purpose of this study is to compare characteristics of respiratory and phonatory control on pitch, loudness, register change with untrained males and trained male singers. Materials and Methods : The 11 untrained males and 11 trained male singers participated. Closed Quotient(CQ), fundamental frequency (fo) and relative volume contribution of the rib cage (in percentage rib cage, % RC) and relative volume contribution of abdomen (in percentage abdomen, % AB) were measured during various pitch, loudness, register tasks using /a/ vowel phonation : Legato, staccato with C3-D3-E3-F3-G3 notes and crescendo and decrescendo with C3 note as well as modal register with C3 and falsetto register with C4 note using an integrated analysis system of Respiration, EGG and Voice. Results : (1) When pitch increased with legato task, loudness also increased in untrained male group but maintained in trained male singers. CQ was also increased both untrained and trained male singers but it was not significantly different ($p>.05$). The abdomen contribution to lung volume were significantly predominant both in inhalation and exhalation in trained males singers ($p<.05$). (2) When pitch increased with staccato task, CQ was not significantly different in untrained but significantly different in trained male singers. The respiratory function of male singers were characterized by significantly predominant abdomen contribution to lung volume in exhalation except for inhalation ($p<.05$) (3) When loudness increased with crescendo, fo was significantly increased with increasing CQ in untrained males but fo was relatively consistent with increasing CQ in trained male singers. The respiratory function of male singers were characterized by significantly predominant abdomen contribution to lung volume in exhalation except for inhalation ($p<.05$). (4) Most male singers were able to change register from modal to falsetto register, but untrained males were not. Thus, CQ was significantly different between modal and falsetto register in trained male singers ($p<.05$). The respiratory function of male singers were characterized by significantly predominant abdomen contribution to lung volume in exhalation except for inhalation ($p<.05$). Conclusion : Male singers were superior to untrained males in coordination of respiratory and phonatory control on pitch, loudness, register change. Implication are offered regarding how the results might be applied to the voice therapy as well as singing training.

  • PDF

Effect of Leadership Style on the Satisfaction of Organizational Members : Focusing on the Transfomational Leadership (리더십 유형이 조직구성원의 만족도에 미치는 영향에 관한 연구 - 변혁적 리더십을 중심으로 -)

  • 이광재
    • KSCI Review
    • /
    • v.1 no.2
    • /
    • pp.47-72
    • /
    • 1995
  • 지금까지 나타난 많은 리더십 이론이 주로 기독교 문화권인 서구에서 연구된 것을 문화권이 다른 우리나라에 적용하고자 하는 것이 기존의 연구 경향이었다. 오랫동안 전통이 되어 온 우리 특유의 조직 문화 중에서 특히 관료적이고 권위주의적인 조직 문화에 새로운 변화가 일어나고 있는 과도기적인 현시점에서 최근에 나타난 리더십 이론인 B.M.Bass의 변형적 리더십 이론을 모델로 하여 우리 기업에 적절한 리더십 유형을 찾는 것이 일차적 목적이며 여기서 찾아낸 리더십 유형 별로 조직원의 상사 만족도에 미치는 영향을 알아보는 것이 이 연구의 목적이었다. 따라서 이 논문에서 제시한 리더십 유형을 보면 변혁적 리더십 유형에 속하는 것으로 지원적 배려, 카리스마, 과업 동기 자극으로 3분류하였으며 거래적 리더십 유형에 속하는 것으로 상황적 보상, 예외 관리, 소극적 관리로 3분류하여 모두 6유형을제시하고 신뢰성과 타당성을 확인하였다. 실증 연구 결과를 보면 우리나라는 거래적 리더십유형인 상황적보상, 예외 관리, 소극적 관리가 모두 조직원의 상사 만족도에 영향을 미치는것으로 나타나 Bass의 주장과는 다소 상이하다. 그러나 변혁적 리더십 유형인 카리스마, 지원적 배려, 과업 동기 자극에서는 카리스마만이 조직원의 상사 만족도에 영향을 미치고 지원적 배려는 영향력의 정도가 미약하며 과업 동기 자극은 거의 영향을 미치지 못하는 것으로 나타났다. 따라서 리더의 부하 통솔 및 지도 방향에서 아직도 우리나라에서는 거래적 요소가 짙다는 것을 알 수 있었다. 비록 당분간은 리더가 거래적 리더십을 많이 사용하겠지만 기업의 역사가 길어지고 종업원의 교육 훈련의 기회가 많아지면 변혁적 리더십으로 전환되리라 생각된다.가 단층원주상피세포와 단층입방상피가 부위에 따라 다르게 분포하고 있으며, 상피세포 및 결합조직에는 두터운 근육층이 있어 음경의강한 운동성이 감지 되었다. 제주도 내의 서귀포 부근에서는 현재 천연기념물로 지정되어 있는 서귀포층내에서 많은 화석들이 산출되고 있다. 이 시대는 빙하기와 간빙기가 교호하던 시대로서, 분석 결과에 의하면 서귀포층이 쌓일 당시에 우리 나라는 빙하기의 영향을 받았던 것으로 생각된다.is)은 근섬유가 산재된 두꺼운 벽을 가졌으며 상음경보다 굵은 원통형이었다. 내강은 많은 돌출부에 의해 복잡하게 나뉘어 있으며 상피세포는 원주 세포로 이루어져 있었고 섬모는 관찰되지 않았다. 내강 내의 분비물과 세포의 형태로 보아 내강상피세포는 분비기능을 가진 것으로 사료된다.술적 문제가 적절히 해결되는 경우 비활성 가스 제너레이터는 민수용으로는 대형 빌딩, 산림, 유조선 등의 화재에 매우 적절히 사용되어 질 수 있을 뿐 아니라 군사적으로도 군사작전 중 및 공군 기지의 화재 그리고 지하벙커에 설치되어 있는 고급 첨단 군사 장비 등의 화재 뿐 아니라 대간첩작전 등에 효과적으로 활용될 수 있을 것으로 판단된다.가 작으며, 본 연소관에 충전된 RDX/AP계 추진제의 경우 추진제의 습기투과에 의한 추진제 물성 변화는 미미한 것으로 나타났다.의 향상으로, 음성개선에 효과적이라고 사료되었으며, 이 방법이 편측 성대마비 환자의 효과적인 음성개선의 치료방법의 하나로 응용될 수 있으리라 생각된다..7%), 혈액투석, 식도부분절제술 및 위루술·위회장문합술을 시행한 경우가 각 1례(2.9%)씩이었다. 13) 심각한 합병증은 9례(26.5%)에서 보였는데 그중 식도협착증이 6례(17.6%), 급성신부전증 1례(2.

  • PDF

A study on end-to-end speaker diarization system using single-label classification (단일 레이블 분류를 이용한 종단 간 화자 분할 시스템 성능 향상에 관한 연구)

  • Jaehee Jung;Wooil Kim
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.6
    • /
    • pp.536-543
    • /
    • 2023
  • Speaker diarization, which labels for "who spoken when?" in speech with multiple speakers, has been studied on a deep neural network-based end-to-end method for labeling on speech overlap and optimization of speaker diarization models. Most deep neural network-based end-to-end speaker diarization systems perform multi-label classification problem that predicts the labels of all speakers spoken in each frame of speech. However, the performance of the multi-label-based model varies greatly depending on what the threshold is set to. In this paper, it is studied a speaker diarization system using single-label classification so that speaker diarization can be performed without thresholds. The proposed model estimate labels from the output of the model by converting speaker labels into a single label. To consider speaker label permutations in the training, the proposed model is used a combination of Permutation Invariant Training (PIT) loss and cross-entropy loss. In addition, how to add the residual connection structures to model is studied for effective learning of speaker diarization models with deep structures. The experiment used the Librispech database to generate and use simulated noise data for two speakers. When compared with the proposed method and baseline model using the Diarization Error Rate (DER) performance the proposed method can be labeling without threshold, and it has improved performance by about 20.7 %.

The Study on the improvement plan for Military combat power by the future computer (미래형컴퓨터를 이용한 군전투력 발전방안 연구)

  • Heo, Yeong Dae
    • Convergence Security Journal
    • /
    • v.13 no.5
    • /
    • pp.57-66
    • /
    • 2013
  • Predicting pattern of future combat ensures a successful war. It is possible to anticipate the shape of the future combat from the fighting method of US Army in the Iraq War. The fighting method: a series of combat progress by real time information to pinpoint strike using a guided weapon by GPS, an intelligence satellite and unmanned surveillance vehicle (USV), shows that real time unification combat power is key element for decide outcome of a war. The NCW is an organically connected network centric warfare paradigm by networking a factor of operation. In this paper, studied on the improvement plan for combat power by the future computer like a portable computer, an audio-recognized computer and non-keyboard computer. In addition, this paper attempts to establish a comprehensive intelligence network of Korea Marine Corps and to apply to combat or training.

Use of Voice Script For Speech Characterization (화법에 의한 성격표현에 활용할 소리대본 작성법)

  • Lee, Ki-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.12
    • /
    • pp.976-985
    • /
    • 2011
  • The purpose of this research is to investigate the usage of voice scripting for speech characterization. The ultimate goal of acting is for an actor to create one's character both in physical and vocal sense, and to present them on stage. Toward this goal, actors train themselves with various methods and techniques as well as character analysis. Most of their efforts are put into for better physical and vocal expression. The vocal characterization on stage is heavily governed by the proper speech based on the effective respiration and voice. This paper provides the way of how to use sound scoring for effective vocal characterization on stage.

A Study on Classification of Waveforms Using Manifold Embedding Based on Commute Time (컴뮤트 타임 기반의 다양체 임베딩을 이용한 파형 신호 인식에 관한 연구)

  • Hahn, Hee-Il
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.2
    • /
    • pp.148-155
    • /
    • 2014
  • In this paper a commute time embedding is implemented by organizing patches according to the graph-based metric, and its properties are investigated via changing the number of nodes on the graph.. It is shown that manifold embedding methods generate the intrinsic geometric structures when waveforms such as speech or music instrumental sound signals are embedded on the low dimensional Euclidean space. Basically manifold embedding algorithms only project the training samples on the graph into an embedding subspace but can not generalize the learning results to test samples. They are very effective for data clustering but are not appropriate for classification or recognition. In this paper a commute time guided transform is adopted to enhance the generalization ability and its performance is analyzed by applying it to the classification of 6 kinds of music instrumental sounds.

Development of Web3D-based Virtual Reality System for Hydrogen Station (웹 3D 기술을 사용한 수소충전소 가상체험교육시스템 제작)

  • Yoon, Jong-Chul;Kwon, Ji-Yong;Lee, In-Kwon
    • Journal of the Korea Computer Graphics Society
    • /
    • v.15 no.2
    • /
    • pp.35-40
    • /
    • 2009
  • In this paper, we present the web3D-based virtual reality(VR) system for the safety education of the hydrogen station. Currently, hydrogen is considered the next generation energy, and hydrogen station is a part of core infrastructure in hydrogen industry. However, the experience of safety equipment in the hydrogen station is limited to the non-experts, because it is a restricted area in general industry. Therefore, using the event driven method, we develop the VR system to transfer the information of hydrogen station to the non-experts. Using our system, user experiences the safety concerns in the hydrogen station, and also learn the informations of hydrogen energy.

  • PDF