• Title/Summary/Keyword: Text-to-Speech

Search Result 505, Processing Time 0.028 seconds

Development of 3D Power Transformer Maintenance Application (3차원 그래픽을 이용한 전력용 변압기 유지보수 프로그램 개발)

  • Lee, Yil-Hwa;Park, Chang-Hyun;Jang, Gil-Soo;Cho, Kyung-Rae
    • Proceedings of the KIEE Conference
    • /
    • 2005.07a
    • /
    • pp.114-116
    • /
    • 2005
  • This paper presents a maintenance application for power transformer. High quality maintenance and accurate diagnosis are essential for all transformers, especially for older ones. The developed application provides maintenance guides for dry type transformer and oil-filled transformer to prevent any malfunctions and to lengthen the lifetime of transformers. Based on windows application, TTS (text to speech) and 3D graphics technologies have been used to enhance the user friendly interface. Developed application is helpful for both expert and novice operators at substation.

  • PDF

A Neglected Factor of French Prosody: The peak variation at the end of rhythmic groups

  • Claude Roberge;Noriko Hoki
    • MALSORI
    • /
    • no.31_32
    • /
    • pp.207-221
    • /
    • 1996
  • The aim of this research is to study the functioning of the peak variations at the end of the rhythmic groups in spoken french. For this purpose, the text '60 Voix, 60 Exercices', published by Hachette in 1988, was selected. This textbook is based on interviews with 60 persons who briefly speak in a monolog from on a subject of their choice. 500 hundred different groups were selected and submitted to the auditory judgment of six informants, three French natives and three Japanese natives who had studied French for at least three years. It was found, first, that there exists a tendency to a change of either rising or tolling intonation compared with the flat one, and second, that the rising intonation obtains a flirty good score of frequency compared with the two other, ones even if the examined sentences do not pertain to the strict classical types of interrogative or exclamative sentences or dialogs, where affectivity is so often an important factor.

  • PDF

Ubiquitous Car Maintenance Services Using Augmented Reality and Context Awareness (증강현실을 활용한 상황인지기반의 편재형 자동차 정비 서비스)

  • Rhee, Gue-Won;Seo, Dong-Woo;Lee, Jae-Yeol
    • Korean Journal of Computational Design and Engineering
    • /
    • v.12 no.3
    • /
    • pp.171-181
    • /
    • 2007
  • Ubiquitous computing is a vision of our future computing lifestyle in which computer systems seamlessly integrate into our everyday lives, providing services and information in anywhere and anytime fashion. Augmented reality (AR) can naturally complement ubiquitous computing by providing an intuitive and collaborative visualization and simulation interface to a three-dimensional information space embedded within physical reality. This paper presents a service framework and its applications for providing context-aware u-car maintenance services using augmented reality, which can support a rich set of ubiquitous services and collaboration. It realizes bi-augmentation between physical and virtual spaces using augmented reality. It also offers a context processing module to acquire, interpret and disseminate context information. In particular, the context processing module considers user's preferences and security profile for providing private and customer-oriented services. The prototype system has been implemented to support 3D animation, TTS (Text-to-Speech), augmented manual, annotation, and pre- and post-augmentation services in ubiquitous car service environments.

Subtitle Automatic Generation System using Speech to Text (음성인식을 이용한 자막 자동생성 시스템)

  • Son, Won-Seob;Kim, Eung-Kon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.1
    • /
    • pp.81-88
    • /
    • 2021
  • Recently, many videos such as online lecture videos caused by COVID-19 have been generated. However, due to the limitation of working hours and lack of cost, they are only a part of the videos with subtitles. It is emerging as an obstructive factor in the acquisition of information by deaf. In this paper, we try to develop a system that automatically generates subtitles using voice recognition and generates subtitles by separating sentences using the ending and time to reduce the time and labor required for subtitle generation.

A Study on the Integration of Similar Sentences in Atomatic Summarizing of Document (자동초록 작성시에 발생하는 유사의미 문장요소들의 통합에 관한 연구)

  • Lee, Tae-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.34 no.2
    • /
    • pp.87-115
    • /
    • 2000
  • The effects of the Case, Part of Speech, Word and Clause Location, Word Frequency etc. were studied in discriminating the similar sentences of the Korean text. Word Frequency was much related to the discrimination of similarity and Tilte word and Functional Clause were little, but the others were not. The cosine coefficient and Salton'similarity measurement are used to measure the similarity between sentences. The change of clauses between each sentence is also used to unify the similar sentences into a represenative sentence.

  • PDF

Issues in Chinese prosody: conceptual foundations of a linguistically-motivated text-to-speech system for Mandarin

  • Lavin, Richard S.
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2002.02a
    • /
    • pp.259-270
    • /
    • 2002
  • I examine various controversial aspects of Chinese prosody-tone structure, syllable structure, stress, and intonation-and stress the need to view all of these as interacting systems, aspects of a hierarchical prosodic structure. 1 examine various proposals at these various levels of the hierarchy and suggest which are most appropriate. Specifically, 1 suggest the adoption of Bao's version of syllable and tone, and Chen's account of stress. As for intonation, it is still not possible to make any definitive claims regarding an optimal model, but I examine work done by Kratochvil, Shih, and Carding et al, and suggest promising directions for future work.

  • PDF

Development of a 3-D Visualization Application for Management of Substation Equipment

  • Park, Chang-Hyun
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.23 no.3
    • /
    • pp.38-44
    • /
    • 2009
  • This paper presents a new windows application based on 3-D graphics and Text-To-Speech (TTS) for effective management of substation equipment. When problems in a power system occur, inexperienced power system operators may have difficulty in understanding the situation as well as finding suitable countermeasures quickly. This paper addresses an effective scheme to visualizing power system equipment under normal and abnormal conditions using 3-D graphics and animations. In addition, the state variations and the order of maintenance priority of substation equipment are represented by TTS and intuitive methods. The proposed system can help power system operators to more quickly understand the state of power system equipment, and it can provide operators with the suitable countermeasures for minimizing damage caused by equipment problems.

Improving the Performance of Korean Text Chunking by Machine learning Approaches based on Feature Set Selection (자질집합선택 기반의 기계학습을 통한 한국어 기본구 인식의 성능향상)

  • Hwang, Young-Sook;Chung, Hoo-jung;Park, So-Young;Kwak, Young-Jae;Rim, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.9
    • /
    • pp.654-668
    • /
    • 2002
  • In this paper, we present an empirical study for improving the Korean text chunking based on machine learning and feature set selection approaches. We focus on two issues: the problem of selecting feature set for Korean chunking, and the problem of alleviating the data sparseness. To select a proper feature set, we use a heuristic method of searching through the space of feature sets using the estimated performance from a machine learning algorithm as a measure of "incremental usefulness" of a particular feature set. Besides, for smoothing the data sparseness, we suggest a method of using a general part-of-speech tag set and selective lexical information under the consideration of Korean language characteristics. Experimental results showed that chunk tags and lexical information within a given context window are important features and spacing unit information is less important than others, which are independent on the machine teaming techniques. Furthermore, using the selective lexical information gives not only a smoothing effect but also the reduction of the feature space than using all of lexical information. Korean text chunking based on the memory-based learning and the decision tree learning with the selected feature space showed the performance of precision/recall of 90.99%/92.52%, and 93.39%/93.41% respectively.

A Study on Verification of Back TranScription(BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)

  • Park, Chanjun;Seo, Jaehyung;Lee, Seolhwa;Moon, Hyeonseok;Eo, Sugyeong;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.109-117
    • /
    • 2021
  • Recently, the use of speech-based interfaces is increasing as a means for human-computer interaction (HCI). Accordingly, interest in post-processors for correcting errors in speech recognition results is also increasing. However, a lot of human-labor is required for data construction. in order to manufacture a sequence to sequence (S2S) based speech recognition post-processor. To this end, to alleviate the limitations of the existing construction methodology, a new data construction method called Back TranScription (BTS) was proposed. BTS refers to a technology that combines TTS and STT technology to create a pseudo parallel corpus. This methodology eliminates the role of a phonetic transcriptor and can automatically generate vast amounts of training data, saving the cost. This paper verified through experiments that data should be constructed in consideration of text style and domain rather than constructing data without any criteria by extending the existing BTS research.

Prototype Design and Development of Online Recruitment System Based on Social Media and Video Interview Analysis (소셜미디어 및 면접 영상 분석 기반 온라인 채용지원시스템 프로토타입 설계 및 구현)

  • Cho, Jinhyung;Kang, Hwansoo;Yoo, Woochang;Park, Kyutae
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.203-209
    • /
    • 2021
  • In this study, a prototype design model was proposed for developing an online recruitment system through multi-dimensional data crawling and social media analysis, and validates text information and video interview in job application process. This study includes a comparative analysis process through text mining to verify the authenticity of job application paperwork and to effectively hire and allocate workers based on the potential job capability. Based on the prototype system, we conducted performance tests and analyzed the result for key performance indicators such as text mining accuracy and interview STT(speech to text) function recognition rate. If commercialized based on design specifications and prototype development results derived from this study, it may be expected to be utilized as the intelligent online recruitment system technology required in the public and private recruitment markets in the future.