• Title/Summary/Keyword: 시스템 합성

Search Result 2,358, Processing Time 0.033 seconds

User-based Relevance and Irrelevance Criteria during the Task Pursuing of Middle School Students (중학생 학습과제 수행을 위한 정보탐색과정에서 적합성 및 비적합성에 관한 연구 - 에듀넷 사이트를 중심으로 -)

  • Kim, Yang-Woo;Park, Sung Jae
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.48 no.3
    • /
    • pp.55-70
    • /
    • 2014
  • Although a significant number of studies have been conducted in user-based relevance criteria, a need for further research still remains. The rational is associated with the following inadequacies: (1) research on young user groups, (2) research on the Web environment with multimedia resources, (3) research on the irrelevance criteria and implications to improve related systems and services. Accordingly, this study identified user - based relevance and irrelevance criteria, examining 40 middle school third grader students who use KERIS Edunet site. The results identified 16 relevance criteria and 8 irrelevance criteria. Major implications related to information system and service improvements.

Development of Text-to-Speech System for PC (PC용 Text-to-Speech 시스템 개발)

  • Choi Muyeol;Hwang Cholgyu;Kim Soontae;Kim Junggon;Yi Sopae;Jang Seokbok;Pyo Kyungnan;Ahn Hyesun;Kim Hyung Soon
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.41-44
    • /
    • 1999
  • 본 논문에서는 PC 응용을 위한 고음질의 한국어 text-to-speech(TTS) 합성 시스템을 개발하였다. 개발된 시스템의 합성방식으로는 음의 고저 조절, 인접음 사이의 연결 처리 및 음색제어 등에서 기존의 PSOLA 방식에 비해 장점을 가지는 정현파 모델 기반의 방식을 채택하였고, 자연스러운 운율 모델링을 위하여 통계적 기법중의 하나인 Classification and regression tree(CART) 방법을 사용하였다. 또한 음소 경계의 불연속성 문제를 줄이기 위한 합성단위로 초성-중성 및 종성 단위를 사용하였고, 다양한 음색표현이 가능하도록 음색제어 기능을 갖추었다. 그리고, 표준 Speech Application Program Interface(SAPI)를 준용한 TTS engine 형태로 구현함으로써 PC 상에서의 응용 프로그램 개발 편의성을 높였다. 합성음의 청취평가 결과 음질의 우수성 및 음색제어 기능의 유효성을 확인할 수 있었다.

  • PDF

Realistic Avatar Face Generation Using Shading Mechanism (음영합성 기법을 이용한 실사형 아바타 얼굴 생성)

  • Park Yeon-Chool
    • Journal of Internet Computing and Services
    • /
    • v.5 no.5
    • /
    • pp.79-91
    • /
    • 2004
  • This paper proposes avatar face generation system that uses shading mechanism and facial features extraction method of facial recognition. Proposed system generates avatar face similar to human face automatically using facial features that extracted from a photo. And proposed system is an approach which compose shade and facial features. Thus, it has advantages that can make more realistic avatar face similar to human face. This paper proposes new eye localization method, facial features extraction method, classification method for minimizing retrieval time, image retrieval method by similarity measure, and realistic avatar face generation method by mapping facial features with shaded face pane.

  • PDF

Probabilistic Reservoir Inflow Forecast Using Nonparametric Methods (비모수적 기법에 의한 확률론적 저수지 유입량 예측)

  • Lee, Han-Goo;Kim, Sun-Gi;Cho, Yong-Hyon;Chong, Koo-Yol
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2008.05a
    • /
    • pp.184-188
    • /
    • 2008
  • 추계학적 시계열 분석은 크게 수문자료의 장기간 합성과 실시간 예측으로 구분해 볼 수 있다. 장기간 합성은 주로 수문자료의 추계적 특성을 반영한 수자원 시스템의 운영율 개발에 이용되어 왔다. 반면에 실시간 예측은 수자원 시스템의 순응적(adaptive) 관리에 적용되고 있다. 두 개념의 차이로 전자는 시계열 자료를 합성하여 발생 가능한 모든 수문조합을 얻고자 하는 것이라면 후자는 전 시간의 수문량을 조건으로 하는 다음 시간의 값을 순응적으로 예측하는 것이라 할 수 있다. 수문자료의 합성과 예측에는 크게 결정론적, 확률론적 방법의 두 가지 대별될 수 있다. 결정론적 모델링 방법에는 인공신경망이나 Fuzzy 기법 등을 이용할 수 있으며, 확률론적 방법에는 ARMAX 등의 모수적 기법과 k-NN(k-nearest neighbor bootstrap resampling), KDE(kernel density estimates), 추계학적 인공신경망 등의 비모수적 기법으로 분류할 수 있다. 본 연구에서는 대표적 비모수적 기법인 k-NN를 이용하여 충주댐을 대상으로 월 및 일 유입량 자료의 예측 정도를 살펴보았다. 전 시간 관측치를 조건으로 하는 다음 시간의 조건부 확률분포를 구하여 평균값을 계산한 후 관측치와 비교함으로써 모형의 정도를 살펴보았다. 그리고 실시간 저수지 운영에 이 기법의 활용성과 장단점도 살펴보았다. 모형개발 절차로 모형의 보정을 거쳐 검증을 실시하였다. 결론적으로 월 및 일 유입량 예측에 k-NN 기법이 실무적으로 적용될 수 있었으며, 장점으로는 k-NN 기법이 다른 기법보다 모델링 절차가 비교적 쉬워 저수지 운영 최적화 등 타 시스템과의 연계에 수월함이 인식되었다.

  • PDF

Applications of Triple Controlled Type DDFS-driven PLL Frequency Synthesizer to Broadband Wireless Systems (3중조절 DDFS 구동 PLL 주파수 합성기의 광대역 무선 통신시스템에 응용)

  • Heung-Gyoon Ryu;Byeong-Rok An
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.13 no.6
    • /
    • pp.546-551
    • /
    • 2002
  • In this paper, a triple controlled type DDFS-driven PLL frequency synthesizer with reduced complexity is used to show its applications for broadband wireless communication systems by frequency synthesis control. Since the proposed DDFS-driven PLL synthesizer is very simplified to use only phase accumulator in DDFS, it improves the switching speed and power consumption than the conventional DDFS-driven PLL frequency synthesizer. It is appropriate for applications with requirements of broadband, low-power consumption and high switching speed, since the proposed synthesizer can cover a wide range of frequency bands by the triple frequency control parameters. Method and results of frequency control parameters assignment are shown for the several frequency bands applications such as GSM, IMT-2000, Bluetooth and PCS system.

저궤도 중형급 위성의 전자파 설계

  • Kim, Tae-Yun;Jang, Jae-Ung;Jang, Gyeong-Deok;Mun, Gwi-Won
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.37 no.2
    • /
    • pp.133.1-133.1
    • /
    • 2012
  • 전자파설계는 위성의 전력시스템, 통신시스템 뿐만 아니라 구조체 등 위성시스템 전반에 걸쳐서 종합적으로 고려가 되어야 하며, 이를 위해서는 개발 초기단계에서부터 시스템 설계에 반영되어야 한다. 위성시스템의 상세 설계가 끝난 후에는 시스템에 구현된 전자파 설계의 적합성을 검증하여야 하며 이는 해석 및 시험을 통해 이루어진다. 본 논문에서는 저궤도 중형급 위성이 우주환경에서 전자파적합성을 이루기 위한 설계 기법 및 전자파환경에 대한 적합성 검증과정에 대해서 다루고 있다. 저궤도 중형급 위성시스템에 대하여 구조물의 전자기적 특성을 정의하는 것부터 우주환경에서 위성의 RF호환성에 이르기까지 부품단위에서부터 시스템 수준까지의 전자파 설계 기준과 각 단계별로 전자파적합성을 검증하기 위한 방법 및 절차에 대해서 기술한다.

  • PDF

Personalized Service Composition and Provision System Based on User-centered Scenarios (사용자 중심의 시나리오에 기반한 개인화 서비스 합성 및 제공 시스템)

  • Jung, Jong-Yun;Ryu, Ki-Yeol;Roh, Byeong-Hee
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.9
    • /
    • pp.649-660
    • /
    • 2009
  • To deliver services suitable to user's situation in the ubiquitous environment, the researches on realizing new services by combining existing ones have been continuously increased. But, it is difficult to provide the personalized services to each user located in the ubiquitous service space where multiple users coexist. In this paper, we propose a service composition model based on user-centered service scenarios and a system for providing personalized services through finding services suitable to user's situation and combining them. The proposed system supports a simple service discovery protocol for finding services from heterogeneous smart objects with limited computing power in the ubiquitous environment. The system aggregates and stores various service scenarios and data derived from users and executes the appropriate services for users. We design and implement a prototype system for the mobile personal device.

Implementation of an Optimal SIMD-based Many-core Processor for Sound Synthesis of Guitar (기타 음 합성을 위한 최적의 SIMD기반 매니코어 프로세서 구현)

  • Choi, Ji-Won;Kang, Myeong-Su;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.1
    • /
    • pp.1-10
    • /
    • 2012
  • Improving operating frequency of processors is no longer today's issues; a multiprocessor technique which integrates many processors has received increasing attention. Currently, high-performance processors that integrate 64 or 128 cores are developing for large data processing over 2, 4, or 8 processor cores. This paper proposes an optimal many-core processor for synthesizing guitar sounds. Unlike the previous research in which a processing element (PE) was assigned to support one of guitar strings, this paper evaluates the impacts of mapping different numbers of PEs to one guitar string in terms of performance and both area and energy efficiencies using architectural and workload simulations. Experimental results show that the maximum area energy efficiencies were achieved at PEs=24 and 96, respectively, for synthesizing guitar sounds with sampling rate of 44.1kHz and 16-bit quantization. The synthesized sounds were very similar to original guitar sounds in their spectra. In addition, the proposed many-core processor was 1,235 and 22 times better than TI TMS320C6416 in area and energy efficiencies, respectively.

One-shot multi-speaker text-to-speech using RawNet3 speaker representation (RawNet3를 통해 추출한 화자 특성 기반 원샷 다화자 음성합성 시스템)

  • Sohee Han;Jisub Um;Hoirin Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.67-76
    • /
    • 2024
  • Recent advances in text-to-speech (TTS) technology have significantly improved the quality of synthesized speech, reaching a level where it can closely imitate natural human speech. Especially, TTS models offering various voice characteristics and personalized speech, are widely utilized in fields such as artificial intelligence (AI) tutors, advertising, and video dubbing. Accordingly, in this paper, we propose a one-shot multi-speaker TTS system that can ensure acoustic diversity and synthesize personalized voice by generating speech using unseen target speakers' utterances. The proposed model integrates a speaker encoder into a TTS model consisting of the FastSpeech2 acoustic model and the HiFi-GAN vocoder. The speaker encoder, based on the pre-trained RawNet3, extracts speaker-specific voice features. Furthermore, the proposed approach not only includes an English one-shot multi-speaker TTS but also introduces a Korean one-shot multi-speaker TTS. We evaluate naturalness and speaker similarity of the generated speech using objective and subjective metrics. In the subjective evaluation, the proposed Korean one-shot multi-speaker TTS obtained naturalness mean opinion score (NMOS) of 3.36 and similarity MOS (SMOS) of 3.16. The objective evaluation of the proposed English and Korean one-shot multi-speaker TTS showed a prediction MOS (P-MOS) of 2.54 and 3.74, respectively. These results indicate that the performance of our proposed model is improved over the baseline models in terms of both naturalness and speaker similarity.

Spectral Shape Invariant Real-time Voice Change System (스펙트럼 형태 불변 실시간 음성 변환 시스템)

  • Kim Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.1
    • /
    • pp.48-52
    • /
    • 2005
  • In this paper, the spectral shape invariant real-time voice change method is proposed to change one's voice to mechanical voice. For this purpose, LPC analysis and synthesis is used to maintain the spectraum of voice and the pitch of synthesis speech can be changed freely. In the proposed method, gain matching method is applied to excitation signal generator to make the changed voice natural to hear. In order to evaluate the performance of the proposed method, voice change experiments were conducted. Experimental results showed that original speech signal is changed to the mechanical voice signal in which context of the speaker's voice is conveyed correctly in spite of drastic change of pitch. The system is implemented using TI TMS320C6711DSK board to verify the system runs in real time.