• Title/Summary/Keyword: 언어평가

Search Result 1,675, Processing Time 0.035 seconds

UA Tree-based Reduction of Speech DB in a Large Corpus-based Korean TTS (대용량 한국어 TTS의 결정트리기반 음성 DB 감축 방안)

  • Lee, Jung-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.7
    • /
    • pp.91-98
    • /
    • 2010
  • Large corpus-based concatenating Text-to-Speech (TTS) systems can generate natural synthetic speech without additional signal processing. Because the improvements in the natualness, personality, speaking style, emotions of synthetic speech need the increase of the size of speech DB, it is necessary to prune the redundant speech segments in a large speech segment DB. In this paper, we propose a new method to construct a segmental speech DB for the Korean TTS system based on a clustering algorithm to downsize the segmental speech DB. For the performance test, the synthetic speech was generated using the Korean TTS system which consists of the language processing module, prosody processing module, segment selection module, speech concatenation module, and segmental speech DB. And MOS test was executed with the a set of synthetic speech generated with 4 different segmental speech DBs. We constructed 4 different segmental speech DB by combining CM1(or CM2) tree clustering method and full DB (or reduced DB). Experimental results show that the proposed method can reduce the size of speech DB by 23% and get high MOS in the perception test. Therefore the proposed method can be applied to make a small sized TTS.

Location Inference of Twitter Users using Timeline Data (타임라인데이터를 이용한 트위터 사용자의 거주 지역 유추방법)

  • Kang, Ae Tti;Kang, Young Ok
    • Spatial Information Research
    • /
    • v.23 no.2
    • /
    • pp.69-81
    • /
    • 2015
  • If one can infer the residential area of SNS users by analyzing the SNS big data, it can be an alternative by replacing the spatial big data researches which result from the location sparsity and ecological error. In this study, we developed the way of utilizing the daily life activity pattern, which can be found from timeline data of tweet users, to infer the residential areas of tweet users. We recognized the daily life activity pattern of tweet users from user's movement pattern and the regional cognition words that users text in tweet. The models based on user's movement and text are named as the daily movement pattern model and the daily activity field model, respectively. And then we selected the variables which are going to be utilized in each model. We defined the dependent variables as 0, if the residential areas that users tweet mainly are their home location(HL) and as 1, vice versa. According to our results, performed by the discriminant analysis, the hit ratio of the two models was 67.5%, 57.5% respectively. We tested both models by using the timeline data of the stress-related tweets. As a result, we inferred the residential areas of 5,301 users out of 48,235 users and could obtain 9,606 stress-related tweets with residential area. The results shows about 44 times increase by comparing to the geo-tagged tweets counts. We think that the methodology we have used in this study can be used not only to secure more location data in the study of SNS big data, but also to link the SNS big data with regional statistics in order to analyze the regional phenomenon.

The Impact of the Argument-based Modeling Strategy using Scientific Writing implemented in Middle School Science (중학교 과학수업에 적용한 글쓰기를 활용한 논의-기반 모델링 전략의 효과)

  • Cho, Hey Sook;Nam, Jeonghee
    • Journal of The Korean Association For Science Education
    • /
    • v.34 no.6
    • /
    • pp.583-592
    • /
    • 2014
  • The purpose of this study is to investigate the impact of argument-based modeling strategy using scientific writing on student's modeling ability. For this study, 66 students (three classes) from the 7th grade were selected and of these, 43 students (two classes) were assigned to two experimental groups while the other 23 students (one class) were assigned to comparative group. In the experimental groups, one group (22 students) was Argument-based multimodal Representation and Modeling (AbRM), and the other group (21 students) was Argument-based Modeling (AbM). Modeling ability consisted of identifying the problem, structuring of scientific concepts, adequacy of claim and evidence and index of multimodal representation. As for the modeling ability, AbRM group scored significantly higher than the other groups, AbM group was significantly higher than comparative group. The four sub-elements of modeling ability in the AbRM group was significantly higher than the other groups statistically and AbM group scored significantly higher than comparative group. From these results, the argument-based modeling strategy using scientific writing was effective on students' modeling ability. Students organized or expressed the model and evaluated or modified it through the process of argument-based modeling using scientific writing and the exchange of opinions with others by scientific language as argument and writing.

An Investigation into the Equivalence of Three Pictures for Creative Story Writing: 'Dog Owners', 'Lost Dog', and 'Overslept' (창의적 이야기 작문용 세 그림의 동형 조사: 'Dog Owners,' 'Lost Dog,' 'Overslept')

  • Suh, Heejung;Bae, Jungok
    • Journal of Gifted/Talented Education
    • /
    • v.26 no.4
    • /
    • pp.699-719
    • /
    • 2016
  • Alternate pictures that are proven to be equivalent are in high demand to assess creative thinking and language skills. This study aimed to investigate the equivalence of three pictures ('Dog owners,' 'Lost Dog,' and 'Overslept') recently developed for use in a creative writing task. Middle school students (N=183) wrote a story in English based on one of the three prompts distributed randomly. Four writing features (fluency, syntactic complexity, lexical diversity, and temporality) were analyzed with Coh-Metrix and MANCOVA. The three prompts were largely equivalent in their capacity to detect differences among writers in all the features of writing. The difficulty levels of the three prompts, however, were not necessarily the same. Two prompts, Dog Owners and Lost Dog, were verified as equivalent prompts, and therefore, they are recommended as alternate forms to assess creative language skills in repeated measurements. The Overslept prompt had greater facility in eliciting diverse words and more temporal connectives in composing stories. The differential difficulty shown among the prompts suggests that the validity of using different picture versions in repeated assessment remains questionable unless those versions undergo equivalence verification.

A Study on Material Expression and Symbolism of Carlo Scarpa's Garden Details (카를로 스카르파(Carlo Scarpa)의 정원 디테일에 나타난 재료 표현기법 및 상징성 연구)

  • Lee, Hyung-Sook
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.36 no.2
    • /
    • pp.54-60
    • /
    • 2018
  • The purpose of this study is to analyze the garden details of Carla Scarpa in order to understand his selection and composition of materials, detailing style and symbolism of the spaces. Literature review and a field trip were conducted for the study and the results are as follows. First, Scarpa used vernacular materials such as Murano glass and Istrian limestone, and juxtaposed various materials using contrast of color and texture. His mixed uses of traditional and modern materials shows the passage of time. Second, he create his own detail style such as ziggurat and geometric motif, which make the garden space to look more interesting and rich. Scarpa respected local craftsmanship like glass design and used textile design style such as overlaying. Third, symbolic uses of water features help make narrative and poetic gardens. Scarpa's unique detail style and respects for traditional craftsmanship provide lessons on how to interpretate traditional design style in modern garden.

Research on Oral Status of Hearing Impaired Youth by Using QLF-D (QLF-D를 이용한 청각장애 청소년의 구강상태에 관한 조사)

  • Kim, Chang-Suk
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.9
    • /
    • pp.305-311
    • /
    • 2013
  • This study analyzed the oral status after recording the images by using QLF-D with targets of 38 youth people with hearing impairment and hearing language impairment. In order to investigate the state of oral hygiene, plaque index (O'Leary index) and contents of investigation of the state of the teeth included the number of sound teeth, the number of caries teeth, dental caries experience and the number of filling teeth. The following results were obtained. First, women lacked the management on plaque and had more caries teeth compared with men. In terms of impairment classification, subjects with both hearing and language impairment lacked the management on plaque and had more caries teeth. Second, subjects who did not get an oral exam for one year had more caries teeth. Oral hygiene score was the highest with the brushing time for 3-4 minutes. The number of sound teeth was increased as the brushing time was increased. In addition, the oral hygiene management time was the highest when cleaning the teeth, gums and tongue at the same time. Third, it was shown that the satisfaction of oral health education by using the new equipment was high. As a result of this study, in order to improve the oral health level of impaired students, they shall be trained to manage their teeth by themselves and educated to increase their motivation and practice. Thus, it is thought that various approaches which are differentiated from existing methods are required to be tried.

A Study on Rhetorical Expression of Public Information Design -Focus on Information Design Case for Seoul Public Transportation- (공공정보디자인의 수사학적 표현에 관한 연구 - 서울시 대중교통 정보디자인 사례를 중심으로 -)

  • Yang, Seung-Ju
    • Archives of design research
    • /
    • v.18 no.3 s.61
    • /
    • pp.95-104
    • /
    • 2005
  • Although the volume and complexity of available information have increased, our ability to process such volume of complex information has not been met with corresponding development. Information designers have been given the responsibility to address such unbalanced progress by developing effective visual systems to deliver and communicate such information to the masses in a manner that is quick and easy to process and understand. This study originated in recognition of these issues. This study seeks to find a solution to these issues in rhetorics in order to proliferate visual communications in recognition of the increasing importance of information and visual communication. Rhetorics, a field of study with a long history of analyzing the delivery of communication, provides numerous possibilities for the re-establishment of importance placed on visual information communication. Included in this study are (i) a thorough analysis of the principals of expression and logic offered by rhetorics, as applicable to information design (ii) a proposal to the solution to the above-mentioned issues encompassing the rhetoric process and methods of expression of information design and (iii) the practical application of these design principals to social activities. In order to provide an example of the practical use of the rhetoric methodology Presented in this study, we applied the rhetoric methodology to the 'Information Design for Public Transportation of Seoul.' and developed a new map and a guidebook. The raw data necessary for the foregoing were obtained through the analysis of the information designs that are currently in use in connection with mass transportation in Seoul and the survey evaluation conducted among Seoul residents. We modulated the infrastructure of Seoul by using 48 TAZs, computed the routes that are most likely to be used, and proposed the predictable information analysis process. The design proposed on this study encompasses color coding and use of combined information, and application of style and sequential information analysis process.

  • PDF

The Design and Development of Online System to Improve Undergraduate Students' Competency (대학생의 역량개발을 위한 온라인 시스템 설계 및 개발)

  • Moon, Yun-Kyoung;Lee, Kyoung-Jae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.6
    • /
    • pp.3807-3818
    • /
    • 2015
  • The objective of this study is to develop an on-line system for improving undergraduate students' competency development. After drawing elements necessary for the competency development such as assessment and planning, competency development, analysis of competency assessment, portfolio, analysis of job ability and community, based on the literature research related to competency and the analysis of the existing system, the direction of the system design was set up. The system was developed by using Microsoft Windows operating system in Windows server, ORACLE ver.10 as its database management system, and JSP and JAVA as its programing language. Reviewing errors and improvements of the system, it was modified and complemented. In order to examine the content functional utilization of the final competency development system, the utilization was verified. The competency development system for undergraduate students can be used as on-line space filled with the internalization of knowledge, self-directed competency development, convenience of record management and interactions between students-professors-alumna, owing to its functions such as boosting competency activities, cultivating career-pioneering ability and introspecting. When it is rare to find researches on the competency development system for undergraduate students, it is expected to be helpful to the development of competency education and the career education for undergraduate students as a new alternative for the competency development.

Implementation and Validation of EtherCAT Support in Integrated Development Environment for Synchronized Motion Control Application (동기 모션 제어 응용을 위한 통합개발환경의 EtherCAT 지원 기능 구현 및 검증)

  • Lee, Jongbo;Kim, Chaerin;Kim, Ikhwan;Kim, Youngdong;Kim, Taehyoun
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.38 no.2
    • /
    • pp.211-218
    • /
    • 2014
  • Recently, software-based programmable logic controller (PLC) systems, which are implemented in standard PLC languages on general hardware, are gaining popularity because they overcome the limitations of classical hardware PLC systems. Another noticeable trend is that the use of integrated development environment (IDE) is becoming important. IDEs can help developers to easily manage the growing complexity of modern control systems. Furthermore, industrial Ethernet, e.g. EtherCAT, is becoming widely accepted as a replacement for conventional fieldbuses in the distributed control domain because it offers favorable features such as short transmission delay, high bandwidth, and low cost. In this paper, we implemented the extension of open source IDE, called Beremiz, for developing EtherCAT-based real-time, synchronized motion control applications. We validated the EtherCAT system management features and the real-time responsiveness of the control function by using commercial EtherCAT drives and evaluation boards.

Detecting Errors in POS-Tagged Corpus on XGBoost and Cross Validation (XGBoost와 교차검증을 이용한 품사부착말뭉치에서의 오류 탐지)

  • Choi, Min-Seok;Kim, Chang-Hyun;Park, Ho-Min;Cheon, Min-Ah;Yoon, Ho;Namgoong, Young;Kim, Jae-Kyun;Kim, Jae-Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.7
    • /
    • pp.221-228
    • /
    • 2020
  • Part-of-Speech (POS) tagged corpus is a collection of electronic text in which each word is annotated with a tag as the corresponding POS and is widely used for various training data for natural language processing. The training data generally assumes that there are no errors, but in reality they include various types of errors, which cause performance degradation of systems trained using the data. To alleviate this problem, we propose a novel method for detecting errors in the existing POS tagged corpus using the classifier of XGBoost and cross-validation as evaluation techniques. We first train a classifier of a POS tagger using the POS-tagged corpus with some errors and then detect errors from the POS-tagged corpus using cross-validation, but the classifier cannot detect errors because there is no training data for detecting POS tagged errors. We thus detect errors by comparing the outputs (probabilities of POS) of the classifier, adjusting hyperparameters. The hyperparameters is estimated by a small scale error-tagged corpus, in which text is sampled from a POS-tagged corpus and which is marked up POS errors by experts. In this paper, we use recall and precision as evaluation metrics which are widely used in information retrieval. We have shown that the proposed method is valid by comparing two distributions of the sample (the error-tagged corpus) and the population (the POS-tagged corpus) because all detected errors cannot be checked. In the near future, we will apply the proposed method to a dependency tree-tagged corpus and a semantic role tagged corpus.