• Title/Summary/Keyword: Vocabulary System

Search Result 289, Processing Time 0.029 seconds

Visualization of movie recommendation system using the sentimental vocabulary distribution map

  • Ha, Hyoji;Han, Hyunwoo;Mun, Seongmin;Bae, Sungyun;Lee, Jihye;Lee, Kyungwon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.19-29
    • /
    • 2016
  • This paper suggests a method to refine a massive collective intelligence data, and visualize with multilevel sentiment network, in order to understand information in an intuitive and semantic way. For this study, we first calculated a frequency of sentiment words from each movie review. Second, we designed a Heatmap visualization to effectively discover the main emotions on each online movie review. Third, we formed a Sentiment-Movie Network combining the MDS Map and Social Network in order to fix the movie network topology, while creating a network graph to enable the clustering of similar nodes. Finally, we evaluated our progress to verify if it is actually helpful to improve user cognition for multilevel analysis experience compared to the existing network system, thus concluded that our method provides improved user experience in terms of cognition, being appropriate as an alternative method for semantic understanding.

A Strategy Study on Sensitive Information Filtering for Personal Information Protect in Big Data Analyze

  • Koo, Gun-Seo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.12
    • /
    • pp.101-108
    • /
    • 2017
  • The study proposed a system that filters the data that is entered when analyzing big data such as SNS and BLOG. Personal information includes impersonal personal information, but there is also personal information that distinguishes it from personal information, such as religious institution, personal feelings, thoughts, or beliefs. Define these personally identifiable information as sensitive information. In order to prevent this, Article 23 of the Privacy Act has clauses on the collection and utilization of the information. The proposed system structure is divided into two stages, including Big Data Processing Processes and Sensitive Information Filtering Processes, and Big Data processing is analyzed and applied in Big Data collection in four stages. Big Data Processing Processes include data collection and storage, vocabulary analysis and parsing and semantics. Sensitive Information Filtering Processes includes sensitive information questionnaires, establishing sensitive information DB, qualifying information, filtering sensitive information, and reliability analysis. As a result, the number of Big Data performed in the experiment was carried out at 84.13%, until 7553 of 8978 was produced to create the Ontology Generation. There is considerable significan ce to the point that Performing a sensitive information cut phase was carried out by 98%.

Speech Interactive Agent on Car Navigation System Using Embedded ASR/DSR/TTS

  • Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
    • Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.181-192
    • /
    • 2004
  • This paper presents an efficient speech interactive agent rendering smooth car navigation and Telematics services, by employing embedded automatic speech recognition (ASR), distributed speech recognition (DSR) and text-to-speech (ITS) modules, all while enabling safe driving. A speech interactive agent is essentially a conversational tool providing command and control functions to drivers such' as enabling navigation task, audio/video manipulation, and E-commerce services through natural voice/response interactions between user and interface. While the benefits of automatic speech recognition and speech synthesizer have become well known, involved hardware resources are often limited and internal communication protocols are complex to achieve real time responses. As a result, performance degradation always exists in the embedded H/W system. To implement the speech interactive agent to accommodate the demands of user commands in real time, we propose to optimize the hardware dependent architectural codes for speed-up. In particular, we propose to provide a composite solution through memory reconfiguration and efficient arithmetic operation conversion, as well as invoking an effective out-of-vocabulary rejection algorithm, all made suitable for system operation under limited resources.

  • PDF

An Arabic Script Recognition System

  • Alginahi, Yasser M.;Mudassar, Mohammed;Nomani Kabir, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.9
    • /
    • pp.3701-3720
    • /
    • 2015
  • A system for the recognition of machine printed Arabic script is proposed. The Arabic script is shared by three languages i.e., Arabic, Urdu and Farsi. The three languages have a descent amount of vocabulary in common, thus compounding the problems for identification. Therefore, in an ideal scenario not only the script has to be differentiated from other scripts but also the language of the script has to be recognized. The recognition process involves the segregation of Arabic scripted documents from Latin, Han and other scripted documents using horizontal and vertical projection profiles, and the identification of the language. Identification mainly involves extracting connected components, which are subjected to Principle Component Analysis (PCA) transformation for extracting uncorrelated features. Later the traditional K-Nearest Neighbours (KNN) algorithm is used for recognition. Experiments were carried out by varying the number of principal components and connected components to be extracted per document to find a combination of both that would give the optimal accuracy. An accuracy of 100% is achieved for connected components >=18 and Principal components equals to 15. This proposed system would play a vital role in automatic archiving of multilingual documents and the selection of the appropriate Arabic script in multi lingual Optical Character Recognition (OCR) systems.

Comparative Analysis of Current Science Textbooks on Category (중학교 과학 교과서의 범주별 분석 비교)

  • Koo, Soo-Jeong;Choi, Don-Hyung
    • Journal of The Korean Association For Science Education
    • /
    • v.12 no.2
    • /
    • pp.97-107
    • /
    • 1992
  • ln this study, we analyzed 5 science textbooks currently used for the 7th graders quantitatively by using the science textbook rating system of Collette and Chiappetta(1986), making meta-analysis of the results of 17 graduate school students of Seoul National University. The rating system consists of 11 categories with detailed items respectively : content, organization, reading level, instruction approach, illustrations, end-chapter teaching aids, laboratory activities in text and/or accompanying manual, teacher aids, indices and glossaries and mechanical makeup of text. Each item in the checklist is to be given between one and five points and the total number of possible points in this rating system is 290. It was shown that 5 science textbooks currently used for 7th-year-students were all "poor" in terms of total points and had, at large, uniformed results especially in 10 items; 7 items concerning moral and ethical implications of science, vocabulary lists, accompanying laboratory manual, annotated editions for test, supply list for laboratory program, student workbook and glossary with low points, while 3 items concerning facilities needed for laboratory activities, activities relevant to the content and textbook size with high points. A Science teachers could get a broad view with a correct impression of the books usefulness in making an evaluation of available textbooks.

  • PDF

Comparative Study of Emotion Evaluation Based on Lighting Scenario of Office, Meeting Room, Lounge, and OA Room (사무실, 회의실, 휴게실, OA실의 조명 시나리오에 따른 감성평가 비교 연구 - 동·서양인을 대상으로 -)

  • Lee, Min-Jin;Cho, Mee-Ryoung;Ko, Jae-Kyu;Kim, Ju-Hyun
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.27 no.9
    • /
    • pp.23-35
    • /
    • 2013
  • In this study, we collected the emotion vocabulary novel related to the LED system light emotion and the existing terms through previous research, targeting professionals and KJ method, we selected emotion term of 35 kinds. Targeting this east Foreigner, we compared the emotion evaluation by changing the lighting elements in accordance office, meeting room, lounge, the OA room different behavior patterns. The results, showing the difference between the results of emotion existing research generally derived factors three axes "future, functionality" See, "stability", "activity". As a result of the comparison of the emotion of the East and the West, the Oriental, functional aspects of the LED system light of space office, meeting room, lounge is drawn most, On the other hand, Westerners, come up, stable surface shows the difference in accidentally. Based on these results, the future, and tries to utilized to evaluate the emotion reaction of the illumination elements each Test-Bed actual.

Interpretation and Prediction of Situations on the Korean Peninsula by Peace Index Analysis from Unstructured Data (비정형자료로부터의 평화지수 분석을 통한 한반도 정세 파악 방법)

  • Kwon, Ohbyung;Park, Dasol;Choi, Jihye;Lee, Jaeyoon
    • Journal of Information Technology Services
    • /
    • v.12 no.4
    • /
    • pp.423-434
    • /
    • 2013
  • Since acquiring intelligence about political situations around the Korea Peninsular in a direct manner is nearly impossible, it is inevitable for the individuals or companies to rely on open and indirect data such as newspapers. However, since the contents in the newspapers are substantially unstructured and very large, conventional content analysis is time-consuming and hence very costly. Hence, this paper aims to propose a sentimental analysis method which computes daily 'peace index' from unstructured data in the newspapers. From the content analysis, words and phrases which represent the sentiment of a nation are carefully identified. To show the feasibility of the idea proposed in this paper, a prototype system with vocabulary repository about political situations was developed for estimating peace index automatically.

The Basic Study on making biphone for Korean Speech Recognition (한국어 음성 인식용 biphone 구성을 위한 기초 연구)

  • Hwang YoungSoo;Song Minsuck
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.99-102
    • /
    • 2000
  • In the case of making large vocabulary speech recognition system, it is better to use the segment than the syllable or the word as the recognition unit. In this paper, we study on the basis of making biphone for Korean speech recognition. For experiments, we use the speech toolkit of OGI in U.S.A. The result shows that the recognition rate of the case in which the diphthong is established as a single unit is superior to that of the case in which the diphthong Is established as two units, i.e. a glide plus a vowel. And also, the recognition rate of the case in which the biphone is used as the recognition unit is better than that of the case in which the mono-phoneme is used.

  • PDF

A Study of Marketing Strategy for Business (기업의 마케팅 전략에 관한 소고)

  • 이종철
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.4 no.5
    • /
    • pp.65-71
    • /
    • 1981
  • The Marketing strategy is basic to the marketing plan for a firm. It encompasses the art and science of employing the means for achieving established marketing goals. The use of the word "strategy" in reference to the performance of marketing operations has been accepted in general usage during the last twenty years by both business practitioners and educators. It has been borrowed from the military vocabulary where it refers to the art and science of employing the armed strength of a belligerent force to secure the objectives of war. A marketing strategy consists of two distinct and yet interrelated parts: 1. A target market ${\cdots}{\cdots}$a fairly homogeneous group of customers to whom a company wishes to appeal. 2. A marketing mix ${\cdots}{\cdots}$ the controllable variables which the company combines to satisfy this target group.get group.

  • PDF

Implementation of Korean TTS System based on Natural Language Processing (자연어 처리 기반 한국어 TTS 시스템 구현)

  • Kim Byeongchang;Lee Gary Geunbae
    • MALSORI
    • /
    • no.46
    • /
    • pp.51-64
    • /
    • 2003
  • In order to produce high quality synthesized speech, it is very important to get an accurate grapheme-to-phoneme conversion and prosody model from texts using natural language processing. Robust preprocessing for non-Korean characters should also be required. In this paper, we analyzed Korean texts using a morphological analyzer, part-of-speech tagger and syntactic chunker. We present a new grapheme-to-phoneme conversion method for Korean using a hybrid method with a phonetic pattern dictionary and CCV (consonant vowel) LTS (letter to sound) rules, for unlimited vocabulary Korean TTS. We constructed a prosody model using a probabilistic method and decision tree-based method. The probabilistic method atone usually suffers from performance degradation due to inherent data sparseness problems. So we adopted tree-based error correction to overcome these training data limitations.

  • PDF