• Title/Summary/Keyword: Semantic Score

Search Result 106, Processing Time 0.025 seconds

A Sentence Theme Allocation Scheme based on Head Driven Patterns in Encyclopedia Domain (백과사전 영역에서 중심어주도패턴에 기반한 문장주제 할당 기법)

  • Kang Bo-Young;Myaeng Sung-Hyon
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.5
    • /
    • pp.396-405
    • /
    • 2005
  • Since sentences are the basic propositional units of text, their themes would be helpful for various tasks that require knowledge about the semantic content of text. Despite the importance of determining the theme of a sentence, however, few studies have investigated the problem of automatically assigning the theme to a sentence. Therefore, we propose a sentence theme allocation scheme based on the head-driven patterns of sentences in encyclopedia. In a serious of experiments using Dusan Dong-A encyclopedia, the proposed method outperformed the baseline of the theme allocation performance. The head-driven pattern 4, which is reconfigured based on the predicate, showed superior performance in the theme allocation with the average F-score of $98.96\%$ for the training data, and $88.57\%$ for the test data.

Recognition of Answer Type for WiseQA (WiseQA를 위한 정답유형 인식)

  • Heo, Jeong;Ryu, Pum Mo;Kim, Hyun Ki;Ock, Cheol Young
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.7
    • /
    • pp.283-290
    • /
    • 2015
  • In this paper, we propose a hybrid method for the recognition of answer types in the WiseQA system. The answer types are classified into two categories: the lexical answer type (LAT) and the semantic answer type (SAT). This paper proposes two models for the LAT detection. One is a rule-based model using question focuses. The other is a machine learning model based on sequence labeling. We also propose two models for the SAT classification. They are a machine learning model based on multiclass classification and a filtering-rule model based on the lexical answer type. The performance of the LAT detection and the SAT classification shows F1-score of 82.47% and precision of 77.13%, respectively. Compared with IBM Watson for the performance of the LAT, the precision is 1.0% lower and the recall is 7.4% higher.

A Study on Analyzing the Interest of the Youth to the Sea (靑少年의 海洋에 관한 關心度의 分析的 硏究)

  • Choi, Jong-Moon;Park, Chang-Ho;Lee, Cheol-Yeong
    • Journal of the Korean Institute of Navigation
    • /
    • v.16 no.1
    • /
    • pp.77-91
    • /
    • 1992
  • How does the youth feel the sea affairs\ulcorner Concerning this question, this paper aims to measure the images of the youth toward the sea affairs - the sea, the ship and the seafarer and to examined the above subject. As sample 3, 250 students of middle and high school were selected by considering geographical environment. The data obtained using Semantic Differential Method were analyzed by principal component analysis, and the obtained factor scores were examined the significance of difference between sex, age and geographical environment. By introducing the principal component analysis, the authors extracted from each of the images, that is, factors of dynamics and affection to the image on these a and the former the factors and pleasure on the ship, and also the former two factors and factor of professional evaluation on the seafarer, The following results are obtained. 1) In the image of the sea, dynamic image of the student in high school were higher than the of the student in middle school in spite of geographic environment and affective image were opposite. 2) In the images of the ship, affective image of the student in middle school and high school in inland were high than the of the male and female student in near the sea. And also, male female students in middle school and male student s in high school of inland showed the highest score to the pleasure image. 3) In the image of the seafare, professional evaluation of the female student in middle school were higher than the others, but the students in high school showed the highest score to dynamic image. Especially, in the case of the majority of students in high school living in the city or town near the, their images of the seafarer were not so good in spite of their explorative experiences about the sea affairs.

  • PDF

Quantitative and Qualitative Considerations to Apply Methods for Identifying Content Relevance between Knowledge Into Managing Knowledge Service (지식 간 내용적 연관성 파악 기법의 지식 서비스 관리 접목을 위한 정량적/정성적 고려사항 검토)

  • Yoo, Keedong
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.3
    • /
    • pp.119-132
    • /
    • 2021
  • Identification of associated knowledge based on content relevance is a fundamental functionality in managing service and security of core knowledge. This study compares the performance of methods to identify associated knowledge based on content relevance, i.e., the associated document network composition performance of keyword-based and word-embedding approach, to examine which method exhibits superior performance in terms of quantitative and qualitative perspectives. As a result, the keyword-based approach showed superior performance in core document identification and semantic information representation, while the word embedding approach showed superior performance in F1-Score and Accuracy, association intensity representation, and large-volume document processing. This study can be utilized for more realistic associated knowledge service management, reflecting the needs of companies and users.

Detection Algorithm of Road Surface Damage Using Adversarial Learning (적대적 학습을 이용한 도로 노면 파손 탐지 알고리즘)

  • Shim, Seungbo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.4
    • /
    • pp.95-105
    • /
    • 2021
  • Road surface damage detection is essential for a comfortable driving environment and the prevention of safety accidents. Road management institutes are using automated technology-based inspection equipment and systems. As one of these automation technologies, a sensor to detect road surface damage plays an important role. For this purpose, several studies on sensors using deep learning have been conducted in recent years. Road images and label images are needed to develop such deep learning algorithms. On the other hand, considerable time and labor will be needed to secure label images. In this paper, the adversarial learning method, one of the semi-supervised learning techniques, was proposed to solve this problem. For its implementation, a lightweight deep neural network model was trained using 5,327 road images and 1,327 label images. After experimenting with 400 road images, a model with a mean intersection over a union of 80.54% and an F1 score of 77.85% was developed. Through this, a technology that can improve recognition performance by adding only road images was developed to learning without label images and is expected to be used as a technology for road surface management in the future.

A Study on the Walkability Scores in Jeonju City Using Multiple Regression Models (다중 회귀 모델을 이용한 전주시 보행 환경 점수 예측에 관한 연구)

  • Lee, KiChun;Nam, KwangWoo;Lee, ChangWoo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.4
    • /
    • pp.1-10
    • /
    • 2022
  • Attempts to interpret human perspectives using computer vision have been developed in various fields. In this paper, we propose a method for evaluating the walking environment through semantic segmentation results of images from road images. First, the Kakao Map API was used to collect road images, and four-way images were collected from about 50,000 points in JeonJu. 20% of the collected images build datasets through crowdsourcing-based paired comparisons, and train various regression models using paired comparison data. In order to derive the walkability score of the image data, the ranking score is calculated using the Trueskill algorithm, which is a ranking algorithm, and the walkability and analysis using various regression models are performed using the constructed data. Through this study, it is shown that the walkability of Jeonju can be evaluated and scores can be derived through the correlation between pixel distribution classification information rather than human vision.

A Method to Solve the Entity Linking Ambiguity and NIL Entity Recognition for efficient Entity Linking based on Wikipedia (위키피디아 기반의 효과적인 개체 링킹을 위한 NIL 개체 인식과 개체 연결 중의성 해소 방법)

  • Lee, Hokyung;An, Jaehyun;Yoon, Jeongmin;Bae, Kyoungman;Ko, Youngjoong
    • Journal of KIISE
    • /
    • v.44 no.8
    • /
    • pp.813-821
    • /
    • 2017
  • Entity Linking find the meaning of an entity mention, which indicate the entity using different expressions, in a user's query by linking the entity mention and the entity in the knowledge base. This task has four challenges, including the difficult knowledge base construction problem, multiple presentation of the entity mention, ambiguity of entity linking, and NIL entity recognition. In this paper, we first construct the entity name dictionary based on Wikipedia to build a knowledge base and solve the multiple presentation problem. We then propose various methods for NIL entity recognition and solve the ambiguity of entity linking by training the support vector machine based on several features, including the similarity of the context, semantic relevance, clue word score, named entity type similarity of the mansion, entity name matching score, and object popularity score. We sequentially use the proposed two methods based on the constructed knowledge base, to obtain the good performance in the entity linking. In the result of the experiment, our system achieved 83.66% and 90.81% F1 score, which is the performance of the NIL entity recognition to solve the ambiguity of the entity linking.

Semantic Visualization of Dynamic Topic Modeling (다이내믹 토픽 모델링의 의미적 시각화 방법론)

  • Yeon, Jinwook;Boo, Hyunkyung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.131-154
    • /
    • 2022
  • Recently, researches on unstructured data analysis have been actively conducted with the development of information and communication technology. In particular, topic modeling is a representative technique for discovering core topics from massive text data. In the early stages of topic modeling, most studies focused only on topic discovery. As the topic modeling field matured, studies on the change of the topic according to the change of time began to be carried out. Accordingly, interest in dynamic topic modeling that handle changes in keywords constituting the topic is also increasing. Dynamic topic modeling identifies major topics from the data of the initial period and manages the change and flow of topics in a way that utilizes topic information of the previous period to derive further topics in subsequent periods. However, it is very difficult to understand and interpret the results of dynamic topic modeling. The results of traditional dynamic topic modeling simply reveal changes in keywords and their rankings. However, this information is insufficient to represent how the meaning of the topic has changed. Therefore, in this study, we propose a method to visualize topics by period by reflecting the meaning of keywords in each topic. In addition, we propose a method that can intuitively interpret changes in topics and relationships between or among topics. The detailed method of visualizing topics by period is as follows. In the first step, dynamic topic modeling is implemented to derive the top keywords of each period and their weight from text data. In the second step, we derive vectors of top keywords of each topic from the pre-trained word embedding model. Then, we perform dimension reduction for the extracted vectors. Then, we formulate a semantic vector of each topic by calculating weight sum of keywords in each vector using topic weight of each keyword. In the third step, we visualize the semantic vector of each topic using matplotlib, and analyze the relationship between or among the topics based on the visualized result. The change of topic can be interpreted in the following manners. From the result of dynamic topic modeling, we identify rising top 5 keywords and descending top 5 keywords for each period to show the change of the topic. Existing many topic visualization studies usually visualize keywords of each topic, but our approach proposed in this study differs from previous studies in that it attempts to visualize each topic itself. To evaluate the practical applicability of the proposed methodology, we performed an experiment on 1,847 abstracts of artificial intelligence-related papers. The experiment was performed by dividing abstracts of artificial intelligence-related papers into three periods (2016-2017, 2018-2019, 2020-2021). We selected seven topics based on the consistency score, and utilized the pre-trained word embedding model of Word2vec trained with 'Wikipedia', an Internet encyclopedia. Based on the proposed methodology, we generated a semantic vector for each topic. Through this, by reflecting the meaning of keywords, we visualized and interpreted the themes by period. Through these experiments, we confirmed that the rising and descending of the topic weight of a keyword can be usefully used to interpret the semantic change of the corresponding topic and to grasp the relationship among topics. In this study, to overcome the limitations of dynamic topic modeling results, we used word embedding and dimension reduction techniques to visualize topics by era. The results of this study are meaningful in that they broadened the scope of topic understanding through the visualization of dynamic topic modeling results. In addition, the academic contribution can be acknowledged in that it laid the foundation for follow-up studies using various word embeddings and dimensionality reduction techniques to improve the performance of the proposed methodology.

A Study on Visual Comfort for Compound Lighting Control Method of Applied Daylighting (자연채광의 응용에 의한 합성조명방식의 시각적 쾌적성에 관한 연구)

  • Han, Sang-Pil;Jeon, Yong-Han;Han, Sang-Chul
    • Journal of the Korean Solar Energy Society
    • /
    • v.32 no.spc3
    • /
    • pp.199-206
    • /
    • 2012
  • The purpose of this study is to understand the change of impression by comparing the uniformity lighting with the compound lighting. In previous study, we proposed a light controlling method to harmonize daylight from a window and artificial lights from a ceiling and obtained the results to support our method. We referred this method as the Adjusted Compound-Lighting Model (AC Model). The experiment is carried out with the scaled-models and mock-up spaces that were supposed to be an office space. One is lit by the uniform lighting and the other by the compound lighting in each experimental space. In order to present varying illuminance distributions, the two variables were used in this study. Subjects were asked to evaluate the point of difference by semantic differential rating on their overall impression after comparing with two rooms. The results showed that the impressions of compound lighting were more positive score than that of uniformity lighting on the items of 'dim-bright', 'dislike-like', 'artificial-natural' and 'closed-open', and that there was no significant difference in impressions between two spaces on other items.

Assessing Landscape Impacts of Apartment Complex on Suburban Hilly Openspace; Multilateral Approach by Analysis of Physical Landscape Variables and Eye Fixation Movements (도시주변 능선녹지를 배경으로 하는 아파트 경관의 시각적 영향 - 물리적 경관변수 및 와시점분석에 의한 다각적 접근-)

  • Choi, Yun;Cho, Tong-Buhm
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.22 no.2
    • /
    • pp.81-103
    • /
    • 1994
  • In recent years, the visual characteristics of natural open space and greenbelt surrounding the urban landscapes have been changed with sprawling of residential areas and highrised residential buildings. Since these natural areas being the background element of residential areas are topographically sloped mountains in many cities. It is easy to be seen in the distance and it is important to preserve these areas as a visual infrastructure of the urban landscape. The purposes of this study are to extract the factors of landscape impact evaluation for these areas and to clarify the physical landscape variables representing these factors, and to infer the visual-perceptional relationships between image and landscape variables. As results, conceptional three factors were extracted with semantic differential evaluation to classified 18 landscape slide, and three regression models were established with factor score of landscapes and physical variables measured in photographs. On the basis of these relationships, visual-perceptional characteristics were discussed by analyzing the data form eye-movement recording to each of landscapes. The factors of "spatial unfolding of backdropped hilly greenspace", "horizontal quence of residential buildings", and "landscape complexity" prove to be important. And it prove important variables of "skyline of mountainous ridge" and "visual edge of building structure" in regression models and eye fixation movements.

  • PDF