Search | Korea Science

Interactive Morphological Analysis to Improve Accuracy of Keyword Extraction Based on Cohesion Scoring

Yu, Yang Woo;Kim, Hyeon Gyu
- Journal of the Korea Society of Computer and Information
- /
- v.25 no.12
- /
- pp.145-153
- /
- 2020
Recently, keyword extraction from social big data has been widely used for the purpose of extracting opinions or complaints from the user's perspective. Regarding this, our previous work suggested a method to improve accuracy of keyword extraction based on the notion of cohesion scoring, but its accuracy can be degraded when the number of input reviews is relatively small. This paper presents a method to resolve this issue by applying simplified morphological analysis as a postprocessing step to extracted keywords generated from the algorithm discussed in the previous work. The proposed method enables to add analysis rules necessary to process input data incrementally whenever new data arrives, which leads to reduction of a dictionary size and improvement of analysis efficiency. In addition, an interactive rule adder is provided to minimize efforts to add new rules. To verify performance of the proposed method, experiments were conducted based on real social reviews collected from online, where the results showed that error ratio was reduced from 10% to 1% by applying our method and it took 450 milliseconds to process 5,000 reviews, which means that keyword extraction can be performed in a timely manner in the proposed method.
https://doi.org/10.9708/jksci.2020.25.12.145 인용 PDF KSCI

Head Pose Estimation by using Morphological Property of Disparity Map

Jun, Se-Woong;Park, Sung-Kee;Lee, Moon-Key
- 제어로봇시스템학회:학술대회논문집
- /
- 2005.06a
- /
- pp.735-739
- /
- 2005
This paper presents a new system to estimate the head pose of human in interactive indoor environment that has dynamic illumination change and large working space. The main idea of this system is to suggest a new morphological feature for estimating head angle from stereo disparity map. When a disparity map is obtained from stereo camera, the matching confidence value can be derived by measurements of correlation of the stereo images. Applying a threshold to the confidence value, we also obtain the specific morphology of the disparity map. Therefore, we can obtain the morphological shape of disparity map. Through the analysis of this morphological property, the head pose can be estimated. It is simple and fast algorithm in comparison with other algorithm which apply facial template, 2D, 3D models and optical flow method. Our system can automatically segment and estimate head pose in a wide range of head motion without manual initialization like other optical flow system. As the result of experiments, we obtained the reliable head orientation data under the real-time performance.
PDF

Analysis of Plants Shape by Image Processing (영상처리에 의한 식물체의 형상분석)

이종환;노상하;류관희
- Journal of Biosystems Engineering
- /
- v.21 no.3
- /
- pp.315-324
- /
- 1996
This study was one of a series of studies on application of machine vision and image processing to extract the geometrical features of plants and to analyze plant growth. Several algorithms were developed to measure morphological properties of plants and describing the growth development of in-situ lettuce(Lactuca sativa L.). Canopy, centroid, leaf density and fractal dimension of plant were measured from a top viewed binary image. It was capable of identifying plants by a thinning top viewed image. Overlapping the thinning side viewed image with a side viewed binary image of plant was very effective to auto-detect meaningful nodes associated with canopy components such as stem, branch, petiole and leaf. And, plant height, stem diameter, number and angle of branches, and internode length and so on were analyzed by using meaningful nodes extracted from overlapped side viewed images. Canopy, leaf density and fractal dimension showed high relation with fresh weight or growth pattern of in-situ lettuces. It was concluded that machine vision system and image processing techniques are very useful in extracting geometrical features and monitoring plant growth, although interactive methods, for some applications, were required.
PDF

Erk AND RETINOIC ACID SIGNALING PARTICIPATE IN THE SEGREGATION AND PATTERNING OF FIRST ARCH DERIVED MAXILLA AND MANDIBLE (Erk와 retinoic acid의 제1인구둥 패터닝 조절)

Park, Eun-Ju;Tak, Hye-Jin;Park, Eun-Ha;Baik, Jeong-Mi;Zhengguo, Piao;Lee, Sang-Hwy
- Maxillofacial Plastic and Reconstructive Surgery
- /
- v.31 no.2
- /
- pp.103-115
- /
- 2009
In vertebrates, the face is mainly formed with neural crest derived neural crest cells by the inherent programs and the interactive environmental factors. Extracellular signaling-regulated kinase (Erk) is one of such programs to regulate the various cellular functions. And retinoic acid (RA) also plays an important role as a regulator in differentiation process at various stages of vertebrate embryogenesis. We wanted to know that the segregation as well as the patterning of maxillary and mandibular structure is greatly influenced by the maxillomandibular cleft (MMC) and the failure of this development may result in the maxillomandibular fusion (syngnathia) or other patterning related disorder. It has been well documented that the epithelium at this cleft region has significant expression of Fibroblast growth factor (Fgf) 8, and it is essential for the patterning of the first arch derived structures. By the morphological, skeletal, cell proliferation and apoptotic, and hybridization analysis, we checked the effects of Erk inhibition and/or RA activation onto MMC and could observe that Erk and RA signaling is individually and synergically involved in the facial patterning in terms of FGF signaling pathway via Barx-l. So RA and Erk signaling work together for the MMC patterning and the segregation of maxilla-mandible by controlling the Fgf-related signaling pathways. And the abnormality in MMC brought by aberrant Fgf signaling may result in the disturbances of maxillary-mandibular segregation.
PDF KSCI

Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

Choi, Hyunseung;Kim, Mintae;Kim, Wooju;Shin, Dongwook;Lee, Yong Hun
- Journal of Intelligence and Information Systems
- /
- v.24 no.4
- /
- pp.111-136
- /
- 2018
In this paper, we propose a methodology to extract answer information about queries from various types of unstructured documents collected from multi-sources existing on web in order to expand knowledge base. The proposed methodology is divided into the following steps. 1) Collect relevant documents from Wikipedia, Naver encyclopedia, and Naver news sources for "subject-predicate" separated queries and classify the proper documents. 2) Determine whether the sentence is suitable for extracting information and derive the confidence. 3) Based on the predicate feature, extract the information in the proper sentence and derive the overall confidence of the information extraction result. In order to evaluate the performance of the information extraction system, we selected 400 queries from the artificial intelligence speaker of SK-Telecom. Compared with the baseline model, it is confirmed that it shows higher performance index than the existing model. The contribution of this study is that we develop a sequence tagging model based on bi-directional LSTM-CRF using the predicate feature of the query, with this we developed a robust model that can maintain high recall performance even in various types of unstructured documents collected from multiple sources. The problem of information extraction for knowledge base extension should take into account heterogeneous characteristics of source-specific document types. The proposed methodology proved to extract information effectively from various types of unstructured documents compared to the baseline model. There is a limitation in previous research that the performance is poor when extracting information about the document type that is different from the training data. In addition, this study can prevent unnecessary information extraction attempts from the documents that do not include the answer information through the process for predicting the suitability of information extraction of documents and sentences before the information extraction step. It is meaningful that we provided a method that precision performance can be maintained even in actual web environment. The information extraction problem for the knowledge base expansion has the characteristic that it can not guarantee whether the document includes the correct answer because it is aimed at the unstructured document existing in the real web. When the question answering is performed on a real web, previous machine reading comprehension studies has a limitation that it shows a low level of precision because it frequently attempts to extract an answer even in a document in which there is no correct answer. The policy that predicts the suitability of document and sentence information extraction is meaningful in that it contributes to maintaining the performance of information extraction even in real web environment. The limitations of this study and future research directions are as follows. First, it is a problem related to data preprocessing. In this study, the unit of knowledge extraction is classified through the morphological analysis based on the open source Konlpy python package, and the information extraction result can be improperly performed because morphological analysis is not performed properly. To enhance the performance of information extraction results, it is necessary to develop an advanced morpheme analyzer. Second, it is a problem of entity ambiguity. The information extraction system of this study can not distinguish the same name that has different intention. If several people with the same name appear in the news, the system may not extract information about the intended query. In future research, it is necessary to take measures to identify the person with the same name. Third, it is a problem of evaluation query data. In this study, we selected 400 of user queries collected from SK Telecom 's interactive artificial intelligent speaker to evaluate the performance of the information extraction system. n this study, we developed evaluation data set using 800 documents (400 questions * 7 articles per question (1 Wikipedia, 3 Naver encyclopedia, 3 Naver news) by judging whether a correct answer is included or not. To ensure the external validity of the study, it is desirable to use more queries to determine the performance of the system. This is a costly activity that must be done manually. Future research needs to evaluate the system for more queries. It is also necessary to develop a Korean benchmark data set of information extraction system for queries from multi-source web documents to build an environment that can evaluate the results more objectively.
https://doi.org/10.13088/jiis.2018.24.4.111 인용 PDF KSCI HTML

Search Result 5, Processing Time 0.02 seconds

Interactive Morphological Analysis to Improve Accuracy of Keyword Extraction Based on Cohesion Scoring

Head Pose Estimation by using Morphological Property of Disparity Map

Analysis of Plants Shape by Image Processing (영상처리에 의한 식물체의 형상분석)

Erk AND RETINOIC ACID SIGNALING PARTICIPATE IN THE SEGREGATION AND PATTERNING OF FIRST ARCH DERIVED MAXILLA AND MANDIBLE (Erk와 retinoic acid의 제1인구둥 패터닝 조절)

Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)