• Title/Summary/Keyword: 表現

Search Result 21,782, Processing Time 0.054 seconds

A Study on the Construction and Landscape Characteristics of Munam Pavilion in Changnyeong(聞巖亭) (창녕 문암정(聞巖亭)의 조영 및 경관특성에 관한 연구)

  • Lee, Won-Ho;Kim, Dong-Hyun;Kim, Jae-Ung;Ahn, Gye-Bog
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.2
    • /
    • pp.27-41
    • /
    • 2014
  • This study aims to investigate the history, cultural values prototype through literature analysis, characteristics of construction, location, space structure and landscape characteristics by Arc-GIS on the Munam pavilion(聞巖亭) in Changnyeong. The results were as follows. First, Shin-cho((辛礎, 1549~1618) is the builder of the Munam pavilion and builder's view of nature is to go back to nature. The period of formation of Munam pavilion is between 1608-1618 as referred from document of retire from politics and build a pavilion. Secondly, Munam pavilion is surrounded by mountains and located at the top of steep slope. Pavilion was known as scenic site of the area. But damaged in a past landscape is caused by near the bridge, agricultural facilities, town, the Kye-sung stream of masonry and beams. Thirdly, Munam pavilion is divided into the main space, which is located on the pavilion, space in located on the pavilion east and west and the orient space, which is located on the Youngjeonggak. Of these, original form of Munam pavilion is a simple structure composed of pavilion and Munam rock, thus at the time of the composition seems to be a direct entry is possible, unlike the current entrance. Fourth, Spatial composition of Munam pavilion is divided into vegetation such as Lagerstroemia indica trees in Sa-ri in Changnyeong, ornament such as letters carved on the rocks and pavilion containing structure. The vegetation around the building is classified as precincts and outside of the premises. Planting of precincts was limited. Outside of area consists of front on the pavilion, which is covered with Lagerstroemia Indica forest and Pinus densiflora forest at the back of the pavilion. Ofthese,LargeLagerstroemiaIndicaforestcorrespondstothenaturalheritageasHistoricalrecordsofrarespeciesresourcesthatareassociated withbuilder. Letterscarvedontherocksrepresenttheboundaryof space, which is close to the location of the Munam pavilion and those associated with the builder as ornaments. Letters carved on the rocks front on the pavilion are rare cases that are made sequentially with a constant direction and rules as act of record for families to honor the achievements. Fifth, 'The eight famous spots of Munam' is divided into landscape elements that have nothing to do with bearing 4 places and landscape elements that have to do with bearing 4 places. Unrelated bearings of landscape elements are Lagerstroemia indica trees in Sa-ri in Changnyeong, Pinus densiflora forest at the back of the pavilion, Okcheon valley, Gwanryongsa temple and Daeheungsa temple. Bearing that related element of absolute orientation, which is corresponding to the elements are Daeheungsa temple, Hwawangsan mountain, Kye-sung stream and Yeongchwisan mountain. Relative bearing is Gwanryongsa temple, Yeongchwisan mountain and Kye-sung stream Gongjigi hill. At Lagerstroemia indica trees in Sa-ri in Changnyeong, Pinus densiflora forest at the back of the pavilion, Kye-sung stream and Okcheon valley, elements are exsting. Currently, it is difficult to confirm the rest of the landscape elements. Because, it is a generic element that reliable estimate of the target and locations are impossible for element. Munam pavilion is made for turn to nature by Shin-cho(辛礎). That was remained a record such as Munamzip(聞巖集) and Munamchungueirok(聞巖忠義錄) that is relating to construction of pavilion. Munam pavilion located in a unique form, archival culture through the letters carved on the rocks and Large Lagerstroemia indica forest and through eight famous spots, cultural landscape elements can be assumed that those elements are remained.

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Methods for Integration of Documents using Hierarchical Structure based on the Formal Concept Analysis (FCA 기반 계층적 구조를 이용한 문서 통합 기법)

  • Kim, Tae-Hwan;Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.63-77
    • /
    • 2011
  • The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte.syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A "citation index" indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the "citing" and "cited" articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesn't make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesn't make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F.measure is about 60% and it is better as about 15%.

Improved Social Network Analysis Method in SNS (SNS에서의 개선된 소셜 네트워크 분석 방법)

  • Sohn, Jong-Soo;Cho, Soo-Whan;Kwon, Kyung-Lag;Chung, In-Jeong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.117-127
    • /
    • 2012
  • Due to the recent expansion of the Web 2.0 -based services, along with the widespread of smartphones, online social network services are being popularized among users. Online social network services are the online community services which enable users to communicate each other, share information and expand human relationships. In the social network services, each relation between users is represented by a graph consisting of nodes and links. As the users of online social network services are increasing rapidly, the SNS are actively utilized in enterprise marketing, analysis of social phenomenon and so on. Social Network Analysis (SNA) is the systematic way to analyze social relationships among the members of the social network using the network theory. In general social network theory consists of nodes and arcs, and it is often depicted in a social network diagram. In a social network diagram, nodes represent individual actors within the network and arcs represent relationships between the nodes. With SNA, we can measure relationships among the people such as degree of intimacy, intensity of connection and classification of the groups. Ever since Social Networking Services (SNS) have drawn increasing attention from millions of users, numerous researches have made to analyze their user relationships and messages. There are typical representative SNA methods: degree centrality, betweenness centrality and closeness centrality. In the degree of centrality analysis, the shortest path between nodes is not considered. However, it is used as a crucial factor in betweenness centrality, closeness centrality and other SNA methods. In previous researches in SNA, the computation time was not too expensive since the size of social network was small. Unfortunately, most SNA methods require significant time to process relevant data, and it makes difficult to apply the ever increasing SNS data in social network studies. For instance, if the number of nodes in online social network is n, the maximum number of link in social network is n(n-1)/2. It means that it is too expensive to analyze the social network, for example, if the number of nodes is 10,000 the number of links is 49,995,000. Therefore, we propose a heuristic-based method for finding the shortest path among users in the SNS user graph. Through the shortest path finding method, we will show how efficient our proposed approach may be by conducting betweenness centrality analysis and closeness centrality analysis, both of which are widely used in social network studies. Moreover, we devised an enhanced method with addition of best-first-search method and preprocessing step for the reduction of computation time and rapid search of the shortest paths in a huge size of online social network. Best-first-search method finds the shortest path heuristically, which generalizes human experiences. As large number of links is shared by only a few nodes in online social networks, most nods have relatively few connections. As a result, a node with multiple connections functions as a hub node. When searching for a particular node, looking for users with numerous links instead of searching all users indiscriminately has a better chance of finding the desired node more quickly. In this paper, we employ the degree of user node vn as heuristic evaluation function in a graph G = (N, E), where N is a set of vertices, and E is a set of links between two different nodes. As the heuristic evaluation function is used, the worst case could happen when the target node is situated in the bottom of skewed tree. In order to remove such a target node, the preprocessing step is conducted. Next, we find the shortest path between two nodes in social network efficiently and then analyze the social network. For the verification of the proposed method, we crawled 160,000 people from online and then constructed social network. Then we compared with previous methods, which are best-first-search and breath-first-search, in time for searching and analyzing. The suggested method takes 240 seconds to search nodes where breath-first-search based method takes 1,781 seconds (7.4 times faster). Moreover, for social network analysis, the suggested method is 6.8 times and 1.8 times faster than betweenness centrality analysis and closeness centrality analysis, respectively. The proposed method in this paper shows the possibility to analyze a large size of social network with the better performance in time. As a result, our method would improve the efficiency of social network analysis, making it particularly useful in studying social trends or phenomena.

Smart Store in Smart City: The Development of Smart Trade Area Analysis System Based on Consumer Sentiments (Smart Store in Smart City: 소비자 감성기반 상권분석 시스템 개발)

  • Yoo, In-Jin;Seo, Bong-Goon;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.25-52
    • /
    • 2018
  • This study performs social network analysis based on consumer sentiment related to a location in Seoul using data reflecting consumers' web search activities and emotional evaluations associated with commerce. The study focuses on large commercial districts in Seoul. In addition, to consider their various aspects, social network indexes were combined with the trading area's public data to verify factors affecting the area's sales. According to R square's change, We can see that the model has a little high R square value even though it includes only the district's public data represented by static data. However, the present study confirmed that the R square of the model combined with the network index derived from the social network analysis was even improved much more. A regression analysis of the trading area's public data showed that the five factors of 'number of market district,' 'residential area per person,' 'satisfaction of residential environment,' 'rate of change of trade,' and 'survival rate over 3 years' among twenty two variables. The study confirmed a significant influence on the sales of the trading area. According to the results, 'residential area per person' has the highest standardized beta value. Therefore, 'residential area per person' has the strongest influence on commercial sales. In addition, 'residential area per person,' 'number of market district,' and 'survival rate over 3 years' were found to have positive effects on the sales of all trading area. Thus, as the number of market districts in the trading area increases, residential area per person increases, and as the survival rate over 3 years of each store in the trading area increases, sales increase. On the other hand, 'satisfaction of residential environment' and 'rate of change of trade' were found to have a negative effect on sales. In the case of 'satisfaction of residential environment,' sales increase when the satisfaction level is low. Therefore, as consumer dissatisfaction with the residential environment increases, sales increase. The 'rate of change of trade' shows that sales increase with the decreasing acceleration of transaction frequency. According to the social network analysis, of the 25 regional trading areas in Seoul, Yangcheon-gu has the highest degree of connection. In other words, it has common sentiments with many other trading areas. On the other hand, Nowon-gu and Jungrang-gu have the lowest degree of connection. In other words, they have relatively distinct sentiments from other trading areas. The social network indexes used in the combination model are 'density of ego network,' 'degree centrality,' 'closeness centrality,' 'betweenness centrality,' and 'eigenvector centrality.' The combined model analysis confirmed that the degree centrality and eigenvector centrality of the social network index have a significant influence on sales and the highest influence in the model. 'Degree centrality' has a negative effect on the sales of the districts. This implies that sales decrease when holding various sentiments of other trading area, which conflicts with general social myths. However, this result can be interpreted to mean that if a trading area has low 'degree centrality,' it delivers unique and special sentiments to consumers. The findings of this study can also be interpreted to mean that sales can be increased if the trading area increases consumer recognition by forming a unique sentiment and city atmosphere that distinguish it from other trading areas. On the other hand, 'eigenvector centrality' has the greatest effect on sales in the combined model. In addition, the results confirmed a positive effect on sales. This finding shows that sales increase when a trading area is connected to others with stronger centrality than when it has common sentiments with others. This study can be used as an empirical basis for establishing and implementing a city and trading area strategy plan considering consumers' desired sentiments. In addition, we expect to provide entrepreneurs and potential entrepreneurs entering the trading area with sentiments possessed by those in the trading area and directions into the trading area considering the district-sentiment structure.

Product Evaluation Criteria Extraction through Online Review Analysis: Using LDA and k-Nearest Neighbor Approach (온라인 리뷰 분석을 통한 상품 평가 기준 추출: LDA 및 k-최근접 이웃 접근법을 활용하여)

  • Lee, Ji Hyeon;Jung, Sang Hyung;Kim, Jun Ho;Min, Eun Joo;Yeo, Un Yeong;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.97-117
    • /
    • 2020
  • Product evaluation criteria is an indicator describing attributes or values of products, which enable users or manufacturers measure and understand the products. When companies analyze their products or compare them with competitors, appropriate criteria must be selected for objective evaluation. The criteria should show the features of products that consumers considered when they purchased, used and evaluated the products. However, current evaluation criteria do not reflect different consumers' opinion from product to product. Previous studies tried to used online reviews from e-commerce sites that reflect consumer opinions to extract the features and topics of products and use them as evaluation criteria. However, there is still a limit that they produce irrelevant criteria to products due to extracted or improper words are not refined. To overcome this limitation, this research suggests LDA-k-NN model which extracts possible criteria words from online reviews by using LDA and refines them with k-nearest neighbor. Proposed approach starts with preparation phase, which is constructed with 6 steps. At first, it collects review data from e-commerce websites. Most e-commerce websites classify their selling items by high-level, middle-level, and low-level categories. Review data for preparation phase are gathered from each middle-level category and collapsed later, which is to present single high-level category. Next, nouns, adjectives, adverbs, and verbs are extracted from reviews by getting part of speech information using morpheme analysis module. After preprocessing, words per each topic from review are shown with LDA and only nouns in topic words are chosen as potential words for criteria. Then, words are tagged based on possibility of criteria for each middle-level category. Next, every tagged word is vectorized by pre-trained word embedding model. Finally, k-nearest neighbor case-based approach is used to classify each word with tags. After setting up preparation phase, criteria extraction phase is conducted with low-level categories. This phase starts with crawling reviews in the corresponding low-level category. Same preprocessing as preparation phase is conducted using morpheme analysis module and LDA. Possible criteria words are extracted by getting nouns from the data and vectorized by pre-trained word embedding model. Finally, evaluation criteria are extracted by refining possible criteria words using k-nearest neighbor approach and reference proportion of each word in the words set. To evaluate the performance of the proposed model, an experiment was conducted with review on '11st', one of the biggest e-commerce companies in Korea. Review data were from 'Electronics/Digital' section, one of high-level categories in 11st. For performance evaluation of suggested model, three other models were used for comparing with the suggested model; actual criteria of 11st, a model that extracts nouns by morpheme analysis module and refines them according to word frequency, and a model that extracts nouns from LDA topics and refines them by word frequency. The performance evaluation was set to predict evaluation criteria of 10 low-level categories with the suggested model and 3 models above. Criteria words extracted from each model were combined into a single words set and it was used for survey questionnaires. In the survey, respondents chose every item they consider as appropriate criteria for each category. Each model got its score when chosen words were extracted from that model. The suggested model had higher scores than other models in 8 out of 10 low-level categories. By conducting paired t-tests on scores of each model, we confirmed that the suggested model shows better performance in 26 tests out of 30. In addition, the suggested model was the best model in terms of accuracy. This research proposes evaluation criteria extracting method that combines topic extraction using LDA and refinement with k-nearest neighbor approach. This method overcomes the limits of previous dictionary-based models and frequency-based refinement models. This study can contribute to improve review analysis for deriving business insights in e-commerce market.

A Study on Chinese Traditional Auspicious Fish Pattern Application in Corperate Identity Design (중국 전통 길상 어(魚)문양을 응용한 중국 기업의 아이덴티티 디자인 동향)

  • ZHANG, JINGQIU
    • Cartoon and Animation Studies
    • /
    • s.50
    • /
    • pp.349-382
    • /
    • 2018
  • China is a great civilization which is a combination of various ethnic groups with long history change. As one of these important components of traditional culture, the lucky shape has been going through the ideological upheaval of the history change of China. Up to now, it has become the important parts which can stimulate the emotion of Chinese nation. The lucky shape becomes the basis of the rich traditional culture by long history of the Chinese nation. Even say it is the centre of this traditional culture resource. The lucky shape is a way of expressing the Chinese history and national emotions. It is the important part of people's living habits, emotion, as well as the cultural background. What's more, it has the value of beliefs of Surname totem. Meanwhile, it also has the function of passing on information. The symbol of information finally was created by the being of lucky shape to indicate its conceptual content. There are various kinds of lucky shapes. It will have its limitations when researching all kinds of them professionally. So, here the lucky shape of FISH will be researched. The shape of fish is the first good shape created by the Chinese nation. It is about 6000 years. Its special shape and lucky meaning embody the peculiar inherent culture and intension of the Chinese nation. It's the important component of the Chinese traditional culture. The traditional shape of fish was focused on the continuation of history and the patterns recognition, etc. It seldom indicated the meaning of the shape into the using of the modern design. So by searching the lucky meaning & the way of fish shape, the purpose of the search is to explore the real analysis of value of the fish shape in the modern enterprise identity design. The way of search is through the development of the history, the evolvement and the meaning of lucky of the traditional fish shape to analyse the symbolic meaning and the cultural meaning from all levels in nation, culture, art and life, etc. And by using the huge living example of the enterprise identity design of the traditional shape of the fish to analyse that how it works in positive way by those enterprise which is based on the trust with good image. In the modern Chinese enterprise identity design, the lucky image will be reinterpreted in the modern way. It will be proofed by the national perceptual knowledge of the consumer and the way of enlarge the goodwill of corporate image. It will be the conclusion. The traditional fish shape is the important core of modern design.So this search is taken through the instance of the design of enterprise image of the traditional fish shape to analysis the idea of the majority Chinese people of the traditional luck and the influence of corporation which based on trust and credibility. In modern image design of Chinese corporation, the auspicious sign reappear. The question survey is taken by people through the perceptual knowledge of the consumer and the cognition the enterprise image. According the result, people can speculate the improvement of consumer's recognition and the possibility of development of traditional concept.

A Sasang Theoretical1) Study about the Morph & Image of Sasang Constitutional Medicine (사상의학(四象醫學) 형상관(形象觀)에 대한 사심신물적(事心身物的) 고찰(考察))

  • Kim, Jeong-ho;Song, Jeong-mo
    • Journal of Sasang Constitutional Medicine
    • /
    • v.11 no.1
    • /
    • pp.295-310
    • /
    • 1999
  • Nowadays there are a lot of attempts and approaches in the Study of Oriental Medicine. The Morph&Image is one of them, and its importance is more and more increasing. Likewise, in the Sasang Consitutional Medicine, the Morph&Image is one of the important part too. And it is presented in the ${\ll}$Dorgyi SooseBowon(東醫壽世保元)${\gg}$. But that Discourse shows us only the concept and conclusion of Morph&Image, based on classification of Sasang Constitution, without explaining how it is derived. So the author studied the basic theory parts of ${\ll}$Dorgyi Soose Bowon${\gg}$-those are the , , , and - and wanted to find out the mechanism of Morph&Image concept in the Sasang Constitutional Medicine. The results were as follows. 1. Every portion of human body, can be considered as Morph&Image, in ${\ll}$Dorgyi Soose Bowon${\gg}$ could be explained in the line with the Sasang theory. Morph&Image in ${\ll}$Dorgyi Soose Bowon${\gg}$ contents not only the shape itself but also image, operation, mind condition, nature, emotion and so on. 2. The traditional Oriental Medicine has the Morph&Image categorized by Five elements(五行). And it is used for Oriental medical Diagnosis. But in the Sasang Constitution, Morph&Image is used for Sasang Constitutional classification. 3. The Morph&Image in Sasang could be classified into four groups. Affairs(事)- group(ears, eyes, nose, mouth(耳目鼻口) and so on), object(物)-group(lung, spleen, liver, kidney(肺脾肝腎)and soon), Mind(心)-group(jaw, chest navel, abdomen and so on) and Body(身)-group(head, shoulders, waist hips(頭肩腰臀) and so on) are those. Event and Object groups reflect the congenital conditions of Sasang-Classified human body, and Mind and Body groups reflect mind state, nature, emotion etc..

  • PDF

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

The Magnetic Relaxation Properties of DTPA-bis(4-carboxycyclohexyl) amide Paramagnetic Gd-chelates (DTPA-bis(4-carboxycyclohexyl)amide 상자성 복합체의 자기이완특성에 관한 연구)

  • Kim, In-Sung;Lee, Young-Ju;Lee, Jae-Jun;Kim, Ju-Hyun;Kim, Yoo-Kyung;Sujit, Dutta;Kim, Suk-Kyung;Kim, Tae-Jeong;Kang, Duk-Sik;Chang, Yong-Min
    • Investigative Magnetic Resonance Imaging
    • /
    • v.10 no.1
    • /
    • pp.20-25
    • /
    • 2006
  • Purpose : To evaluate the NMR relaxation properties of newly developed high performance paramagnetic complexes. Materials and methods : 4-aminomethylcyclohexane carboxylic acid (0.63g, 4 mmol) was mixed with the suspension solution of DMF (15mL) and DTPA-bis-anhydride (0.71g, 2 mmol) to synthesize the ligand. The ligand was then mixed with Gd2O3 (0.18g, 0.5 mmol) to synthesize Gd-chelate. For the measurement of magnetic relaxivity of paramagnetic compounds, the compounds were diluted to 1mM and then the relaxation times were measured at 1.5T(64 MHz). Inversion-recovery pulse sequence was employed for T1 relaxation measurement and CPMG(Carr-Purcell-Meiboon-Gill) pulse sequence was employed for T2 relaxation measurement. Using MATLAB(Version 7.1) program, T1 magnetic relaxation map, R1 map, T2 magnetic relaxation map and R2 map were developed to represent magnetic relaxation time and magnetic relaxivity as image. Results : Compared to $R1=4.9mM^{-1}sec^{-1}$ and $R2=4.8mM^{-1}sec^{-1}$ of Omniscan (Gadodiamide), which is commercially available paramagnetic MR agent, R1 of SUK090(Gd-C32H74N5O24) was $12.46mM^{-1}sec^{-1}$ and R1 of SUK091(Gd-C34H78N5O24) was $12.77mM^{-1}sec^{-1}$. However, R1 of SUK092(Gd-C30H56N5O17) was decreased to $2.09mM^{-1}sec^{-1}$. In case of R2, SUK090(Gd-C32H74N5O24) was $8.76mM^{-1}sec^{-1}$ and SUK091(Gd-C34H78N5O24) was $7.60mM^{-}1sec^{-1}$ whereas SUK092(Gd-C30H56N5O17) was decreased to $1.82mM^{-1}sec^{-1}$. Conclusion : Among three new paramagnetic complexes, SUK090(Gd-C32H74N5O24) and SUK091(Gd-C34H78N5O24) showed higher T1, T2 magnetic relaxation rates than that of commercially available paramagnetic MR agent and thus expected to have more contrast enhancement effect.

  • PDF