• Title/Summary/Keyword: Text Semantics

Search Result 51, Processing Time 0.026 seconds

Automatic Compiler Generator for Visual Languages using Semantic Actions based on Classes (클래스 기반의 의미수행코드 명세를 이용한 시각언어 컴파일러 자동 생성)

  • 김경아
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.6
    • /
    • pp.1088-1099
    • /
    • 2003
  • The syntax-directed translation using semantic actions is frequently used in construction of compiler for text programming languages. it is very useful for the language designers to develop compiler back-end using a syntax structure of a source programming language. Due to the lack of the integrated representation method for a parse tree node and modeling method of syntax structures, it is very hard to construct compiler using syntax-directed translation in visual languages. In this Paper, we propose a visual language compiler generation method for constructing a visual languages compiler automatically, using syntax-directed translation. Our method uses the Picture Layout Grammar as a underlying grammar formalism. This grammar allows our approach to generate parser efficiently u sing And-Or-Waiting Graph and encapsulating syntax definition as one unit. Unlike other systems, we suggest separating the specification and the generation of semantic actions. Because of this, it provides a very efficient method for modification.

  • PDF

A Design on Informal Big Data Topic Extraction System Based on Spark Framework (Spark 프레임워크 기반 비정형 빅데이터 토픽 추출 시스템 설계)

  • Park, Kiejin
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.11
    • /
    • pp.521-526
    • /
    • 2016
  • As on-line informal text data have massive in its volume and have unstructured characteristics in nature, there are limitations in applying traditional relational data model technologies for data storage and data analysis jobs. Moreover, using dynamically generating massive social data, social user's real-time reaction analysis tasks is hard to accomplish. In the paper, to capture easily the semantics of massive and informal on-line documents with unsupervised learning mechanism, we design and implement automatic topic extraction systems according to the mass of the words that consists a document. The input data set to the proposed system are generated first, using N-gram algorithm to build multiple words to capture the meaning of the sentences precisely, and Hadoop and Spark (In-memory distributed computing framework) are adopted to run topic model. In the experiment phases, TB level input data are processed for data preprocessing and proposed topic extraction steps are applied. We conclude that the proposed system shows good performance in extracting meaningful topics in time as the intermediate results come from main memories directly instead of an HDD reading.

King Sejo's Establishment of the Thirteen-story Stone Pagoda of Wongaksa Temple and Its Semantics (세조의 원각사13층석탑 건립과 그 의미체계)

  • Nam, Dongsin
    • MISULJARYO - National Museum of Korea Art Journal
    • /
    • v.101
    • /
    • pp.12-46
    • /
    • 2022
  • Completed in 1467, the Thirteen-story Stone Pagoda of Wongaksa Temple is the last Buddhist pagoda erected at the center of the capital (present-day Seoul) of the Joseon Dynasty. It was commissioned by King Sejo, the final Korean king to favor Buddhism. In this paper, I aim to examine King Sejo's intentions behind celebrating the tenth anniversary of his enthronement with the construction of the thirteen-story stone pagoda in the central area of the capital and the enshrinement of sarira from Shakyamuni Buddha and the Newly Translated Sutra of Perfect Enlightenment (圓覺經). This paper provides a summary of this examination and suggests future research directions. The second chapter of the paper discusses the scriptural background for thirteen-story stone pagodas from multiple perspectives. I was the first to specify the Latter Part of the Nirvana Sutra (大般涅槃經後分) as the most direct and fundamental scripture for the erection of a thirteen-story stone pagoda. I also found that this sutra was translated in Central Java in the latter half of the seventh century and was then circulated in East Asia. Moreover, I focused on the so-called Kanishka-style stupa as the origin of thirteen-story stone pagodas and provided an overview of thirteen-story stone pagodas built around East Asia, including in Korea. In addition, by consulting Buddhist references, I prove that the thirteen stories symbolize the stages of the practice of asceticism towards enlightenment. In this regard, the number thirteen can be viewed as a special and sacred number to Buddhist devotees. The third chapter explores the Buddhist background of King Sejo's establishment of the Thirteen-story Stone Pagoda of Wongaksa Temple. I studied both the Dictionary of Sanskrit-Chinese Translation of Buddhist Terms (翻譯名義集) (which King Sejo personally purchased in China and published for the first time in Korea) and the Sutra of Perfect Enlightenment. King Sejo involved himself in the first translation of the Sutra of Perfect Enlightenment into Korean. The Dictionary of Sanskrit-Chinese Translation of Buddhist Terms was published in the fourteenth century as a type of Buddhist glossary. King Sejo is presumed to have been introduced to the Latter Part of the Nirvana Sutra, the fundamental scripture regarding thirteen-story pagodas, through the Dictionary of Sanskrit-Chinese Translation of Buddhist Terms, when he was set to erect a pagoda at Wongaksa Temple. King Sejo also enshrined the Newly Translated Sutra of Perfect Enlightenment inside the Wongaksa pagoda as a scripture representing the entire Tripitaka. This enshrined sutra appears to be the vernacular version for which King Sejo participated in the first Korean translation. Furthermore, I assert that the original text of the vernacular version is the Abridged Commentary on the Sutra of Perfect Enlightenment (圓覺經略疏) by Zongmi (宗密, 780-841), different from what has been previously believed. The final chapter of the paper elucidates the political semantics of the establishment of the Wongaksa pagoda by comparing and examining stone pagodas erected at neungsa (陵寺) or jinjeonsawon (眞殿寺院), which were types of temples built to protect the tombs of royal family members near their tombs during the early Joseon period. These stone pagodas include the Thirteen-story Pagoda of Gyeongcheonsa Temple, the Stone Pagoda of Gaegyeongsa Temple, the Stone Pagoda of Yeongyeongsa Temple, and the Multi-story Stone Pagoda of Silleuksa Temple. The comparative analysis of these stone pagodas reveals that King Sejo established the Thirteen-story Stone Pagoda at Wongaksa Temple as a political emblem to legitimize his succession to the throne. In this paper, I attempt to better understand the scriptural and political semantics of the Wongaksa pagoda as a thirteen-story pagoda. By providing a Korean case study, this attempt will contribute to the understanding of Buddhist pagoda culture that reached its peak during the late Goryeo and early Joseon periods. It also contributes to the research on thirteen-story pagodas in East Asia that originated with Kanishka stupa and were based on the Latter Part of the Nirvana Sutra.

Public Perception and Usage Pattern of Science Museum by Social Media Big Data Analysis (소셜 빅데이터 분석을 통해 알아본 대중의 과학관에 대한 인식 및 사용 행태)

  • Yun, Eunjeong;Park, Yunebae
    • Journal of The Korean Association For Science Education
    • /
    • v.37 no.6
    • /
    • pp.1005-1014
    • /
    • 2017
  • Focusing on the role of the science museum as an institution to improve the scientific literacy of the public, this study investigated public perception and behavior about science museum to know how much science museums affect the public by using social media big data analysis. For this purpose, we extracted texts containing 'science museum' in Naver blogs and Twitter, analyzed them by using network, frequency, co-ocurrence, and semantics analysis and compared them with the results in English speaking countries. As a result, blogs were mainly concerned with science museum among parents who have young children, while in Twitter posts from many students who visited as a group appeared. Therefore, the Korean public used science museum mainly as a space for children's experience, and in this case, programs and exhibitions of science museums are perceived positively. On the other hand, students who visited as a group showed some negative emotions. The result of comparison with the cases of foreign countries in terms of the function of the third generation science museum such as communications with the science museum and the public and the participation of the public in science, the Korean public hardly mentioned the scientific contents, words related to communications such as 'argue', and curators or staff after visiting the science museum. In contrast to many verbs related to meaningful activities such as 'learn', 'participate', 'listen', 'read', 'ask', 'think' appeared in English, only a small number of verbs include 'ask' and 'thin' appeared in Korean. Therefore, science museum need to improve impression, communicating with public, and involving activity with impact and variety after visit.

A Study on the Contemporary Definition of 'GARDEN' - Keyword Analysis used Literature Research and Big Data - ('정원'의 시대적 정의에 관한 연구 - 문헌연구와 빅데이터를 활용한 키워드 분석을 중심으로-)

  • Woo, Kyungsook;Suh, Joo Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.44 no.5
    • /
    • pp.1-11
    • /
    • 2016
  • There has been an increasingly high interest in gardens and garden design in Korea recently. However, the usage of the term 'garden' is extremely varied and complex, and there has been very little academic research made on the meaning of garden. Therefore, this research attempts to investigate the ideas of current gardens and to elucidate their changing patterns by means of extensive literature research and big data analysis. The notion of garden in the past was broad including not only private space such as Madang(마당) and Teul(뜰), but also even field and grass land as public outdoor space. Yet, the meaning has become smaller to merely private space due to the change of dwelling systems due to high industrial development of the 20th century. Furthermore, the introduction of urban parks as an interactive space between nature and humans, the similar spatial function of gardens, has blurred the boundary between garden and park, which created confusion in understanding the concept of a garden. After all, garden is a subject for humans. The meanings of garden need to be recognized from various points of view since garden itself is a creation by the sum of diverse fields such as natural and social sciences as well as culturology. This discussion on the meaning of garden in the present day will give a conceptual foundation for future research on gardens and garden design. Also, the big data analysis employed here as a research method can help other similar research topics, particularly semantics in landscape architecture.

Maritime Safety Tribunal Ruling Analysis using SentenceBERT (SentenceBERT 모델을 활용한 해양안전심판 재결서 분석 방법에 대한 연구)

  • Bori Yoon;SeKil Park;Hyerim Bae;Sunghyun Sim
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.7
    • /
    • pp.843-856
    • /
    • 2023
  • The global surge in maritime traffic has resulted in an increased number of ship collisions, leading to significant economic, environmental, physical, and human damage. The causes of these maritime accidents are multifaceted, often arising from a combination of crew judgment errors, negligence, complexity of navigation routes, weather conditions, and technical deficiencies in the vessels. Given the intricate nuances and contextual information inherent in each incident, a methodology capable of deeply understanding the semantics and context of sentences is imperative. Accordingly, this study utilized the SentenceBERT model to analyze maritime safety tribunal decisions over the last 20 years in the Busan Sea area, which encapsulated data on ship collision incidents. The analysis revealed important keywords potentially responsible for these incidents. Cluster analysis based on the frequency of specific keyword appearances was conducted and visualized. This information can serve as foundational data for the preemptive identification of accident causes and the development of strategies for collision prevention and response.

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.

Latent topics-based product reputation mining (잠재 토픽 기반의 제품 평판 마이닝)

  • Park, Sang-Min;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.39-70
    • /
    • 2017
  • Data-drive analytics techniques have been recently applied to public surveys. Instead of simply gathering survey results or expert opinions to research the preference for a recently launched product, enterprises need a way to collect and analyze various types of online data and then accurately figure out customer preferences. In the main concept of existing data-based survey methods, the sentiment lexicon for a particular domain is first constructed by domain experts who usually judge the positive, neutral, or negative meanings of the frequently used words from the collected text documents. In order to research the preference for a particular product, the existing approach collects (1) review posts, which are related to the product, from several product review web sites; (2) extracts sentences (or phrases) in the collection after the pre-processing step such as stemming and removal of stop words is performed; (3) classifies the polarity (either positive or negative sense) of each sentence (or phrase) based on the sentiment lexicon; and (4) estimates the positive and negative ratios of the product by dividing the total numbers of the positive and negative sentences (or phrases) by the total number of the sentences (or phrases) in the collection. Furthermore, the existing approach automatically finds important sentences (or phrases) including the positive and negative meaning to/against the product. As a motivated example, given a product like Sonata made by Hyundai Motors, customers often want to see the summary note including what positive points are in the 'car design' aspect as well as what negative points are in thesame aspect. They also want to gain more useful information regarding other aspects such as 'car quality', 'car performance', and 'car service.' Such an information will enable customers to make good choice when they attempt to purchase brand-new vehicles. In addition, automobile makers will be able to figure out the preference and positive/negative points for new models on market. In the near future, the weak points of the models will be improved by the sentiment analysis. For this, the existing approach computes the sentiment score of each sentence (or phrase) and then selects top-k sentences (or phrases) with the highest positive and negative scores. However, the existing approach has several shortcomings and is limited to apply to real applications. The main disadvantages of the existing approach is as follows: (1) The main aspects (e.g., car design, quality, performance, and service) to a product (e.g., Hyundai Sonata) are not considered. Through the sentiment analysis without considering aspects, as a result, the summary note including the positive and negative ratios of the product and top-k sentences (or phrases) with the highest sentiment scores in the entire corpus is just reported to customers and car makers. This approach is not enough and main aspects of the target product need to be considered in the sentiment analysis. (2) In general, since the same word has different meanings across different domains, the sentiment lexicon which is proper to each domain needs to be constructed. The efficient way to construct the sentiment lexicon per domain is required because the sentiment lexicon construction is labor intensive and time consuming. To address the above problems, in this article, we propose a novel product reputation mining algorithm that (1) extracts topics hidden in review documents written by customers; (2) mines main aspects based on the extracted topics; (3) measures the positive and negative ratios of the product using the aspects; and (4) presents the digest in which a few important sentences with the positive and negative meanings are listed in each aspect. Unlike the existing approach, using hidden topics makes experts construct the sentimental lexicon easily and quickly. Furthermore, reinforcing topic semantics, we can improve the accuracy of the product reputation mining algorithms more largely than that of the existing approach. In the experiments, we collected large review documents to the domestic vehicles such as K5, SM5, and Avante; measured the positive and negative ratios of the three cars; showed top-k positive and negative summaries per aspect; and conducted statistical analysis. Our experimental results clearly show the effectiveness of the proposed method, compared with the existing method.

A Destructive Method in the Connection of the Algorithm and Design in the Digital media - Centered on the Rapid Prototyping Systems of Product Design - (디지털미디어 환경(環境)에서 디자인 특성(特性)에 관한 연구(硏究) - 실내제품(室內製品) 디자인을 중심으로 -)

  • Kim Seok-Hwa
    • Journal of Science of Art and Design
    • /
    • v.5
    • /
    • pp.87-129
    • /
    • 2003
  • The purpose of this thesis is to propose a new concept of design of the 21st century, on the basis of the study on the general signification of the structures and the signs of industrial product design, by examining the difference between modern and post-modern design, which is expected to lead the users to different design practice and interpretation of it. The starting point of this study is the different styles and patterns of 'Gestalt' in the post-modern design of the late 20th century from modern design - the factor of determination in industrial product design. That is to say, unlike functional and rational styles of modern product design, the late 20th century is based upon the pluralism characterized by complexity, synthetic and decorativeness. So far, most of the previous studies on design seem to have excluded visual aspects and usability, focused only on effective communication of design phenomena. These partial studies on design, blinded by phenomenal aspects, have resulted in failure to discover a principle of fundamental system. However, design varies according to the times; and the transformation of design is reflected in Design Pragnanz to constitute a new text of design. Therefore, it can be argued that Design Pragnanz serves as an essential factor under influence of the significance of text. In this thesis, therefore, I delve into analysis of the 20th century product design, in the light of Gestalt theory and Design Pragnanz, which have been functioning as the principle of the past design. For this study, I attempted to discover the fundamental elements in modern and post-modern designs, and to examine the formal structure of product design, the users' aesthetic preference and its semantics, from the integrative viewpoint. Also, with reference to history and theory of design my emphasis is more on fundamental visual phenomena than on structural analysis or process of visualization in product design, in order to examine the formal properties of modern and post-modern designs. Firstly, In Chapter 1, 'Issues and Background of the Study', I investigated the Gestalt theory and Design Pragnanz, on the premise of formal distinction between modern and post-modern designs. These theories are founded upon the discussion on visual perception of Gestalt in Germany in 1910's, in pursuit of the principle of perception centered around visual perception of human beings. In Chapter 2, I dealt with functionalism of modern design, as an advance preparation for the further study on the product design of the late 20th century. First of all, in Chapter 2-1, I examined the tendency of modern design focused on functionalism, which can be exemplified by the famous statement 'Form follows function'. Excluding all unessential elements in design - for example, decoration, this tendency has attained the position of the international style based on the spirit of Bauhause - universality and regularity - in search of geometric order, standardization and rationalization. In Chapter 2-2, I investigated the anthropological viewpoint that modern design started representing culture in a symbolic way including overall aspects of the society - politics, economics and ethics, and its criticism on functionalist design that aesthetic value is missing in exchange of excessive simplicity in style. Moreover, I examined the pluralist phenomena in post-modern design such as kitsch, eclecticism, reactionism, hi-tech and digital design, breaking away from functionalist purism of modern design. In Chapter 3, I analyzed Gestalt Pragnanz in design in a practical way, against the background of design trends. To begin with, I selected mass product design among those for the 20th century products as a target of analysis, highlighting representative styles in each category of the products. For this analysis, I adopted the theory of J. M Lehnhardt, who gradated in percentage the aesthetic and semantic levels of Pragnantz in design expression, and that of J. K. Grutter, who expressed it in a formula of M = O : C. I also employed eight units of dichotomies, according to the G. D. Birkhoff's aesthetic criteria, for the purpose of scientific classification of the degree of order and complexity in design; and I analyzed phenomenal aspects of design form represented in each unit. For Chapter 4, I executed a questionnaire about semiological phenomena of Design Pragnanz with 28 units of antonymous adjectives, based upon the research in the previous chapter. Then, I analyzed the process of signification of Design Pragnanz, founded on this research. Furthermore, the interpretation of the analysis served as an explanation to preference, through systematic analysis of Gestalt and Design Pragnanz in product design of the late 20th century. In Chapter 5, I determined the position of Design Pragnanz by integrating the analyses of Gestalt and Pragnanz in modern and post-modern designs In this process, 1 revealed the difference of each Design Pragnanz in formal respect, in order to suggest a vision of the future as a result, which will provide systemic and structural stimulation to current design.

  • PDF

A Study on the Rhythm of Sijo Using Prosodie Analysis - Centering on < Ouga > by Seon-do Yun - (프로조디(prosodie) 분석을 통한 시조의 가락 고찰 시론(試論) - 윤선도(尹善道)의 <오우가(五友歌)>를 대상으로 -)

  • Kim, Seong-Moon
    • Sijohaknonchong
    • /
    • v.43
    • /
    • pp.41-66
    • /
    • 2015
  • A study on rhythm of a sijo was mostly conducted based on rhythm theory. As it is considered to define the rhythm of a formal sijo based on three verses, its significance has been recognized. However, if rhythm is understood to be superior to cadence or versification, it seems necessary to examine the rhythm of a sijo as a verse with a fixed form as well as a highly individual rhythm of each and every lyric poet, which is informal rhythm, in order to fully understand them. In this case, prosodie analysis by H. Meschonnic (1932~ 2009) can be a significant methodology. As this study gropes for a possibility to examine the rhythm of a sijo from a new perspective instead of existing rhythm theory through the application of H. Meschonnic's prosodie analysis, it can be regarded as an essay. Prosodie newly suggested by Meschonnic is referred to as linguistic organization of consonants and vowels and indication of their paradigm, and it conflicts the perspective that traditionally separates linguistic sound from meaning for dichotomous understanding. It is due to the fact that the organization of consonants and vowels is a unit that constitutes a complicated layer of significant sound and meaning. Accordingly, prosodie analysis that is irregularly and aperiodically distributed within poetic text can be considered as methodology aimed at explaining how a poem is integrated in terms of sound and semantics. The core of prosodie analysis is to examine how the phonologic system stands against the theme of a poem. It ultimately has the same way of establishing literary style of a poet as it is to explain a unique aesthetic structure that individual poems have and show distinct characteristics of linguistic use by a poet. Prior to application of the prosodie analysis to sijo in general, the study preparatorily conducted prosodie analysis on < Ouga > by Gosan Seon-do Yun.

  • PDF