• Title/Summary/Keyword: Public Big data

Search Result 697, Processing Time 0.032 seconds

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Analysis of Behavior of Seoullo 7017 Visitors - With a Focus on Text Mining and Social Network Analysis - (서울로 7017 방문자들의 이용행태 분석 -텍스트 마이닝과 소셜 네트워크 분석을 중심으로-)

  • Woo, Kyung-Sook;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.6
    • /
    • pp.16-24
    • /
    • 2020
  • The purpose of this study is to analyze the usage behavior of Seoullo 7017, the first public garden in Korea, to understand the usage status by analyzing blogs, and to present usage behavior and improvement plans for Seoullo 7017. From June 2017 to May 2020, after Seoullo 7017 was open to citizens, character data containing 'Seoullo 7017' in the title and contents of NAVER and·DAUM blogs were converted to text mining and socialization, a Big Data technique. The analysis was conducted using social network analysis. The summary of the research results is as follows. First of all, the ratio of men and women searching for Seoullo 7017 online is similar, and the regions that searched most are in the order of Seoul and Gyeonggi, and those in their 40s and 50s were the most interested. In other words, it can be seen that there is a lack of interest in regions other than Seoul and Gyeonggi and among those in their 10s, 20s, and 30s. The main behaviors of Seoullo 7017 are' night view' and 'walking', and the factors that affect culture and art are elements related to culture and art. If various programs and festivals are opened and actively promoted, the main behavior will be more varied. On the other hand, the main behavior that the users of Seoullo 7017 want is 'sit', which is a static behavior, but the physical conditions are not sufficient for the behavior to occur. Therefore, facilities that can cause sitting behavior, such as shades and benches must be improved to meet the needs of visitors. The peculiarity of the change in the behavior of Seoullo 7017 is that it is recognized as a good place to travel alone and a good place to walk alone as a public multi-use facility and group activities are restricted due to COVID-19. Accordingly, in a situation like the COVD-19 pandemic, more diverse behaviors can be derived in facilities where people can take a walk, etc., and the increase of various attractions and the satisfaction of users can be increased. Seoullo 7017, as Korea's first public pedestrian area, was created for urban regeneration and the efficient use of urban resources in areas beyond the meaning of public spaces and is a place with various values such as history, nature, welfare, culture, and tourism. However, as a result of the use behavior analysis, various behaviors did not occur in Seoullo 7017 as expected, and elements that hinder those major behaviors were derived. Based on these research results, it is necessary to understand the usage behavior of Seoullo 7017 and to establish a plan for spatial system and facility improvement, so that Seoullo 7017 can be an important place for urban residents and a driving force to revitalize the city.

Exploring the 4th Industrial Revolution Technology from the Landscape Industry Perspective (조경산업 관점에서 4차 산업혁명 기술의 탐색)

  • Choi, Ja-Ho;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.47 no.2
    • /
    • pp.59-75
    • /
    • 2019
  • This study was carried out to explore the 4th Industrial Revolution technology from the perspective of the landscape industry to provide the basic data necessary to increase the virtuous circle value. The 4th Industrial Revolution, the characteristics of the landscape industry and urban regeneration were considered and the methodology was established and studied including the technical classification system suitable for systematic research, which was selected as a framework. First, the 4th Industrial Revolution technology based on digital data was selected, which could be utilized to increase the value of the virtuous circle for the landscape industry. From 'Element Technology Level', and 'Core Technology' such as the Internet of Things, Cloud Computing, Big Data, Artificial Intelligence, Robot, 'Peripheral Technology', Virtual or Augmented Reality, Drones, 3D 4D Printing, and 3D Scanning were highlighted as the 4th Industrial Revolution technology. It has been shown that it is possible to increase the value of the virtuous circle when applied at the 'Trend Level', in particular to the landscape industry. The 'System Level' was analyzed as a general-purpose technology, and based on the platform, the level of element technology(computers, and smart devices) was systematically interconnected, and illuminated with the 4th Industrial Revolution technology based on digital data. The application of the 'Trend Level' specific to the landscape industry has been shown to be an effective technology for increasing the virtuous circle values. It is possible to realize all synergistic effects and implementation of the proposed method at the trend level applying the element technology level. Smart gardens, smart parks, etc. have been analyzed to the level they should pursue. It was judged that Smart City, Smart Home, Smart Farm, and Precision Agriculture, Smart Tourism, and Smart Health Care could be highly linked through the collaboration among technologies in adjacent areas at the Trend Level. Additionally, various utilization measures of related technology applied at the Trend Level were highlighted in the process of urban regeneration, public service space creation, maintenance, and public service. In other words, with the realization of ubiquitous computing, Hyper-Connectivity, Hyper-Reality, Hyper-Intelligence, and Hyper-Convergence were proposed, reflecting the basic characteristics of digital technology in the landscape industry can be achieved. It was analyzed that the landscaping industry was effectively accommodating and coordinating with the needs of new characters, education and consulting, as well as existing tasks, even when participating in urban regeneration projects. In particular, it has been shown that the overall landscapig area is effective in increasing the virtuous circle value when it systems the related technology at the trend level by linking maintenance with strategic bridgehead. This is because the industrial structure is effective in distributing data and information produced from various channels. Subsequent research, such as demonstrating the fusion of the 4th Industrial Revolution technology based on the use of digital data in creation, maintenance, and service of actual landscape space is necessary.

The Comparative Analysis of Outcomes on Patents and Papers of Railway Research Institutes in Korea, China and Japan (한국, 중국, 일본 철도연구기관 특허 및 논문실적 비교분석)

  • Baek, Sunghyun;Yi, Yoonju
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.6
    • /
    • pp.455-460
    • /
    • 2020
  • The governments of Korea, China, and Japan have operated comprehensive research institutes for railway technologies. Korea Railroad Research Institute (KRRI), China Academy of Railway Sciences Corporation Limited (CARS), and Railway Technical Research Institute (RTRI) are representatives of comprehensive railway research institutes in each country. KRRI was found to be the most advanced in the quantitative competitiveness of patents. In terms of qualitative competitiveness, KRRI has strength in civil engineering, whereas RTRI has strength in electricity. KRRI was found to have the greatest efforts in securing competitiveness in overseas property rights. By comparing the publication of papers, CARS published the most papers. On the other hand, from 2015, KRRI showed an upward trend and published the most papers. By examining the impact of the papers by the citation, KRRI was found to have higher competitiveness than the other two institutions. In the future, it will be necessary to perform big data analysis on patents and papers of the three organizations, derive the key research areas and promising technology areas for each institute, and establish a mid-to-long-term development plan for railway technology based on scientific evidence.

A Study on Mental Health of Single Aged Persons in Home Perceived by Daughter-in-law (재가 독신노인을 부양하는 주부가 인지하는 노인의 정신건강에 관한 연구)

  • Yun Suk-Rye
    • Journal of Korean Public Health Nursing
    • /
    • v.7 no.1
    • /
    • pp.31-48
    • /
    • 1993
  • Nowaday, there have a lot of changes in the demands of the aged persons. Their problems also came to the fore with diverse forms under the influences of industrialization, urbanization and nuclear family. To make the matter worse, the aged population is mounting rapidly. Also, such structure as nuclear family is widely disseminating uncomfortable to the aged. People is mainly being guided by self interest above everything else. Indeed, they had, all together, bad effects on our traditional value system regarding 'respect for the aged and devotion to patients'. It seems unfortunately obvious that the family responsibility is gradually weakening to support the old who is a dependent family. The result is that the aged must have suffered all sorts of hardships in lightenning psychological, physical and economical difficulties. First, to grasp the situations and conditions supporting for single aged persons from each view of psychological, emotional, family-relational, healthy, social and economical standpoints, and second, to analyze their own recognition levels thinking of their health conditions and the relationships between the supporting environments of old family dependants and their psychological healths and then finally, to propose suggestions being able to be helpful for living comfortably in an old age and thereby, building up good family relation. The statistical techniques used to analyze 115 respondents living in Puchun city are frequency, $x^2$ test, t-Jest, ANOVA, Pearson's Correlation Coefficiency and Regression analysis (SPSS package), pertnent to prove the hypothesis suggested in this paper. Of course, it is needless to say that more data are needed on this point. However, several main research findings can be summarized as follows: First, the better single aged persons may be in the habit of eating a meal and the higher they may think of their physical health conditions and movement, the more they want to participate in economic activities to be free from economical dependence upon their children and to overcome lonliness. Second, single aged women appear to have had higher ability to take care their health for themself than single aged men do. It is why signle aged women do not, in general, have big problems to manage their health. But, as shown in this paper, single aged person"s were more liable to the diseases of the aged and, thereby, requiring special medical treatment badly to be healthy. Third. single aged persons revealed potential desires to free themself from socio economic dependence upon their children even in simple labor Job which can draw a monthly salary of about W200, 000. Fourth, they are generally satisfied with their children's filial piety toward them. Nonethless, most of them appear to be reluctantly dependent upon their children and live lonly lives very much. Fifth, they seem to have some hesitation in expressing their candid opinions as that then are some others along with family environmental factors for psychological and emotiona stability. Accordingly, it is safe to conclude by saying that much attention should' be paid no only to socio-economic supports and better medical services for the aged but also to political supports of the society and towards their children for the aged's emotiona support, for improving the quality of their lives in old age and promoting efficiency in suporting for old family dependants.

  • PDF

Health Status and Medical Utilization of Women in Rural Area (농촌지역 여성의 건강수준과 의료이용에 대한 연구)

  • Shin, Hyung-Chul;Kang, Ji-Young;Park, Woong-Sub;Kim, Sang-A
    • Journal of agricultural medicine and community health
    • /
    • v.34 no.1
    • /
    • pp.67-75
    • /
    • 2009
  • Objectives: This study was conducted to examine health inequality for gender and region in Korea. Especially it focused on health status such as disease prevalence and medical utilization of rural women. Methods: Data from the Korea national health and nutrition survey in 2001 were used. The final sample size was 37,108 individuals with age 20 and over. This study applied the logistic regression for nominal variables such as disease prevalence and unmet care needs and with the regression for continuos variables such as the length and costs of medical services. Results: Rates of disease prevalence and unmet care needs for chronic disease in rural area are higher than those in middle cities and big cities, and regional differences of those for women are more than those for mens with controlling ages. There could be interaction effect with region and sex. Conclusions: This study suggests that health policy maker should take consider of special status of rural women who are in health inequality.

A Study on Mode Choice of Trips to Sport Facilities Using SP Survey Data (SP조사자료를 활용한 스포츠시설 이용 수단선택에 관한 연구)

  • KIM, Joo Young;LEE, Seungjae;KIM, Jae-Young;PARK, Hyeon
    • Journal of Korean Society of Transportation
    • /
    • v.35 no.3
    • /
    • pp.197-209
    • /
    • 2017
  • With the advent of age that people spend more time and money on leisure activities, there is increasing interest in professional sport games. The location of large scale sport facilities has substantial impacts on existing transportation pattern because the facility attracts and generates massive traffic volume within a short period of time. This study aims to develop a mode choice model of leisure trips of which the destinations are a sport facility. A structured SP (stated preference) survey questionnaires were developed through an experimental design, and professional sport spectators were asked to state their preference in the choice of transport mode to the sport facility. The survey results show that public transportation is preferred to passenger cars for their trip to big sports event, implying that the convenience of back home trip after the event is an important factor of their mode choice. This study is a rare research on the trip pattern to sports complex in Korea, which provides policy implications on the provision of mass transit including subway system to large scale sport complexes. And it is also expected that this study contributes to future researches on leisure trip pattern.

Comparing Corporate and Public ESG Perceptions Using Text Mining and ChatGPT Analysis: Based on Sustainability Reports and Social Media (텍스트마이닝과 ChatGPT 분석을 활용한 기업과 대중의 ESG 인식 비교: 지속가능경영보고서와 소셜미디어를 기반으로)

  • Jae-Hoon Choi;Sung-Byung Yang;Sang-Hyeak Yoon
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.347-373
    • /
    • 2023
  • As the significance of ESG (Environmental, Social, and Governance) management amplifies in driving sustainable growth, this study delves into and compares ESG trends and interrelationships from both corporate and societal viewpoints. Employing a combination of Latent Dirichlet Allocation Topic Modeling (LDA) and Semantic Network Analysis, we analyzed sustainability reports alongside corresponding social media datasets. Additionally, an in-depth examination of social media content was conducted using Joint Sentiment Topic Modeling (JST), further enriched by Semantic Network Analysis (SNA). Complementing text mining analysis with the assistance of ChatGPT, this study identified 25 different ESG topics. It highlighted differences between companies aiming to avoid risks and build trust, and the general public's diverse concerns like investment options and working conditions. Key terms like 'greenwashing,' 'serious accidents,' and 'boycotts' show that many people doubt how companies handle ESG issues. The findings from this study set the foundation for a plan that serves key ESG groups, including businesses, government agencies, customers, and investors. This study also provide to guide the creation of more trustworthy and effective ESG strategies, helping to direct the discussion on ESG effectiveness.

Development of Topic Trend Analysis Model for Industrial Intelligence using Public Data (텍스트마이닝을 활용한 공개데이터 기반 기업 및 산업 토픽추이분석 모델 제안)

  • Park, Sunyoung;Lee, Gene Moo;Kim, You-Eil;Seo, Jinny
    • Journal of Technology Innovation
    • /
    • v.26 no.4
    • /
    • pp.199-232
    • /
    • 2018
  • There are increasing needs for understanding and fathoming of business management environment through big data analysis at industrial and corporative level. The research using the company disclosure information, which is comprehensively covering the business performance and the future plan of the company, is getting attention. However, there is limited research on developing applicable analytical models leveraging such corporate disclosure data due to its unstructured nature. This study proposes a text-mining-based analytical model for industrial and firm level analyses using publicly available company disclousre data. Specifically, we apply LDA topic model and word2vec word embedding model on the U.S. SEC data from the publicly listed firms and analyze the trends of business topics at the industrial and corporate levels. Using LDA topic modeling based on SEC EDGAR 10-K document, whole industrial management topics are figured out. For comparison of different pattern of industries' topic trend, software and hardware industries are compared in recent 20 years. Also, the changes of management subject at firm level are observed with comparison of two companies in software industry. The changes of topic trends provides lens for identifying decreasing and growing management subjects at industrial and firm level. Mapping companies and products(or services) based on dimension reduction after using word2vec word embedding model and principal component analysis of 10-K document at firm level in software industry, companies and products(services) that have similar management subjects are identified and also their changes in decades. For suggesting methodology to develop analysis model based on public management data at industrial and corporate level, there may be contributions in terms of making ground of practical methodology to identifying changes of managements subjects. However, there are required further researches to provide microscopic analytical model with regard to relation of technology management strategy between management performance in case of related to various pattern of management topics as of frequent changes of management subject or their momentum. Also more studies are needed for developing competitive context analysis model with product(service)-portfolios between firms.

Current Trends for National Bibliography through Analyzing the Status of Representative National Bibliographies (주요국 국가서지 현황조사를 통한 국가서지의 최신 경향 분석)

  • Lee, Mihwa;Lee, Ji-Won
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.32 no.1
    • /
    • pp.35-57
    • /
    • 2021
  • This paper is to grasp the current trends of national bibliographies through analyzing representative national bibliographies using literature review, analysis of national bibliographies' web pages and survey. First, in order to conform to the definition of a national bibliography as a record of a national publication, it attempts to include a variety of materials from print to electronic resources, but in reality it cannot contain all the materials, so there are exceptions. It is impossible to create a general selection guide for national bibliography coverage, and a plan that reflects the national characteristics and prepares a valid and comprehensive coverage based on analysis is needed. Second, cooperation with publishers and libraries is being made to efficiently generate national bibliography. For the efficiency of national bibliography generation, changes should be sought such as the standardization and consistency, the collection level metadata description for digital resources, and the creation of national bibliography using linked data. Third, national bibliography is published through the national bibliographic online search system, linked data search, MARC download using PDF, OAI-PMH, SRU, Z39.50, and mass download in RDF/XML format, and is integrated with the online public access catalog or also built separately. Above all, national bibliographies and online public access catalogs need to be built in a way of data reuse through an integrated library system. Fourth, as a differentiated function for national bibliography, various services such as user tagging and national bibliographic statistics are provided along with various browsing functions. In addition, services of analysis of national bibliographic big data, links to electronic publications, and mass download of linked data should be provided, and it is necessary to identify users' needs and provide open services that reflect them in order to develop differentiated services. Through the current trends and considerations of the national bibliographies analyzed in this study, it will be possible to explore changes in national and international national bibliography.