• Title/Summary/Keyword: 빅데이터 분석 기법

Search Result 588, Processing Time 0.03 seconds

Analysis of Behavior of Seoullo 7017 Visitors - With a Focus on Text Mining and Social Network Analysis - (서울로 7017 방문자들의 이용행태 분석 -텍스트 마이닝과 소셜 네트워크 분석을 중심으로-)

  • Woo, Kyung-Sook;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.6
    • /
    • pp.16-24
    • /
    • 2020
  • The purpose of this study is to analyze the usage behavior of Seoullo 7017, the first public garden in Korea, to understand the usage status by analyzing blogs, and to present usage behavior and improvement plans for Seoullo 7017. From June 2017 to May 2020, after Seoullo 7017 was open to citizens, character data containing 'Seoullo 7017' in the title and contents of NAVER and·DAUM blogs were converted to text mining and socialization, a Big Data technique. The analysis was conducted using social network analysis. The summary of the research results is as follows. First of all, the ratio of men and women searching for Seoullo 7017 online is similar, and the regions that searched most are in the order of Seoul and Gyeonggi, and those in their 40s and 50s were the most interested. In other words, it can be seen that there is a lack of interest in regions other than Seoul and Gyeonggi and among those in their 10s, 20s, and 30s. The main behaviors of Seoullo 7017 are' night view' and 'walking', and the factors that affect culture and art are elements related to culture and art. If various programs and festivals are opened and actively promoted, the main behavior will be more varied. On the other hand, the main behavior that the users of Seoullo 7017 want is 'sit', which is a static behavior, but the physical conditions are not sufficient for the behavior to occur. Therefore, facilities that can cause sitting behavior, such as shades and benches must be improved to meet the needs of visitors. The peculiarity of the change in the behavior of Seoullo 7017 is that it is recognized as a good place to travel alone and a good place to walk alone as a public multi-use facility and group activities are restricted due to COVID-19. Accordingly, in a situation like the COVD-19 pandemic, more diverse behaviors can be derived in facilities where people can take a walk, etc., and the increase of various attractions and the satisfaction of users can be increased. Seoullo 7017, as Korea's first public pedestrian area, was created for urban regeneration and the efficient use of urban resources in areas beyond the meaning of public spaces and is a place with various values such as history, nature, welfare, culture, and tourism. However, as a result of the use behavior analysis, various behaviors did not occur in Seoullo 7017 as expected, and elements that hinder those major behaviors were derived. Based on these research results, it is necessary to understand the usage behavior of Seoullo 7017 and to establish a plan for spatial system and facility improvement, so that Seoullo 7017 can be an important place for urban residents and a driving force to revitalize the city.

A Robust Object Detection and Tracking Method using RGB-D Model (RGB-D 모델을 이용한 강건한 객체 탐지 및 추적 방법)

  • Park, Seohee;Chun, Junchul
    • Journal of Internet Computing and Services
    • /
    • v.18 no.4
    • /
    • pp.61-67
    • /
    • 2017
  • Recently, CCTV has been combined with areas such as big data, artificial intelligence, and image analysis to detect various abnormal behaviors and to detect and analyze the overall situation of objects such as people. Image analysis research for this intelligent video surveillance function is progressing actively. However, CCTV images using 2D information generally have limitations such as object misrecognition due to lack of topological information. This problem can be solved by adding the depth information of the object created by using two cameras to the image. In this paper, we perform background modeling using Mixture of Gaussian technique and detect whether there are moving objects by segmenting the foreground from the modeled background. In order to perform the depth information-based segmentation using the RGB information-based segmentation results, stereo-based depth maps are generated using two cameras. Next, the RGB-based segmented region is set as a domain for extracting depth information, and depth-based segmentation is performed within the domain. In order to detect the center point of a robustly segmented object and to track the direction, the movement of the object is tracked by applying the CAMShift technique, which is the most basic object tracking method. From the experiments, we prove the efficiency of the proposed object detection and tracking method using the RGB-D model.

A Study on the Defect Detection of Fabrics using Deep Learning (딥러닝을 이용한 직물의 결함 검출에 관한 연구)

  • Eun Su Nam;Yoon Sung Choi;Choong Kwon Lee
    • Smart Media Journal
    • /
    • v.11 no.11
    • /
    • pp.92-98
    • /
    • 2022
  • Identifying defects in textiles is a key procedure for quality control. This study attempted to create a model that detects defects by analyzing the images of the fabrics. The models used in the study were deep learning-based VGGNet and ResNet, and the defect detection performance of the two models was compared and evaluated. The accuracy of the VGGNet and the ResNet model was 0.859 and 0.893, respectively, which showed the higher accuracy of the ResNet. In addition, the region of attention of the model was derived by using the Grad-CAM algorithm, an eXplainable Artificial Intelligence (XAI) technique, to find out the location of the region that the deep learning model recognized as a defect in the fabric image. As a result, it was confirmed that the region recognized by the deep learning model as a defect in the fabric was actually defective even with the naked eyes. The results of this study are expected to reduce the time and cost incurred in the fabric production process by utilizing deep learning-based artificial intelligence in the defect detection of the textile industry.

A study on the Application of Optimal Evacuation Route through Evacuation Simulation System in Case of Fire (화재발생 시 대피시뮬레이션 시스템을 통한 최적대피경로 적용에 관한 연구)

  • Kim, Daeill;Jeong, Juahn;Park, Sungchan;Go, Jooyeon;Yeom, Chunho
    • Journal of the Society of Disaster Information
    • /
    • v.16 no.1
    • /
    • pp.96-110
    • /
    • 2020
  • Recently, due to global warming, it is easily exposed to various disasters such as fire, flood, and earthquake. In particular, large-scale disasters have continuously been occurring in crowded areas such as traditional markets, facilities for the elderly and children, and public facilities where various people stay. Purpose: This study aims to detect a fire occurred in crowded facilities early in the event to analyze and provide an optimal evacuation route using big data and advanced technology. Method: The researchers propose a new algorithm through context-aware 3D object model technology and A* algorithm optimization and propose a scenario-based optimal evacuation route selection technique. Result: Using the HPA* E algorithm, the evacuation simulation in the event of a fire was reproduced as a 3D model and the optimal evacuation route and evacuation time were calculated for each scenario. Conclusion: It is expected to reduce fatalities and injuries through the evacuation induction technique that enables evacuation of the building in the shortest path by analyzing in real-time via fire detection sensors that detects the temperature, flame, and smoke.

Analysis of the Weight of SWOT Factors of Korean Venture Companies Based on the Industry 4.0 (4차 산업혁명 기반 한국 벤처기업의 SWOT요인에 대한 중요도 분석)

  • Lee, Dongik;Lee, Sangsuk
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.16 no.4
    • /
    • pp.115-133
    • /
    • 2021
  • This study examines the concept and related technologies of the 4th industrial revolution that has been mixed so far and examines the socio-economic changes and influences resulting from it, and the cases of responding to the 4th industrial revolution in major countries. Based on this, by deriving SWOT factors and calculating the importance of each factor for Korean venture companies to prepare for the forth industrial revolution, it was intended to help the government and policymakers in suggesting directions for establishing related policies. Furthermore, the purpose of this study was to suggest a direction for securing global competitiveness to Korean venture entrepreneurs and to help with basic and systematic analysis for further academic in-depth research. For this study, a total of 21 items derived through extensive literature research and data research to understand what are the necessary competency factors for internal and external environmental changes in order for Korean venture companies to have global competitiveness in the era of the 4th Industrial Revolution. After reviewing SWOT factors by three expert groups and confirming them through Delphi survey, the importance of each item was analyzed by using AHP, a systematic decision-making technique. As a result of the analysis, it was shown that Strength(48%), Opportunity(25%), Threat(16%), Weakness(11%) were considered important in order. In terms of sub-items, 'quick and flexible commercialization capability', 'platform/big data/non-face-to-face service activation', and 'ICT infrastructure and it's utilization' were shown to be of the comparatively high importance. On the other hand, in the lower three items, 'macro-economic stability and social infrastructure', 'difficulty in entering overseas markets due to global protectionism', and 'absolutely inferior in foreign investment' were found to have low priority. As a result of the correlation verification by item to see differences in opinions by industry, academia, and policy expert groups, there was no significant difference of opinion, as industry and academic experts showed a high correlation and industry experts and policy experts showed a moderate correlation. The correlation between the academic and policy experts was not statistically significant (p<0.01), so it was analyzed that there was a difference of opinion on importance. This was due to the fact that policy experts highly valued 'quick and flexible commercialization', which are strengths, and 'excellent educational system and high-quality manpower' and 'creation of new markets' which are opportunity items, while academic experts placed great importance on 'support part of government policy', which are strengths. The implication of this study is that in order for Korean venture companies to secure competitiveness in the field of the 4th industrial revolution, it is necessary to have a policy that preferentially supports the relevant items of strengths and opportunity factors. The difference in the details of strength factors and opportunity factors, which shows a high level of variability, suggests that it is necessary to actively review it and reflect it in the policy.

Analysis of the Effects of E-commerce User Ratings and Review Helfulness on Performance Improvement of Product Recommender System (E-커머스 사용자의 평점과 리뷰 유용성이 상품 추천 시스템의 성능 향상에 미치는 영향 분석)

  • FAN, LIU;Lee, Byunghyun;Choi, Ilyoung;Jeong, Jaeho;Kim, Jaekyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.311-328
    • /
    • 2022
  • Because of the spread of smartphones due to the development of information and communication technology, online shopping mall services can be used on computers and mobile devices. As a result, the number of users using the online shopping mall service increases rapidly, and the types of products traded are also growing. Therefore, to maximize profits, companies need to provide information that may interest users. To this end, the recommendation system presents necessary information or products to the user based on the user's past behavioral data or behavioral purchase records. Representative overseas companies that currently provide recommendation services include Netflix, Amazon, and YouTube. These companies support users' purchase decisions by recommending products to users using ratings, purchase records, and clickstream data that users give to the items. In addition, users refer to the ratings left by other users about the product before buying a product. Most users tend to provide ratings only to products they are satisfied with, and the higher the rating, the higher the purchase intention. And recently, e-commerce sites have provided users with the ability to vote on whether product reviews are helpful. Through this, the user makes a purchase decision by referring to reviews and ratings of products judged to be beneficial. Therefore, in this study, the correlation between the product rating and the helpful information of the review is identified. The valuable data of the evaluation is reflected in the recommendation system to check the recommendation performance. In addition, we want to compare the results of skipping all the ratings in the traditional collaborative filtering technique with the recommended performance results that reflect only the 4 and 5 ratings. For this purpose, electronic product data collected from Amazon was used in this study, and the experimental results confirmed a correlation between ratings and review usefulness information. In addition, as a result of comparing the recommendation performance by reflecting all the ratings and only the 4 and 5 points in the recommendation system, the recommendation performance of remembering only the 4 and 5 points in the recommendation system was higher. In addition, as a result of reflecting review usefulness information in the recommendation system, it was confirmed that the more valuable the review, the higher the recommendation performance. Therefore, these experimental results are expected to improve the performance of personalized recommendation services in the future and provide implications for e-commerce sites.

The development of symmetrically and attributably pure confidence in association rule mining (연관성 규칙에서 활용 가능한 대칭적 기여 순수 신뢰도의 개발)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.3
    • /
    • pp.601-609
    • /
    • 2014
  • The most widely used data mining technique for big data analysis is to generate meaningful association rules. This method has been used to find the relationship between set of items based on the association criteria such as support, confidence, lift, etc. Among them, confidence is the most frequently used, but it has the drawback that we can not know the direction of association by it. The attributably pure confidence was developed to compensate for this drawback, but the value was changed by the position of two item sets. In this paper, we propose four symmetrically and attributably pure confidence measures to compensate the shortcomings of confidence and the attributably pure confidence. And then we prove three conditions of interestingness measure by Piatetsky-Shapiro, and comparative studies with confidence, attributably pure confidence, and four symmetrically and attributably pure confidence measures are shown by numerical examples. The results show that the symmetrically and attributably pure confidence measures are better than confidence and the attributably pure confidence. Also the measure NSAPis found to be the best among these four symmetrically and attributably pure confidence measures.

A Methodology for Automatic Multi-Categorization of Single-Categorized Documents (단일 카테고리 문서의 다중 카테고리 자동확장 방법론)

  • Hong, Jin-Sung;Kim, Namgyu;Lee, Sangwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.77-92
    • /
    • 2014
  • Recently, numerous documents including unstructured data and text have been created due to the rapid increase in the usage of social media and the Internet. Each document is usually provided with a specific category for the convenience of the users. In the past, the categorization was performed manually. However, in the case of manual categorization, not only can the accuracy of the categorization be not guaranteed but the categorization also requires a large amount of time and huge costs. Many studies have been conducted towards the automatic creation of categories to solve the limitations of manual categorization. Unfortunately, most of these methods cannot be applied to categorizing complex documents with multiple topics because the methods work by assuming that one document can be categorized into one category only. In order to overcome this limitation, some studies have attempted to categorize each document into multiple categories. However, they are also limited in that their learning process involves training using a multi-categorized document set. These methods therefore cannot be applied to multi-categorization of most documents unless multi-categorized training sets are provided. To overcome the limitation of the requirement of a multi-categorized training set by traditional multi-categorization algorithms, we propose a new methodology that can extend a category of a single-categorized document to multiple categorizes by analyzing relationships among categories, topics, and documents. First, we attempt to find the relationship between documents and topics by using the result of topic analysis for single-categorized documents. Second, we construct a correspondence table between topics and categories by investigating the relationship between them. Finally, we calculate the matching scores for each document to multiple categories. The results imply that a document can be classified into a certain category if and only if the matching score is higher than the predefined threshold. For example, we can classify a certain document into three categories that have larger matching scores than the predefined threshold. The main contribution of our study is that our methodology can improve the applicability of traditional multi-category classifiers by generating multi-categorized documents from single-categorized documents. Additionally, we propose a module for verifying the accuracy of the proposed methodology. For performance evaluation, we performed intensive experiments with news articles. News articles are clearly categorized based on the theme, whereas the use of vulgar language and slang is smaller than other usual text document. We collected news articles from July 2012 to June 2013. The articles exhibit large variations in terms of the number of types of categories. This is because readers have different levels of interest in each category. Additionally, the result is also attributed to the differences in the frequency of the events in each category. In order to minimize the distortion of the result from the number of articles in different categories, we extracted 3,000 articles equally from each of the eight categories. Therefore, the total number of articles used in our experiments was 24,000. The eight categories were "IT Science," "Economy," "Society," "Life and Culture," "World," "Sports," "Entertainment," and "Politics." By using the news articles that we collected, we calculated the document/category correspondence scores by utilizing topic/category and document/topics correspondence scores. The document/category correspondence score can be said to indicate the degree of correspondence of each document to a certain category. As a result, we could present two additional categories for each of the 23,089 documents. Precision, recall, and F-score were revealed to be 0.605, 0.629, and 0.617 respectively when only the top 1 predicted category was evaluated, whereas they were revealed to be 0.838, 0.290, and 0.431 when the top 1 - 3 predicted categories were considered. It was very interesting to find a large variation between the scores of the eight categories on precision, recall, and F-score.

Improvement Issues of Personal Information Protection Laws through Meta-Analysis (메타분석을 통한 개인정보보호법의 개선과제)

  • Cho, Myunggeun;Lee, Hwansoo
    • Journal of Digital Convergence
    • /
    • v.15 no.9
    • /
    • pp.1-14
    • /
    • 2017
  • As we enter the era of big data, the value of personal information is becoming ever more important. However, personal information protection laws in Korea have several issues. Furthermore, existing research are limited in their ability to facilitate a comprehensive understanding of measures to improve personal information protection laws. Accordingly, this study analyzes improvements to be made in the current personal information protection laws based on existing research. A total of 39 research articles discussing the problems of the personal information protection law were selected and analyzed by applying the meta - analysis technique. According to the results, the various issues such as the meaning and scope of personal information, the role and obligations of relevant parties, provision of personal information to third parties, and redundant and imbalanced regulations in special acts in each field. that exist in the current personal information protection laws were confirmed. This study contributes to the improvement of inconsistency between information protection laws and related special laws in each field in practice. Academically, it will contribute to understanding the problems of th law from the macro perspective and suggesting the integrated improvement ways of the law.

A Study on the Correlation between Uniaxial Compressive Strength of Rock by Elastic Wave Velocity and Elastic Modulus of Granite in Seoul and Gyeonggi Region (서울·경기지역 화강암의 탄성파속도와 탄성계수에 의한 암석의 일축압축강도와의 상관성 연구)

  • Son, In-Hwan;Kim, Byong-kuk;Lee, Byok-Kyu;Jang, Seung-jin;Lee, Su-Gon
    • Journal of the Society of Disaster Information
    • /
    • v.15 no.2
    • /
    • pp.249-258
    • /
    • 2019
  • Purpose: The purpose of this study is to attain the correlation analysis and thereby to deduce the uniaxial compressive strength of rock specimens through the elastic wave velocity and the elastic modulus among the physical characteristics measured from the rock specimens collected during drilling investigations in Seoul and Gyeonggi region. Method: Experiments were conducted in the laboratory with 119 granite specimens in order to derive the correlation between the compressive strength of the rocks and elastic wave velocity and elastic modulus. Results: In the case of granite, the results of the analysis of the interaction between the compressive strength of a rock and the elastic wave velocity and elastic modulus were found to be less reliable in the relation equation as a whole. And it is believed that the estimation of the compressive strength by the elastic wave velocity and elastic modulus is less used because of the composition of non-homogeneous particles of granite. Conclusion: In this study, the analysis of correlation between the compressive strength of a rock and the elastic wave velocity and elastic modulus was performed with simple regression analysis and multiple regression analysis. The coefficient determination ($R^2$) of simple regression analysis was shown between 0.61 and 0.67. Multiple regression analysis was 0.71. Thus, using multiple regression analysis when estimating compressive strength can increase the reliability of the correlation. Also, in the future, a variety of statistical analysis techniques such as recovery analysis, and artificial neural network analysis, and big data analysis can lead to more reliable results when estimating the compressive sterength of a rock based on the elastic wave velocity and elastic modulus.