• Title/Summary/Keyword: 데이터 분석론

Search Result 1,383, Processing Time 0.03 seconds

A study on the rainfall-runoff reduction efficiency on each design rainfall for the green infrastructure-baesd stormwater management (그린인프라 기반 빗물 관리를 위한 설계강우량별 강우-유출저감 효율성 분석 연구)

  • Kim, Byungsung;Kim, Jaemoon;Lee, Sangjin
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.8
    • /
    • pp.613-621
    • /
    • 2022
  • Due to the global climate change, the rainfall volume and frequency on the Korean Peninsula are predicted to increase at the end of the 21st century. In addition, impervious surface areas have increased due to rapid urbanization which has caused the urban water cycle to deteriorate. Green Infrastructure (GI) researches have been conducted to improve the water cycle soundness; the efficiency of this technique has been verified through various studies. However, there are still no suitable GI design guidelines for this aspect. Therefore, the rainfall scenarios are set up for each percentile (60, 70, 80, 90) based on the volume and frequency analysis using 10-year rainfall data (Busan Meteorological Station). After determining the GI areas for each scenario, the runoff reduction characteristics are analyzed based on Storm Water Management Model (SWMM) 10-year rainfall-runoff-simulations. The total runoff reduction efficiency for each GI areas are computed to have a range of 13.1~52.1%. As a results of the quantitative analysis, the design rainfall for GI is classified into the 80~85 percentile in the study site.

A Study on customer experience centered innovation model for Funeral Mutual Enterprise - Centered on Funeral service - (상조기업의 고객경험 기반 혁신모델 연구 - 장례서비스 산업을 중심으로 -)

  • Ahn, Jinho;Lee, Jeungsun
    • Journal of Service Research and Studies
    • /
    • v.11 no.2
    • /
    • pp.67-77
    • /
    • 2021
  • This study is a study on the methodology of establishing an innovation strategy centering on the customer experience, which is essential in order to transform the existing collection and preservation-centered mutual aid company service into a visitor-centered service. To this end, we conducted literature research on environmental changes in the funeral industry from the perspective of service science and the significance and value of customer experiences within them, good customer experiences and bad customer experiences from the perspective of customer experience management. A study was conducted to present and prove a specific model. The customer experience-oriented innovation strategy of the funeral industry means to search for various alternatives that can reach the target state from the present state, focusing on the customer, and select the most appropriate transformation plan among them. As an effect of application, it was found that it is a source of differentiation by generating positive emotions to customers, and that customer experience data is highly helpful in making important decisions for the actual resource input of the parent company. This innovation model was presented, and its value was firstly proved by analyzing the difference from the existing evaluation method. Finally, as a result of analyzing the causal relationship through regression analysis using the customer experience measurement procedure, customer experience diagnosis/evaluation, customer experience innovation strategy, and cooperative company's performance as variables, the relationship proved to be significant.

A Study on customer experience centered innovation model for culture and arts organizations - Centered on museum service - (문화예술 기관의 고객경험 중심 혁신모형 연구 - 박물관 서비스를 중심으로 -)

  • Ahn, Jinho;Kim, Yeunhee
    • Journal of Service Research and Studies
    • /
    • v.11 no.1
    • /
    • pp.21-30
    • /
    • 2021
  • This study examined methodologies for establishing a customer experience-focused innovation strategy, which is essential to transforming existing collection and preservation-oriented museum service into a visitor-oriented service. To this end, a review of studies examining good customer experience and bad customer experience was conducted, focusing on change in museum environments and customer experience from the service science and customer experience management perspectives. Research was conducted to present and prove. Implementing a customer experience-oriented innovation strategy in a museum requires exploring the various approaches that can be used to reach the target state from the present state, focusing on the customer, and selecting the most appropriate transformation plan. This study found that effective approaches are distinguished by the fact that they generate positive emotions in customers and use customer experience data to make important decisions regarding the establishment of practical resources in museums. This innovation model was developed and validated by analyzing how it differed from existing evaluation methods. Finally, a regression analysis using the customer experience measurement procedure, customer experience diagnosis/evaluation, customer experience innovation strategy, and museum performance as variables revealed a significant causal relationship.

Assessment of Applicability of CNN Algorithm for Interpretation of Thermal Images Acquired in Superficial Defect Inspection Zones (포장층 이상구간에서 획득한 열화상 이미지 해석을 위한 CNN 알고리즘의 적용성 평가)

  • Jang, Byeong-Su;Kim, YoungSeok;Kim, Sewon ;Choi, Hyun-Jun;Yoon, Hyung-Koo
    • Journal of the Korean Geotechnical Society
    • /
    • v.39 no.10
    • /
    • pp.41-48
    • /
    • 2023
  • The presence of abnormalities in the subgrade of roads poses safety risks to users and results in significant maintenance costs. In this study, we aimed to experimentally evaluate the temperature distributions in abnormal areas of subgrade materials using infrared cameras and analyze the data with machine learning techniques. The experimental site was configured as a cubic shape measuring 50 cm in width, length, and depth, with abnormal areas designated for water and air. Concrete blocks covered the upper part of the site to simulate the pavement layer. Temperature distribution was monitored over 23 h, from 4 PM to 3 PM the following day, resulting in image data and numerical temperature values extracted from the middle of the abnormal area. The temperature difference between the maximum and minimum values measured 34.8℃ for water, 34.2℃ for air, and 28.6℃ for the original subgrade. To classify conditions in the measured images, we employed the image analysis method of a convolutional neural network (CNN), utilizing ResNet-101 and SqueezeNet networks. The classification accuracies of ResNet-101 for water, air, and the original subgrade were 70%, 50%, and 80%, respectively. SqueezeNet achieved classification accuracies of 60% for water, 30% for air, and 70% for the original subgrade. This study highlights the effectiveness of CNN algorithms in analyzing subgrade properties and predicting subsurface conditions.

A Framework Development for Sketched Data-Driven Building Information Model Creation to Support Efficient Space Configuration and Building Performance Analysis (효율적 공간 형상화 및 건물성능분석을 위한 스케치 정보 기반 BIM 모델 자동생성 프레임워크 개발)

  • Kong, ByungChan;Jeong, WoonSeong
    • Korean Journal of Construction Engineering and Management
    • /
    • v.25 no.1
    • /
    • pp.50-61
    • /
    • 2024
  • The market for compact houses is growing due to the demand for floor plans prioritizing user needs. However, clients often have difficulty communicating their spatial requirements to professionals including architects because they lack the means to provide evidence, such as spatial configurations or cost estimates. This research aims to create a framework that can translate sketched data-driven spatial requirements into 3D building components in BIM models to facilitate spatial understanding and provide building performance analysis to aid in budgeting in the early design phase. The research process includes developing a process model, implementing, and validating the framework. The process model describes the data flow within the framework and identifies the required functionality. Implementation involves creating systems and user interfaces to integrate various systems. The validation verifies that the framework can automatically convert sketched space requirements into walls, floors, and roofs in a BIM model. The framework can also automatically calculate material and energy costs based on the BIM model. The developed frame enables clients to efficiently create 3D building components based on the sketched data and facilitates users to understand the space and analyze the building performance through the created BIM models.

An Analysis of Military Strategies in the Israel-Hamas War (2023): Asymmetric Tactics and Implications for International Politics (이스라엘-하마스 전쟁(2023)의 군사전략 분석: 비대칭 전술과 국제정치적 함의)

  • Seung-Hyun Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.389-395
    • /
    • 2024
  • This study aims to deeply analyze the military strategies and tactics used in the battles between Israel and Hamas, to understand the military approaches, technical capabilities, and their impact on the outcomes of the conflict. To achieve this, methodologies such as literature review, data analysis, and case studies were utilized. The research findings confirm that Hamas employed asymmetric tactics, such as rocket attacks and surprise attacks through underground tunnels, to counter Israel's military superiority. On the other hand, Israel responded to Hamas's attacks with the Iron Dome interception system and intelligence-gathering capabilities, but faced difficulties due to Hamas's underground tunnel network. After six months of fighting, the casualties in the Gaza Strip exceeded 30,000, and more than 1.7 million people became refugees. Israel also suffered over 1,200 deaths. Militarily, neither side achieved a decisive victory, resulting in a war of attrition. This study suggests that the Israel-Hamas war exemplifies the complexity of modern asymmetric warfare. Furthermore, it recommends that political compromise between the two sides and active mediation efforts by the international community are necessary for the peaceful resolution of the Israel-Palestine conflict.

A Methodology for Automatic Multi-Categorization of Single-Categorized Documents (단일 카테고리 문서의 다중 카테고리 자동확장 방법론)

  • Hong, Jin-Sung;Kim, Namgyu;Lee, Sangwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.77-92
    • /
    • 2014
  • Recently, numerous documents including unstructured data and text have been created due to the rapid increase in the usage of social media and the Internet. Each document is usually provided with a specific category for the convenience of the users. In the past, the categorization was performed manually. However, in the case of manual categorization, not only can the accuracy of the categorization be not guaranteed but the categorization also requires a large amount of time and huge costs. Many studies have been conducted towards the automatic creation of categories to solve the limitations of manual categorization. Unfortunately, most of these methods cannot be applied to categorizing complex documents with multiple topics because the methods work by assuming that one document can be categorized into one category only. In order to overcome this limitation, some studies have attempted to categorize each document into multiple categories. However, they are also limited in that their learning process involves training using a multi-categorized document set. These methods therefore cannot be applied to multi-categorization of most documents unless multi-categorized training sets are provided. To overcome the limitation of the requirement of a multi-categorized training set by traditional multi-categorization algorithms, we propose a new methodology that can extend a category of a single-categorized document to multiple categorizes by analyzing relationships among categories, topics, and documents. First, we attempt to find the relationship between documents and topics by using the result of topic analysis for single-categorized documents. Second, we construct a correspondence table between topics and categories by investigating the relationship between them. Finally, we calculate the matching scores for each document to multiple categories. The results imply that a document can be classified into a certain category if and only if the matching score is higher than the predefined threshold. For example, we can classify a certain document into three categories that have larger matching scores than the predefined threshold. The main contribution of our study is that our methodology can improve the applicability of traditional multi-category classifiers by generating multi-categorized documents from single-categorized documents. Additionally, we propose a module for verifying the accuracy of the proposed methodology. For performance evaluation, we performed intensive experiments with news articles. News articles are clearly categorized based on the theme, whereas the use of vulgar language and slang is smaller than other usual text document. We collected news articles from July 2012 to June 2013. The articles exhibit large variations in terms of the number of types of categories. This is because readers have different levels of interest in each category. Additionally, the result is also attributed to the differences in the frequency of the events in each category. In order to minimize the distortion of the result from the number of articles in different categories, we extracted 3,000 articles equally from each of the eight categories. Therefore, the total number of articles used in our experiments was 24,000. The eight categories were "IT Science," "Economy," "Society," "Life and Culture," "World," "Sports," "Entertainment," and "Politics." By using the news articles that we collected, we calculated the document/category correspondence scores by utilizing topic/category and document/topics correspondence scores. The document/category correspondence score can be said to indicate the degree of correspondence of each document to a certain category. As a result, we could present two additional categories for each of the 23,089 documents. Precision, recall, and F-score were revealed to be 0.605, 0.629, and 0.617 respectively when only the top 1 predicted category was evaluated, whereas they were revealed to be 0.838, 0.290, and 0.431 when the top 1 - 3 predicted categories were considered. It was very interesting to find a large variation between the scores of the eight categories on precision, recall, and F-score.

Analyzing Contextual Polarity of Unstructured Data for Measuring Subjective Well-Being (주관적 웰빙 상태 측정을 위한 비정형 데이터의 상황기반 긍부정성 분석 방법)

  • Choi, Sukjae;Song, Yeongeun;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.83-105
    • /
    • 2016
  • Measuring an individual's subjective wellbeing in an accurate, unobtrusive, and cost-effective manner is a core success factor of the wellbeing support system, which is a type of medical IT service. However, measurements with a self-report questionnaire and wearable sensors are cost-intensive and obtrusive when the wellbeing support system should be running in real-time, despite being very accurate. Recently, reasoning the state of subjective wellbeing with conventional sentiment analysis and unstructured data has been proposed as an alternative to resolve the drawbacks of the self-report questionnaire and wearable sensors. However, this approach does not consider contextual polarity, which results in lower measurement accuracy. Moreover, there is no sentimental word net or ontology for the subjective wellbeing area. Hence, this paper proposes a method to extract keywords and their contextual polarity representing the subjective wellbeing state from the unstructured text in online websites in order to improve the reasoning accuracy of the sentiment analysis. The proposed method is as follows. First, a set of general sentimental words is proposed. SentiWordNet was adopted; this is the most widely used dictionary and contains about 100,000 words such as nouns, verbs, adjectives, and adverbs with polarities from -1.0 (extremely negative) to 1.0 (extremely positive). Second, corpora on subjective wellbeing (SWB corpora) were obtained by crawling online text. A survey was conducted to prepare a learning dataset that includes an individual's opinion and the level of self-report wellness, such as stress and depression. The participants were asked to respond with their feelings about online news on two topics. Next, three data sources were extracted from the SWB corpora: demographic information, psychographic information, and the structural characteristics of the text (e.g., the number of words used in the text, simple statistics on the special characters used). These were considered to adjust the level of a specific SWB. Finally, a set of reasoning rules was generated for each wellbeing factor to estimate the SWB of an individual based on the text written by the individual. The experimental results suggested that using contextual polarity for each SWB factor (e.g., stress, depression) significantly improved the estimation accuracy compared to conventional sentiment analysis methods incorporating SentiWordNet. Even though literature is available on Korean sentiment analysis, such studies only used only a limited set of sentimental words. Due to the small number of words, many sentences are overlooked and ignored when estimating the level of sentiment. However, the proposed method can identify multiple sentiment-neutral words as sentiment words in the context of a specific SWB factor. The results also suggest that a specific type of senti-word dictionary containing contextual polarity needs to be constructed along with a dictionary based on common sense such as SenticNet. These efforts will enrich and enlarge the application area of sentic computing. The study is helpful to practitioners and managers of wellness services in that a couple of characteristics of unstructured text have been identified for improving SWB. Consistent with the literature, the results showed that the gender and age affect the SWB state when the individual is exposed to an identical queue from the online text. In addition, the length of the textual response and usage pattern of special characters were found to indicate the individual's SWB. These imply that better SWB measurement should involve collecting the textual structure and the individual's demographic conditions. In the future, the proposed method should be improved by automated identification of the contextual polarity in order to enlarge the vocabulary in a cost-effective manner.

Heart Rate Variability and Lipid Profile in Patients with Major Depressive Disorder (주요우울장애 환자에서의 심박변이도와 혈중 지질 농도와의 연관성)

  • Ahn, Eun-Jung;Choi, Jin-Sook;Jang, Yong-Lee;Lee, Hae-Woo;Sim, Hyun-Bo
    • Sleep Medicine and Psychophysiology
    • /
    • v.19 no.1
    • /
    • pp.27-34
    • /
    • 2012
  • Objectives: The analysis of heart rate variability (HRV) is a useful non-invasive tool to investigate the autonomic nerve function. Previous studies on the relationship between HRV and depression have been reported controversial results. Similarly, the correlation between the serum lipids and depression is debatable. The purpose of this study is to examine the relationship between heart rate variability, lipid profile and depression. Methods: A total of 42 patients with major depressive disorder (MDD) and 32 age and sex-matched normal subjects who had no previous history of major medical and mental illnesses were recruited for this study. A structured-interview was used to assess the general characteristics and psychiatric illness. HRV measures were assessed by time-domain and frequency-domain analyses. Psychological symptoms were measured using the Hamilton rating scale for anxiety (HAM-A), Hamilton rating scale for depression (HAM-D). In addition, the evaluation for lipid profile was performed by blood test. Results: In serum lipid profile test, MDD group showed higher cholesterol ($197.68{\pm}42.94$ mg/dL vs. $176.85{\pm}34.68$ mg/dL, p=0.044), TG ($139.45{\pm}92.54$ mg/dL vs. $91.4{\pm}65.68$ mg/dL, p=0.018), LDL ($130.03{\pm}33.18$ vs. $106.62{\pm}27.08$, p=0.004) level than normal control group. In HRV time domain analyses, the standard deviation of the NN interval (SDNN) was decreased in MDD group than normal control group, but was not significant ($32.82{\pm}14.33$ ms vs. $40.36{\pm}21.40$ms, p=0.078). ApEn (Approximate Entrophy) was significantly increased in MDD group than normal control group ($1.13{\pm}0.11$ vs. $0.91{\pm}0.18$, p<0.001). ApEn was correlated with LDL level (r=0.277, p=0.028), HAM-D scores (r=0.534, p<0.001) and HAM-A scores (r=0.470, p<0.001). Conclusions: MDD patients showed increased ApEn, one of the HRV measurement. And this ApEn was correlated with LDL, HAM-D and HAM-A scores. In this study, the analysis of ApEn would be a useful test of MDD.

Aspect-Based Sentiment Analysis Using BERT: Developing Aspect Category Sentiment Classification Models (BERT를 활용한 속성기반 감성분석: 속성카테고리 감성분류 모델 개발)

  • Park, Hyun-jung;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.1-25
    • /
    • 2020
  • Sentiment Analysis (SA) is a Natural Language Processing (NLP) task that analyzes the sentiments consumers or the public feel about an arbitrary object from written texts. Furthermore, Aspect-Based Sentiment Analysis (ABSA) is a fine-grained analysis of the sentiments towards each aspect of an object. Since having a more practical value in terms of business, ABSA is drawing attention from both academic and industrial organizations. When there is a review that says "The restaurant is expensive but the food is really fantastic", for example, the general SA evaluates the overall sentiment towards the 'restaurant' as 'positive', while ABSA identifies the restaurant's aspect 'price' as 'negative' and 'food' aspect as 'positive'. Thus, ABSA enables a more specific and effective marketing strategy. In order to perform ABSA, it is necessary to identify what are the aspect terms or aspect categories included in the text, and judge the sentiments towards them. Accordingly, there exist four main areas in ABSA; aspect term extraction, aspect category detection, Aspect Term Sentiment Classification (ATSC), and Aspect Category Sentiment Classification (ACSC). It is usually conducted by extracting aspect terms and then performing ATSC to analyze sentiments for the given aspect terms, or by extracting aspect categories and then performing ACSC to analyze sentiments for the given aspect category. Here, an aspect category is expressed in one or more aspect terms, or indirectly inferred by other words. In the preceding example sentence, 'price' and 'food' are both aspect categories, and the aspect category 'food' is expressed by the aspect term 'food' included in the review. If the review sentence includes 'pasta', 'steak', or 'grilled chicken special', these can all be aspect terms for the aspect category 'food'. As such, an aspect category referred to by one or more specific aspect terms is called an explicit aspect. On the other hand, the aspect category like 'price', which does not have any specific aspect terms but can be indirectly guessed with an emotional word 'expensive,' is called an implicit aspect. So far, the 'aspect category' has been used to avoid confusion about 'aspect term'. From now on, we will consider 'aspect category' and 'aspect' as the same concept and use the word 'aspect' more for convenience. And one thing to note is that ATSC analyzes the sentiment towards given aspect terms, so it deals only with explicit aspects, and ACSC treats not only explicit aspects but also implicit aspects. This study seeks to find answers to the following issues ignored in the previous studies when applying the BERT pre-trained language model to ACSC and derives superior ACSC models. First, is it more effective to reflect the output vector of tokens for aspect categories than to use only the final output vector of [CLS] token as a classification vector? Second, is there any performance difference between QA (Question Answering) and NLI (Natural Language Inference) types in the sentence-pair configuration of input data? Third, is there any performance difference according to the order of sentence including aspect category in the QA or NLI type sentence-pair configuration of input data? To achieve these research objectives, we implemented 12 ACSC models and conducted experiments on 4 English benchmark datasets. As a result, ACSC models that provide performance beyond the existing studies without expanding the training dataset were derived. In addition, it was found that it is more effective to reflect the output vector of the aspect category token than to use only the output vector for the [CLS] token as a classification vector. It was also found that QA type input generally provides better performance than NLI, and the order of the sentence with the aspect category in QA type is irrelevant with performance. There may be some differences depending on the characteristics of the dataset, but when using NLI type sentence-pair input, placing the sentence containing the aspect category second seems to provide better performance. The new methodology for designing the ACSC model used in this study could be similarly applied to other studies such as ATSC.