• Title/Summary/Keyword: Analysis of Category Data

Search Result 965, Processing Time 0.025 seconds

Economic Effectiveness of Advanced Enterprise Welfare System (선진기업복지제도 도입지원사업의 경제적 효과성 분석)

  • Kwon, Jin-A;Ahn, Young-Gyu;Kim, Hyun-Soo;Park, Kyung-Il
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.10
    • /
    • pp.216-230
    • /
    • 2017
  • The purpose of this study is to analyze the economic effectiveness of advanced enterprise welfare system utilizing DEA(Data Envelopment Analysis) and contribute to the adoption and implementation of the system conducted by Korea Workers' Compensation & Welfare Service(COMWEL). We classified 48 sample data into 3 categories : A category(basic consulting & intensive consulting, adoption) is 36, B category(basic consulting & intensive consulting, non-adoption) is 5, and C category(only basic consulting, adoption) is 7. A consulting fee is used as input variable, earning per employee and average employee tenure are used as output variables. As a result from DEA, we find out the fact that the economic effectiveness of A category is better than the economic effectiveness of B and C category and it comes to the conclusion that the consulting service provided by COMWEL has a positive effect on the adoption and implementation of advanced enterprise welfare system. Therefore, COMWEL is required to perform consulting service to small & medium business more actively and is needed to look at the reason why some businesses hesitate to accept the relevant system.

Study on the effectiveness of english-medium class (영어강의의 효과성에 대한 연구)

  • Cho, Jang Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.6
    • /
    • pp.1137-1144
    • /
    • 2012
  • Many universities stress gradually the importance of english-medium class in order to improve the international competitiveness and the internationalization of the university. In this paper, we compare english-medium class with korean class using course evaluation score. Also we analyze the factors that affect the effectiveness of the course evaluation score of english-medium class. First, logistic regression analysis is used to examine the main effects of subjects and individual characteristics. Also, decision tree analysis is used to examine the interaction effects for subjects and individual characteristics. The results of this paper are as follows. Grade, department category, class size, GPA and screening method affect the effectiveness of english-medium class. The highest effectiveness group of english-medium class is that grade is freshmen and department category is humanity. Also the group of the second highest effectiveness group is that grade is freshmen and department category is nature and art and GPA is high.

Analysis of employee's characteristic using data visualization (데이터 시각화를 이용한 취업자 특성분석)

  • Cho, Jang Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.4
    • /
    • pp.727-736
    • /
    • 2014
  • The fundamental concerns of this paper are to analyze the effects of some characteristics on the employment of new college graduated students in viewpoint of data visualization. We use individual and department characteristic data of K-university graduated students in 2010. We apply multiple correspondence analysis, decision tree analysis, association rules and social network analysis for data visualization. The results of the analysis are summarized as follows. First, an analysis of the determinants of employment shows that GPA, department category, age and number of majors, recruiting time affect the employment rate. Second, higher GPA and natural category of department positively affect the employment rate. Finally, low age, single major and early recruiting time also positively affect the employment rate.

Aspect-Based Sentiment Analysis Using BERT: Developing Aspect Category Sentiment Classification Models (BERT를 활용한 속성기반 감성분석: 속성카테고리 감성분류 모델 개발)

  • Park, Hyun-jung;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.1-25
    • /
    • 2020
  • Sentiment Analysis (SA) is a Natural Language Processing (NLP) task that analyzes the sentiments consumers or the public feel about an arbitrary object from written texts. Furthermore, Aspect-Based Sentiment Analysis (ABSA) is a fine-grained analysis of the sentiments towards each aspect of an object. Since having a more practical value in terms of business, ABSA is drawing attention from both academic and industrial organizations. When there is a review that says "The restaurant is expensive but the food is really fantastic", for example, the general SA evaluates the overall sentiment towards the 'restaurant' as 'positive', while ABSA identifies the restaurant's aspect 'price' as 'negative' and 'food' aspect as 'positive'. Thus, ABSA enables a more specific and effective marketing strategy. In order to perform ABSA, it is necessary to identify what are the aspect terms or aspect categories included in the text, and judge the sentiments towards them. Accordingly, there exist four main areas in ABSA; aspect term extraction, aspect category detection, Aspect Term Sentiment Classification (ATSC), and Aspect Category Sentiment Classification (ACSC). It is usually conducted by extracting aspect terms and then performing ATSC to analyze sentiments for the given aspect terms, or by extracting aspect categories and then performing ACSC to analyze sentiments for the given aspect category. Here, an aspect category is expressed in one or more aspect terms, or indirectly inferred by other words. In the preceding example sentence, 'price' and 'food' are both aspect categories, and the aspect category 'food' is expressed by the aspect term 'food' included in the review. If the review sentence includes 'pasta', 'steak', or 'grilled chicken special', these can all be aspect terms for the aspect category 'food'. As such, an aspect category referred to by one or more specific aspect terms is called an explicit aspect. On the other hand, the aspect category like 'price', which does not have any specific aspect terms but can be indirectly guessed with an emotional word 'expensive,' is called an implicit aspect. So far, the 'aspect category' has been used to avoid confusion about 'aspect term'. From now on, we will consider 'aspect category' and 'aspect' as the same concept and use the word 'aspect' more for convenience. And one thing to note is that ATSC analyzes the sentiment towards given aspect terms, so it deals only with explicit aspects, and ACSC treats not only explicit aspects but also implicit aspects. This study seeks to find answers to the following issues ignored in the previous studies when applying the BERT pre-trained language model to ACSC and derives superior ACSC models. First, is it more effective to reflect the output vector of tokens for aspect categories than to use only the final output vector of [CLS] token as a classification vector? Second, is there any performance difference between QA (Question Answering) and NLI (Natural Language Inference) types in the sentence-pair configuration of input data? Third, is there any performance difference according to the order of sentence including aspect category in the QA or NLI type sentence-pair configuration of input data? To achieve these research objectives, we implemented 12 ACSC models and conducted experiments on 4 English benchmark datasets. As a result, ACSC models that provide performance beyond the existing studies without expanding the training dataset were derived. In addition, it was found that it is more effective to reflect the output vector of the aspect category token than to use only the output vector for the [CLS] token as a classification vector. It was also found that QA type input generally provides better performance than NLI, and the order of the sentence with the aspect category in QA type is irrelevant with performance. There may be some differences depending on the characteristics of the dataset, but when using NLI type sentence-pair input, placing the sentence containing the aspect category second seems to provide better performance. The new methodology for designing the ACSC model used in this study could be similarly applied to other studies such as ATSC.

Research on the evaluation model for the impact of AI services

  • Soonduck Yoo
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.3
    • /
    • pp.191-202
    • /
    • 2023
  • This study aims to propose a framework for evaluating the impact of artificial intelligence (AI) services, based on the concept of AI service impact. It also suggests a model for evaluating this impact and identifies relevant factors and measurement approaches for each item of the model. The study classifies the impact of AI services into five categories: ethics, safety and reliability, compliance, user rights, and environmental friendliness. It discusses these five categories from a broad perspective and provides 21 detailed factors for evaluating each category. In terms of ethics, the study introduces three additional factors-accessibility, openness, and fairness-to the ten items initially developed by KISDI. In the safety and reliability category, the study excludes factors such as dependability, policy, compliance, and awareness improvement as they can be better addressed from a technical perspective. The compliance category includes factors such as human rights protection, privacy protection, non-infringement, publicness, accountability, safety, transparency, policy compliance, and explainability.For the user rights category, the study excludes factors such as publicness, data management, policy compliance, awareness improvement, recoverability, openness, and accuracy. The environmental friendliness category encompasses diversity, publicness, dependability, transparency, awareness improvement, recoverability, and openness.This study lays the foundation for further related research and contributes to the establishment of relevant policies by establishing a model for evaluating the impact of AI services. Future research is required to assess the validity of the developed indicators and provide specific evaluation items for practical use, based on expert evaluations.

A Prediction of the Land-cover Change Using Multi-temporal Satellite Imagery and Land Statistical Data: Case Study for Cheonan City and Asan City, Korea (다중시기 위성영상과 토지 통계자료를 이용한 토지피복 변화 예측: 천안시·아산시를 사례로)

  • KIM, Chansoo;PARK, Ji-Hoon;JANG, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.18 no.1
    • /
    • pp.41-56
    • /
    • 2011
  • This study analyzes the change in land-cover based on satellite imagery to draw up land-cover map in the future, and estimates the change in land category using statistical data of the land category. To estimate land category, this study applied the double exponentially smoothing method. The result of the land cover classification according to year using satellite imagery showed that the type with the largest increase in area of land cover change in the cities of Cheonan and Asan was artificial structure, followed by water, grass field and bare land. However forest, paddy, marsh and dry field were reduced. Further, the result of the time-series analysis of the land category was found to be similar to the result of the land cover classification using satellite imagery. Especially, the result of the estimation of the land category change using the double exponentially smoothing method showed that paddy, dry field, forest and marsh are anticipated to consistently decrease in area from 2010 to 2100, whereas artificial structure, water, bare land and grass field are anticipated to consistently increase. Such results can be utilized as basic data to estimate the change in land cover according to climate change in order to prepare climate change response strategies.

A Methodology for Automatic Multi-Categorization of Single-Categorized Documents (단일 카테고리 문서의 다중 카테고리 자동확장 방법론)

  • Hong, Jin-Sung;Kim, Namgyu;Lee, Sangwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.77-92
    • /
    • 2014
  • Recently, numerous documents including unstructured data and text have been created due to the rapid increase in the usage of social media and the Internet. Each document is usually provided with a specific category for the convenience of the users. In the past, the categorization was performed manually. However, in the case of manual categorization, not only can the accuracy of the categorization be not guaranteed but the categorization also requires a large amount of time and huge costs. Many studies have been conducted towards the automatic creation of categories to solve the limitations of manual categorization. Unfortunately, most of these methods cannot be applied to categorizing complex documents with multiple topics because the methods work by assuming that one document can be categorized into one category only. In order to overcome this limitation, some studies have attempted to categorize each document into multiple categories. However, they are also limited in that their learning process involves training using a multi-categorized document set. These methods therefore cannot be applied to multi-categorization of most documents unless multi-categorized training sets are provided. To overcome the limitation of the requirement of a multi-categorized training set by traditional multi-categorization algorithms, we propose a new methodology that can extend a category of a single-categorized document to multiple categorizes by analyzing relationships among categories, topics, and documents. First, we attempt to find the relationship between documents and topics by using the result of topic analysis for single-categorized documents. Second, we construct a correspondence table between topics and categories by investigating the relationship between them. Finally, we calculate the matching scores for each document to multiple categories. The results imply that a document can be classified into a certain category if and only if the matching score is higher than the predefined threshold. For example, we can classify a certain document into three categories that have larger matching scores than the predefined threshold. The main contribution of our study is that our methodology can improve the applicability of traditional multi-category classifiers by generating multi-categorized documents from single-categorized documents. Additionally, we propose a module for verifying the accuracy of the proposed methodology. For performance evaluation, we performed intensive experiments with news articles. News articles are clearly categorized based on the theme, whereas the use of vulgar language and slang is smaller than other usual text document. We collected news articles from July 2012 to June 2013. The articles exhibit large variations in terms of the number of types of categories. This is because readers have different levels of interest in each category. Additionally, the result is also attributed to the differences in the frequency of the events in each category. In order to minimize the distortion of the result from the number of articles in different categories, we extracted 3,000 articles equally from each of the eight categories. Therefore, the total number of articles used in our experiments was 24,000. The eight categories were "IT Science," "Economy," "Society," "Life and Culture," "World," "Sports," "Entertainment," and "Politics." By using the news articles that we collected, we calculated the document/category correspondence scores by utilizing topic/category and document/topics correspondence scores. The document/category correspondence score can be said to indicate the degree of correspondence of each document to a certain category. As a result, we could present two additional categories for each of the 23,089 documents. Precision, recall, and F-score were revealed to be 0.605, 0.629, and 0.617 respectively when only the top 1 predicted category was evaluated, whereas they were revealed to be 0.838, 0.290, and 0.431 when the top 1 - 3 predicted categories were considered. It was very interesting to find a large variation between the scores of the eight categories on precision, recall, and F-score.

Illness Experience of Adolescents with Hematologic Malignancies (혈액종양 청소년의 질병 경험)

  • Son, Sun-Young
    • Journal of Korean Academy of Nursing
    • /
    • v.41 no.5
    • /
    • pp.603-612
    • /
    • 2011
  • Purpose: The purpose of this study was to describe the experience process of adolescents with hematologic malignancies. The question for the study was "What is the experience of adolescents with hematologic malignancies like?" Methods: The grounded theory methodology was used for this study. The data were collected through in-depth interview from 10 adolescents with hematologic malignancies. Data collection was done from January to June 2007. Theoretical sampling was used until the data reached saturation. Results: As a result of the analysis, "Reconstructing self-image from deviated and suspended life" was identified as the core category. And 11 subcategories were identified and they were integrated to the core category. 'Establishment of expanded and matured self' was identified as the consequence. Conclusion: The results of the study provide a frame for effective individualized nursing intervention strategies in helping adjustment of the adolescents with hematologic malignancies.

E-customized Product: User-centered Co-design Experiences

  • Li, Pei;Liu, Zi Yang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.9
    • /
    • pp.3680-3692
    • /
    • 2020
  • The purpose of this study is to orient users' touchpoints in co-design experience, to identify their need via visualized experience map, to recommend valid design information in online e-customization services. A user-centered co-design experience map (UCEM) is adopted to analyze the relation between users' desire and time spent, so as to evaluate the online co-design experiences. Based on evolutionary algorithm and fuzzy theory, data of this study is collected from 30 participants. The data was analyzed by descriptive analysis in SPSS, and frequency query and word cloud in NVivo. Employing design category and evaluating users' time spent, the findings are that (a) vamp color matching is consistent with interview data; (b) supported by qualitative feedback, the virtual experience map played an important role in the co-design process and the visualized interaction process; and (c) participants prefer to get more information and professional help on color matching and exterior design. Based on the findings in design category, future work should be focused on developing a better understanding of design resource recommendations and multi-stakeholder communication.

Impact Analysis of Partition Utility Score in Cluster Analysis (군집분석의 분할 유용도 점수의 영향 분석)

  • Lee, Gye Sung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.3
    • /
    • pp.481-486
    • /
    • 2021
  • Machine learning algorithms adopt criterion function as a key component to measure the quality of their model derived from data. Cluster analysis also uses this function to rate the clustering result. All the criterion functions have in general certain types of favoritism in producing high quality clusters. These clusters are then described by attributes and their values. Category utility and partition utility play an important role in cluster analysis. These are fully analyzed in this research particularly in terms of how they are related to the favoritism in the final results. In this research, several data sets are selected and analyzed to show how different results are induced from these criterion functions.