• Title/Summary/Keyword: Association Rules Analysis

Search Result 402, Processing Time 0.03 seconds

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Analysis of Educational Issues through Topic Modeling of National Petitions Text (국민청원글의 토픽 모델링을 통한 교육이슈 분석)

  • Shim, Jaekwoun
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.4
    • /
    • pp.633-640
    • /
    • 2021
  • Education related issues are social problems in which various groups and situations are intricately linked to each other. It is difficult to find issues by analyzing social phenomena related to education. Korean based text analysis can be analyzed in a quantitative. With the development of text analysis techniques, research results have been recently achieved, and it can be fully utilized to derive educational issues from text data in Korean. In this study, petition articles in the field of childcare/education were collected on the online-board of the Blue House National Petition website, and text analysis was used to derive issues in the education world. The analysis derived 6 topics through Latent Dirichlet Allocation(LDA) among topic modeling techniques. The association rules of major keywords were analyzed and visualized as graphs. In addition to deriving educational issues through the existing questionnaire, it can provide implications for future research directions and policies in that issues can be sufficiently discovered through text-based analysis methods.

Development of Intelligent Job Classification System based on Job Posting on Job Sites (구인구직사이트의 구인정보 기반 지능형 직무분류체계의 구축)

  • Lee, Jung Seung
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.123-139
    • /
    • 2019
  • The job classification system of major job sites differs from site to site and is different from the job classification system of the 'SQF(Sectoral Qualifications Framework)' proposed by the SW field. Therefore, a new job classification system is needed for SW companies, SW job seekers, and job sites to understand. The purpose of this study is to establish a standard job classification system that reflects market demand by analyzing SQF based on job offer information of major job sites and the NCS(National Competency Standards). For this purpose, the association analysis between occupations of major job sites is conducted and the association rule between SQF and occupation is conducted to derive the association rule between occupations. Using this association rule, we proposed an intelligent job classification system based on data mapping the job classification system of major job sites and SQF and job classification system. First, major job sites are selected to obtain information on the job classification system of the SW market. Then We identify ways to collect job information from each site and collect data through open API. Focusing on the relationship between the data, filtering only the job information posted on each job site at the same time, other job information is deleted. Next, we will map the job classification system between job sites using the association rules derived from the association analysis. We will complete the mapping between these market segments, discuss with the experts, further map the SQF, and finally propose a new job classification system. As a result, more than 30,000 job listings were collected in XML format using open API in 'WORKNET,' 'JOBKOREA,' and 'saramin', which are the main job sites in Korea. After filtering out about 900 job postings simultaneously posted on multiple job sites, 800 association rules were derived by applying the Apriori algorithm, which is a frequent pattern mining. Based on 800 related rules, the job classification system of WORKNET, JOBKOREA, and saramin and the SQF job classification system were mapped and classified into 1st and 4th stages. In the new job taxonomy, the first primary class, IT consulting, computer system, network, and security related job system, consisted of three secondary classifications, five tertiary classifications, and five fourth classifications. The second primary classification, the database and the job system related to system operation, consisted of three secondary classifications, three tertiary classifications, and four fourth classifications. The third primary category, Web Planning, Web Programming, Web Design, and Game, was composed of four secondary classifications, nine tertiary classifications, and two fourth classifications. The last primary classification, job systems related to ICT management, computer and communication engineering technology, consisted of three secondary classifications and six tertiary classifications. In particular, the new job classification system has a relatively flexible stage of classification, unlike other existing classification systems. WORKNET divides jobs into third categories, JOBKOREA divides jobs into second categories, and the subdivided jobs into keywords. saramin divided the job into the second classification, and the subdivided the job into keyword form. The newly proposed standard job classification system accepts some keyword-based jobs, and treats some product names as jobs. In the classification system, not only are jobs suspended in the second classification, but there are also jobs that are subdivided into the fourth classification. This reflected the idea that not all jobs could be broken down into the same steps. We also proposed a combination of rules and experts' opinions from market data collected and conducted associative analysis. Therefore, the newly proposed job classification system can be regarded as a data-based intelligent job classification system that reflects the market demand, unlike the existing job classification system. This study is meaningful in that it suggests a new job classification system that reflects market demand by attempting mapping between occupations based on data through the association analysis between occupations rather than intuition of some experts. However, this study has a limitation in that it cannot fully reflect the market demand that changes over time because the data collection point is temporary. As market demands change over time, including seasonal factors and major corporate public recruitment timings, continuous data monitoring and repeated experiments are needed to achieve more accurate matching. The results of this study can be used to suggest the direction of improvement of SQF in the SW industry in the future, and it is expected to be transferred to other industries with the experience of success in the SW industry.

A Comparative Study on Value Orientation about Family Norm between the older Generations and University Students (가정규범에 관한 기성세대와 대학생간의 가치의식 비교연구)

  • 이길표
    • Journal of the Korean Home Economics Association
    • /
    • v.32 no.3
    • /
    • pp.135-146
    • /
    • 1994
  • This study proposed a plan to seek a more practical way of life norm education of today's families on the basis of family rule in the traditional society by comparison between the older generation's family life rule education and college students. The study was made by analysing rules in Chosun Dynasty questionaire nair was drawn up on the basis of it. The subjects of this study were college students of one largest cities and their 800 parents. Collected data was processed by frequency analysis, ANOVA, interrelation and regression which are used through SPSS computer programs, Study results show that acceptance level is higher among the older generation but the necessity of family standare education is urgent beyond the generations. Also people who have lived with grand parents feel more necessity of education family norm. When the education could not be made in families because parents excessive protection examination-centered education, and bad effects of mass media then emphasis has to be made to create life culture which makes family norms to be kept continuously by the education at schools, education culture centers and public facilities.

  • PDF

Basin-Wide Multi-Reservoir Operation Using Reinforcement Learning (강화학습법을 이용한 유역통합 저수지군 운영)

  • Lee, Jin-Hee;Shim, Myung-Pil
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2006.05a
    • /
    • pp.354-359
    • /
    • 2006
  • The analysis of large-scale water resources systems is often complicated by the presence of multiple reservoirs and diversions, the uncertainty of unregulated inflows and demands, and conflicting objectives. Reinforcement learning is presented herein as a new approach to solving the challenging problem of stochastic optimization of multi-reservoir systems. The Q-Learning method, one of the reinforcement learning algorithms, is used for generating integrated monthly operation rules for the Keum River basin in Korea. The Q-Learning model is evaluated by comparing with implicit stochastic dynamic programming and sampling stochastic dynamic programming approaches. Evaluation of the stochastic basin-wide operational models considered several options relating to the choice of hydrologic state and discount factors as well as various stochastic dynamic programming models. The performance of Q-Learning model outperforms the other models in handling of uncertainty of inflows.

  • PDF

An Analysis of the Research Methodologies and Techniques in the Industrial Engineering Using Text Mining (텍스트 마이닝을 이용한 산업공학 연구기법의 분석)

  • Cho, Geun Ho;Lim, Si Yeong;Hur, Sun
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.40 no.1
    • /
    • pp.52-59
    • /
    • 2014
  • We survey 3,857 journal articles published on the four domestic academic journals in the industrial engineering field during 1975~2012. Titles, abstracts, and keywords of the papers are searched by means of text mining technique to draw the information on the methodologies and techniques adopted in the papers, and then we aggregate and merge similar ones to obtain final 38 representative methodologies and techniques. Trends of these methodologies and techniques are studied by analyzing frequencies, clustering, and finding association rules among them. Results of the paper can shed a light to choose tools in the future education and research in the industrial engineering related area.

A Post-analysis of the Association Rule Mining Applied to Internee Shopping Mall

  • Kim, Jae-Kyeong;Song, Hee-Seok
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.06a
    • /
    • pp.253-260
    • /
    • 2001
  • Understanding and adapting to changes of customer behavior is an important aspect for a company to survive in continuously changing environment. The aim of this paper is to develop a methodology which detects changes of customer behavior automatically from customer profiles and sales data at different time snapshots. For this purpose, we first define three types of changes as emerging pattern, unexpected change and the added / perished rule. Then we develop similarity and difference measures for rule matching to detect all types of change. Finally, the degree of change is evaluated to detect significantly changed rules. Our proposed methodology can evaluate degree of changes as well as detect all kinds of change automatically from different time snapshot data. A case study for evaluation and practical business implications for this methodology are also provided.

  • PDF

The Usage of Buildings in Tiantong Temple in the Song Era - Through Rules of Purity for the Chan Monastery and Five Mountains Ten Checks Figures - (송대(宋代) 천동사(天童寺)의 전각과 이용 - "선원청규(禪院淸規)"와 "오산십찰도"의 문헌을 중심으로 -)

  • Seo, A-Ri;Hong, Dae-Hyung
    • Journal of architectural history
    • /
    • v.14 no.2 s.42
    • /
    • pp.7-20
    • /
    • 2005
  • Ceremony is important to Buddhism as a part of the religious practice. Buddhist ceremony is a kind of discipline and it rules the Chan monastery life. This discipline, called $\ulcorner$Qinggui(淸規)$\lrcorner$ also forms a part of the practice for enlightenment in the Chan monastery(禪宗). Qinggui is derived from $\ulcorner$Baizhang's monastic code(百丈淸規)$\lrcorner$ which no longer exists. $\ulcorner$Chanyuan qinggui(禪院淸規)$\lrcorner$ is considered the oldest surviving Chinese monastic discipline. Its success is partly due to the emphasis in the Chan monastery on the succession of monks to abbot hood. Qinggui has been called the only discipline in Buddhist monastic life in religion. Whether it is also the discipline of the architectural space of the Chan temples is the focus of this thesis. The examination of this assumption may expand the meaning of Qinggui as embodying not only the religious form of discipline but also a fundamental part of the architectural archive. The majority of the buildings in the Chan monastery in Qinggui are related to $\ulcorner$Five Mountains Ten Checks figures$\lrcorner$. Most of all, it can be clarified that the elements of Qinggui are expressed through the analysis of the activities in each building. This proves that Qinggui has become a stipulation not only for the regulation of the monastery life but also the architectural code of the Chan temples. In conclusion, this study shows how the meaning of ceremony and monastery life in $\ulcorner$Chanyuan qinggui$\lrcorner$ can be expanded to include the design program of temples. The research proves that there is a basic code in the Chan temples for designing the structure of the monastery space. Similarly, $\ulcorner$Five Mountains Ten Checks Figures$\lrcorner$ was a diagram for examination and analysis as well as a tool for creating drawings of the temples in the Song era.

  • PDF

Emotional Display Rules and Emotional Labor Strategy of Childcare Teachers (보육교사의 정서표현규칙과 정서노동 수행전략에 관한 연구)

  • Lee, Yeon Jun;Suh, Young Sook
    • Korean Journal of Childcare and Education
    • /
    • v.11 no.5
    • /
    • pp.19-37
    • /
    • 2015
  • The purpose of this study was to find out the linkage between emotional display rules and emotional labor strategy and the affects of the display rule factors on the emotional labor strategy. The participants of this study were 268 childcare teachers in Seoul, and the collected data were analyzed using correlation analysis and multiple regression analysis. The results were as follows: First, display rule perception was positively related to deep acting and surface acting. And the deep acting was positively related to display rule education, commitment, fairness of display rule, and explicit display rule. Second, display rule perception has a positive effect on deep acting and surface acting. And the commitment to display rule has a positive effect on deep acting. This study provided practical implications to help childcare teachers' emotional labor, and suggested directions for the education program for the emotional competence of childcare teachers.

Drug-likeness and Oral bioavailability for Chemical Compounds of Medicinal Materials Constituting Oryeong-san (오령산 구성약재 성분의 Drug-likeness와 Oral bioavailability)

  • Kim, Sang-Kyun;Lee, Seungho
    • The Korea Journal of Herbology
    • /
    • v.33 no.5
    • /
    • pp.19-37
    • /
    • 2018
  • Objectives : Oryeong-san was composed of Alismatis Rhizoma, Atractylodis Rhizoma Alba, Poria Sclerotium, Polyporus, Cinnamomi Cortex, and known to have hundreds of chemical compounds. The aim of this study was to screen chemical compounds constituting Oryeong-san with the drug-likeness and oral bioavailability from the analysis of their physicochemical properties. Methods : A list of chemical compounds of Oryeong-san was obtained from TM-MC(database of medicinal materials and chemical compounds in Northeast Asian traditional medicine). To remove redundant compounds, the SMILES (Simplified Molecular Input Line Entry System) strings of each compound were identified. All of the physicochemical properties for the compounds were calculated using the DruLiTo(Drug Likeness Tool). Drug-likeness was estimated by QED(Quantitative Estimate of Druglikeness) and OB(Oral bioavailability) was checked based on the Veber's rules. Results : A total of 475 compounds were obtained by eliminating duplication among 544 compounds of 5 medicinal materials. Analysis of the physicochemical properties revealed that the most common values were MW(molecular weight) 200~300 g/mol, ALOGP(octanol-water partition coefficient) 1~2, HBA(number of hydrogen bond acceptors) 0~1, HBD(number of hydrogen bond donors) 0, PSA(polar surface area) 0~50 angstrom, ROTB(number of rotatable bonds) 1, AROM(number of aromatic rings) 0, and ALERT(number of structural alerts) 1. QED had 93% of the values between 0.2 and 0.7, and OB had 90% of the value of TRUE. Conclusions : We in this paper screened the candidate active compounds of Oryeong-san using the QED and Veber's rules. In the future, we will use the screening results to analyze the mechanism of Oryeong-san based on systems pharmacology.