• Title/Summary/Keyword: Weight Mining

Search Result 157, Processing Time 0.023 seconds

Study on Classification Algorithm based on Weight of Support and Confidence Degree (지지도와 신뢰도의 가중치에 기반한 분류알고리즘에 관한 연구)

  • Kim, Keun-Hyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.13 no.4
    • /
    • pp.700-713
    • /
    • 2009
  • Most of any existing classification algorithm in data mining area have focused on goals improving efficiency, which is to generate decision tree more rapidly by utilizing just less computing resources. In this paper, we focused on the efficiency as well as effectiveness that is able to generate more meaningful classification rules in application area, which might consist of the ontology automatic generation, business environment and so on. For this, we proposed not only novel function with the weight of support and confidence degree but also analyzed the characteristics of the weighted function in theoretical viewpoint. Furthermore, we proposed novel classification algorithm based on the weighted function and the characteristics. In the result of evaluating the proposed algorithm, we could perceive that the novel algorithm generates more classification rules with significance more rapidly.

Exploring Issues Related to the Metaverse from the Educational Perspective Using Text Mining Techniques - Focusing on News Big Data (텍스트마이닝 기법을 활용한 교육관점에서의 메타버스 관련 이슈 탐색 - 뉴스 빅데이터를 중심으로)

  • Park, Ju-Yeon;Jeong, Do-Heon
    • Journal of Industrial Convergence
    • /
    • v.20 no.6
    • /
    • pp.27-35
    • /
    • 2022
  • The purpose of this study is to analyze the metaverse-related issues in the news big data from an educational perspective, explore their characteristics, and provide implications for the educational applicability of the metaverse and future education. To this end, 41,366 cases of metaverse-related data searched on portal sites were collected, and weight values of all extracted keywords were calculated and ranked using TF-IDF, a representative term weight model, and then word cloud visualization analysis was performed. In addition, major topics were analyzed using topic modeling(LDA), a sophisticated probability-based text mining technique. As a result of the study, topics such as platform industry, future talent, and extension in technology were derived as core issues of the metaverse from an educational perspective. In addition, as a result of performing secondary data analysis under three key themes of technology, job, and education, it was found that metaverse has issues related to education platform innovation, future job innovation, and future competency innovation in future education. This study is meaningful in that it analyzes a vast amount of news big data in stages to draw issues from an education perspective and provide implications for future education.

Blast Coefficient for Bench Blasting (벤치발파 설계에서 발파계수 설정에 관한 연구)

  • Kim, Hee-Do;Kim, Jung-Kyu;Ko, Young-Hun;Noh, You-Song;Shin, Myeong-Jin;Yang, Hyung-Sik
    • Explosives and Blasting
    • /
    • v.33 no.1
    • /
    • pp.1-12
    • /
    • 2015
  • In this study, the domestic bench blasting sites were researched to set the blast coefficient C according to the type of rock and type of industry. With the use of the experimental data on the representative industrial explosives and the data of the manufacturers'data on explosives, powder coefficient e was set up. The blast coefficient C was 0.21~0.30 when the average value for 5 representative kinds of rocks including granite was searched. The blast coefficient C for quarrying, mining and construction sites were 0.22, 0.13 and 0.26 respectively. On the other hand, powder coefficient e was obtained in four elements such as reactive energy, ballistic mortar test, VOD, Langefors'strength per unit weight. e value for emulsion which is one of the representative explosives was found to be 1 while those of high performance emulsion and ANFO were found to be 0.9 and 1, respectively.

The Mechanical Properties of Limestones Distributed in Jecheon (제천지역 석회암의 역학적 특성에 관한 연구)

  • Kim, Jong Woo;Kim, Min Sik;Kim, Pyoung Gi;Nor, Seung Jae;Park, Chan;Jo, Young Do;Park, Sam Gyu
    • Tunnel and Underground Space
    • /
    • v.22 no.5
    • /
    • pp.354-364
    • /
    • 2012
  • In order to evaluate the physical properties of rock which might serve as a database for both mining and civil works, a lot of laboratory tests for Jecheon limestones were conducted to find unit weight, absorption ratio, porosity, elastic wave velocity, uniaxial compressive strength, Young's modulus, poisson's ratio, tensile strength, shore hardness, friction angle and cohesion. On investigation of the mechanical properties of both the gray limestone and the clayey limestone distributed in the studied region, the clayey limestone turned out to have more weak mechanical properties which might come from low unit weight, high absorption ratio and high porosity of rocks. The failure criteria of Jecheon limestones were discussed by means of both Mohr-Coulomb criterion and Hoek-Brown criterion. Regression analyses of the physical properties obtained from a lot of laboratory tests were also conducted by means of both linear and multiple regression analyses.

Analysis and Prediction Algorithms on the State of User's Action Using the Hidden Markov Model in a Ubiquitous Home Network System (유비쿼터스 홈 네트워크 시스템에서 은닉 마르코프 모델을 이용한 사용자 행동 상태 분석 및 예측 알고리즘)

  • Shin, Dong-Kyoo;Shin, Dong-Il;Hwang, Gu-Youn;Choi, Jin-Wook
    • Journal of Internet Computing and Services
    • /
    • v.12 no.2
    • /
    • pp.9-17
    • /
    • 2011
  • This paper proposes an algorithm that predicts the state of user's next actions, exploiting the HMM (Hidden Markov Model) on user profile data stored in the ubiquitous home network. The HMM, recognizes patterns of sequential data, adequately represents the temporal property implicated in the data, and is a typical model that can infer information from the sequential data. The proposed algorithm uses the number of the user's action performed, the location and duration of the actions saved by "Activity Recognition System" as training data. An objective formulation for the user's interest in his action is proposed by giving weight on his action, and change on the state of his next action is predicted by obtaining the change on the weight according to the flow of time using the HMM. The proposed algorithm, helps constructing realistic ubiquitous home networks.

Similarity Measurement with Interestingness Weight for Improving the Accuracy of Web Transaction Clustering (웹 트랜잭션 클러스터링의 정확성을 높이기 위한 흥미가중치 적용 유사도 비교방법)

  • Kang, Tae-Ho;Min, Young-Soo;Yoo, Jae-Soo
    • The KIPS Transactions:PartD
    • /
    • v.11D no.3
    • /
    • pp.717-730
    • /
    • 2004
  • Recently. many researches on the personalization of a web-site have been actively made. The web personalization predicts the sets of the most interesting URLs for each user through data mining approaches such as clustering techniques. Most existing methods using clustering techniques represented the web transactions as bit vectors that represent whether users visit a certain WRL or not to cluster web transactions. The similarity of the web transactions was decided according to the match degree of bit vectors. However, since the existing methods consider only whether users visit a certain URL or not, users' interestingness on the URL is excluded from clustering web transactions. That is, it is possible that the web transactions with different visit proposes or inclinations are classified into the same group. In this paper. we propose an enhanced transaction modeling with interestingness weight to solve such problems and a new similarity measuring method that exploits the proposed transaction modeling. It is shown through performance evaluation that our similarity measuring method improves the accuracy of the web transaction clustering over the existing method.

Design of Black Plastics Classifier Using Data Information (데이터 정보를 이용한 흑색 플라스틱 분류기 설계)

  • Park, Sang-Beom;Oh, Sung-Kwun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.4
    • /
    • pp.569-577
    • /
    • 2018
  • In this paper, with the aid of information which is included within data, preprocessing algorithm-based black plastic classifier is designed. The slope and area of spectrum obtained by using laser induced breakdown spectroscopy(LIBS) are analyzed for each material and its ensuing information is applied as the input data of the proposed classifier. The slope is represented by the rate of change of wavelength and intensity. Also, the area is calculated by the wavelength of the spectrum peak where the material property of chemical elements such as carbon and hydrogen appears. Using informations such as slope and area, input data of the proposed classifier is constructed. In the preprocessing part of the classifier, Principal Component Analysis(PCA) and fuzzy transform are used for dimensional reduction from high dimensional input variables to low dimensional input variables. Characteristic analysis of the materials as well as the processing speed of the classifier is improved. In the condition part, FCM clustering is applied and linear function is used as connection weight in the conclusion part. By means of Particle Swarm Optimization(PSO), parameters such as the number of clusters, fuzzification coefficient and the number of input variables are optimized. To demonstrate the superiority of classification performance, classification rate is compared by using WEKA 3.8 data mining software which contains various classifiers such as Naivebayes, SVM and Multilayer perceptron.

A Study on the Construction of Fisheries Producer Price Index (수산물 생산자물가지수 산정방식에 관한 고찰;-연근해 어획물을 중심으로-)

  • 이광진
    • The Journal of Fisheries Business Administration
    • /
    • v.27 no.1
    • /
    • pp.67-90
    • /
    • 1996
  • As an important factor of capitalism economics, price of commodities represents a certain country's economic index. For having correct price policy, there should be an appropriate mechanism to make and use systematic statistical data on price. Price statistics are made by indexes and price indexes are categorized into producer price index(PPI) and consumer price index(CPI). The Bank of Korea is publishing producer price index every year, but the producer price index contains some problems. These include as follows : (a) the impractical selection of fisheries products sample (b) uncorrect measure of aquatic products weights (c) investigating sample places. This study try to substitute producer price index of aquatic products and change construction of fisheries producer price index with experimental research on representative fisheries, weight of each fisheries, and suitability of investigating sample places. It is possible to improve practical fisheries producer price index with the results of this research. The findings are as follow. (a) It will be helpful for the government to make the fisheries price policy. (b) It can be used to understand trends of accurate price and price increase of aquatic products, and it's possible to compare with it other industrial indexes including the mining, agricultural, and manufacturing industry and understand relative price movement. (c) When free sales systems of fisheries products as expected, it will be helpful to analyze price movement of producing fisheries cooperatives, producing fisheries market and consuming fisheries market, analysis of market, and formation and consideration of budget. (d) It can be an important index to determine labor wage.

  • PDF

LANDSLIDE SUSCEPTIBILITY ANALYSIS USING GIS AND ARTIFICIAL NEURAL NETWORK

  • Lee, Moung-Jin;Won, Joong-Sun;Lee, Saro
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.256-272
    • /
    • 2002
  • The purpose of this study is to develop landslide susceptibility analysis techniques using artificial neural network and to apply the newly developed techniques to the study area of Boun in Korea. Landslide locations were identified in the study area from interpretation of aerial photographs, field survey data, and a spatial database of the topography, soil type, timber cover, geology and land use. The landslide-related factors (slope, aspect, curvature, topographic type, soil texture, soil material, soil drainage, soil effective thickness, timber type, timber age, and timber diameter, timber density, geology and land use) were extracted from the spatial database. Using those factors, landslide susceptibility was analyzed by artificial neural network methods. For this, the weights of each factor were determinated in 3 cases by the backpropagation method, which is a type of artificial neural network method. Then the landslide susceptibility indexes were calculated and the susceptibility maps were made with a GIS program. The results of the landslide susceptibility maps were verified and compared using landslide location data. A GIS was used to efficiently analyze the vast amount of data, and an artificial neural network was turned out be an effective tool to maintain precision and accuracy.

  • PDF

A New Ensemble System using Dynamic Weighting Method (동적 중요도 결정 방법을 이용한 새로운 앙상블 시스템)

  • Seo, Dong-Hun;Lee, Won-Don
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.6
    • /
    • pp.1213-1220
    • /
    • 2011
  • In this paper, a new ensemble system using dynamic weighting method with added weight information into classifiers is proposed. The weights used in the traditional ensemble system are those after the training phase. Once extracted, the weights in the traditional ensemble system remain fixed regardless of the test data set. One way to circumvent this problem in the gating networks is to update the weights dynamically by adding processes making architectural hierarchies, but it has the drawback of added processes. A simple method to update weights dynamically, without added processes, is proposed, which can be applied to the already established ensemble system without much of the architectural modification. Experiment shows that this method performs better than AdaBoost.