• Title/Summary/Keyword: Distributed Data Mining

Search Result 110, Processing Time 0.028 seconds

A Database Schema Integration Method Using XML Schema (XML Schema를 이용한 이질의 데이터베이스 스키마 통합)

  • 박우창
    • Journal of Internet Computing and Services
    • /
    • v.3 no.2
    • /
    • pp.39-56
    • /
    • 2002
  • In distributed computing environments, there are many database applications that should share data each other such as data warehousing and data mining with autonomy on local databases. The first step to such applications is the integration of heterogeneous database schema, but there is no accepted common data model for the integration and also are difficulties on the construction of integration program. In this paper, we use the XML Schema for the representation of common data model and exploit XSLT for reducing the programming difficulties. We define the schema integration operations and develop a methodology for the semi-automatic schema integration according to schema conflicts types. Our integration method has benefits on standardization, extendibility on schema integration process comparing to existing methodologies.

  • PDF

Study on the Metal Ore Deposits of Gyeongsang buk-do Area (경상북도(慶尙北道) 일원(一圓)에 부존(賦存)하고 있는 금속지하자원(金屬地下資源)의 지질광상학적(地質鑛床學的) 연구(硏究))

  • Kim, Y.K.;Lee, J.Y.;Kim, S.W.;Koh, I.S.
    • Economic and Environmental Geology
    • /
    • v.9 no.3
    • /
    • pp.143-156
    • /
    • 1976
  • The Cretaceous metal ore deposits in the Gyeongsang basin of Gyeongsangbuk-do are characterized by the formation of metallogenic provinces which show zonal distribution pattern around Yeonil province where pneumatolytic type is dominated and hydrothermal type are distributed in the order of decreasing temperature type outward. Some Cretaceous granitic rocks include zoned alkali feldspars which reflect rapid variation of $H_2O$ during emplacement and crystallization of the water-saturated granitic magma. The ore deposits are considered to be originated from upward transportation of ore solution from the excess of water exhausted from uprising magma, which seems to be intimately related to the fact that the majority of the ore deposits in Daegu area are cummulated around the granites including zoned alkali feldspars. In order to collect geochemical data necessary for geochemical exploration in the study area, certain trace elements were chosen as pathfinders from monzonite and soil in the vicinity of Dalsung Tungsten Mine by studying the dispersion patterns of trace elements: Ba and Sr show trends to decrease toward ore deposit while Cu, Pb, and Mo increase. Around mining area there are distributed apparently Equisetum arvense Linne and Mentha sachinensis Kudo which may be used as index plants. In the viewpoint of geologic structure, the trends of the ore veins in contact aureole around the Palgongsan granite body correspond with the pre- and syn- plutonism joint pattern in hornfels in the area.

  • PDF

Geophysical and Geological Exploration of Cobalt-rich Ferromanganese Crusts on a Seamount in the Western Pacific (서태평양 해저산 고코발트 망간각 자원평가를 위한 광역 탐사 방안)

  • Kim, Jonguk;Ko, Young-Tak;Hyeong, Kiseong;Moon, Jai-Woon
    • Economic and Environmental Geology
    • /
    • v.46 no.6
    • /
    • pp.569-580
    • /
    • 2013
  • Co-rich ferromanganese crusts (Fe-Mn crusts) distributed on the seamounts in the western Pacific are potential economic resources for cobalt, nickel, platinum, and other rare metals in the future. Regulations for prospecting and exploration of Fe-Mn crusts in the Area, which enables the process to obtain an exclusive exploration right for blocks of the fixed size, were enacted recently by the International Seabed Authority, which led to public attention on its potential for commercial development. Evaluation and selection of a mining site can be established based on abundance and grade of Fe-Mn crusts in the site as well as topography that should be smooth enough for mining efficiency. Therefore, acquisition of shipboard echo-sounding and acoustic backscatter data are prerequisite to select potential mine sites in addition to visual and sampling operations. Acoustic backscatter data can be used to locate crust-covered areas in a regional scale with the understanding of acoustic properties of crust through its correlation with visual and sampling data. KIOST had collected the topographic and geologic data to assess the resources potential for Fe-Mn crusts in the west Pacific region from 1994 to 2001. However, they could not obtain acoustic backscatter data that is crucial for the selection of prospective mining sites. Therefore, additional exploration surveys are required to carry out side scan sonar mapping combined with seafloor observation and sampling to decide the blocks for application of an exclusive exploration right.

A Study on the Effects of Online Word-of-Mouth on Game Consumers Based on Sentimental Analysis (감성분석 기반의 게임 소비자 온라인 구전효과 연구)

  • Jung, Keun-Woong;Kim, Jong Uk
    • Journal of Digital Convergence
    • /
    • v.16 no.3
    • /
    • pp.145-156
    • /
    • 2018
  • Unlike the past, when distributors distributed games through retail stores, they are now selling digital content, which is based on online distribution channels. This study analyzes the effects of eWOM (electronic Word of Mouth) on sales volume of game sold on Steam, an online digital content distribution channel. Recently, data mining techniques based on Big Data have been studied. In this study, emotion index of eWOM is derived by emotional analysis which is a text mining technique that can analyze the emotion of each review among factors of eWOM. Emotional analysis utilizes Naive Bayes and SVM classifier and calculates the emotion index through the SVM classifier with high accuracy. Regression analysis is performed on the dependent variable, sales variation, using the emotion index, the number of reviews of each game, the size of eWOM, and the user score of each game, which is a rating of eWOM. Regression analysis revealed that the size of the independent variable eWOM and the emotion index of the eWOM were influential on the dependent variable, sales variation. This study suggests the factors of eWOM that affect the sales volume when Korean game companies enter overseas markets based on steam.

The evaluation of Distributed Data Mining System using USA census Database (미국 인구통계 데이터를 이용한 분산형 데이터마이닝 시스템 성능평가)

  • Kim, Choong-Gon;Woo, Jung-Geun;Kim, Sung-Guk;Baik, Sung-Wook
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.10c
    • /
    • pp.191-194
    • /
    • 2007
  • 본 논문에서는 분산형 환경에 적합한 새로운 의사결정나무 알고리즘을 제안하고 그 실용성을 확인하기 위해 분산형 데이터마이닝 시스템을 구현하였다. 그리고 본 논문에서 구현한 시스템을 평가하기 위해 데이터의 신뢰성이 높은 방대한 양의 미국의 인구통계 데이터(Census bureau database)를 사용하였다. 본 논문에서 구현한 시스템을 이용하여 신뢰성을 테스트하였고 그 결과가 다른 시스템의 알고리즘과 유사한 신뢰성을 나타내었다.

  • PDF

Intrusion Detection System Model using agent teaming in network (네트워크에서 에이전트 학습을 이용한 침입탐지시스템 모델)

  • 정종근;김용호;이윤배
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.8
    • /
    • pp.1346-1351
    • /
    • 2002
  • It is very complex to construct Intrusion Detection System in distributed network environment than simple ones. Especially, In the collecting and analysis of logdata from out different operating system break out much problem. So In this paper, We present a Intrusion Detection System model applying agent teaming system to solve these problem. We apply the data Mining algorithm for agent learning.

Trace Element and Mineral Chemistry of the Cretaceous Granites in the Southern Mungyeong Area (문경남부일대(聞慶南部一帶)에 분포(分布)하는 백악기(白堊紀) 화강암류(花崗岩類)의 미량원소(微量元素) 및 광물화학(鑛物化學))

  • Yun, Hyun Soo
    • Economic and Environmental Geology
    • /
    • v.24 no.4
    • /
    • pp.379-391
    • /
    • 1991
  • The studied Cretaceous granties are widely distributed at the southern Mungyeong area in the southwestern part of Ogcheon Fold Belt. From the mineralogical and geochemical compositions, it is suggested that they show the characteristics of I-type and magnetite-series and formed under the conditions of high oxygen fugacity. The mineral chemistry of plagioclase, alkali feldspar and biotite in the granites by EMPA, was revealed as albite to oligoclase, microcline to microcline perthite and orthoclase perthite, and annite compositions, respectively. The granites have the distribution patterns of enriched LREE and depleted HREE, and show Eu negative anomalies suggesting mainly due to the feldspar fractionation in the residual magma. The geochemical data of Eu, EU/$^*Eu$, Sm and Gd suggest that the granites of the area have more abundant alkali feldspar crystallization than plagioclase. From the geochemical characteristics of Sr/Ba, La/Sm vs. Ce/Yb and other trace element evidences, the granites were the late stage products of differentiation and fractionated from a homogeneous parental granitic magma.

  • PDF

Privacy Preserving Distributed Data Mining of Sequential Patterns on Horizontally Partitioned Databases (수평 분산 데이터베이스 상의 세부 데이터 유출이 없는 순차 패턴 마이닝 기법)

  • Kim, Seung-Woo;Won, Jung-Im;Park, Sang-Hyun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.61-63
    • /
    • 2005
  • 본 논문에서는 수평 분산 데이터베이스에서 각 로컬 데이터베이스의 세부 데이터를 유출하지 않는 순차패턴 마이닝 기법을 제안한다. 데이터 마이닝은 대용량 데이터베이스에서 유용한 지식을 추출하는 기법으로서 각광을 받고 있다. 그러나 분산 데이터베이스를 대상으로 마이닝을 수행하는 경우, 데이터 공유에 따른 개인 혹인 집단의 프라이버시가 유출될 수 있다는 문제점이 존재한다. 따라서 본 논문에서는 프라이버시 보호를 위하여 각 로컬 데이터베이스의 세부 데이터를 보호하면서도, 마이닝 결과의 정확성을 보장할 수 있는 새로운 순차 패턴 마이닝 기법을 제안한다. 제안된 기법에서는 우선, 세부 데이터의 유출을 방지하기 위하여 마이닝의 대상이 되는 항목과 항목간의 시간 선후 관계의 성립 여부를 벡터로 표현한 후, 이들 벡터간의 스칼라 프로덕트 연산을 수행하여 얻어진 결과를 패턴의 지지도로 활용하는 방안을 제안하였다. 또한, 연산 결과에 영향을 미치지 않는 벡터를 미리 제거하여 스칼라 프로덕트 연산에 따른 비용을 감소시키는 방안을 제안하였다.

  • PDF

An Interpretation of Interoperability Definitions Using Association Rules Discovery (연관성 규칙 탐사를 이용한 상호운용성 정의의 해석)

  • Heo, Hwan;Kim, Ja-Hee
    • The Journal of Society for e-Business Studies
    • /
    • v.16 no.2
    • /
    • pp.39-71
    • /
    • 2011
  • Lately, developing systems fully interoperable with others is considered an essential element for successful projects, as not only do e-commerce becomes ubiquitous but also distributed systems' paradigm spreads. However, since definitions of interoperability vary by viewpoints, it is still difficult to have the same understanding and evaluation criteria on interoperability. For instance, various interoperability parties in military use different definitions of interoperability, and its T&E is not conducted according to the definition, but only to levels of information exchange. In this paper, we proposed a new definition of interoperability as followsm First of all, we collected existing and various interoperability definitions, extracting key components in each of them. Second, we statistically analyzed those components and applied the association rules discovery in data mining. We compared existing interoperability definitions to ours. From this research, we found associations among the components from various definitions applying market-basketanalysis, redefining interoperability. Key findings of this research can contribute to a unified viewpoint on the definition, level, and evaluation items of interoperability.

Predicting the Direction of the Stock Index by Using a Domain-Specific Sentiment Dictionary (주가지수 방향성 예측을 위한 주제지향 감성사전 구축 방안)

  • Yu, Eunji;Kim, Yoosin;Kim, Namgyu;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.95-110
    • /
    • 2013
  • Recently, the amount of unstructured data being generated through a variety of social media has been increasing rapidly, resulting in the increasing need to collect, store, search for, analyze, and visualize this data. This kind of data cannot be handled appropriately by using the traditional methodologies usually used for analyzing structured data because of its vast volume and unstructured nature. In this situation, many attempts are being made to analyze unstructured data such as text files and log files through various commercial or noncommercial analytical tools. Among the various contemporary issues dealt with in the literature of unstructured text data analysis, the concepts and techniques of opinion mining have been attracting much attention from pioneer researchers and business practitioners. Opinion mining or sentiment analysis refers to a series of processes that analyze participants' opinions, sentiments, evaluations, attitudes, and emotions about selected products, services, organizations, social issues, and so on. In other words, many attempts based on various opinion mining techniques are being made to resolve complicated issues that could not have otherwise been solved by existing traditional approaches. One of the most representative attempts using the opinion mining technique may be the recent research that proposed an intelligent model for predicting the direction of the stock index. This model works mainly on the basis of opinions extracted from an overwhelming number of economic news repots. News content published on various media is obviously a traditional example of unstructured text data. Every day, a large volume of new content is created, digitalized, and subsequently distributed to us via online or offline channels. Many studies have revealed that we make better decisions on political, economic, and social issues by analyzing news and other related information. In this sense, we expect to predict the fluctuation of stock markets partly by analyzing the relationship between economic news reports and the pattern of stock prices. So far, in the literature on opinion mining, most studies including ours have utilized a sentiment dictionary to elicit sentiment polarity or sentiment value from a large number of documents. A sentiment dictionary consists of pairs of selected words and their sentiment values. Sentiment classifiers refer to the dictionary to formulate the sentiment polarity of words, sentences in a document, and the whole document. However, most traditional approaches have common limitations in that they do not consider the flexibility of sentiment polarity, that is, the sentiment polarity or sentiment value of a word is fixed and cannot be changed in a traditional sentiment dictionary. In the real world, however, the sentiment polarity of a word can vary depending on the time, situation, and purpose of the analysis. It can also be contradictory in nature. The flexibility of sentiment polarity motivated us to conduct this study. In this paper, we have stated that sentiment polarity should be assigned, not merely on the basis of the inherent meaning of a word but on the basis of its ad hoc meaning within a particular context. To implement our idea, we presented an intelligent investment decision-support model based on opinion mining that performs the scrapping and parsing of massive volumes of economic news on the web, tags sentiment words, classifies sentiment polarity of the news, and finally predicts the direction of the next day's stock index. In addition, we applied a domain-specific sentiment dictionary instead of a general purpose one to classify each piece of news as either positive or negative. For the purpose of performance evaluation, we performed intensive experiments and investigated the prediction accuracy of our model. For the experiments to predict the direction of the stock index, we gathered and analyzed 1,072 articles about stock markets published by "M" and "E" media between July 2011 and September 2011.