• Title/Summary/Keyword: On-Line Mining

Search Result 129, Processing Time 0.024 seconds

Integrated System of On-Off Line in Agricultural Products Electronic Commerce Based on Data Mining (데이터 마이닝을 이용한 농산물 전자상거래의 온 오프라인 통합시스템)

  • 주종문;황승국
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.25 no.3
    • /
    • pp.58-63
    • /
    • 2002
  • The Internet, as a commercial tool, presented a new market that connects producers with consumers through the E-commerce. Now, E-commerce spreads over almost all industries through the Internet excluding some. This research indicates the reason why the E-commerce is not activated in agricultural Industry, which is less developed than other industries. And it suggests a good example of E-commerce on the agricultural products combining on and off line markets. In addition, data-mining technique is suggested to analyze whole information in system.

An Application of Data Mining Techniques in Electronic Commerce (전자상거래에서 지식탐사기법의 활용에 관한 연구)

  • Sung Tae-Kyung;Chu Seok-Chin;Kim Joong-Han;Hong Jun-Seok
    • The Journal of Information Systems
    • /
    • v.14 no.2
    • /
    • pp.277-292
    • /
    • 2005
  • This paper uses a data mining approach to develop bankruptcy prediction models suitable for traditional (off-line) companies and electronic (on-line) companies. It observes the differences in the composition prediction models between these two types of companies and provides interpretation of bankruptcy classifications. The bankruptcy prediction models revealed the major variables in predicting bankruptcy to be 'cash flow to total assets' and 'gross value-added to net sales' for traditional off-line companies while 'cash flow to liabilities','gross value-added to net sales', and 'current ratio' for electronic companies. The accuracy rates of final prediction models for traditional off-line and electronic companies were found to be $84.7\%\;and\;82.4\%$, respectively. When the model for traditional off-line companies was applied for electronic companies, prediction accuracy dropped significantly in the case of bankruptcy classification (from $70.4\%\;to\;45.2\%$) at the level of a blind guess ($41.30\%$). Therefore, the need for different models for traditional off-line and electronic companies is justified.

  • PDF

A Study on Constructing the Prediction System Using Data Mining Techniques to Find Medium-Voltage Customers Causing Distribution Line Faults (특별고압 수전설비 관리에 데이터 마이닝 기법을 적용한 파급고장 발생가능고객 예측시스템 구현 연구)

  • Bae, Sung-Hwan;Kim, Ja-Hee;Lim, Han-Seung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.12
    • /
    • pp.2453-2461
    • /
    • 2009
  • Faults caused by medium-voltage customers have been increased and enlarged their portion in total distribution faults even though we have done many efforts. In the previous paper, we suggested the fault prediction model and fault prevention method for these distribution line faults. However we can't directly apply this prediction model in the field. Because we don't have an useful program to predict those customers causing distribution line faults. This paper presents the construction method of data warehouse in ERP system and the program to find customers who cause distribution line faults in medium-voltage customer's electric facility management applying data mining techniques. We expect that this data warehouse and prediction program can effectively reduce faults resulted from medium-voltage customer facility.

A study on Reliability Enhancement Method and the Prediction Model Construction of Medium-Voltage Customers Causing Distribution Line Fault Using Data Mining Techniques (데이터 마이닝 기법을 이용한 특별고압 파급고장 발생가능 고객 예측모델 구축 및 신뢰도 향상방안에 관한 연구)

  • Bae, Sung-Hwan;Kim, Ja-Hee;Hong, Jung-Sik;Lim, Han-Seung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.10
    • /
    • pp.1869-1880
    • /
    • 2009
  • Distribution line fault has been reduced gradually by the efforts on improving the quality of electrical materials and distribution system maintenance. However faults caused by medium voltage customers have been increased gradually even though we have done many efforts. The problem is that we don't know which customer will cause the fault. This paper presents the concept to find these customers using data mining techniques, which is based on accumulated fault records of medium voltage customers in the past. It also suggests the prediction model construction of medium voltage customers causing distribution line fault and methods to enhance the reliability of distribution system. We expect that we can effectively reduce faults resulted from medium voltage customers, which is 30% of total faults.

A Case Study of OLAP and Data Mining on the Analytical Knowledge Creation in Organizations (OLAP과 데이터마이닝을 이용한 조직내 분석지 생성에 관한 사례연구)

  • Cho, Jae-Hee
    • Knowledge Management Research
    • /
    • v.5 no.1
    • /
    • pp.69-82
    • /
    • 2004
  • Prior research on knowledge management focused more on the experiential knowledge based on individual's experience or knowhow than on the analytical knowledge extracted from corporate data. This study examines the effects of the data warehouse technology, especially OLAP(on line analytical processing) and data mining techniques, on the analytical knowledge creation in organizations, linking analytical knowledge creation to data analysis method through real world case studies.

  • PDF

Geoacoustic Characteristics of P-Wave Velocity in Donghae City - Ulleung Island Line, East Sea: Preliminary Results (동해시-울릉도 해저 측선에서의 P파 속도 지음향 특성: 예비 결과)

  • Ryang, Woo-Hun;Kwon, Yi-Kyun;Jin, Jae-Hwa;Kim, Hyun-Tae;Lee, Chi-Won;Jung, Ja-Hun;Kim, Dae-Choul;Choi, Jin-Hyuk;Kim, Young-Gyu;Kim, Sung-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.2E
    • /
    • pp.44-49
    • /
    • 2007
  • Donghae City - Ulleung Island Line (DC-UI Line) is a representative line for underwater and geoacoustic modeling in the middle western East Sea. In this line, an integrated model of P-wave velocity is proposed for a low-frequency range target (<200 Hz), based on high-resolution seismic profiles (2 - 7 kHz sonar and air-gun), shallow and deep cores (grab, piston, and Portable Remote Operated Drilling), and outcrop geology (Tertiary rocks and the basement on land). The basement comprises 3 geoacoustic layers of P-wave velocity ranging from 3750 to 5550 m/s. The overlying sediments consist of 7 layers of P-wave velocities ranging from 1500 to 1900 m/s. The bottom model shows that the structure is very irregular and the velocity is also variable with both vertical and lateral extension. In this area, seabed and underwater acousticians should consider that low-frequency acoustic modeling is very range-dependent and a detailed geoacoustic model is necessary for better modeling of acoustic propagation such as long-range surveillance of submarines and monitoring of currents.

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.10a
    • /
    • pp.59-69
    • /
    • 2005
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Web Mining for successful e-Business based on Artificial Intelligence Techniques (성공적인 e-Business를 위한 인공지능 기법 기반 웹 마이닝)

  • 이장희;유성진;박상찬
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.2
    • /
    • pp.159-175
    • /
    • 2002
  • Web mining is an emerging science of applying modem data mining technologies to the problem of extracting valid, comprehensible, and actionable information from large databases of web in e-Business environment and of using it to make crucial e-Business decisions. In this paper, we present the noble framework of data visualization system based on web mining for analyzing the characteristics of on-line customers in e-Business. We also propose the framework of forecasting system for providing the forecasting information of sales/purchase through the use of web mining based on artificial intelligence techniques such as back-propagation network, memory-based reasoning, and self-organizing map.

  • PDF

Parallel Algorithm for Spatial Data Mining Using CUDA

  • Oh, Byoung-Woo
    • Journal of Advanced Information Technology and Convergence
    • /
    • v.9 no.2
    • /
    • pp.89-97
    • /
    • 2019
  • Recently, there is an increasing demand for applications utilizing maps and locations such as autonomous vehicles and location-based services. Since these applications are developed based on spatial data, interest in spatial data processing is increasing and various studies are being conducted. In this paper, I propose a parallel mining algorithm using the CUDA library to efficiently analyze large spatial data. Spatial data includes both geometric (spatial) and non-spatial (aspatial) attributes. The proposed parallel spatial data mining algorithm analyzes both the geometric and non-spatial relationships between two layers. The experiment was performed on graphics cards containing CUDA cores based on TIGER/Line data, which is the actual spatial data for the US census. Experimental results show that the proposed parallel algorithm using CUDA greatly improves spatial data mining performance.