• Title/Summary/Keyword: mining system

Search Result 1,840, Processing Time 0.032 seconds

Text Mining in Online Social Networks: A Systematic Review

  • Alhazmi, Huda N
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.3
    • /
    • pp.396-404
    • /
    • 2022
  • Online social networks contain a large amount of data that can be converted into valuable and insightful information. Text mining approaches allow exploring large-scale data efficiently. Therefore, this study reviews the recent literature on text mining in online social networks in a way that produces valid and valuable knowledge for further research. The review identifies text mining techniques used in social networking, the data used, tools, and the challenges. Research questions were formulated, then search strategy and selection criteria were defined, followed by the analysis of each paper to extract the data relevant to the research questions. The result shows that the most social media platforms used as a source of the data are Twitter and Facebook. The most common text mining technique were sentiment analysis and topic modeling. Classification and clustering were the most common approaches applied by the studies. The challenges include the need for processing with huge volumes of data, the noise, and the dynamic of the data. The study explores the recent development in text mining approaches in social networking by providing state and general view of work done in this research area.

An Efficient Algorithm for Mining Frequent Sequences In Spatiotemporal Data

  • Vhan Vu Thi Hong;Chi Cheong-Hee;Ryu Keun-Ho
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2005.11a
    • /
    • pp.61-66
    • /
    • 2005
  • Spatiotemporal data mining represents the confluence of several fields including spatiotemporal databases, machine loaming, statistics, geographic visualization, and information theory. Exploration of spatial data mining and temporal data mining has received much attention independently in knowledge discovery in databases and data mining research community. In this paper, we introduce an algorithm Max_MOP for discovering moving sequences in mobile environment. Max_MOP mines only maximal frequent moving patterns. We exploit the characteristic of the problem domain, which is the spatiotemporal proximity between activities, to partition the spatiotemporal space. The task of finding moving sequences is to consider all temporally ordered combination of associations, which requires an intensive computation. However, exploiting the spatiotemporal proximity characteristic makes this task more cornputationally feasible. Our proposed technique is applicable to location-based services such as traffic service, tourist service, and location-aware advertising service.

  • PDF

Data Mining System in the Service Industry : Delphi Study

  • Hyun, Sung-Hyup;Huh, Jin;Hahm, Sung-Pil
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.10 no.4
    • /
    • pp.128-136
    • /
    • 2005
  • The use of technology is increasing within the service industry, but there is some doubt as to whether the benefits of employing this technology have been efficiently harnessed such as data mining. Data mining is the process of extracting certain predictive information from databases that can evolve from currently used restaurant management systems. The potential of harnessing this predictive information can have an enormous impact on the restaurant's operation on the whole, particularly in the area customer retention and competition. Since there is insufficient literature on the use of data mining in the restaurant industry, this study is both seminal and investigative, done via a Delphi survey to explore and describe the current and future applications of this process.

  • PDF

A Comparison on the Efficiency of Data Mining Softwares (데이터마이닝 소프트웨어의 기능 및 효율성 비교에 관한 사례연구)

  • 한상태;강현철;이성건;이덕기
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.201-211
    • /
    • 2002
  • Data is being generated at an ever increasing rate in recent years, mainly due to technological advances in system architecture, processor speed, and storage structures. In this respect, data mining has attracted considerable attention and many commercial softwares for data mining have been developed. In this study, we compare the differences of functions and efficiency of application about several commercial data mining softwares which are widely used in real field.

DSS Architectures to Support Data Mining Activities for Supply Chain Management (데이터 마이닝을 활용한 공급사슬관리 의사결정지원시스템의 구조에 관한 연구)

  • Jhee, Won-Chul;Suh, Min-Soo
    • Asia pacific journal of information systems
    • /
    • v.8 no.3
    • /
    • pp.51-73
    • /
    • 1998
  • This paper is to evaluate the application potentials of data mining in the areas of Supply Chain Management (SCM) and to suggest the architectures of Decision Support Systems (DSS) that support data mining activities. We first briefly introduce data mining and review the recent literatures on SCM and then evaluate data mining applications to SCM in three aspects: marketing, operations management and information systems. By analyzing the cases about pricing models in distribution channels, demand forecasting and quality control, it is shown that artificial intelligence techniques such as artificial neural networks, case-based reasoning and expert systems, combined with traditional analysis models, effectively mine the useful knowledge from the large volume of SCM data. Agent-based information system is addressed as an important architecture that enables the pursuit of global optimization of SCM through communication and information sharing among supply chain constituents without loss of their characteristics and independence. We expect that the suggested architectures of intelligent DSS provide the basis in developing information systems for SCM to improve the quality of organizational decisions.

  • PDF

A Comparative Analysis for the knowledge of Data Mining Techniques with Experties (Data Mining 기법들과 전문가들로부터 추출된 지식에 관한 실증적 비교 연구)

  • 김광용;손광기;홍온선
    • Journal of Intelligence and Information Systems
    • /
    • v.4 no.1
    • /
    • pp.41-58
    • /
    • 1998
  • 본 연구는 여러 가지 Data Mining 기법들로부터 도출된 지식과 AHP를 이용하여 도출된 전문가의 지식을 사용된 정보의 특성에 따라 조사하고, 이러한 각각의 지식들을 중심으로 부도예측 모형을 설계한 후, 각 모형의 특성 및 부도예측력에 대한 실증적 비교연구에 그 목적을 두고 있다. 사용된 Data Mining 기법들은 통계적 다중판별분석 모형, ID3 모형, 인공신경망 모형이며, 전문가 지식의 추출은 AHP를 사용하여 45명의 전문가로부터 부도와 관련하여 인터뷰 및 설문조사를 실시하였다. 특히 부도예측에 사용된 변수의 특성을 정량적 재무정보와 정성적 비재무정보로 나누어서 각 모형의 특성을 비교연구하였다. 연구결과 부도예측시 정성적정보의 중요성을 확인하였으며, 전문가의 지식을 기반으로한 AHP 모형이 위험예측모형으로 사용될 수 있음을 실증적으로 보여주었다.

  • PDF

A LF based Answer Indexing Method for Encyclopedia Question-Answering System (백과사전 질의응답을 위한 구문정보기반 정답색인방법)

  • Kim Hyeon-Jin;Lee Chung-Hee;Oh Hyo-Jung;Wang Ji-Hyun;Jang Myung-Gil
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.511-513
    • /
    • 2005
  • 본 논문은 정답 색인 방법을 이용하여 응답 속도가 빠르고 정확한 백과사전 질의응답 시스템을 구현하는 방법을 제안한다. 논문에서 제안한 정답 색인 방법은 대상 문서에서 160여 개의 정답 유형 범주에 해당하는 정답 후보를 인식하고, 정답 후보와 색인 범주에 속하는 키워드를 색인단위로 정의하여 저장하였다. 특히 용언정보에 대해서는 LF(Logical Form)단위로 색인하여 색인 정확도를 높였다. 정답 랭킹에서는 사용자 질문에서 각 단어별로 문장 성분. 단어 가중치 정보 등을 이용하여, 필수단어를 산정하고 이를 정답랭킹의 방법으로 활용하였다. 이러한 방법론은 용언 정보를 활용해야 효과적인 백과사전이라는 문서 도메인의 특성을 반영하고, 빠른 질문 응답 시간을 보장하는 백과사전 질의응답 시스템에 적합하다.

  • PDF

Abnormal Sound Detection and Identification in Surveillance System (감시 시스템에서의 비정상 소리 탐지 및 식별)

  • Joo, Young-min;Lee, Eui-jong;Kim, Jeong-sik;Oh, Seung-geun;Park, Dai-hee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.592-595
    • /
    • 2010
  • 본 논문에서는 감시카메라 환경에서 취득한 오디오 데이터를 입력으로 하여, 비정상 상황을 인식하는 시스템을 제안한다. 제안된 시스템은 단일클래스 SVM의 대표적인 모델인 SVDD와 최근 얼굴 인식 분야에서 성공적인 업적을 보여주고 있는 신호 처리 분야의 SRC를 계층적으로 결합한 구조로써, 첫 번째 계층에서는 SVDD로 비정상 소리를 신속하게 탐지하여 관리자에게 알람 경고하고, 두 번째 계층의 SRC는 탐지된 비정상 소리를 유형별로 세분화 식별하여 관리자에게 비상 상황을 보고함으로써 관리자의 위기 상황 대처를 돕는다. 제안된 시스템은 실시간 처리가 가능하며, 점증적 갱신의 학습 능력으로 인하여 비정상 오디오 데이터베이스의 변화에도 능동적으로 적응할 수 있다. 실험을 통하여 제안된 시스템의 성능을 검증한다.

An Active Mining Framework Design using Spatial-Temporal Ontology (시공간 온톨로지를 이용한 능동 마이닝 프레임워크 설계)

  • Hwang, Jeong-Hee;Noh, Si-Choon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.9
    • /
    • pp.3524-3531
    • /
    • 2010
  • In order to supply suitable services to users in ubiquitous computing environments, it is important to consider both location and time information which is related to all object and user's activity. To do this, in this paper, we design a spatial-temporal ontology considering user context and propose a system architecture for active mining user activity and service pattern. The proposed system is a framework for active mining user activity and service pattern by considering the relation between user context and object based on trigger system.

BIOLOGY ORIENTED TARGET SPECIFIC LITERATURE MINING FOR GPCR PATHWAY EXTRACTION (GPCR 경로 추출을 위한 생물학 기반의 목적지향 텍스트 마이닝 시스템)

  • KIm, Eun-Ju;Jung, Seol-Kyoung;Yi, Eun-Ji;Lee, Gary-Geunbae;Park, Soo-Jun
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.86-94
    • /
    • 2003
  • Electronically available biological literature has been accumulated exponentially in the course of time. So, researches on automatically acquiring knowledge from these tremendous data by text mining technology become more and more prosperous. However, most of the previous researches are technology oriented and are not well focused in practical extraction target, hence result in low performance and inconvenience for the bio-researchers to actually use. In this paper, we propose a more biology oriented target domain specific text mining system, that is, POSTECH bio-text mining system (POSBIOTM), for signal transduction pathway extraction, especially for G protein-coupled receptor (GPCR) pathway. To reflect more domain knowledge, we specify the concrete target for pathway extraction and define the minimal pathway domain ontology. Under this conceptual model, POSBIOTM extracts interactions and entities of pathways from the full biological articles using a machine learning oriented extraction method and visualizes the pathways using JDesigner module provided in the system biology workbench (SBW) [14]

  • PDF