• Title/Summary/Keyword: process model discovery

Search Result 108, Processing Time 0.027 seconds

A Language Model and Clue based Machine Learning Method for Discovering Technology Trends from Patent Text (특허 문서 텍스트로부터의 기술 트렌드 탐지를 위한 언어 모델 및 단서 기반 기계학습 방법)

  • Tian, Yingshi;Kim, Young-Ho;Jeong, Yoon-Jae;Ryu, Ji-Hee;Myaeng, Sung-Hyon
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.5
    • /
    • pp.420-429
    • /
    • 2009
  • Patent text is a rich source for discovering technological trends. In order to automate such a discovery process, we attempt to identify phrases corresponding to the problem and its solution method which together form a technology. Problem and solution phrases are identified by a SVM classifier using features based on a combination of a language modeling approach and linguistic clues. Based on the occurrence statistics of the phrases, we identify the time span of each problem and solution and finally generate a trend. Based on our experiment, we show that the proposed semantic phrase identification method is promising with its accuracy being 77% in R-precision. We also show that the unsupervised method for discovering technological trends is meaningful.

A Possible Scientific Inquiry Model based on Hypothetico-Deduction Method Involving Abduction

  • Oh, Jun-Young
    • Journal of The Korean Association For Science Education
    • /
    • v.32 no.3
    • /
    • pp.486-501
    • /
    • 2012
  • The aims of this study are to investigate two main problems for the hypothetico-deduction method and to develop a scientific inquiry model to resolve these problems. The structure of this scientific inquiry model consists of accounts of the context of discovery and justification that the hypothetico-deduction holds as two main problems : 1) the heuristic flaw in the hypothetico-deduction method is that there is no limit to creating hypotheses to explain natural phenomena; 2) Logically, this brings into question affirming the consequent and modus tollens. The features of the model are as follows: first, the generation of hypotheses using an analogical abduction and the selection of hypotheses using consilience and simplicity; second, the expansion phase as resolution for the fallacy of affirming the consequent and the recycle phase as resolution for modus tollens involving auxiliary hypotheses. Finally, we examine the establishment process of Copernicus's Heliocentric Hypothesis and the main role of the history of science for the historical invalidity of this scientific inquiry model based on three examples of If/and/then type of explanation testing suggested by Lawson (International journal of science and Mathematics Education, 2005a, 3(1): 1-5) We claim that this hypotheticho-deduction process involving abduction approach produced favorable in scientific literacy rising for science teacher as well as students.

Defining and Discovering Cardinalities of the Temporal Workcases from XES-based Workflow Logs

  • Yun, Jaeyoung;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.20 no.3
    • /
    • pp.77-84
    • /
    • 2019
  • Workflow management system is a system that manages the workflow model which defines the process of work in reality. We can define the workflow process by sequencing jobs which is performed by the performers. Using the workflow management system, we can also analyze the flow of the process and revise it more efficiently. Many researches are focused on how to make the workflow process model more efficiently and manage it more easily. Recently, many researches use the workflow log files which are the execution history of the workflow process model performed by the workflow management system. Ourresearch group has many interests in making useful knowledge from the workflow event logs. In this paper we use XES log files because there are many data using this format. This papersuggests what are the cardinalities of the temporal workcases and how to get them from the workflow event logs. Cardinalities of the temporal workcases are the occurrence pattern of critical elements in the workflow process. We discover instance cardinalities, activity cardinalities and organizational resource cardinalities from several XES-based workflow event logs and visualize them. The instance cardinality defines the occurrence of the workflow process instances, the activity cardinality defines the occurrence of the activities and the organizational cardinality defines the occurrence of the organizational resources. From them, we expect to get many useful knowledge such as a patterns of the control flow of the process, frequently executed events, frequently working performer and etc. In further, we even expect to predict the original process model by only using the workflow event logs.

A Study on the Method of Combining Empirical Data and Deterministic Model for Fuel Failure Prediction (핵연료 파손 예측을 위한 경험적 자료와 결정론적 모델의 접합 방법)

  • Cho, Byeong-Ho;Yoon, Young-Ku;Chang, Soon-Heung
    • Nuclear Engineering and Technology
    • /
    • v.19 no.4
    • /
    • pp.233-241
    • /
    • 1987
  • Difficulties are encountered when the behavior of complex systems (i.e., fuel failure probability) that have unreliable deterministic models is predicted. For more realistic prediction of the behavior of complex systems with limited observational data, the present study was undertaken to devise an approach of combining predictions from the deterministic model and actual observational data. Predictions by this method of combining are inferred to be of higher reliability than separate predictions made by either model taken independently. A systematic method of hierarchical pattern discovery based on the method developed in the SPEAR was used for systematic search of weighting factors and pattern boundaries for the present method. A sample calculation was performed for prediction of CANDU fuel failures that had occurred due to power ramp during refuelling process. It was demonstrated by this sample calculation that there exists a region of feature space in which fuel failure probability from the PROFIT model nearly agree with that from observational data.

  • PDF

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

A Study on Factory Building Fire Risk Reduction Management (공장건물의 화재리스크 경감방안에 관한 연구)

  • Chung, Eui-Soo;Kang, Kyung-Sik
    • Journal of the Korea Safety Management & Science
    • /
    • v.10 no.3
    • /
    • pp.43-53
    • /
    • 2008
  • This study is carried out for the fire safety of the factory building, the fire risk reduction measure in compliance with an example approached in fire risk reduction systematically, contribute to reduce the fire risk. The analytical fire risk process of discovering, identifying, estimating and evaluating risk and control measure as risk reduction measures are core concept, applies loss prevention with loss control techniques. The painting process in the workplace where the fire hazard and death accident accompanies coexists. Loss prevention problem of creation prevention of dangerous atmosphere at workplace is health and human services problem of normal circumstances, must be inspected with problem of combustible gases at the time of fire explosion. Static electricity measure accomplished the risk control process thoroughly as the fire risk reduction process model with the ignition sources measure which is presented. Fire risk from within organizing will be able to classify with each field by detailedly but risk treatment process will be able to apply basically all the same concept. Consequently about risk management example from before, this study is proposed risk management techniques that standardized rightly in the actual condition of organization with one plan, with discovery of fire risk, the feedback process in compliance with a fire risk reduction and the review which control the result is joint responsibility of engineer, technical expert and manager as part of safety management to practice with the fact must be supervised.

Design of Process Management System based on Data Mining and Artificial Modelling for the Etching Process (데이터 마이닝과 지능 모델링에 기반한 에칭공정의 공정관리시스템 설계)

  • Bae, Hyeon;Kim, Sung-shin;Woo, Kwang-Bang
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.4
    • /
    • pp.390-395
    • /
    • 2004
  • A semiconductor manufacturing process is the complicate and dynamic process, and consists of many sub-processes. An etching process is the most important process in the semiconductor fabrication. In this paper, the decision support system based upon data mining and knowledge discovery is an important factor to improve the productivity and yield. The proposed decision support system consists of a neural network model and an inference system based on fuzzy logic Firstly, the product results are predicted by the neural network model constructed by the product patterns that represent the quality of the etching process. And the product patters are classified by expert's knowledge. Finally, the product conditions are estimated by the fuzzy inference system using the rules extracted from the classified patterns. Prediction of product qualities can be linked to each input and process variables. We employ data mining and intelligent techniques to find the best condition of the etching process. The proposed decision support system is efficient and easy to be implemented for the process management based upon expert's knowledge.

The Intraday Lead-Lag Relationships between the Stock Index and the Stock Index Futures Market in Korea and China (한국과 중국의 현물시장과 주가지수선물시장간의 선-후행관계에 관한 연구)

  • Seo, Sang-Gu
    • Management & Information Systems Review
    • /
    • v.32 no.4
    • /
    • pp.189-207
    • /
    • 2013
  • Using high-frequency data for 2 years, this study investigates intraday lead-lag relationship between stock index and stock index futures markets in Korea and China. We found that there are some differences in price discovery and volatility transmission between Korea and China after the stock index futures markets was introduced. Following Stoll-Whaley(1990) and Chan(1992), the multiple regression is estimated to examine the lead-lag patterns between the two markets by Newey-West's(1987) heteroskedasticity and autocorrelation consistent covariance matrix(HAC matrix). Empirical results of KOSPI 200 shows that the futures market leads the cash market and weak evidence that the cash market leads the futures market. New market information disseminates in the futures market before the stock market with index arbitrageurs then stepping in quickly to bring the cost-of-carry relation back into alignment. The regression tests for the conditional volatility which is estimated using EGARCH model do not show that there is a clear pattern of the futures market leading the stock market in terms of the volatility even though controlling nonsynchronous trading effects. This implies that information in price innovations that originate in the futures market is transmitted to the volatility of the cash market. Empirical results of CSI 300 shows that the cash market is found to play a more dominant role in the price discovery process after the Chinese index started a sharp decline immediately after the stock index futures were introduced. The new stock index futures markets does not function well in its price discovery performance at its infancy stage, apparently due to high barriers to entry into this emerging futures markets. Based on EGAECH model, the results uncover strong bi-directional dependence in the intraday volatility of both markets.

  • PDF

MicroSPECT and MicroPET Imaging of Small Animals for Drug Development

  • Jang, Beom-Su
    • Toxicological Research
    • /
    • v.29 no.1
    • /
    • pp.1-6
    • /
    • 2013
  • The process of drug discovery and development requires substantial resources and time. The drug industry has tried to reduce costs by conducting appropriate animal studies together with molecular biological and genetic analyses. Basic science research has been limited to in vitro studies of cellular processes and ex vivo tissue examination using suitable animal models of disease. However, in the past two decades new technologies have been developed that permit the imaging of live animals using radiotracer emission, X-rays, magnetic resonance signals, fluorescence, and bioluminescence. The main objective of this review is to provide an overview of small animal molecular imaging, with a focus on nuclear imaging (single photon emission computed tomography and positron emission tomography). These technologies permit visualization of toxicodynamics as well as toxicity to specific organs by directly monitoring drug accumulation and assessing physiological and/or molecular alterations. Nuclear imaging technology has great potential for improving the efficiency of the drug development process.

Reverse Engineering of a Gene Regulatory Network from Time-Series Data Using Mutual Information

  • Barman, Shohag;Kwon, Yung-Keun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.849-852
    • /
    • 2014
  • Reverse engineering of gene regulatory network is a challenging task in computational biology. To detect a regulatory relationship among genes from time series data is called reverse engineering. Reverse engineering helps to discover the architecture of the underlying gene regulatory network. Besides, it insights into the disease process, biological process and drug discovery. There are many statistical approaches available for reverse engineering of gene regulatory network. In our paper, we propose pairwise mutual information for the reverse engineering of a gene regulatory network from time series data. Firstly, we create random boolean networks by the well-known $Erd{\ddot{o}}s-R{\acute{e}}nyi$ model. Secondly, we generate artificial time series data from that network. Then, we calculate pairwise mutual information for predicting the network. We implement of our system on java platform. To visualize the random boolean network graphically we use cytoscape plugins 2.8.0.