• Title/Summary/Keyword: Bayesian information

Search Result 1,230, Processing Time 0.027 seconds

Genetic Differentiation between Up- and Downstream Populations of Tribolodon hakonensis (Pieces: Cyprinidae) (삼척오십천 상.하류에 분포하는 황어, Tribolodon hakonensis (잉어과) 집단의 유전적 분화)

  • Lee, Sihn-Ae;Lee, Wan-Ok;Suk, Ho-Young
    • Korean Journal of Environment and Ecology
    • /
    • v.26 no.4
    • /
    • pp.475-483
    • /
    • 2012
  • Tribolodon hakonensis(Cypriniformes; Leuciscinae) is anadromous; they are born in freshwater, migrate back to the ocean, then return to their home stream for spawning from mid-March to early-June. Here, five microsatellites were used to assess the level of gene flow among T. hakonensis populations from the Samcheok-Oship Stream, South Korea. The frequencies of dominant alleles across several loci differed between down-and upstream populations divided by several weirs, and pairwise multilocus $F_{ST}$ estimate was significantly high(0.083). However, there were no signs of any loss of genetic variation in the upstream population. Assignment tests of individuals in admixture model(K=2) to a set of baseline samples showed fairly correct assignment to each cluster; all of upstream individuals sere included in the first cluster, while the majority of downstream individuals(65%) comprise the second cluster. These results indicate reduced gene flow between up- and downstream populations but allowing passive downstream drift. It is likely that man-made structures might at least partially be a factor for creating and consolidating the current distribution patterns of genetic variation among T. hakonensis populations in the Samcheok-Oship Stream. This information will assist governing agencies in making informed decisions regarding conservation of anadromous fishes in Korean drainage systems.

Performance Improvement of Collaborative Filtering System Using Associative User′s Clustering Analysis for the Recalculation of Preference and Representative Attribute-Neighborhood (선호도 재계산을 위한 연관 사용자 군집 분석과 Representative Attribute -Neighborhood를 이용한 협력적 필터링 시스템의 성능향상)

  • Jung, Kyung-Yong;Kim, Jin-Su;Kim, Tae-Yong;Lee, Jung-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.287-296
    • /
    • 2003
  • There has been much research focused on collaborative filtering technique in Recommender System. However, these studies have shown the First-Rater Problem and the Sparsity Problem. The main purpose of this Paper is to solve these Problems. In this Paper, we suggest the user's predicting preference method using Bayesian estimated value and the associative user clustering for the recalculation of preference. In addition to this method, to complement a shortcoming, which doesn't regard the attribution of item, we use Representative Attribute-Neighborhood method that is used for the prediction when we find the similar neighborhood through extracting the representative attribution, which most affect the preference. We improved the efficiency by using the associative user's clustering analysis in order to calculate the preference of specific item within the cluster item vector to the collaborative filtering algorithm. Besides, for the problem of the Sparsity and First-Rater, through using Association Rule Hypergraph Partitioning algorithm associative users are clustered according to the genre. New users are classified into one of these genres by Naive Bayes classifier. In addition, in order to get the similarity value between users belonged to the classified genre and new users, and this paper allows the different estimated value to item which user evaluated through Naive Bayes learning. As applying the preference granted the estimated value to Pearson correlation coefficient, it can make the higher accuracy because the errors that cause the missing value come less. We evaluate our method on a large collaborative filtering database of user rating and it significantly outperforms previous proposed method.

Dependency-based Framework of Combining Multiple Experts for Recognizing Unconstrained Handwritten Numerals (무제약 필기 숫자를 인식하기 위한 다수 인식기를 결합하는 의존관계 기반의 프레임워크)

  • Kang, Hee-Joong;Lee, Seong-Whan
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.8
    • /
    • pp.855-863
    • /
    • 2000
  • Although Behavior-Knowledge Space (BKS) method, one of well known decision combination methods, does not need any assumptions in combining the multiple experts, it should theoretically build exponential storage spaces for storing and managing jointly observed K decisions from K experts. That is, combining K experts needs a (K+1)st-order probability distribution. However, it is well known that the distribution becomes unmanageable in storing and estimating, even for a small K. In order to overcome such weakness, it has been studied to decompose a probability distribution into a number of component distributions and to approximate the distribution with a product of the component distributions. One of such previous works is to apply a conditional independence assumption to the distribution. Another work is to approximate the distribution with a product of only first-order tree dependencies or second-order distributions as shown in [1]. In this paper, higher order dependency than the first-order is considered in approximating the distribution and a dependency-based framework is proposed to optimally approximate the (K+1)st-order probability distribution with a product set of dth-order dependencies where ($1{\le}d{\le}K$), and to combine multiple experts based on the product set using the Bayesian formalism. This framework was experimented and evaluated with a standardized CENPARMI data base.

  • PDF

Assessing the public preference and acceptance for renewable energy participation initiatives - focusing on photovoltaic power (재생에너지 사업 참여에 대한 국민 선호와 수용성 분석 - 태양광 발전을 중심으로)

  • Ham, AeJung;Kang, SeungJin
    • Journal of Energy Engineering
    • /
    • v.27 no.4
    • /
    • pp.36-49
    • /
    • 2018
  • This study analyzed the public preference and acceptance regarding renewable energy projects through Choice Based Conjoint Analysis. The results show that the surveyed respondents consider the leading authority of the projects, as the most important factor when considering participating in renewable energy initiatives. Following this, the mode of participation and profit distribution and the power plant location are also viewed as important, whereas participation through decision making regarding the projects was less important. Also when participating in renewable energy projects, respondents tend to prefer to financially participating through loans or owning shares rather than volunteering support for the business such as sharing information, stating one's views, or providing cooperation and coordination. Therefore, the focus is on distributional justice, such as financial investment and profit distribution, rather than procedural justice, for instance decision making. When analyzing the part-worths utilities for the participation attribute, the respondents most preferred to receiving dividends based on earnings by owning shares with the local government in charge of the entire projects. As a consequence, the results suggest that it is important to have local government get involved and have trust-worthy governing systems in place for the initiation of the public participating-renewable energy projects.

Managing the Reverse Extrapolation Model of Radar Threats Based Upon an Incremental Machine Learning Technique (점진적 기계학습 기반의 레이더 위협체 역추정 모델 생성 및 갱신)

  • Kim, Chulpyo;Noh, Sanguk
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.13 no.4
    • /
    • pp.29-39
    • /
    • 2017
  • Various electronic warfare situations drive the need to develop an integrated electronic warfare simulator that can perform electronic warfare modeling and simulation on radar threats. In this paper, we analyze the components of a simulation system to reversely model the radar threats that emit electromagnetic signals based on the parameters of the electronic information, and propose a method to gradually maintain the reverse extrapolation model of RF threats. In the experiment, we will evaluate the effectiveness of the incremental model update and also assess the integration method of reverse extrapolation models. The individual model of RF threats are constructed by using decision tree, naive Bayesian classifier, artificial neural network, and clustering algorithms through Euclidean distance and cosine similarity measurement, respectively. Experimental results show that the accuracy of reverse extrapolation models improves, while the size of the threat sample increases. In addition, we use voting, weighted voting, and the Dempster-Shafer algorithm to integrate the results of the five different models of RF threats. As a result, the final decision of reverse extrapolation through the Dempster-Shafer algorithm shows the best performance in its accuracy.

Building battery deterioration prediction model using real field data (머신러닝 기법을 이용한 납축전지 열화 예측 모델 개발)

  • Choi, Keunho;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.243-264
    • /
    • 2018
  • Although the worldwide battery market is recently spurring the development of lithium secondary battery, lead acid batteries (rechargeable batteries) which have good-performance and can be reused are consumed in a wide range of industry fields. However, lead-acid batteries have a serious problem in that deterioration of a battery makes progress quickly in the presence of that degradation of only one cell among several cells which is packed in a battery begins. To overcome this problem, previous researches have attempted to identify the mechanism of deterioration of a battery in many ways. However, most of previous researches have used data obtained in a laboratory to analyze the mechanism of deterioration of a battery but not used data obtained in a real world. The usage of real data can increase the feasibility and the applicability of the findings of a research. Therefore, this study aims to develop a model which predicts the battery deterioration using data obtained in real world. To this end, we collected data which presents change of battery state by attaching sensors enabling to monitor the battery condition in real time to dozens of golf carts operated in the real golf field. As a result, total 16,883 samples were obtained. And then, we developed a model which predicts a precursor phenomenon representing deterioration of a battery by analyzing the data collected from the sensors using machine learning techniques. As initial independent variables, we used 1) inbound time of a cart, 2) outbound time of a cart, 3) duration(from outbound time to charge time), 4) charge amount, 5) used amount, 6) charge efficiency, 7) lowest temperature of battery cell 1 to 6, 8) lowest voltage of battery cell 1 to 6, 9) highest voltage of battery cell 1 to 6, 10) voltage of battery cell 1 to 6 at the beginning of operation, 11) voltage of battery cell 1 to 6 at the end of charge, 12) used amount of battery cell 1 to 6 during operation, 13) used amount of battery during operation(Max-Min), 14) duration of battery use, and 15) highest current during operation. Since the values of the independent variables, lowest temperature of battery cell 1 to 6, lowest voltage of battery cell 1 to 6, highest voltage of battery cell 1 to 6, voltage of battery cell 1 to 6 at the beginning of operation, voltage of battery cell 1 to 6 at the end of charge, and used amount of battery cell 1 to 6 during operation are similar to that of each battery cell, we conducted principal component analysis using verimax orthogonal rotation in order to mitigate the multiple collinearity problem. According to the results, we made new variables by averaging the values of independent variables clustered together, and used them as final independent variables instead of origin variables, thereby reducing the dimension. We used decision tree, logistic regression, Bayesian network as algorithms for building prediction models. And also, we built prediction models using the bagging of each of them, the boosting of each of them, and RandomForest. Experimental results show that the prediction model using the bagging of decision tree yields the best accuracy of 89.3923%. This study has some limitations in that the additional variables which affect the deterioration of battery such as weather (temperature, humidity) and driving habits, did not considered, therefore, we would like to consider the them in the future research. However, the battery deterioration prediction model proposed in the present study is expected to enable effective and efficient management of battery used in the real filed by dramatically and to reduce the cost caused by not detecting battery deterioration accordingly.

Comparison of Development times of Myzus persicae (Hemiptera:Aphididae) between the Constant and Variable Temperatures and its Temperature-dependent Development Models (항온과 변온조건에서 복숭아혹진딧물의 발육비교 및 온도 발육모형)

  • Kim, Do-Ik;Choi, Duck-Soo;Ko, Suk-Ju;Kang, Beom-Ryong;Park, Chang-Gyu;Kim, Seon-Gon;Park, Jong-Dae;Kim, Sang-Soo
    • Korean journal of applied entomology
    • /
    • v.51 no.4
    • /
    • pp.431-438
    • /
    • 2012
  • The developmental time of the nymphs of Myzus persicae was studied in the laboratory (six constant temperatures from 15 to $30^{\circ}C$ with 50~60% RH, and a photoperiod of 14L:10D) and in a green-pepper plastic house. Mortality of M. persicae in laboratory was high in the first(6.7~13.3%) and second instar nymphs(6.7%) at low temperatures and high in the third (17.8%) and fourth instar nymphs(17.8%) at high temperatures. Mortality was 66.7% at $33^{\circ}C$ in laboratory and $26.7^{\circ}C$ in plastic house. The total developmental time was the longest at $14.6^{\circ}C$ (14.4 days) and shortest at $26.7^{\circ}C$ (6.0 days) in plastic house. The lower threshold temperature of the total nymphal stage was $3.0^{\circ}C$ in laboratory. The thermal constant required for nymphal stage was 111.1DD. The relationship between developmental rate and temperature was fitted nonlinear model by Logan-6 which has the lowest value on Akaike information criterion (AIC) and Bayesian information criterion (BIC). The distribution of completion of each developmental stage was well described by the 3-parameter Weibull function ($r^2=0.95{\sim}0.97$). This model accurately described the predicted and observed occurrences. Thus the model is considered to be good for use in predicting the optimal spray time for Myzus persicae.

Comparison of Temperature-dependent Development Model of Aphis gossypii (Hemiptera: Aphididae) under Constant Temperature and Fluctuating Temperature (실내 항온과 온실 변온조건에서 목화진딧물의 온도 발육비교)

  • Kim, Do-Ik;Ko, Suk-Ju;Choi, Duck-Soo;Kang, Beom-Ryong;Park, Chang-Gyu;Kim, Seon-Gon;Park, Jong-Dae;Kim, Sang-Soo
    • Korean journal of applied entomology
    • /
    • v.51 no.4
    • /
    • pp.421-429
    • /
    • 2012
  • The developmental time period of Aphis gossypii was studied in laboratory (six constant temperatures from 15 to $30^{\circ}C$ with 50~60% RH, and a photoperiod of 14L:10D) and in a cucumber plastic house. The mortality of A. gossypii in the laboratory was high in the 2nd (20.0%) and 3rd stage(13.3%) at low temperature but high in the 3rd (26.7%) and 4th stage (33.3%) at high temperatures. Mortality in the plastic house was high in the 1st and 2nd stage but there was no mortality in the 4th stage at low temperature. The total developmental period was longest at $15^{\circ}C$ (12.2 days) in the laboratory and shortest at $28.5^{\circ}C$ (4.09 days) in the plastic house. The lower threshold temperature at the total nymphal stage was $6.8^{\circ}C$ in laboratory. The thermal constant required to reach the total nymphal stage was 111.1DD. The relationship between the developmental rate and temperature fit the nonlinear model of Logan-6 which has the lowest value for the Akaike information criterion(AIC) and Bayesian information criterion(BIC). The distribution of completion of each development stage was well described by the 3-parameter Weibull function ($r^2=0.89{\sim}0.96$). This model accurately described the predicted and observed outcomes. Thus it is considered that the model can be used for predicting the optimal spray time for Aphis gossypii.

A Study on Differences of Contents and Tones of Arguments among Newspapers Using Text Mining Analysis (텍스트 마이닝을 활용한 신문사에 따른 내용 및 논조 차이점 분석)

  • Kam, Miah;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.53-77
    • /
    • 2012
  • This study analyses the difference of contents and tones of arguments among three Korean major newspapers, the Kyunghyang Shinmoon, the HanKyoreh, and the Dong-A Ilbo. It is commonly accepted that newspapers in Korea explicitly deliver their own tone of arguments when they talk about some sensitive issues and topics. It could be controversial if readers of newspapers read the news without being aware of the type of tones of arguments because the contents and the tones of arguments can affect readers easily. Thus it is very desirable to have a new tool that can inform the readers of what tone of argument a newspaper has. This study presents the results of clustering and classification techniques as part of text mining analysis. We focus on six main subjects such as Culture, Politics, International, Editorial-opinion, Eco-business and National issues in newspapers, and attempt to identify differences and similarities among the newspapers. The basic unit of text mining analysis is a paragraph of news articles. This study uses a keyword-network analysis tool and visualizes relationships among keywords to make it easier to see the differences. Newspaper articles were gathered from KINDS, the Korean integrated news database system. KINDS preserves news articles of the Kyunghyang Shinmun, the HanKyoreh and the Dong-A Ilbo and these are open to the public. This study used these three Korean major newspapers from KINDS. About 3,030 articles from 2008 to 2012 were used. International, national issues and politics sections were gathered with some specific issues. The International section was collected with the keyword of 'Nuclear weapon of North Korea.' The National issues section was collected with the keyword of '4-major-river.' The Politics section was collected with the keyword of 'Tonghap-Jinbo Dang.' All of the articles from April 2012 to May 2012 of Eco-business, Culture and Editorial-opinion sections were also collected. All of the collected data were handled and edited into paragraphs. We got rid of stop-words using the Lucene Korean Module. We calculated keyword co-occurrence counts from the paired co-occurrence list of keywords in a paragraph. We made a co-occurrence matrix from the list. Once the co-occurrence matrix was built, we used the Cosine coefficient matrix as input for PFNet(Pathfinder Network). In order to analyze these three newspapers and find out the significant keywords in each paper, we analyzed the list of 10 highest frequency keywords and keyword-networks of 20 highest ranking frequency keywords to closely examine the relationships and show the detailed network map among keywords. We used NodeXL software to visualize the PFNet. After drawing all the networks, we compared the results with the classification results. Classification was firstly handled to identify how the tone of argument of a newspaper is different from others. Then, to analyze tones of arguments, all the paragraphs were divided into two types of tones, Positive tone and Negative tone. To identify and classify all of the tones of paragraphs and articles we had collected, supervised learning technique was used. The Na$\ddot{i}$ve Bayesian classifier algorithm provided in the MALLET package was used to classify all the paragraphs in articles. After classification, Precision, Recall and F-value were used to evaluate the results of classification. Based on the results of this study, three subjects such as Culture, Eco-business and Politics showed some differences in contents and tones of arguments among these three newspapers. In addition, for the National issues, tones of arguments on 4-major-rivers project were different from each other. It seems three newspapers have their own specific tone of argument in those sections. And keyword-networks showed different shapes with each other in the same period in the same section. It means that frequently appeared keywords in articles are different and their contents are comprised with different keywords. And the Positive-Negative classification showed the possibility of classifying newspapers' tones of arguments compared to others. These results indicate that the approach in this study is promising to be extended as a new tool to identify the different tones of arguments of newspapers.

Robo-Advisor Algorithm with Intelligent View Model (지능형 전망모형을 결합한 로보어드바이저 알고리즘)

  • Kim, Sunwoong
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.39-55
    • /
    • 2019
  • Recently banks and large financial institutions have introduced lots of Robo-Advisor products. Robo-Advisor is a Robot to produce the optimal asset allocation portfolio for investors by using the financial engineering algorithms without any human intervention. Since the first introduction in Wall Street in 2008, the market size has grown to 60 billion dollars and is expected to expand to 2,000 billion dollars by 2020. Since Robo-Advisor algorithms suggest asset allocation output to investors, mathematical or statistical asset allocation strategies are applied. Mean variance optimization model developed by Markowitz is the typical asset allocation model. The model is a simple but quite intuitive portfolio strategy. For example, assets are allocated in order to minimize the risk on the portfolio while maximizing the expected return on the portfolio using optimization techniques. Despite its theoretical background, both academics and practitioners find that the standard mean variance optimization portfolio is very sensitive to the expected returns calculated by past price data. Corner solutions are often found to be allocated only to a few assets. The Black-Litterman Optimization model overcomes these problems by choosing a neutral Capital Asset Pricing Model equilibrium point. Implied equilibrium returns of each asset are derived from equilibrium market portfolio through reverse optimization. The Black-Litterman model uses a Bayesian approach to combine the subjective views on the price forecast of one or more assets with implied equilibrium returns, resulting a new estimates of risk and expected returns. These new estimates can produce optimal portfolio by the well-known Markowitz mean-variance optimization algorithm. If the investor does not have any views on his asset classes, the Black-Litterman optimization model produce the same portfolio as the market portfolio. What if the subjective views are incorrect? A survey on reports of stocks performance recommended by securities analysts show very poor results. Therefore the incorrect views combined with implied equilibrium returns may produce very poor portfolio output to the Black-Litterman model users. This paper suggests an objective investor views model based on Support Vector Machines(SVM), which have showed good performance results in stock price forecasting. SVM is a discriminative classifier defined by a separating hyper plane. The linear, radial basis and polynomial kernel functions are used to learn the hyper planes. Input variables for the SVM are returns, standard deviations, Stochastics %K and price parity degree for each asset class. SVM output returns expected stock price movements and their probabilities, which are used as input variables in the intelligent views model. The stock price movements are categorized by three phases; down, neutral and up. The expected stock returns make P matrix and their probability results are used in Q matrix. Implied equilibrium returns vector is combined with the intelligent views matrix, resulting the Black-Litterman optimal portfolio. For comparisons, Markowitz mean-variance optimization model and risk parity model are used. The value weighted market portfolio and equal weighted market portfolio are used as benchmark indexes. We collect the 8 KOSPI 200 sector indexes from January 2008 to December 2018 including 132 monthly index values. Training period is from 2008 to 2015 and testing period is from 2016 to 2018. Our suggested intelligent view model combined with implied equilibrium returns produced the optimal Black-Litterman portfolio. The out of sample period portfolio showed better performance compared with the well-known Markowitz mean-variance optimization portfolio, risk parity portfolio and market portfolio. The total return from 3 year-period Black-Litterman portfolio records 6.4%, which is the highest value. The maximum draw down is -20.8%, which is also the lowest value. Sharpe Ratio shows the highest value, 0.17. It measures the return to risk ratio. Overall, our suggested view model shows the possibility of replacing subjective analysts's views with objective view model for practitioners to apply the Robo-Advisor asset allocation algorithms in the real trading fields.