• 제목/요약/키워드: Shannon's entropy

검색결과 45건 처리시간 0.023초

Diversity and Genotypic Structure of ECOR Collection Determined by Repetitive Extragenic Palindromic PCR Genome Fingerprinting

  • HWANG KEUM-OK;JANG HYO-MI;CHO JAE-CHANG
    • Journal of Microbiology and Biotechnology
    • /
    • 제15권3호
    • /
    • pp.672-677
    • /
    • 2005
  • The standard reference collection of strains for E. coli, the ECOR collection, was analyzed by a genome-based typing method. Seventy-one ECOR strains were subjected to repetitive extragenic palindromic PCR genome fingerprinting with BOX primers (BOX-PCR). Using a similarity value of 0.8 or more after cluster analysis of BOX-PCR fingerprinting patterns to define the same genotypes, we identified 28 genotypes in the ECOR collection. Shannon's entropy-based diversity index was 3.07, and the incident-based coverage estimator indicated potentially 420 genotypes among E. coli populations. Chi-square test of goodness-of-fit showed statistically significant association between the genotypes defined by BOX-PCR fingerprinting and the groups previously defined by multi-locus enzyme electrophoresis. This study suggests that the diversification of E. coli strains in natural populations is actively ongoing, and rep-PCR fingerprinting is a convenient and reliable method to type E. coli strains for the purposes ranging from ecology to quarantine.ine.

검색효율 측정척도에 관한 연구 (A Study on measuring techniques of retrieval effectiveness)

  • 윤구호
    • 한국문헌정보학회지
    • /
    • 제16권
    • /
    • pp.177-205
    • /
    • 1989
  • Retrieval effectiveness is the principal criteria for measuring the performance of an information retrieval system. This paper deals with the characteristics of 'relevance' of information and various measuring techniques of retrieval effectivess. The outlines of this study are as follows: 1) Relevance decision for evaluation should be devided into the user-oriented and the system-oriented decisions. 2) The recall-precision measure seems to be user-oriented, and the recall-fallout measure to be system-oriented. 3) Many of composite measures can not be justified III any rational manner unfortunately. 4) The Swets model has demonstrated that it yields, in general, a straight line instead of a curve of varying curvature and emphasized the fundamentally probabilistic nature of information retrieval. 5) The Cooper model seems to be a good substitute for precision and a useful measure for systems which ranked documents. 6) The Rocchio model were proposed for the evaluation of retreval systems which ranked documents, and were designed to be independent of cut-off. 7) The Cawkell model suggested that the Shannon's equation for entropy can be applied to measuring of retrieval effectiveness.

  • PDF

Relationship between Diversity and Productivity at Ratargul Fresh Water Swamp Forest in Bangladesh

  • Sharmin, Mahmuda;Dey, Sunanda;Chowdhury, Sangita
    • Journal of Forest and Environmental Science
    • /
    • 제32권3호
    • /
    • pp.291-301
    • /
    • 2016
  • One of the most concerned topics in ecology is the relationship between biodiversity and ecosystem functioning. However, there are few field studies, carried out in forests, although many studies have been done in controlled experiments in grasslands. In this paper, we describe the relationship pattern between three facets of diversity and productivity at Ratargul Fresh Water Swamp Forest (RFWSF) in Bangladesh, which is the only remaining fresh water swamp forest of the country. Sixty sample plots were selected from RFWSF and included six functional traits including leaf area (LA), specific leaf area (SLA), leaf dry matter content (LDMC), tree height, bark thickness and wood density. In analyzing TD, we used Shannon diversity and richness indices, functional diversity was measured by Rao's quadratic entropy (Rao 1982) and Faith's (1992) index was used for phylogenetic diversity (PD). It was found that, TD, FD and PD were positively related with productivity (basal area) due to resource use complementarity but surprisingly the best predictor of tree productivity was FD. The results contribute to the understanding the effects of biodiversity loss and it is essential for conservation decision-making and policy-making of Ratargul Fresh Water Swamp Forest.

Multi-objective Optimization of Pedestrian Wind Comfort and Natural Ventilation in a Residential Area

  • H.Y. Peng;S.F. Dai;D. Hu;H.J. Liu
    • 국제초고층학회논문집
    • /
    • 제11권4호
    • /
    • pp.315-320
    • /
    • 2022
  • With the rapid development of urbanization the problems of pedestrian-level wind comfort and natural ventilation of tall buildings are becoming increasingly prominent. The velocity at the pedestrian level ($\overline{MVR}$) and variation of wind pressure coefficients $\overline{{\Delta}C_p}$ between windward and leeward surfaces of tall buildings were investigated systematically through numerical simulations. The examined parameters included building density ρ, height ratio of building αH, width ratio of building αB, and wind direction θ. The linear and quadratic regression analyses of $\overline{MVR}$ and $\overline{{\Delta}C_p}$ were conducted. The quadratic regression had better performance in predicting $\overline{MVR}$ and $\overline{{\Delta}C_p}$ than the linear regression. $\overline{MVR}$ and $\overline{{\Delta}C_p}$ were optimized by the NSGA-II algorithm. The LINMAP and TOPSIS decision-making methods demonstrated better capability than the Shannon's entropy approach. The final optimal design parameters of buildings were ρ = 20%, αH = 4.5, and αB = 1, and the wind direction was θ = 10°. The proposed method could be used for the optimization of pedestrian-level wind comfort and natural ventilation in a residential area.

확률론적 이론을 이용한 종단면에서의 단방향 이동거리에 관한 연구 (A Study on the One-Way Distance in the Longitudinal Section Using Probabilistic Theory)

  • 김성률;문지현;전해성;서종철;추연문
    • 한국산학기술학회논문지
    • /
    • 제21권12호
    • /
    • pp.87-96
    • /
    • 2020
  • 유속은 수리학적 구조물을 운영 관리하는데 필수적인 요소임에도 불구하고 경제적, 인력적 이유로 인해 유량, 유속 측정이 충분히 이루어지지 않고 있다. 또한, 홍수기에 하천의 유속을 산정하는데 사용되는 공식은 Chezy와 Manning 등류 공식으로 일반적으로 등류가 아닌 자연하천에 이 공식을 그대로 사용하는 것은 문제가 있다. 이에 따라 본 연구는 측정치와 경험적인 방법으로 유량과 유속을 다른 수리학적 요소로 나타내는 방법이 아닌 이론적 방법으로 유속에 접근하였다. 이전의 기존 유속 공식과 분포식의 한계를 엔트로피 이론으로 해결한 Chiu (1987)의 연구를 따라 개수로의 유속을 엔트로피 이론에 근거하여 전개하였고 효용성을 검증하기 위해 지점의 유속이 측정된 수로의 데이터를 활용하여 식을 검증하였다. 이동거리의 R2값은 0.9993, 유속의 R2값은 0.8051~0.9483으로 예측치가 실제 적용 가능함을 확인하였다. 식을 활용하면 실제 유속 측정을 여러 지점에서 매시간 하지 않아도 특성 인자를 통해 시간에 따라 이동하는 하천의 유속과 이동지점을 동시에 구할 수 있고 실시간 유속이 필요하지만 빈번한 실측이 불가능한 홍수기의 유속 산정이 가능하다. 이를 활용하여 하천의 수평·수직거리를 이용해 제작되는 하천 종단면도에 활용 가능하며 GIS와 연계하여 하천 특성인자의 정확성을 높일 수 있을것으로 판단된다. GIS의 공간 모델에 하천의 거리와 유속 정보를 결합해 홍수기의 경보·예측 시스템에 활용이 가능할 것으로 보인다.

Acoustic emission source location and noise cancellation for crack detection in rail head

  • Kuanga, K.S.C.;Li, D.;Koh, C.G.
    • Smart Structures and Systems
    • /
    • 제18권5호
    • /
    • pp.1063-1085
    • /
    • 2016
  • Taking advantage of the high sensitivity and long-distance detection capability of acoustic emission (AE) technique, this paper focuses on the crack detection in rail head, which is one of the most vulnerable parts of rail track. The AE source location and noise cancellation were studied on the basis of practical rail profile, material and operational noise. In order to simulate the actual AE events of rail head cracks, field tests were carried out to acquire the AE waves induced by pencil lead break (PLB) and operational noise of the railway system. Wavelet transform (WT) was first utilized to investigate the time-frequency characteristics and dispersion phenomena of AE waves. Here, the optimal mother wavelet was selected by minimizing the Shannon entropy of wavelet coefficients. Regarding the obvious dispersion of AE waves propagating along the rail head and the high operational noise, the wavelet transform-based modal analysis location (WTMAL) method was then proposed to locate the AE sources (i.e. simulated cracks) respectively for the PLB-induced AE signals with and without operational noise. For those AE signals inundated with operational noise, the Hilbert transform (HT)-based noise cancellation method was employed to improve the signal-to-noise ratio (SNR). Finally, the experimental results demonstrated that the proposed crack detection strategy could locate PLB-simulated AE sources effectively in the rail head even at high operational noise level, highlighting its potential for field application.

A Risk-Return Analysis of Loan Portfolio Diversification in the Vietnamese Banking System

  • HUYNH, Japan;DANG, Van Dan
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제7권9호
    • /
    • pp.105-115
    • /
    • 2020
  • The study empirically examines the effects of loan portfolio diversification on bank risk and return in the nascent banking market of Vietnam. Loan portfolio diversification is captured through the Hirschman-Herfindahl index and the Shannon Entropy with sectoral exposures. We access each bank's financial reports to collect the required data, especially the breakdown of sectoral loan portfolios, thus constituting a unique dataset. To compute bank return, we use the traditional accounting indicators, including return-on-assets, return-on-equity, and net-interest margin. For bank risk, we utilize the loan-loss provisions and non-performing loans relative to gross customer loans. Using a sample of 30 commercial banks over the period from 2008 to 2019 and the system generalized method of moments estimator for the dynamic panel, we indicate the downsides of portfolio diversification. Concretely, we observe that all diversification measures exhibit significantly negative signs in all regressions across different bank return proxies. At the same time, the estimates display the significant and positive impact of diversification on the non-performing loan ratio. Hence, sectoral loan portfolio diversification significantly hampers bank performance in both aspects of lower return and higher credit risk. The results are robust across a rich set of bank performance and portfolio diversification measures.

LSTM 및 정보이득 기반의 악성 안드로이드 앱 탐지연구 (A Study on Detection of Malicious Android Apps based on LSTM and Information Gain)

  • 안유림;홍승아;김지연;최은정
    • 한국멀티미디어학회논문지
    • /
    • 제23권5호
    • /
    • pp.641-649
    • /
    • 2020
  • As the usage of mobile devices extremely increases, malicious mobile apps(applications) that target mobile users are also increasing. It is challenging to detect these malicious apps using traditional malware detection techniques due to intelligence of today's attack mechanisms. Deep learning (DL) is an alternative technique of traditional signature and rule-based anomaly detection techniques and thus have actively been used in numerous recent studies on malware detection. In order to develop DL-based defense mechanisms against intelligent malicious apps, feeding recent datasets into DL models is important. In this paper, we develop a DL-based model for detecting intelligent malicious apps using KU-CISC 2018-Android, the most up-to-date dataset consisting of benign and malicious Android apps. This dataset has hardly been addressed in other studies so far. We extract OPcode sequences from the Android apps and preprocess the OPcode sequences using an N-gram model. We then feed the preprocessed data into LSTM and apply the concept of Information Gain to improve performance of detecting malicious apps. Furthermore, we evaluate our model with numerous scenarios in order to verify the model's design and performance.

퍼지뉴럴 시스템을 위한 초기 입력공간분할의 최적화 : Measure of Fuzziness (The Optimal Partition of Initial Input Space for Fuzzy Neural System : Measure of Fuzziness)

  • 백덕수;박인규
    • 대한전자공학회논문지TE
    • /
    • 제39권3호
    • /
    • pp.97-104
    • /
    • 2002
  • 이 논문에서는 퍼지뉴럴 시스템을 위하여 measure of fuzziness에 의한 입력공간의 분할을 최적화하는 방법을 제안한다. 이에 따라 최적화된 퍼지 부공간에 대하여 퍼지 제어규칙을 자동으로 생성하는 방법을 제안한다. 또한 시계열 예측 문제에서 입력패턴의 간격을 조정하여 그 성능을 검증한다. 이 방법은 샤논 함수와 index of fuzziness를 이용하여 입력공간을 분할하고, 분할된 부 공간에 대해 입력 데이터와 부합할 수 있는 각각의 규칙에 등급을 정하여 불필요한 제어규칙을 제거하여 최적의 규칙베이스를 구성하도록 한다. 적용되는 퍼지 신경망의 기본적인 구조는 퍼지 제어기의 규칙베이스와 추론의 과정을 신경회로망을 이용하여 구현하며 퍼지 제어규칙의 매개변수들은 최대 급경사 강하법에 의해 적응되어진다. 제안된 알고리즘을 토대로 여덟 가지의 입력패턴에 대하여 추론한 결과 입력공간의 최적분할에 의하여 수렴과정에서 초기에 오차(RMSE)가 빠르게 수렴함을 알 수 있었다.

세계 각국의 자원에 대한 통계적 고찰 (Statistical Consideration on the Resources of the Countries in the World)

  • 허문열;최병수;이승천
    • 응용통계연구
    • /
    • 제22권1호
    • /
    • pp.41-57
    • /
    • 2009
  • 본 논문에서는 세계 232 개국에 대한 인구, 경제 및 기타 자원에 관한 자료를 사용하여 국가의 개발정도, 인간개발지수, 경제력 그리고 OECD 가입 여부에 어떤 자원이 어떻게 영향을 미치는가를 통계적으로 고찰해보고자 한다. 여기서 사용하는 국가별 자원 자료는 연속형 자료와 이산형 자료가 혼재되어있는 혼합형이며 많은 결측값이 포함되어 있어 기존의 방법으로는 분석하는 데 한계가 있다. 이 논문에서는 시각적 방법을 동원하여 복합형 자료를 탐색하는 과정을 제시하고 이러한 방법의 한계점을 보이고자한다. 이러한 한계점을 극복하고 객관적인 판단기준을 적용하여 주어진 문제에 대한 과학적인 결론을 유도하기 위해 Shannon (1948)의 엔트로피 이론에 기본을 둔 상호정보(MI)를 활용하고자 한다. 상호정보를 추정하는 방법은 여러 가지가 있으며 각 방법에 따라 결과가 매우 다르게 나타난다. 본 논문에서는 Fayyad와 Irani (1992)의 이산화 방법을 적용하여 MI를 추정하는 방법을 적용한다. 여기서 이루어지는 모든 과정은 다차원 자료의 시각적 탐색 도구인 DAVIS (Huh와 Song, 2002)를 사용하였다.