• Title/Summary/Keyword: Family Tree

Search Result 308, Processing Time 0.058 seconds

API Feature Based Ensemble Model for Malware Family Classification (악성코드 패밀리 분류를 위한 API 특징 기반 앙상블 모델 학습)

  • Lee, Hyunjong;Euh, Seongyul;Hwang, Doosung
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.29 no.3
    • /
    • pp.531-539
    • /
    • 2019
  • This paper proposes the training features for malware family analysis and analyzes the multi-classification performance of ensemble models. We construct training data by extracting API and DLL information from malware executables and use Random Forest and XGBoost algorithms which are based on decision tree. API, API-DLL, and DLL-CM features for malware detection and family classification are proposed by analyzing frequently used API and DLL information from malware and converting high-dimensional features to low-dimensional features. The proposed feature selection method provides the advantages of data dimension reduction and fast learning. In performance comparison, the malware detection rate is 93.0% for Random Forest, the accuracy of malware family dataset is 92.0% for XGBoost, and the false positive rate of malware family dataset including benign is about 3.5% for Random Forest and XGBoost.

Diagnostic Classification Scheme in Iranian Breast Cancer Patients using a Decision Tree

  • Malehi, Amal Saki
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.14
    • /
    • pp.5593-5596
    • /
    • 2014
  • Background: The objective of this study was to determine a diagnostic classification scheme using a decision tree based model. Materials and Methods: The study was conducted as a retrospective case-control study in Imam Khomeini hospital in Tehran during 2001 to 2009. Data, including demographic and clinical-pathological characteristics, were uniformly collected from 624 females, 312 of them were referred with positive diagnosis of breast cancer (cases) and 312 healthy women (controls). The decision tree was implemented to develop a diagnostic classification scheme using CART 6.0 Software. The AUC (area under curve), was measured as the overall performance of diagnostic classification of the decision tree. Results: Five variables as main risk factors of breast cancer and six subgroups as high risk were identified. The results indicated that increasing age, low age at menarche, single and divorced statues, irregular menarche pattern and family history of breast cancer are the important diagnostic factors in Iranian breast cancer patients. The sensitivity and specificity of the analysis were 66% and 86.9% respectively. The high AUC (0.82) also showed an excellent classification and diagnostic performance of the model. Conclusions: Decision tree based model appears to be suitable for identifying risk factors and high or low risk subgroups. It can also assists clinicians in making a decision, since it can identify underlying prognostic relationships and understanding the model is very explicit.

Creating Level Set Trees Using One-Class Support Vector Machines (One-Class 서포트 벡터 머신을 이용한 레벨 셋 트리 생성)

  • Lee, Gyemin
    • Journal of KIISE
    • /
    • v.42 no.1
    • /
    • pp.86-92
    • /
    • 2015
  • A level set tree provides a useful representation of a multidimensional density function. Visualizing the data structure as a tree offers many advantages for data analysis and clustering. In this paper, we present a level set tree estimation algorithm for use with a set of data points. The proposed algorithm creates a level set tree from a family of level sets estimated over a whole range of levels from zero to infinity. Instead of estimating density function then thresholding, we directly estimate the density level sets using one-class support vector machines (OC-SVMs). The level set estimation is facilitated by the OC-SVM solution path algorithm. We demonstrate the proposed level set tree algorithm on benchmark data sets.

Composition and Diversity of Tree Species in Kamalachari Natural Forest of Chittagong South Forest Division, Bangladesh

  • Hossain, M. Akhter;Hossain, M. Kamal;Alam, M. Shafiul;Uddin, M. Main
    • Journal of Forest and Environmental Science
    • /
    • v.31 no.3
    • /
    • pp.192-201
    • /
    • 2015
  • Information on plant diversity and community structure are required to chalk out necessary actions for conservation management. The present study assessed the composition and diversity of tree species in Kamalachari Natural Forest of Chittagong South Forest Division, Bangladesh, during April 2010 to November 2011. A total of 107 tree species belonging to 72 genera and 37 families were recorded, where Moraceae family was represented by maximum (11) species. Density, Basal area and volume of tree species were $418{\pm}20.09stem/ha$, $21.10{\pm}2.62m^2/ha$ and $417.4{\pm}79.8m^3/ha$ respectively. Diameter and height class distribution of tree species revealed an almost reverse J-shaped curve. Both the number of species and percentage of tree individuals were maximum in the lower DBH and height ranges. Anthropogenic disturbances like illegal tree cutting, over extraction, settlement inside forest area etc. were noticed during the study, which are supposed to cause gradual decrease of both tree species and individuals in the higher DBH and height classes. However, Artocarpus chama was found dominant showing maximum IVI followed by Schima wallichii, Aporosa wallichii, and Lithocarpus acuminata. The quantitative structure of the tree species of Kamalachari natural forest is comparable to other tree species rich tropical natural forests. The findings of the study may help in monitoring future plant population changes of the identified species and adopting species specific conservation programs in Kamalachari natural forest.

Gene Structure and Phylogenetic Analysis of Cytohesin Family

  • Kim, Heui-Soo;Shin, Kyung-Mi;Lee, Ji-Won;Yi, Joo-Mi
    • Journal of Life Science
    • /
    • v.11 no.1
    • /
    • pp.39-41
    • /
    • 2001
  • Cytohesin family has been thought to participate in inside-outside signaling linking growth factor receptor stimulation of PI 3-kinase to cell adhesion and stimulate nucleotide exchange of ARF through its Sec7 domain. The genomic structure of the cytohesin family was analyzed by BLAST search using cDNA and genomic DNA sequences from the GeneBank database. The cytohesin-2 was encoded by 12 exons. while the cytohesin-4 was encoded by 13 exons. The Sec7 and PH domains were not encoded by separate exons. In an analysis of retroviral integration, those two families did not contain any retroviral elements in introns or exons. The phylogenetic tree calculated by the neighbor-joining method suggests that the cytohesin-1 family was closely related to cytohesin-3 (ARNO3) family. These date could be of great use in further studies for resolving the exact function and evolution of the cytohesin family.

  • PDF

Wind-induced fragility assessment of urban trees with structural uncertainties

  • Peng, Yongbo;Wang, Zhiheng;Ai, Xiaoqiu
    • Wind and Structures
    • /
    • v.26 no.1
    • /
    • pp.45-56
    • /
    • 2018
  • Wind damage of urban trees arises to be a serious issue especially in the typhoon-prone areas. As a family of tree species widely-planted in Southeast China, the structural behaviors of Plane tree is investigated. In order to accommodate the complexities of tree morphology, a fractal theory based finite element modeling method is proposed. On-site measurement of Plane trees is performed for physical definition of structural parameters. It is revealed that modal frequencies of Plane trees distribute in a manner of grouped dense-frequencies; bending is the main mode of structural failure. In conjunction with the probability density evolution method, the fragility assessment of urban trees subjected to wind excitations is then proceeded. Numerical results indicate that small-size segments such as secondary branches feature a relatively higher failure risk in a low wind level, and a relatively lower failure risk in a high wind level owing to windward shrinks. Besides, the trunk of Plane tree is the segment most likely to be damaged than other segments in case of high winds. The failure position tends to occur at the connection between trunk and primary branches, where the logical protections and reinforcement measures can be implemented for mitigating the wind damage.

Bit-Vector-Based Space Partitioning Indexing Scheme for Improving Node Utilization and Information Retrieval (노드 이용률과 검색 속도 개선을 위한 비트 벡터 기반 공간 분할 색인 기법)

  • Yeo, Myung-Ho;Seong, Dong-Ook;Yoo, Jae-Soo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.7
    • /
    • pp.799-803
    • /
    • 2010
  • The KDB-tree is a traditional indexing scheme for retrieving multidimensional data. Much research for KDB-tree family frequently addresses the low storage utilization and insufficient retrieval performance as their two bottlenecks. The bottlenecks occur due to a number of unnecessary splits caused by data insertion orders and data skewness. In this paper, we propose a novel index structure, called as $KDB_{CS}^+$-tree, to process skewed data efficiently and improve the retrieval performance. The $KDB_{CS}^+$-tree increases the number of fan-outs by exploiting bit-vectors for representing splitting information and pointer elimination. It also improves the storage utilization by representing entries as a hierarchical structure in each internal node.

Environmental Predictors of Atopic Dermatitis in Children - Using Answer Tree Analysis - (아동 아토피 피부염을 예측하는 환경적 요인들 - 의사결정 나무분석의 적용 -)

  • Lee, Ju-Lie
    • Korean Journal of Child Studies
    • /
    • v.31 no.2
    • /
    • pp.183-195
    • /
    • 2010
  • This study sought to investigate the environmental predictors of atopic dermatitis in children. The participants were 1050 (age 3-5) children taken from data data from the Ministry for Health, Welfare and Family Affairs. A data mining decision tree model revealed that the factors of medical neglect, breakfast, attachment to mother, and mother's depression influenced atopic dermatitis in children. Our results revealed that in the factors considered above, medical neglect had the greatest influence upon atopic dermatitis in children.

Phylogeny, host-parasite relationship and zoogeography

  • Hasegawa, Hideo
    • Parasites, Hosts and Diseases
    • /
    • v.37 no.4
    • /
    • pp.197-213
    • /
    • 1999
  • Phylogeny is the evolutionary history of a group or the lineage of organisms and is reconstructed based on morphological, molecular and other characteristics. The genealogical relationship of a group of taxa is often expressed as a phylogenetic tree. The difficulty in categorizing the phylogeny is mainly due to the existence of frequent homoplasies that deceive observers. At the present time, cladistic analysis is believed to be one of the most effective methods of reconstructing a phylogenetic tree. Excellent computer program software for phylogenetic analysis is available. As an example, cladistic analysis was applied for nematode genera of the family Acuariidae, and the phylogenetic tree formed was compared with the system used currently. Nematodes in the genera Nippostrongylus and Heligmonoides were also analyzed, and the validity of the reconstructed phylogenetic trees was observed from a zoogeographical point of view. Some of the theories of parasite evolution were briefly reviewed as well. Coevolution of parasites and humans was discussed with special reference to the evolutionary relationship between Enterobius and primates.

  • PDF