• Title/Summary/Keyword: Selection Signature Analysis

Search Result 27, Processing Time 0.018 seconds

Genome-wide analyses of the Jeju, Thoroughbred, and Jeju crossbred horse populations using the high density SNP array

  • Kim, Nam Young;Seong, Ha-Seung;Kim, Dae Cheol;Park, Nam Geon;Yang, Byoung Chul;Son, Jun Kyu;Shin, Sang Min;Woo, Jae Hoon;Shin, Moon Cheol;Yoo, Ji Hyun;Choi, Jung-Woo
    • Genes and Genomics
    • /
    • v.40 no.11
    • /
    • pp.1249-1258
    • /
    • 2018
  • The Jeju horse is an indigenous Korean horse breed that is currently registered with the Food and Agriculture Organization of the United Nations. However, there is severe lack of genomic studies on Jeju horse. This study was conducted to investigate genetic characteristics of horses including Jeju horse, Thoroughbred and Jeju crossbred (Jeju${\times}$Thoroughbred) populations. We compared the genomes of three horse populations using the Equine SNP70 Beadchip array. Short-range Linkage disequilibrium was the highest in Thoroughbred, whereas $r^2$ values were lowest in Jeju horse. Expected heterozygosity was the highest in Jeju crossbred (0.351), followed by the Thoroughbred (0.337) and Jeju horse (0.311). The level of inbreeding was slightly higher in Thoroughbred (-0.009) than in Jeju crossbred (-0.035) and Jeju horse (-0.038). $F_{ST}$ value was the highest between Jeju horse and Thoroughbred (0.113), whereas Jeju crossbred and Thoroughbred showed the lowest value (0.031). The genetic relationship was further assessed by principal component analysis, suggesting that Jeju crossbred is more genetically similar to Thoroughbred than Jeju horse population. Additionally, we detected potential selection signatures, for example, in loci located on LCORL/NCAPG and PROP1 genes that are known to influence body. Genome-wide analyses of the three horse populations showed that all the breeds had somewhat a low level of inbreeding within each population. In the population structure analysis, we found that Jeju crossbred was genetically closer to Thoroughbred than Jeju horse. Furthermore, we identified several signatures of selection which might be associated with traits of interest. To our current knowledge, this study is the first genomic research, analyzing genetic relationships of Jeju horse, Thoroughbred and Jeju crossbred.

Offline Based Ransomware Detection and Analysis Method using Dynamic API Calls Flow Graph (다이나믹 API 호출 흐름 그래프를 이용한 오프라인 기반 랜섬웨어 탐지 및 분석 기술 개발)

  • Kang, Ho-Seok;Kim, Sung-Ryul
    • Journal of Digital Contents Society
    • /
    • v.19 no.2
    • /
    • pp.363-370
    • /
    • 2018
  • Ransomware detection has become a hot topic in computer security for protecting digital contents. Unfortunately, current signature-based and static detection models are often easily evadable by compress, and encryption. For overcoming the lack of these detection approach, we have proposed the dynamic ransomware detection system using data mining techniques such as RF, SVM, SL and NB algorithms. We monitor the actual behaviors of software to generate API calls flow graphs. Thereafter, data normalization and feature selection were applied to select informative features. We improved this analysis process. Finally, the data mining algorithms were used for building the detection model for judging whether the software is benign software or ransomware. We conduct our experiment using more suitable real ransomware samples. and it's results show that our proposed system can be more effective to improve the performance for ransomware detection.

Performance Improvement of the Statistical Information based Traffic Identification System (통계 정보 기반 트래픽 분석 방법론의 성능 향상)

  • An, Hyun Min;Ham, Jae Hyun;Kim, Myung Sup
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.2 no.8
    • /
    • pp.335-342
    • /
    • 2013
  • Nowadays, the traffic type and behavior are extremely diverse due to the growth of network speed and the appearance of various services on Internet. For efficient network operation and management, the importance of application-level traffic identification is more and more increasing in the area of traffic analysis. In recent years traffic identification methodology using statistical features of traffic flow has been broadly studied. However, there are several problems to be considered in the identification methodology base on statistical features of flow to improve the analysis accuracy. In this paper, we recognize these problems by analyzing the ground-truth traffic and propose the solution of these problems. The four problems considered in this paper are the distance measurement of features, the selection of the representative value of features, the abnormal behavior of TCP sessions, and the weight assignment to the feature. The proposed solutions were verified by showing the performance improvement through experiments in campus network.

Genetic diversity and divergence among Korean cattle breeds assessed using a BovineHD single-nucleotide polymorphism chip

  • Kim, Seungchang;Cheong, Hyun Sub;Shin, Hyoung Doo;Lee, Sung-Soo;Roh, Hee-Jong;Jeon, Da-Yeon;Cho, Chang-Yeon
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.11
    • /
    • pp.1691-1699
    • /
    • 2018
  • Objective: In Korea, there are three main cattle breeds, which are distinguished by coat color: Brown Hanwoo (BH), Brindle Hanwoo (BRH), and Jeju Black (JB). In this study, we sought to compare the genetic diversity and divergence among there Korean cattle breeds using a BovineHD chip genotyping array. Methods: Sample data were collected from 168 cattle in three populations of BH (48 cattle), BRH (96 cattle), and JB (24 cattle). The single-nucleotide polymorphism (SNP) genotyping was performed using the Illumina BovineHD SNP 777K Bead chip. Results: Heterozygosity, used as a measure of within-breed genetic diversity, was higher in BH (0.293) and BRH (0.296) than in JB (0.266). Linkage disequilibrium decay was more rapid in BH and BRH than in JB, reaching an average $r^2$ value of 0.2 before 26 kb in BH and BRH, whereas the corresponding value was reached before 32 kb in JB. Intra-population, interpopulation, and Fst analyses were used to identify candidate signatures of positive selection in the genome of a domestic Korean cattle population and 48, 11, and 11 loci were detected in the genomic region of the BRH breed, respectively. A Neighbor-Joining phylogenetic tree showed two main groups: a group comprising BH and BRH on one side and a group containing JB on the other. The runs of homozygosity analysis between Korean breeds indicated that the BRH and JB breeds have high inbreeding within breeds compared with BH. An analysis of differentiation based on a high-density SNP chip showed differences between Korean cattle breeds and the closeness of breeds corresponding to the geographic regions where they are evolving. Conclusion: Our results indicate that although the Korean cattle breeds have common features, they also show reliable breed diversity.

Machine Learning-Based Malicious URL Detection Technique (머신러닝 기반 악성 URL 탐지 기법)

  • Han, Chae-rim;Yun, Su-hyun;Han, Myeong-jin;Lee, Il-Gu
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.3
    • /
    • pp.555-564
    • /
    • 2022
  • Recently, cyberattacks are using hacking techniques utilizing intelligent and advanced malicious codes for non-face-to-face environments such as telecommuting, telemedicine, and automatic industrial facilities, and the damage is increasing. Traditional information protection systems, such as anti-virus, are a method of detecting known malicious URLs based on signature patterns, so unknown malicious URLs cannot be detected. In addition, the conventional static analysis-based malicious URL detection method is vulnerable to dynamic loading and cryptographic attacks. This study proposes a technique for efficiently detecting malicious URLs by dynamically learning malicious URL data. In the proposed detection technique, malicious codes are classified using machine learning-based feature selection algorithms, and the accuracy is improved by removing obfuscation elements after preprocessing using Weighted Euclidean Distance(WED). According to the experimental results, the proposed machine learning-based malicious URL detection technique shows an accuracy of 89.17%, which is improved by 2.82% compared to the conventional method.

A Study on Ransomware Detection Methods in Actual Cases of Public Institutions (공공기관 실제 사례로 보는 랜섬웨어 탐지 방안에 대한 연구)

  • Yong Ju Park;Huy Kang Kim
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.33 no.3
    • /
    • pp.499-510
    • /
    • 2023
  • Recently, an intelligent and advanced cyber attack attacks a computer network of a public institution using a file containing malicious code or leaks information, and the damage is increasing. Even in public institutions with various information protection systems, known attacks can be detected, but unknown dynamic and encryption attacks can be detected when existing signature-based or static analysis-based malware and ransomware file detection methods are used. vulnerable to The detection method proposed in this study extracts the detection result data of the system that can detect malicious code and ransomware among the information protection systems actually used by public institutions, derives various attributes by combining them, and uses a machine learning classification algorithm. Results are derived through experiments on how the derived properties are classified and which properties have a significant effect on the classification result and accuracy improvement. In the experimental results of this paper, although it is different for each algorithm when a specific attribute is included or not, the learning with a specific attribute shows an increase in accuracy, and later detects malicious code and ransomware files and abnormal behavior in the information protection system. It is expected that it can be used for property selection when creating algorithms.

Current Status of Cattle Genome Sequencing and Analysis using Next Generation Sequencing (차세대유전체해독 기법을 이용한 소 유전체 해독 연구현황)

  • Choi, Jung-Woo;Chai, Han-Ha;Yu, Dayeong;Lee, Kyung-Tai;Cho, Yong-Min;Lim, Dajeong
    • Journal of Life Science
    • /
    • v.25 no.3
    • /
    • pp.349-356
    • /
    • 2015
  • Thanks to recent advances in next-generation sequencing (NGS) technology, diverse livestock species have been dissected at the genome-wide sequence level. As for cattle, there are currently four Korean indigenous breeds registered with the Domestic Animal Diversity Information System of the Food and Agricultural Organization of the United Nations: Hanwoo, Chikso, Heugu, and Jeju Heugu. These native genetic resources were recently whole-genome resequenced using various NGS technologies, providing enormous single nucleotide polymorphism information across the genomes. The NGS application further provided biological such that Korean native cattle are genetically distant from some cattle breeds of European origins. In addition, the NGS technology was successfully applied to detect structural variations, particularly copy number variations that were usually difficult to identify at the genome-wide level with reasonable accuracy. Despite the success, those recent studies also showed an inherent limitation in sequencing only a representative individual of each breed. To elucidate the biological implications of the sequenced data, further confirmatory studies should be followed by sequencing or validating the population of each breed. Because NGS sequencing prices have consistently dropped, various population genomic theories can now be applied to the sequencing data obtained from the population of each breed of interest. There are still few such population studies available for the Korean native cattle breeds, but this situation will soon be improved with the recent initiative for NGS sequencing of diverse native livestock resources, including the Korean native cattle breeds.