Search | Korea Science

A Study on Clustering and Identifying Gene Sequences using Suffix Tree Clustering Method and BLAST (서픽스트리 클러스터링 방법과 블라스트를 통합한 유전자 서열의 클러스터링과 기능검색에 관한 연구)

Han, Sang-Il;Lee, Sung-Gun;Kim, Kyung-Hoon;Lee, Ju-Yeong;Kim, Young-Han;Hwang, Kyu-Suk
- Journal of Institute of Control, Robotics and Systems
- /
- v.11 no.10
- /
- pp.851-856
- /
- 2005
The DNA and protein data of diverse species have been daily discovered and deposited in the public archives according to each established format. Database systems in the public archives provide not only an easy-to-use, flexible interface to the public, but also in silico analysis tools of unidentified sequence data. Of such in silico analysis tools, multiple sequence alignment [1] methods relying on pairwise alignment and Smith-Waterman algorithm [2] enable us to identify unknown DNA, protein sequences or phylogenetic relation among several species. However, in the existing multiple alignment method as the number of sequences increases, the runtime increases exponentially. In order to remedy this problem, we adopted a parallel processing suffix tree algorithm that is able to search for common subsequences at one time without pairwise alignment. Also, the cross-matching subsequences triggering inexact-matching among the searched common subsequences might be produced. So, the cross-matching masking process was suggested in this paper. To identify the function of the clusters generated by suffix tree clustering, BLAST was combined with a clustering tool. Our clustering and annotating tool is summarized as the following steps: (1) construction of suffix tree; (2) masking of cross-matching pairs; (3) clustering of gene sequences and (4) annotating gene clusters by BLAST search. The system was successfully evaluated with 22 gene sequences in the pyrubate pathway of bacteria, clustering 7 clusters and finding out representative common subsequences of each cluster
https://doi.org/10.5302/J.ICROS.2005.11.10.851 인용 PDF KSCI

Query Space Exploration Model Using Genetic Algorithm

Lee, Jae-Hoon;Lee, Sung-Joo
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.3 no.2
- /
- pp.222-226
- /
- 2003
Information retrieval must be able to search the most suitable document that user need from document set. If foretell document adaptedness by similarity degree about QL(Query Language) of document, documents that search person does not require are searched. In this paper, showed that can search the most suitable document on user's request searching document of the whole space using genetic algorithm and used knowledge-base operator to solve various model's problem.
https://doi.org/10.5391/IJFIS.2003.3.2.222 인용 PDF KSCI

Finding a Temperature Control Method in Microwave Oven using Genetic Algorithm (Genetic Algorithm을 이용한 전자레인지 온도 최적 제어패턴 구현)

최이존;이승구;임형택;김성현;전홍태
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1995.10b
- /
- pp.98-103
- /
- 1995
In this paper, a method is presented for finding an optimal temperature control pattern in microwaveoven using genetic algorithm. Power spectrum of temperature variance of charcoal is obtained and oven system modeling with fuzzy-neural-network is explained. Fan on/off timing is converted to strings in gene pool and then genetic iterations make the power spectrum of simmulated temperature variance of microwave oven closer to that o charcoal.
PDF

The effect of the new stopping criterion on the genetic algorithm performance

Kaya, Mustafa;Genc, Asim
- Computers and Concrete
- /
- v.27 no.1
- /
- pp.63-71
- /
- 2021
In this study, a new stopping criterion, called "backward controlled stopping criterion" (BCSC), was proposed to be used in Genetic Algorithms. In the study, the available stopping citeria; adaptive stopping citerion, evolution time, fitness threshold, fitness convergence, population convergence, gene convergence, and developed stopping criterion were applied to the following four comparison problems; high strength concrete mix design, pre-stressed precast concrete beam, travelling salesman and reinforced concrete deep beam problems. When completed the analysis, the developed stopping criterion was found to be more accomplished than available criteria, and was able to research a much larger area in the space design supplying higher fitness values.
https://doi.org/10.12989/cac.2021.27.1.063 인용 KSCI

A Study on the Design of a Biologizing Control System

Park, Byung-Jae;Wang, Paul P.
- Journal of the Korean Institute of Intelligent Systems
- /
- v.14 no.5
- /
- pp.630-634
- /
- 2004
According to the progress of an information-oriented society, more human friendly systems are required. The systems can be implemented by a kind of intelligent algorithms. In this paper we propose the possibility of the implementation of an intelligent algorithm from gene, behavior of human beings, which has some properties such as self organization and self regulation. The regulation of gene behavior is widely analyzed by Boolean network. Also the SORE (Self Organizable and Regulating Engine) is one of those algorithms. This paper does not report detailed research results; rather, it studies the feasibility of gene behavior in biocontrol systems based upon computer simulations.
https://doi.org/10.5391/JKIIS.2004.14.5.630 인용 PDF KSCI

A Finite Mixture Model for Gene Expression and Methylation Pro les in a Bayesian Framewor

Jeong, Jae-Sik
- The Korean Journal of Applied Statistics
- /
- v.24 no.4
- /
- pp.609-622
- /
- 2011
The pattern of methylation draws significant attention from cancer researchers because it is believed that DNA methylation and gene expression have a causal relationship. As the interest in the role of methylation patterns in cancer studies (especially drug resistant cancers) increases, many studies have been done investigating the association between gene expression and methylation. However, a model-based approach is still in urgent need. We developed a finite mixture model in the Bayesian framework to find a possible relationship between gene expression and methylation. For inference, we employ Expectation-Maximization(EM) algorithm to deal with latent (unobserved) variable, producing estimates of parameters in the model. Then we validated our model through simulation study and then applied the method to real data: wild type and hydroxytamoxifen(OHT) resistant MCF7 breast cancer cell lines.
https://doi.org/10.5351/KJAS.2011.24.4.609 인용 PDF KSCI

Comparison of the Cluster Validation Techniques using Gene Expression Data (유전자 발현 자료를 이용한 군집 타당성분석 기법 비교)

Jeong, Yun-Kyoung;Baek, Jang-Sun
- 한국데이터정보과학회:학술대회논문집
- /
- 2006.04a
- /
- pp.63-76
- /
- 2006
Several clustering algorithms to analyze gene expression data and cluster validation techniques that assess the quality of their outcomes, have been suggested, but evaluations of these cluster validation techniques have seldom been implemented. In this paper we compared various cluster validity indices for simulation data and real genomic data, and found that Dunn's index is more effective and robust through small simulations and with real gene expression data.
PDF

Novel Diagnostic Algorithm Using tuf Gene Amplification and Restriction Fragment Length Polymorphism is Promising Tool for Identification of Nontuberculous Mycobacteria

Shin, Ji-Hyun;Cho, Eun-Jin;Lee, Jung-Yeon;Yu, Jae-Yon;Kang, Yeon-Ho
- Journal of Microbiology and Biotechnology
- /
- v.19 no.3
- /
- pp.323-330
- /
- 2009
Nontuberculous mycobacteria (NTM) are a major cause of opportunistic infections in immunocompromised patients, making the reliable and rapid identification of NTM to the species level very important for the treatment of such patients. Therefore, this study evaluated the usefulness of the novel target genes tuf and tmRNA for the identification of NTM to the species level, using a PCRrestriction fragment length polymorphism analysis (PRA). A total of 44 reference strains and 17 clinical isolates of the genus Mycobacterium were used. The 741 bp or 744 bp tuf genes were amplified, restricted with two restriction enzymes (HaeIII/MboI), and sequenced. The tuf gene-PRA patterns were compared with those for the tmRNA (AvaII), hsp65 (HaeIII/HphI), rpoB (MspI/HaeIII), and 16S rRNA (HaeIII) genes. For the reference strains, the tuf gene-PRA yielded 43 HaeIII patterns, of which 35 (81.4%) showed unique patterns on the species level, whereas the tmRNA, hsp65, rpoB, and 16S rRNA-PRAs only showed 10 (23.3%), 32 (74.4%), 19 (44.2%), and 3 (7%) unique patterns after single digestion, respectively. The tuf gene-PRA produced a clear distinction between closely related NTM species, such as M. abscessus (557-84-58) and M. chelonae (477-84-80-58), and M. kansasii (141-136-80-63-58-54-51) and M. gastri (141-136-117-80-58-51). No difference was observed between the tuf-PRA patterns for the reference strains and clinical isolates. Thus, a diagnostic algorithm using a tuf gene-targeting PRA is a promising tool with more advantages than the previously used hsp65, rpoB, and 16S rRNA genes for the identification of NTM to the species level.
https://doi.org/10.4014/jmb.0804.267 인용 PDF KSCI

Unfolding the Eigen Shin-Tou-Jil (Proper Body.Earth Materials) by the Algorithm of Human-ware (고유신토질(固有身土質)의 휴먼웨어적 전개)

서윤정;유왕진
- Journal of Korean Society for Quality Management
- /
- v.28 no.1
- /
- pp.13-26
- /
- 2000
It is really hard for the material factors to basically improve quality of life, since it is the only partial means of the survival and activity of life. Development of Eigen Shin-Tou-Jil(Proper Body·Earth materials), therefore, must be concentrated on providing man with essential meaning of life, not with simply economic advantage. Eigen Shin-Tou-Jil(Proper Body·Earth materials) which is formed through long passage of time in the original environments that include the climate and nature features of a special region, the representative examples are like Korea Bong-sam(a kind of genseng) of Yellow Earth etc. Unfolding the Eigen Shin-Tou-Jil(Proper Body·Earth materials) by the Algorithm of Human-ware means the development for manifesting individual eigen motives and traits as subject of behavior(Gene-ware). It is because all plants, animals, inanimate objects, including Human, have evolved with their own values in the ecosystem. It was reported that a Baeksong(white pine tree), grown well up in TongEeDong, Seoul, Korea had rarely grown up during the period suppressed by Japan. By the developments of Bio-Engineering, we also found that 40% of gene base sequence of C. Elegance(a kind of worm) is identical to that of characteristic Human. In this reason, through considering common characteristics between Human and Nature, the developments of Eigen Shin-Tou-Jil(Proper Body·Earth materials) must begin with epoch for manifesting and understanding individual's Eigen motives and traits as subject of behavior(Gene-ware)
PDF

Support vector machine and multifactor dimensionality reduction for detecting major gene interactions of continuous data (서포트 벡터 머신 알고리즘을 활용한 연속형 데이터의 다중인자 차원축소방법 적용)

Lee, Jea-Young;Lee, Jong-Hyeong
- Journal of the Korean Data and Information Science Society
- /
- v.21 no.6
- /
- pp.1271-1280
- /
- 2010
We have used multifactor dimensionality reduction (MDR) method to study genegene interaction effect of statistical model in general. But, MDR method could not be applied in the continuous data. In this paper, continuous-type data by the support vector machine (SVM) algorithm are proposed to the MDR method which provides an introduction to the technique. Also we apply the method on the identify major interaction effects of single nucleotide polymorphisms (SNPs) responsible for economic traits in a Korean cattle population.
PDF KSCI

Search Result 232, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)