• Title/Summary/Keyword: Similarity Matrix

Search Result 316, Processing Time 0.026 seconds

The RNA Base Over 95% of Onju Citrus and Coffee Genes Cut & Paste Based on The BCJM Matrix with Chargaff-Shannon Entropy (BCJM 행렬 및 Chargaff 법칙과 Shannon Entropy에 의한 RNA 유전자 비율이 95%이상인 온주감귤과 귤의 유전자 조합)

  • Lee, Sung Kook;Kim, Jeong Su;Lee, Moon Ho
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.415-422
    • /
    • 2022
  • The heterogeneous Onju citrus genes (A=20.57, C=32.71, G=30.01, U=16.71%) and coffee genes (A=20.66, C=31.76, G=30.187, U=16.71%) have the same genetic ratio of 95% or more. It is known that gene compatibility is generally not possible with this group. However, it can be grafted if the conditions of Chargaff rule and Shannon Entropy are met with gene functional-similarity of more than 95%, and it becomes a new breed of Coffrange. We calculated the world's first BCJM matrix for DNA-RNA and published it in US patents and international journals. All animals and viruses are similar to human genes. Based on this, it was announced in June in the British matrix textbook by solving the genetic characteristics of COVID-19 and the human body. In plants, it is treated with BCJM-Transposon treatment, a technique that easily changes gene location. Simulation predicted that the matrix could be successful with Cut & Paste and Transpose.

A Recommendation Technique using Weight of User Information (사용자 정보 가중치를 이용한 추천 기법)

  • Yun, So-Young;Youn, Sung-Dae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.4
    • /
    • pp.877-885
    • /
    • 2011
  • A collaborative filtering(CF) is the most widely used technique in recommender system. However, CF has sparsity and scalability problems. These problems reduce the accuracy of recommendation and extensive studies have been made to solve these problems, In this paper, we proposed a method that uses a weight so as to solve these problems. After creating a user-item matrix, the proposed method analyzes information about users who prefer the item only by using data with a rating over 4 for enhancing the accuracy in the recommendation. The proposed method uses information about the genre of the item as well as analyzed user information as a weight during the calculation of similarity, and it calculates prediction by using only data for which the similarity is over a threshold and uses the data as the rating value of unrated data. It is possible simultaneously to reduce sparsity and to improve accuracy by calculating prediction through an analysis of the characteristics of an item. Also, it is possible to conduct a quick classification based on the analyzed information once a new item and a user are registered. The experiment result indicated that the proposed method has been more enhanced the accuracy, compared to item based, genre based methods.

A study on searching image by cluster indexing and sequential I/O (연속적 I/O와 클러스터 인덱싱 구조를 이용한 이미지 데이타 검색 연구)

  • Kim, Jin-Ok;Hwang, Dae-Joon
    • The KIPS Transactions:PartD
    • /
    • v.9D no.5
    • /
    • pp.779-788
    • /
    • 2002
  • There are many technically difficult issues in searching multimedia data such as image, video and audio because they are massive and more complex than simple text-based data. As a method of searching multimedia data, a similarity retrieval has been studied to retrieve automatically basic features of multimedia data and to make a search among data with retrieved features because exact match is not adaptable to a matrix of features of multimedia. In this paper, data clustering and its indexing are proposed as a speedy similarity-retrieval method of multimedia data. This approach clusters similar images on adjacent disk cylinders and then builds Indexes to access the clusters. To minimize the search cost, the hashing is adapted to index cluster. In addition, to reduce I/O time, the proposed searching takes just one I/O to look up the location of the cluster containing similar object and one sequential file I/O to read in this cluster. The proposed schema solves the problem of multi-dimension by using clustering and its indexing and has higher search efficiency than the content-based image retrieval that uses only clustering or indexing structure.

Query Processing Model Using Two-level Fuzzy Knowledge Base (2단계 퍼지 지식베이스를 이용한 질의 처리 모델)

  • Lee, Ki-Young;Kim, Young-Un
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.4 s.36
    • /
    • pp.1-16
    • /
    • 2005
  • When Web-based special retrieval systems for scientific field extremely restrict the expression of user's information request, the process of the information content analysis and that of the information acquisition become inconsistent. Accordingly, this study suggests the re-ranking retrieval model which reflects the content based similarity between user's inquiry terms and index words by grasping the document knowledge structure. In order to accomplish this, the former constructs a thesaurus and similarity relation matrix to provide the subject analysis mechanism and the latter propose the algorithm which establishes a search model such as query expansion in order to analyze the user's demands. Therefore, the algorithm that this study suggests as retrieval utilizing the information structure of a retrieval system can be content-based retrieval mechanism to establish a 2-step search model for the preservation of recall and improvement of accuracy which was a weak point of the previous fuzzy retrieval model.

  • PDF

A Question Example Generation System for Multiple Choice Tests by utilizing Concept Similarity in Korean WordNet (한국어 워드넷에서의 개념 유사도를 활용한 선택형 문항 생성 시스템)

  • Kim, Young-Bum;Kim, Yu-Seop
    • The KIPS Transactions:PartA
    • /
    • v.15A no.2
    • /
    • pp.125-134
    • /
    • 2008
  • We implemented a system being able to suggest example sentences for multiple choice tests, considering the level of students. To build the system, we designed an automatic method for sentence generation, which made it possible to control the difficulty degree of questions. For the proper evaluation in the multiple choice tests, proper size of question pools is required. To satisfy this requirement, a system which can generate various and numerous questions and their example sentences in a fast way should be used. In this paper, we designed an automatic generation method using a linguistic resource called WordNet. For the automatic generation, firstly, we extracted keywords from the existing sentences with the morphological analysis and candidate terms with similar meaning to the keywords in Korean WordNet space are suggested. When suggesting candidate terms, we transformed the existing Korean WordNet scheme into a new scheme to construct the concept similarity matrix. The similarity degree between concepts can be ranged from 0, representing synonyms relationships, to 9, representing non-connected relationships. By using the degree, we can control the difficulty degree of newly generated questions. We used two methods for evaluating semantic similarity between two concepts. The first one is considering only the distance between two concepts and the second one additionally considers positions of two concepts in the Korean Wordnet space. With these methods, we can build a system which can help the instructors generate new questions and their example sentences with various contents and difficulty degree from existing sentences more easily.

Evaluation on Development Performances of E-Commerce for 50 Major Cities in China (중국 주요 50개 도시의 전자상거래 발전성과에 대한 평가)

  • Jeong, Dong-Bin;Wang, Qiang
    • Journal of Distribution Science
    • /
    • v.14 no.1
    • /
    • pp.67-74
    • /
    • 2016
  • Purpose - In this paper, the degree of similarity and dissimilarity between pairs of 50 major cities in China can be shown on the basis of three evaluation variables(internet businessman index, internet shopping index and e-commerce development index). Dissimilarity distance matrix is used to analyze both similarity and dissimilarity between each fifty city in China by calculating dissimilarity as distance. Higher value signifies higher degree of dissimilarity between two cities. Cluster analysis is exploited to classify 50 cities into a number of different groups such that similar cities are placed in the same group. In addition, multidimensional scaling(MDS) technique can obtain visual representation for exploring the pattern of proximities among 50 major cities in China based on three development performance attributes. Research design, data, and methodology - This research is performed by the 2013 report provided with AliResearch in China(1/1/2013~11/30/2013) and utilized multivariate methods such as dissimilarity distance matrix, cluster analysis and MDS by using CLUSTER, KMEANS, PROXIMITIES and ALSCAL procedures in SPSS 21.0. Results - This research applies two types of cluster analysis and MDS on three development performances based on the 2013 report of Aliresearch. As a result, it is confirmed that grouping is possible by categorizing the types into four clusters which share similar characteristics. MDS is exploited to carry out positioning of both grouped locations of cluster and 50 major cities belonging to each cluster. Since all the values corresponding to Shenzhen, Guangzhou and Hangzhou(which belong to cluster 1 among 50 major cities) are very large, these cities are superior to other cities in all three evaluation attributes. Twelve cities(Beijing, ShangHai, Jinghua, ZhuHai, XiaMen, SuZhou, NanJing, DongWan, ZhangShan, JiaXing, NingBo and FoShan), which belong to cluster 3, are inferior to those of cluster 1 in terms of all three attributes, but they can be expected to be the next e-commerce revolution. The rest of major cities, in particular, which belong to cluster 4 are relatively inferior in all three attributes, so that this automatically evokes creative innovation, which leads to e-commerce development as a whole in China. In terms of internet businessman index, on the other hand, Tainan, Taizhong, and Gaoxiong(which belong to cluster 2) are situated superior to others. However, these three cities are inferior to others in an internet shopping index sense. The rest of major cities, in particular, which belong to cluster 4 are relatively inferior in all three evaluation attributes, so that this automatically evokes innovation and entrepreneurship, which leads to e-commerce development as a whole in China. Conclusions - This study suggests the implications to help e-governmental officers and companies make strategies in both Korea and China. This is expected to give some useful information in understanding the recent situation of e-commerce in China, by looking over development performances of 50 major cities. Therefore, we should develop marketing, branding and communication relevant to online Chinese consumers. One of these efforts will be incentives like loyalty points and coupons that can encourage consumers and building in-house logistics networks.

Discrimination of Bacillus anthracis Spores by Direct in-situ Analysis of Matrix-Assisted Laser Desorption/Ionization Time-Of-Flight Mass Spectrometry

  • Jeong, Young-Su;Lee, Jonghee;Kim, Seong-Joo
    • Bulletin of the Korean Chemical Society
    • /
    • v.34 no.9
    • /
    • pp.2635-2639
    • /
    • 2013
  • The rapid and accurate identification of biological agents is a critical step in the case of bio-terror and biological warfare attacks. Recently, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry has been widely used for the identification of microorganisms. In this study, we describe a method for the rapid and accurate discrimination of Bacillus anthracis spores using MALDI-TOF MS. Our direct in-situ analysis of MALDI-TOF MS does not involve subsequent high-resolution mass analyses and sample preparation steps. This method allowed the detection of species-specific biomarkers from each Bacillus spores. Especially, B. anthracis spores had specific biomarker peaks at 2503, 3089, 3376, 6684, 6698, 6753, and 6840 m/z. Cluster and PCA analyses of the mass spectra of Bacillus spores revealed distinctively separated clusters and within-groups similarity. Therefore, we believe that this method is effective in the real-time identification of biological warfare agents such as B. anthracis as well as other microorganisms in the field.

Centrifugal Infiltration Process of Fibrous Tubular Preform by Al-Cu Alloy

  • Li, Yanhong;Wang, Kai;Su, Yongkang;Hu, Guoxin
    • Advanced Composite Materials
    • /
    • v.18 no.4
    • /
    • pp.381-394
    • /
    • 2009
  • The kinetics of centrifugal infiltration of fibrous tubular preform is built theoretically, and simulations are conducted to study the effects of various casting conditions on infiltration kinetics and macrosegregation by combining with the energy, mass and kinetic equations. A similarity way is used to simplify the one-dimensional model and the parameter is ascertained by an iterative method. The results indicate that the increase of superheat, initial preform temperature, porosity tends to enlarge the remelting region and decrease copper solute concentration at the infiltration front. Higher angular velocity leads to smaller remelting region and solute concentration at the tip. The pressure in the infiltrated region increase significantly when the angular velocity is much higher, which requires a stronger preform. It is observed that the pressure distribution is mainly determined by the angular velocity, and the macrosegregation in the centrifugal casting is greatly dependent on the superheat of inlet metal matrix, initial temperature and porosity of the preform, and the angular velocity.

Application of RAPD markers for characterization of ${\gamma}$-ray-induced rose mutants and assessment of genetic diversity

  • Chakrabarty, D.;Datta, S.K.
    • Plant Biotechnology Reports
    • /
    • v.4 no.3
    • /
    • pp.237-242
    • /
    • 2010
  • Six parent and their 12 gamma ray-induced somatic flower colour mutants of garden rose were characterized to discriminate the mutants from their respective parents and understanding the genetic diversity using Random amplification of polymorphic DNA (RAPD) markers. Out of 20 primers screened, 14 primers yielded completely identical fragments patterns. The other 7 primers gave highly polymorphic banding patterns among the radiomutants. All the cultivars were identified by using only 7 primers. Moreover, individual mutants were also distinguished by unique RAPD marker bands. Based on the presence or absence of the 48 polymorphic bands, the genetic variations within and among the 18 cultivars were measured. Genetic distance between all 18 cultivars varied from 0.40 to 0.91, as revealed by Jaccard's coefficient matrix. A dendrogram was constructed based on the similarity matrix using the Neighbor Joining Tree method showed three main clusters. The present RAPD analysis can be used not only for estimating genetic diversity present in gamma ray-induced mutants but also for correct identification of mutant/new varieties for their legal protection under plant variety rights.

Stability Robustness of Unified Decentralized Systems (단일 분산시스템의 강인안정성 해석)

  • Lee, Dong-Gi;Heo, Gwang-Hee;Oh, Do-Chang;Lee, Giu;Lee, Woo-Sang
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.44 no.2 s.314
    • /
    • pp.1-9
    • /
    • 2007
  • In this paper, new results for perturbation bounds for unified decentralized systems by a unified approach using $\delta$ (defined as a shift operator at unified approach) are presented. Robust stability analysis of unified decentralized system is investigated by new robust stability bound under system uncertainties. New unified stability bounds are developed based on the unified Lyapunov matrix equation. It is shown that the system maintains its stability when new unified bounds are applied. Numerical example is presented to illustrate the proposed analysis.