• Title/Summary/Keyword: cluster coefficient

Search Result 222, Processing Time 0.032 seconds

Improving the Performance of Document Clustering with Distributional Similarities (분포유사도를 이용한 문헌클러스터링의 성능향상에 대한 연구)

  • Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.267-283
    • /
    • 2007
  • In this study, measures of distributional similarity such as KL-divergence are applied to cluster documents instead of traditional cosine measure, which is the most prevalent vector similarity measure for document clustering. Three variations of KL-divergence are investigated; Jansen-Shannon divergence, symmetric skew divergence, and minimum skew divergence. In order to verify the contribution of distributional similarities to document clustering, two experiments are designed and carried out on three test collections. In the first experiment the clustering performances of the three divergence measures are compared to that of cosine measure. The result showed that minimum skew divergence outperformed the other divergence measures as well as cosine measure. In the second experiment second-order distributional similarities are calculated with Pearson correlation coefficient from the first-order similarity matrixes. From the result of the second experiment, secondorder distributional similarities were found to improve the overall performance of document clustering. These results suggest that minimum skew divergence must be selected as document vector similarity measure when considering both time and accuracy, and second-order similarity is a good choice for considering clustering accuracy only.

A Study on the Quantitative Rehabilitation Extent Evaluation Method Using High-Order Function Waveform Analysis of EMG Signal (근전도 신호의 고차함수분석법을 이용한 정량적 재활정도 평가에 관한 연구)

  • Moon, D.J.;Kim, J.Y.;Noh, S.C.;Choi, H.H.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.4
    • /
    • pp.305-312
    • /
    • 2014
  • In this study, in order to quantitatively confirm walking rehabilitation degree, we analyzed EMG pattern simulated abnormal gait and normal gait by applying a curve fitting. We calculated the suitable high-order function for EMG signal, and classified them into 5 groups by using cluster analysis. Depending on the distance from normal pattern group, we listed the pattern group and then the distribution of each variables were confirmed. The amplitude-decreased pattern was the most similar to the normal pattern, but the reversed pattern showed the lowest similarity. Due to the smaller overlapping range, the distribution of the groups were possible to classify using the value of variable. The standard deviation of each term coefficient was compared to indicate the quantitative rehabilitation extent, and the higher value was confirmed as the pattern is close to the normal pattern. Consequently, the representation of quantitative rehabilitation extent is expected to contribute to the more effective rehabilitation method study.

  • PDF

Genetic Diversity of Korean Rice Breeding Parents as Measured by DNA Fingerprinting with Simple Sequence Repeat (SSR) Markers

  • Song, Moon-Tae;Lee, Jeom-Ho;Lee, Sang-Bok;Cho, Youn-Sang;Ku, Ja-hwan;Seo, Kyoung-In;Choi, Seong-ho;Hwang, Heung-Goo
    • Plant Resources
    • /
    • v.6 no.1
    • /
    • pp.16-26
    • /
    • 2003
  • Molecular markers are useful tools for evaluating genetic diversity and determining cultivar identity. Present study was conducted to evaluate the genetic diversity within a diverse collection of rice accessions used for Korean breeding programs. Two hundred eighty-seven rice cultivars, composed of temperate japonica, tropical japonica, indica, and Tongil-type of Korean crossing parents were evaluated by means of 15 simple sequence repeat (SSR) markers. A total of 99 alleles were detected, and the number of alleles per marker ranged from 4 to 11, with an average of 6.6 per locus. Polymorphism information content (PIC) for each of the SSR markers ranged from 0.2924 to 0.8102 with an average of 0.5785. These results, with the result that use of only 15 SSR markers made all rice cultivars examined could be uniquely distinguished, imply the efficiency of SSR markers for analysis of genetic diversity in rice. Cluster analysis was performed on similar coefficient matrics calculated from SSR markers to generate a dendogram in which two major groups corresponding to japonica (Group I) and indica and Tongil type rice (group II) with additional subclasses within both major groups. The narrowness of the Korean breeding germplasm was revealed by the fact that most of the Korean-bred and Japan-bred temperate japonica cultivars were concentrated into only 2 of the sub-group I-1 (143 cultivars) and I-2 (58 cultivars) among six sub-groups in major group of japonica. This is because of the japonica accessions used in this study was a very closely related ones because of frequent sharing of the crossing parents with similar genetic background with synergy effect of the inherited genetic difference between indica and japonica. A rice breeding strategy with the use of molecular markers was discussed for overcoming of genetic vulnerability owing to this genetic narrowness.

  • PDF

A Study on the Pattern Development and Wear Fitness of the Bodysuit (Bodysuit의 패턴개발과 적합성에 관한 연구)

  • 최미성
    • Korean Journal of Rural Living Science
    • /
    • v.5 no.2
    • /
    • pp.93-106
    • /
    • 1994
  • The purpose of this study was to develop the pattern of bodysuit and to identify the wear fitness of it The methods of statistical analysis applied to the study were ANOVA and cluster analysis. The materials used in making bodysuit were Nylon/Polyurethane, lace, power net, binding tape, and hook eye. The try-on test was administered in two aspects ; (1) the comparison of anthropometric data before and after trying on the experimentally constructed bodysuit with those of marketing bodysuit, (2) the sensory evaluation to estimate the wear fitness in terms of appearance and motion function. The conclusions obtained are as follows ; 1. In the survey of wearing state, 52.2% of respondents had experience of wearing bodysuit. 60.6% of them responded to the item, “well-balanced body” in the question about the purpose of wearing it. 55.7% considered the item, “feel choky in the chest” as uncomfortable point in putting on bodysuit. 48.3% felt the portion of crotch drawn above in taking exercise or behaving routinely in everyday life. 2. As for the characteristics of the bodysuit design, the scooped neckline and horizontal outline without wire in lower bust was used, the adjust point being located right above the perineum point, and the length of bodysuit is as far as trochanteric point. 3. In comparing anthropometric data of the subjects, there was significant difference in the height of lower bust the distance around abdomen, and the length of bust point(right, left) between the experimentally constructed bodysuit and the marketing bodysuit. 4. Concerning the results of the try-on test in appearance, the estimates of expert panel, which were in agreement with those of subjects in mean value and composite reliability coefficient, showed that the pattern fitness of experimentally designed bodysuit was higher than that of marketing bodysuit. 5. To take try-on test in motion function, motion was classified the five steps. The results of the test showed that experimentally designed bodysuit was fitter in each steps of motion than marketing bodysuit.

  • PDF

Relationships Between Leisure Competence, Leisure Flow, and Leisure Satisfaction of University Students Participating in Leisure Activities (대학생의 여가유능감과 여가몰입, 여가만족도의 관계)

  • Song, Kang-Young;Lim, Young-Sam;Ahn, Byoung-Wook
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.10
    • /
    • pp.425-433
    • /
    • 2011
  • The purpose of this study was to investigate the relationships between leisure competence, leisure flow, and leisure satisfaction of university students participating in leisure activity. The subjects were selected by stratified cluster random sampling method. They were composed of 308 students who had been leisure activity participating in university students. The Leisure competence(Ahn, 2005), Leisure flow(Lee, 2006), Leisure satisfaction(Ahn, 2009) were used for collecting data. In consequence of exploratory factor analysis, 3sub-factors(leisure competence), 5sub-factors(leisure flow), and 5sub-factor(leisure satisfaction were found. Cronbach's ${\alpha}$ coefficient were .726~.850, .537~.887, .764~.943 respectively. For the statistical analysis, SPSS 15.0 and AMOS 7.0 were utilized. The relationship between research variables were examined by the frequency, explore factor, reliability, corelation, structural equation modeling analysis. The significance level of all test was p<.05. The findings were as follows: First, leisure competence did have a positive influence on leisure flow. Second, leisure competence didn't have influence on leisure satisfaction. Final, leisure flow did have positive influence on leisure satisfactions.

Molecular Characterization of 170 New gDNA-SSR Markers for Genetic Diversity in Button Mushroom (Agaricus bisporus)

  • An, Hyejin;Jo, Ick-Hyun;Oh, Youn-Lee;Jang, Kab-Yeul;Kong, Won-Sik;Sung, Jwa-Kyung;So, Yoon-Sup;Chung, Jong-Wook
    • Mycobiology
    • /
    • v.47 no.4
    • /
    • pp.527-532
    • /
    • 2019
  • We designed 170 new simple sequence repeat (SSR) markers based on the whole-genome sequence data of button mushroom (Agaricus bisporus), and selected 121 polymorphic markers. A total of 121 polymorphic markers, the average major allele frequency (MAF) and the average number of alleles (NA) were 0.50 and 5.47, respectively. The average number of genotypes (NG), observed heterozygosity (HO), expected heterozygosity (HE), and polymorphic information content (PIC) were 6.177, 0.227, 0.619, and 0.569, respectively. Pearson's correlation coefficient showed that MAF was negatively correlated with NG (-0.683), NA (-0.600), HO (-0.584), and PIC (-0.941). NG, NA, HO, and PIC were positively correlated with other polymorphic parameters except for MAF. UPGMA clustering showed that 26 A. bisporus accessions were classified into 3 groups, and each accession was differentiated. The 121 SSR markers should facilitate the use of molecular markers in button mushroom breeding and genetic studies.

Rank-Size Distribution with Web Document Frequency of City Name : Case study with U.S incorporated places of 100,000 or more population (인터넷 문서빈도를 통해 본 도시순위규모에 관한 연구 -미국 10만 이상의 인구를 갖는 도시들을 사례로-)

  • Hong, Il-Young
    • Journal of the Korean association of regional geographers
    • /
    • v.13 no.3
    • /
    • pp.290-300
    • /
    • 2007
  • In this study, web document frequency of city place name is analyzed and it is used as the dataset for rank-size analysis. The search keywords are compared in the context of spatial meaning and the different domain corpus is applied. The acquired search results are applied for the further analysis. Firstly, the rank-size analysis is applied to compare the result between population and document frequency. Secondly, in case of correlation analysis, the significant changes are revealed when the spatial criteria for search keywords are increased. In case of corpus, COM, NET, and ORG shows the higher coefficient values. Lastly, the cluster analysis is applied to classify the list of cities that shows the similarity and difference. These analyses have a significant role in representing the rank-size distribution of city names that are reflected on the web documents in the information society.

  • PDF

Classification of National Highway by Factor Analysis (요인분석을 활용한 일반국도 유형분류)

  • Lim, Sung-Han;Ha, Jung-A;Oh, Ju-Sam
    • International Journal of Highway Engineering
    • /
    • v.7 no.3 s.25
    • /
    • pp.43-52
    • /
    • 2005
  • Highway classification is an essential part of defining design criteria of roads. This study is to classify highways by factor analysis. To accomplish the objectives, factor analysis is performed for classifying highways using the traffic data observed at the permanent traffic count points in 2004. A total off variables are applied : AADT, K factor, D factor, heavy vehicle proportion, day time traffic volume proportion, peak hour volume proportion, sunday factor, vacation factor and COV(Coefficient of Variation). The results of factor analysis show that variables are divided into two factors, which are the factor related to the fluctuational characteristics of traffic volume and the factor related to heavy vehicle and directional volume characteristics. According to the results of cluster analysis, 353 permanent traffic count points are categorized into such three groups as type I for urban highway, type II for rural highway, type III for recreational highway, respectively.

  • PDF

Factors Influencing Internet Addiction among Adolescents in an Area (일부 지역 청소년의 인터넷 중독에 영향을 미치는 요인)

  • Shin, Seung-Bae;Lee, Ju-Yul;Kim, Seok-Hwan
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.12 no.1
    • /
    • pp.45-58
    • /
    • 2011
  • Objectives: The purpose of this study is to analyze the fators affecting Internet addiction among adolescents in an area. Methods: By using cluster sampling, 2,479 participants representing 22 elementary school, 11 middle school, 7 high school students in a county of the Chungcheongnam-do. Data was analyzed by SPSS 12.0. using t-test, F-test, chi-square test, Pearson correlation coefficient and multiple regression. Results: Internet addiction positively correlated with high school students(dummy variable), Internet-connected computers in PC Game Room(dummy variable), Internet using time(weekday) and Internet using time(weekend). Internet addiction negatively correlated with Internet-connected computers in school(dummy variable), Internet-connected computers in friend's house(dummy variable). For the male students, Significant factors affecting Internet addiction were eating habits, Internet-connected computers in friend's house, Internet using time(weekend). For the female students, Internet using time(weekday) and Internet using time(weekend) were significant. For the elementary school students, Significant factors affecting Internet addiction were Internet using tine(weekday) and Internet using time(weekend). In the case of the middle school students, Internet using tine(weekday), Internet using time(weekend) and eating habits were significant. but, the high school students, Internet using time(weekend) was significant. Conclusions: Students who spend more time in the internet have higher tendency to become addicted to the internet. Therefore, it would be necessary to develop program to prevent internet addiction.

  • PDF

Classification of Capsicum annuum Germplasm Using Random Amplified Polymorphic DNA (RAPD를 이용한 고추(Capsicum annuum) 유전자원의 분류)

  • Nam, Seung-Hyun;Choi, Geun-Won;Yoo, Il-Woong
    • Horticultural Science & Technology
    • /
    • v.16 no.4
    • /
    • pp.503-507
    • /
    • 1998
  • This study was initiated to evaluate genetic relationship among various domestic and exotic pepper accessions using random amplified polymorphic DNA(RAPD) markers. The results suggested that the optimum conditions for PCR with random primers in Capsicum spp. could be obtained with 3mM of $MgCl_2$, 1.5U of Taq. DNA polymerase, 10ng of template DNA, $200{\mu}M$ of dNTPs, 200nM of random primer, and $42^{\circ}C$ of annealing temperature. Sixteen random primers showing high band intensity and reproducibility were selected from 80 random primers. Primers having 70% GC content were more effective in DNA amplification than primers having 60% GC content. The total 93 DNA bands including 71 polymorphic bands and 22 monomorphic bands were obtained with selected 16 random primers for 31 pepper cultivars and lines. About 4.4 polymorphic bands per primer were produced. Similarity coefficients were calculated by using 71 polymorphic bands and dendrogram based on the similarity coefficient showed clear classification of 31 peppers into three Capsicum species of Capsicum annuum, Capsicum chinense and Capsicum chacoense.

  • PDF