• Title/Summary/Keyword: Co-clustering

Search Result 221, Processing Time 0.024 seconds

An Investigation on Intellectual Structure of Social Sciences Research by Analysing the Publications of ICPSR Data Reuse (ICPSR 데이터 재이용 저작물 분석을 통한 사회과학 분야의 지적구조 분석)

  • Chung, EunKyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.52 no.1
    • /
    • pp.341-357
    • /
    • 2018
  • Due to the paradigm of open science and advanced digital information technology, data sharing and re-use have been actively conducted and considered data-intensive in a wide variety of disciplines. This study aims to investigate the intellectual structure portrayed by the research products re-using the data sets from ICPSR. For the purpose of this study, a total of 570 research products published in 2017 from the ICPSR site were collected and analyzed in two folds. First, the authors and publications of those research products were analyzed in order to show the trends of research using ICPSR data. Authors tend to be affiliated with university or research institute in the United States. The subject areas of journals are recognized into Social Sciences, Health, and Psychology. In addition, a network with clustering analysis was conducted with using co-word occurrence from the titles of the research products. The results show that there are 12 clusters, mental health, tabocco effect, disorder in school, childhood, and adolescence, sexual risk, child injuries, physical activity, violent behavior, survey, family role, women, problem behavior, gender differences in research areas. The structure portrayed by ICPSR data re-uses demonstrates that substantial number of studies in Medicine have been conducted with a perspective of social sciences.

Analysis of Reading Domian of Men and Women Elderly Using Book Lending Data (도서 대출데이터를 활용한 남녀 노령자의 독서 주제 분석)

  • Cho, Jane
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.1
    • /
    • pp.23-41
    • /
    • 2019
  • This study understand the subject domain of book which has been read by men and woman elderly by analizying the PFNET using library big data and confirm the difference between adult at age 30-40. This study extract co-occurrence matrix of book lending on the popular book list from library big data, for 4 group, men/woman elderly, men/woman adult. With these matrix, this study performs FP network analysis. And Pearson Correlation Analysis based on the Triangle Betweenness Centrality calculated on the loan book was performed to understand the correlation among the 4 clusters which has been created by PNNC algorithm. As a result, reading trend which has been focused on modern korean novel has been revealed in elderly regardless gender, among them, men elderly show extreme tendency concentrated on modern korean long series novel. In the correlation analysis, the male elderly showed a weak negative correlation with the adult male of r = -0.222, and the negative direction of all the other groups showed that the tendency of male elderly's loan book was opposite.

Bibliometric Analysis on Health Information-Related Research in Korea (국내 건강정보관련 연구에 대한 계량서지학적 분석)

  • Jin Won Kim;Hanseul Lee
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.1
    • /
    • pp.411-438
    • /
    • 2024
  • This study aims to identify and comprehensively view health information-related research trends using a bibliometric analysis. To this end, 1,193 papers from 2002 to 2023 related to "health information" were collected through the Korea Citation Index (KCI) database and analyzed in diverse aspects: research trends by period, academic fields, intellectual structure, and keyword changes. Results indicated that the number of papers related to health information continued to increase and has been decreasing since 2021. The main academic fields of health information-related research included "biomedical engineering," "preventive medicine/occupational environmental medicine," "law," "nursing," "library and information science," and "interdisciplinary research." Moreover, a co-word analysis was performed to understand the intellectual structure of research related to health information. As a result of applying the parallel nearest neighbor clustering (PNNC) algorithm to identify the structure and cluster of the derived network, four clusters and 17 subgroups belonging to them could be identified, centering on two conglomerates: "medical engineering perspective on health information" and "social science perspective on health information." An inflection point analysis was attempted to track the timing of change in the academic field and keywords, and common changes were observed between 2010 and 2011. Finally, a strategy diagram was derived through the average publication year and word frequency, and high-frequency keywords were presented by dividing them into "promising," "growth," and "mature." Unlike previous studies that mainly focused on content analysis, this study is meaningful in that it viewed the research area related to health information from an integrated perspective using various bibliometric methods.

Genetic Traceability of Black Pig Meats Using Microsatellite Markers

  • Oh, Jae-Don;Song, Ki-Duk;Seo, Joo-Hee;Kim, Duk-Kyung;Kim, Sung-Hoon;Seo, Kang-Seok;Lim, Hyun-Tae;Lee, Jae-Bong;Park, Hwa-Chun;Ryu, Youn-Chul;Kang, Min-Soo;Cho, Seoae;Kim, Eui-Soo;Choe, Ho-Sung;Kong, Hong-Sik;Lee, Hak-Kyo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.27 no.7
    • /
    • pp.926-931
    • /
    • 2014
  • Pork from Jeju black pig (population J) and Berkshire (population B) has a unique market share in Korea because of their high meat quality. Due to the high demand of this pork, traceability of the pork to its origin is becoming an important part of the consumer demand. To examine the feasibility of such a system, we aim to provide basic genetic information of the two black pig populations and assess the possibility of genetically distinguishing between the two breeds. Muscle samples were collected from slaughter houses in Jeju Island and Namwon, Chonbuk province, Korea, for populations J and B, respectively. In total 800 Jeju black pigs and 351 Berkshires were genotyped at thirteen microsatellite (MS) markers. Analyses on the genetic diversity of the two populations were carried out in the programs MS toolkit and FSTAT. The population structure of the two breeds was determined by a Bayesian clustering method implemented in structure and by a phylogenetic analysis in Phylip. Population J exhibited higher mean number of alleles, expected heterozygosity and observed heterozygosity value, and polymorphism information content, compared to population B. The $F_{IS}$ values of population J and population B were 0.03 and -0.005, respectively, indicating that little or no inbreeding has occurred. In addition, genetic structure analysis revealed the possibility of gene flow from population B to population J. The expected probability of identify value of the 13 MS markers was $9.87{\times}10^{-14}$ in population J, $3.17{\times}10^{-9}$ in population B, and $1.03{\times}10^{-12}$ in the two populations. The results of this study are useful in distinguishing between the two black pig breeds and can be used as a foundation for further development of DNA markers.

Evaluation of Genetic Diversity among Persimmon Cultivars (Diospyros kaki Thunb.) Using Microsatellite Markers (초위성 마커를 이용한 감(Diospyros kaki Thunb.)의 유연관계 분석)

  • Hwang, Ji-Hyeon;Park, Yu-Ok;Kim, Sung-Churl;Lee, Yong-Jae;Kang, Jum-Soon;Choi, Young-Whan;Son, Beung-Gu;Park, Young-Hoon
    • Journal of Life Science
    • /
    • v.20 no.4
    • /
    • pp.632-638
    • /
    • 2010
  • The genetic diversity among 48 persimmon (Diospyros kaki Thunb.) accessions, indigenous in Korea and introduced from Japan and China, was evaluated by using simple sequence repeat (SSR) markers. From 20 SSR primer sets, a total of 114 polymorphic markers were detected among 12 pollination-constant non-astringent (PCNA), 13 pollination-variant non-astringent (PVNA), 15 pollination-variant astringent (PVA), and 8 pollination-constant astringent (PCA) cultivars. Analysis of pair-wise genetic similarity coefficient (Nei-Li) and unweighted pair-group method with arithmetic averaging (UPGMA) clustering revealed two main clusters and four subclusters for cluster I. The subclustering pattern was in accordance with the classification of persimmon cultivars based on the nature of astringency loss. Phenetic relationships among the subclusters showed a closer relatedness of the PCNA group with the PVNA group, and the PVA with the PCA group. Genetic similarity co-efficiency was 0.499 on average and the highest (0.954) similarity was observed between 'Cheongdo-Bansi' and 'Haman-Bansi'. The similarity was lowest (0.192) between 'Damopan'and 'Atago'. Identification of each cultivar with the execption of 'Cheongdo-Bansi' and 'Gyeongsan-Bansi' was possible based on the SSR fingerprints, suggesting that these SSR markers are a useful tool for protecting intellectual property on newly developed cultivars.

Automatic Clustering of Same-Name Authors Using Full-text of Articles (논문 원문을 이용한 동명 저자 자동 군집화)

  • Kang, In-Su;Jung, Han-Min;Lee, Seung-Woo;Kim, Pyung;Goo, Hee-Kwan;Lee, Mi-Kyung;Goo, Nam-Ang;Sung, Won-Kyung
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.652-656
    • /
    • 2006
  • Bibliographic information retrieval systems require bibliographic data such as authors, organizations, source of publication to be uniquely identified using keys. In particular, when authors are represented simply as their names, users bear the burden of manually discriminating different users of the same name. Previous approaches to resolving the problem of same-name authors rely on bibliographic data such as co-author information, titles of articles, etc. However, these methods cannot handle the case of single author articles, or the case when articles do not have common terms in their titles. To complement the previous methods, this study introduces a classification-based approach using similarity between full-text of articles. Experiments using recent domestic proceedings showed that the proposed method has the potential to supplement the previous meta-data based approaches.

  • PDF

Intraspecies Volatile Interactions Affect Growth Rates and Exometabolomes in Aspergillus oryzae KCCM 60345

  • Singh, Digar;Lee, Choong Hwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.28 no.2
    • /
    • pp.199-209
    • /
    • 2018
  • Volatile organic compounds (VOCs) are increasingly been recognized as the chemical mediators of mold interactions, shaping their community dynamics, growth, and metabolism. Herein, we selectively examined the time-correlated (0 D-11 D, where D = incubation days) effects of intraspecies VOC-mediated interactions (VMI) on Aspergillus oryzae KCCM 60345 (S1), following co-cultivation with partner strain A. oryzae KACC 44967 (S2), in a specially designed twin plate assembly. The comparative evaluation of $S1_{VMI}$ (S1 subjected to VMI with S2) and its control ($S1_{Con}$) showed a notable disparity in their radial growth ($S1_{VMI}$ < $S1_{Con}$) at 5 D, protease activity ($S1_{VMI}$ > $S1_{Con}$) at 3-5 D, amylase activity ($S1_{VMI}$ < $S1_{Con}$) at 3-5 D, and antioxidant levels ($S1_{VMI}$ > $S1_{Con}$) at 3 D. Furthermore, we observed a distinct clustering pattern for gas chromatography-time of flight-mass spectrometry datasets from 5 D extracts of $S1_{VMI}$ and $S1_{Con}$ in principle component analysis (PC1: 30.85%; PC2: 10.31%) and partial least squares discriminant analysis (PLS-DA) (PLS1: 30.77; PLS2: 10.15%). Overall, 43 significantly discriminant metabolites were determined for engendering the metabolic variance based on the PLS-DA model (VIP > 0.7, p < 0.05). In general, a marked disparity in the relative abundance of amino acids ($S1_{VMI}$ > $S1_{Con}$) at 5 D, organic acids ($S1_{VMI}$ > $S1_{Con}$) at 5 D, and kojic acid ($S1_{VMI}$ < $S1_{Con}$) at 5-7 D were observed. Examining the headspace VOCs shared between S1 and S2 in the twin plate for 5 D incubated samples, we observed the relatively higher abundance of C-8 VOCs (1-octen-3-ol, (5Z)-octa-1,5-dien-3-ol, 3-octanone, 1-octen-3-ol acetate) having known semiochemical functions. The present study potentially illuminates the effects of VMI on commercially important A. oryzae's growth and biochemical phenotypes with subtle details of altered metabolomes.

Comparison between Planned and Actual Data of Block Assembly Process using Process Mining in Shipyards (조선 산업에서 프로세스 마이닝을 이용한 블록 조립 프로세스의 계획 및 실적 비교 분석)

  • Lee, Dongha;Park, Jae Hun;Bae, Hyerim
    • The Journal of Society for e-Business Studies
    • /
    • v.18 no.4
    • /
    • pp.145-167
    • /
    • 2013
  • This paper proposes a method to compare planned processes with actual processes of bock assembly operations in shipbuilding industry. Process models can be discovered using the process mining techniques both for planned and actual log data. The comparison between planned and actual process is focused in this paper. The analysis procedure consists of five steps : 1) data pre-processing, 2) definition of analysis level, 3) clustering of assembly bocks, 4) discovery of process model per cluster, and 5) comparison between planned and actual processes per cluster. In step 5, it is proposed to compare those processes by the several perspectives such as process model, task, process instance and fitness. For each perspective, we also defined comparison factors. Especially, in the fitness perspective, cross fitness is proposed and analyzed by the quantity of fitness between the discovered process model by own data and the other data(for example, the fitness of planned model to actual data, and the fitness of actual model to planned data). The effectiveness of the proposed methods was verified in a case study using planned data of block assembly planning system (BAPS) and actual data generated from block assembly monitoring system (BAMS) of a top ranked shipbuilding company in Korea.

A Cluster-Based Channel Assignment Algorithm for IEEE 802.11b/g Wireless Mesh Networks (IEEE 802.11b/g 무선 메쉬 네트워크를 위한 클러스터 기반 채널 할당 알고리즘)

  • Cha, Si-Ho;Ryu, Min-Woo;Cho, Kuk-Hyun;Jo, Min-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.4
    • /
    • pp.87-93
    • /
    • 2009
  • Wireless mesh networks (WMNs) are emerging technologies that provide ubiquitous environments and wireless broadband access. The aggregate capacity of WMNs can be improved by minimizing the effect of channel interference. The IEEE 802.11b/g standard which is mainly used for the network interface technology in WMNs provides 3 multiple channels. We must consider the channel scanning delay and the channel dependency problem to effectively assign channels in like these multi-channel WMNs. This paper proposes a cluster-based channel assignment (CB-CA) algorithm for multi-channel WMNs to solve such problems. The CB-CA does not perform the channel scanning and the channel switching through assigning co-channel to the inter-cluster head (CH) links. In the CB-CA, the communication between the CH and cluster member (CM) nodes uses a channel has no effect on channels being used by the inter-CH links. Therefore, the CB-CA can minimize the interference within multi-channel environments. Our simulation results show that CB-CA can improve the performance of WMNs.

Analysis of Geographic and Pairwise Distances among Chinese Cashmere Goat Populations

  • Liu, Jian-Bin;Wang, Fan;Lang, Xia;Zha, Xi;Sun, Xiao-Ping;Yue, Yao-Jing;Feng, Rui-Lin;Yang, Bo-Hui;Guo, Jian
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.26 no.3
    • /
    • pp.323-333
    • /
    • 2013
  • This study investigated the geographic and pairwise distances of nine Chinese local Cashmere goat populations through the analysis of 20 microsatellite DNA markers. Fluorescence PCR was used to identify the markers, which were selected based on their significance as identified by the Food and Agriculture Organization of the United Nations (FAO) and the International Society for Animal Genetics (ISAG). In total, 206 alleles were detected; the average allele number was 10.30; the polymorphism information content of loci ranged from 0.5213 to 0.7582; the number of effective alleles ranged from 4.0484 to 4.6178; the observed heterozygosity was from 0.5023 to 0.5602 for the practical sample; the expected heterozygosity ranged from 0.5783 to 0.6464; and Allelic richness ranged from 4.7551 to 8.0693. These results indicated that Chinese Cashmere goat populations exhibited rich genetic diversity. Further, the Wright's F-statistics of subpopulation within total (FST) was 0.1184; the genetic differentiation coefficient (GST) was 0.0940; and the average gene flow (Nm) was 2.0415. All pairwise FST values among the populations were highly significant (p<0.01 or p<0.001), suggesting that the populations studied should all be considered to be separate breeds. Finally, the clustering analysis divided the Chinese Cashmere goat populations into at least four clusters, with the Hexi and Yashan goat populations alone in one cluster. These results have provided useful, practical, and important information for the future of Chinese Cashmere goat breeding.