• Title/Summary/Keyword: Co-clustering

Search Result 221, Processing Time 0.021 seconds

Seabed Sediment Classification Algorithm using Continuous Wavelet Transform

  • Lee, Kibae;Bae, Jinho;Lee, Chong Hyun;Kim, Juho;Lee, Jaeil;Cho, Jung Hong
    • Journal of Advanced Research in Ocean Engineering
    • /
    • v.2 no.4
    • /
    • pp.202-208
    • /
    • 2016
  • In this paper, we propose novel seabed sediment classification algorithm using feature obtained by continuous wavelet transform (CWT). Contrast to previous researches using direct reflection coefficient of seabed which is function of frequency and is highly influenced by sediment types, we develop an algorithm using both direct reflection signal and backscattering signal. In order to obtain feature vector, we employ CWT of the signal and obtain histograms extracted from local binary patterns of the scalogram. The proposed algorithm also adopts principal component analysis (PCA) to reduce dimension of the feature vector so that it requires low computational cost to classify seabed sediment. For training and classification, we adopts K-means clustering algorithm which can be done with low computational cost and does not require prior information of the sediment. To verify the proposed algorithm, we obtain field data measured at near Jeju island and show that the proposed classification algorithm has reliable discrimination performance by comparing the classification results with actual physical properties of the sediments.

A Characteristic Analysis and Countermeasure Study of the Hedging of Listed Companies in China Stock Markets

  • WU, Guo-Hua;JIANG, Xiao-Ling;DENG, Su-Ya
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.10
    • /
    • pp.147-158
    • /
    • 2021
  • Due to COVID-19, the risk of price volatility in commodity and equity markets increases. The research and application of hedging is the most effective way to reduce the market risk. Hedging is a risk management strategy employed to offset losses in investments by taking an opposite position in a related asset. We use K-means and hierarchical clustering methods to cluster companies and futures products respectively, and analyze the relationship between the number of hedging firms, regional distribution, nature of firms, capital distribution, company size, profitability, number of local Futures Commission Merchants (FCMs), regional location, and listing time. The study shows that listed companies with large scale and good profitability invest more money in hedging, while state-owned enterprises' participation in hedging is more likely to be affected by the company size and the number of local futures commission merchants, and private enterprises are more likely to be affected by the company profitability and the regional location. Listed companies are more willing to choose long-listed and mature futures products for hedging. We also provide policy advice based on our conclusion. So far, there is no study on the characteristics of hedging. This paper fills the gap. The results provide a basis and guidance for people's investment and risk management. Using clustering analysis in hedging study is another innovation of this paper.

A Bibliometric Approach for Department-Level Disciplinary Analysis and Science Mapping of Research Output Using Multiple Classification Schemes

  • Gautam, Pitambar
    • Journal of Contemporary Eastern Asia
    • /
    • v.18 no.1
    • /
    • pp.7-29
    • /
    • 2019
  • This study describes an approach for comparative bibliometric analysis of scientific publications related to (i) individual or several departments comprising a university, and (ii) broader integrated subject areas using multiple disciplinary schemes. It uses a custom dataset of scientific publications (ca. 15,000 articles and reviews, published during 2009-2013, and recorded in the Web of Science Core Collections) with author affiliations to the research departments, dedicated to science, technology, engineering, mathematics, and medicine (STEMM), of a comprehensive university. The dataset was subjected, at first, to the department level and discipline level analyses using the newly available KAKEN-L3 classification (based on MEXT/JSPS Grants-in-Aid system), hierarchical clustering, correspondence analysis to decipher the major departmental and disciplinary clusters, and visualization of the department-discipline relationships using two-dimensional stacked bar diagrams. The next step involved the creation of subsets covering integrated subject areas and a comparative analysis of departmental contributions to a specific area (medical, health and life science) using several disciplinary schemes: Essential Science Indicators (ESI) 22 research fields, SCOPUS 27 subject areas, OECD Frascati 38 subordinate research fields, and KAKEN-L3 66 subject categories. To illustrate the effective use of the science mapping techniques, the same subset for medical, health and life science area was subjected to network analyses for co-occurrences of keywords, bibliographic coupling of the publication sources, and co-citation of sources in the reference lists. The science mapping approach demonstrates the ways to extract information on the prolific research themes, the most frequently used journals for publishing research findings, and the knowledge base underlying the research activities covered by the publications concerned.

Analysis Framework using Process Mining for Block Movement Process in Shipyards (조선 산업에서 프로세스 마이닝을 이용한 블록 이동 프로세스 분석 프레임워크 개발)

  • Lee, Dongha;Bae, Hyerim
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.39 no.6
    • /
    • pp.577-586
    • /
    • 2013
  • In a shipyard, it is hard to predict block movement due to the uncertainty caused during the long period of shipbuilding operations. For this reason, block movement is rarely scheduled, while main operations such as assembly, outfitting and painting are scheduled properly. Nonetheless, the high operating costs of block movement compel task managers to attempt its management. To resolve this dilemma, this paper proposes a new block movement analysis framework consisting of the following operations: understanding the entire process, log clustering to obtain manageable processes, discovering the process model and detecting exceptional processes. The proposed framework applies fuzzy mining and trace clustering among the process mining technologies to find main process and define process models easily. We also propose additional methodologies including adjustment of the semantic expression level for process instances to obtain an interpretable process model, definition of each cluster's process model, detection of exceptional processes, and others. The effectiveness of the proposed framework was verified in a case study using real-world event logs generated from the Block Process Monitoring System (BPMS).

Author Graph Generation based on Author Disambiguation (저자 식별에 기반한 저자 그래프 생성)

  • Kang, In-Su
    • Journal of Information Management
    • /
    • v.42 no.1
    • /
    • pp.47-62
    • /
    • 2011
  • While an ideal author graph should have its nodes to represent authors, automatically-generated author graphs mostly use author names as their nodes due to the difficulty of resolving author names into individuals. However, employing author names as nodes of author graphs merges namesakes, otherwise separate nodes in the author graph, into the same node, which may distort the characteristics of the author graph. This study proposes an algorithm which resolves author ambiguities based on co-authorship and then yields an author graph consisting of not author name nodes but author nodes. Scientific collaboration relationship this algorithm depends on tends to produce the clustering results which minimize the over-clustering error at the expense of the under-clustering error. In experiments, the algorithm is applied to the real citation records where Korean namesakes occur, and the results are discussed.

Document Summarization Based on Sentence Clustering Using Graph Division (그래프 분할을 이용한 문장 클러스터링 기반 문서요약)

  • Lee Il-Joo;Kim Min-Koo
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.149-154
    • /
    • 2006
  • The main purpose of document summarization is to reduce the complexity of documents that are consisted of sub-themes. Also it is to create summarization which includes the sub-themes. This paper proposes a summarization system which could extract any salient sentences in accordance with sub-themes by using graph division. A document can be represented in graphs by using chosen representative terms through term relativity analysis based on co-occurrence information. This graph, then, is subdivided to represent sub-themes through connected information. The divided graphs are types of sentence clustering which shows a close relationship. When salient sentences are extracted from the divided graphs, summarization consisted of core elements of sentences from the sub-themes can be produced. As a result, the summarization quality will be improved.

Energy Efficient Topology Control based on Sociological Cluster in Wireless Sensor Networks

  • Kang, Sang-Wook;Lee, Sang-Bin;Ahn, Sae-Young;An, Sun-Shin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.1
    • /
    • pp.341-360
    • /
    • 2012
  • The network topology for a wide area sensor network has to support connectivity and a prolonged lifetime for the many applications used within it. The concepts of structure and group in sociology are similar to the concept of cluster in wireless sensor networks. The clustering method is one of the preferred ways to produce a topology for reduced electrical energy consumption. We herein propose a cluster topology method based on sociological structures and concepts. The proposed sociological clustering topology (SOCT) is a method that forms a network in two phases. The first phase, which from a sociological perspective is similar to forming a state within a nation, involves using nodes with large transmission capacity to set up the global area for the cluster. The second phase, which is similar to forming a city inside the state, involves using nodes with small transmission capacity to create regional clusters inside the global cluster to provide connectivity within the network. The experimental results show that the proposed method outperforms other methods in terms of energy efficiency and network lifetime.

Taxonomic implications of multivariate analyses of Egyptian Ononis L. (Fabaceae) based on morphological traits

  • FAYED, Abdel Aziz A.;EL-HADIDY, Azza M.H.;FARIED, Ahmed M.;OLWEY, Asmaa O.
    • Korean Journal of Plant Taxonomy
    • /
    • v.49 no.1
    • /
    • pp.13-27
    • /
    • 2019
  • Numerical taxonomy is employed to determine the phenetic proximity of the Egyptian taxa belonging to the genus Ononis L. A classical clustering analysis and a principal component analysis (PCA) were used to separate 57 macro- and micromorphological characters in order to circumscribe 11 taxa of Ononis. A clustering analysis using the unweighted pair-group method with the arithmetic means (UPGMA) method gives the highest co-phenetic correlation. Results from clustering and PCA revealed the segregation of five groups. Our results are in line, to some certain degree, with the traditional sub-sectional concept, as can be seen in the grouping of the representative members of the subsections Diffusae and Mittisimae together and the representative members of the subsections Viscosae and Natrix. The phenetic uniqueness of Ononis variegata and O. reclinata subsp. mollis was formally established. However, our findings contradict the classic sectional concept; this opinion was suggested earlier in previous phylogenetic circumscriptions of the genus. The most useful characters that provide taxonomic clarity were discussed.

Analysis of Reference Inquiries in the Field of Social Science in the Collaborative Reference Service Using the Co-Word Technique

  • Cho, Jane
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.1
    • /
    • pp.129-148
    • /
    • 2015
  • This study grasped the true nature of the inquiry domain by analysing the requests for collaborative reference service in the social science field using the co-word technique, and schematized the intellectual structure. First, this study extracted 748 uncontrolled keywords from inquiries for reference in the field of social science. Second, calculated similarity indices between the words on the basis of co-occurrence frequency, and performed not only clustering but also MDS mapping. Third, to grasp the difference in inquiries for reference by period, dividing the period into two parts, and performed comparative analysis. As a result, there formed 5 clusters and "Korea Education" showed an overwhelming size with 40.3% among those clusters. The result of the analysis through the period division showed there were many questions about "Education" during the first half, while a lot of inquiries with focus on "welfare and business information" during the second half.

Modeling Large S-System using Clustering and Genetic Algorithm

  • Jung, Sung-Won;Lee, Kwang-H.;Lee, Co-Heon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.197-201
    • /
    • 2005
  • When we want to find out the regulatory relationships between genes from gene expression data, dimensionality is one of the big problem. In general, the size of search space in modeling the regulatory relationships grows in O(n$^2$) while the number of genes is increasing. However, hopefully it can be reduced to O(kn) with selected k by applying divide and conquer heuristics which depend on some assumptions about genetic network. In this paper, we approach the modeling problem in divide-and-conquer manner. We applied clustering to make the problem into small sub-problems, then hierarchical model process is applied to those small sub-problems.

  • PDF