• Title/Summary/Keyword: rank analysis

Search Result 1,409, Processing Time 0.031 seconds

KR-WordRank : An Unsupervised Korean Word Extraction Method Based on WordRank (KR-WordRank : WordRank를 개선한 비지도학습 기반 한국어 단어 추출 방법)

  • Kim, Hyun-Joong;Cho, Sungzoon;Kang, Pilsung
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.40 no.1
    • /
    • pp.18-33
    • /
    • 2014
  • A Word is the smallest unit for text analysis, and the premise behind most text-mining algorithms is that the words in given documents can be perfectly recognized. However, the newly coined words, spelling and spacing errors, and domain adaptation problems make it difficult to recognize words correctly. To make matters worse, obtaining a sufficient amount of training data that can be used in any situation is not only unrealistic but also inefficient. Therefore, an automatical word extraction method which does not require a training process is desperately needed. WordRank, the most widely used unsupervised word extraction algorithm for Chinese and Japanese, shows a poor word extraction performance in Korean due to different language structures. In this paper, we first discuss why WordRank has a poor performance in Korean, and propose a customized WordRank algorithm for Korean, named KR-WordRank, by considering its linguistic characteristics and by improving the robustness to noise in text documents. Experiment results show that the performance of KR-WordRank is significantly better than that of the original WordRank in Korean. In addition, it is found that not only can our proposed algorithm extract proper words but also identify candidate keywords for an effective document summarization.

An Evaluation of Constituent Factors for Port Logistics (항만물류 구성요소의 평가에 관한 연구)

  • Yeo, Gi-Tae;Jung, Hyun-Jae;Kim, Jae-Young
    • Journal of Korea Port Economic Association
    • /
    • v.27 no.3
    • /
    • pp.273-288
    • /
    • 2011
  • Recently, the rankings of Korean container ports in terms of container handling cargo volume were plunged down due to the emerging Chinese ports. The efficient container ports strategies which increase container port competitiveness were requested. In this respect, it is urgently required to draw out constituent factors for Port Logistics, weigh these factors, and finally focus on improving the suggested factors. The aim of this paper is to evaluate weight and priority values for 'inner consisted factors' and 'outer requested factors' of port logistics by using the AHP(Analytic Hierarchy Process) method. As for the analysis regarding the inner consisted factors, the results were shown as follows: a storage and handling system (0.288) as the first rank; an information system of port logistics (0.210) as the second rank; an inland intermodal system (0.189) as the third rank; a ship's entering and departuring system (0.184) as the fourth rank, and a ship's berthing system (0.129) as the fifth rank. In terms of analysis regarding outer requested factors, the results came out as follows: a logistics cost (0.360) as the first rank; a port service (0.128) as the second rank; a connectivity (0.118) as the third rank; a hinterland condition (0.116) as the fourth rank; an convenience (0.106) as the fifth rank; a regional center (0.095) as the sixth rank, and an availability (0.077) as the seventh rank. For analyzing the priorities changes in constituent factors, the comparison of results between the year 2007 and the year 2011 was done. As the results, among inner consisted factors, 'information system of port logistics' was ranked first in the year 2007 while 'a storage and handling system' became the most important factor in the year 2011. Among the inner consisted factors, however, the logistics cost was the important factor in 2007 and 2011, respectively.

Patent citation network analysis (특허 인용 네트워크 분석)

  • Lee, Minjung;Kim, Yongdai;Jang, Woncheol
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.4
    • /
    • pp.613-625
    • /
    • 2016
  • The development of technology has changed the world drastically. Patent data analysis helps to understand modern technology trends and predict prospective future technology. In this paper, we analyze the patent citation network using the USPTO data between 1985 and 2012 to identify technology trends. We use network centrality measures that include a PageRank algorithm to find core technologies and identify groups of technology with similar properties with statistical network models.

An effective evaluation method for the subjective sensibility of linen-like silk (의마 가공된 견직물의 효율적인 주관적 감성평가 방법)

  • You, Ji-Ho;Lee, Jung-Soon
    • Korean Journal of Human Ecology
    • /
    • v.15 no.3
    • /
    • pp.439-447
    • /
    • 2006
  • The purpose of this study is to explore the accuracy and reliability of subjective evaluation instruments in evaluating sensibility of similar fabrics, Kendall's coefficient of concordance W (agreement among subjects) and Spearman rank correlation coefficient (reproducibility after 1 week) were used to evaluate which one is more efficient. Eight kinds of linen-like silk fabrics finished with polyurethane resin were used, Subjective evaluation instruments such as rating scale method, contrasting method against a control, rank ordering method, paired comparison and Quad analysis were used, 'Stiffness and Pliability' and 'Preference of summer fabric' were estimated, From the result of subjective stiffness and pliability, which are effective on objective properties of fabric, the rating scale method in Kendall's coefficient of concordance W and Quad analysis in Spearman rank correlation coefficient were given the highest score, From the result of subjective preference of summer fabric, which are effective on individual sensibility, contrasting method against a control in Kendall's coefficient of concordance W and Quad analysis in Spearman rank correlation coefficient revealed the highest score, Regarding the accuracy, reliability and efficiency, Quad analysis was an efficient method for subjective evaluation of linen-like silk fabrics.

  • PDF

Analysing the Combined Kerberos Timed Authentication Protocol and Frequent Key Renewal Using CSP and Rank Functions

  • Kirsal-Ever, Yoney;Eneh, Agozie;Gemikonakli, Orhan;Mostarda, Leonardo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.12
    • /
    • pp.4604-4623
    • /
    • 2014
  • Authentication mechanisms coupled with strong encryption techniques are used for network security purposes; however, given sufficient time, well-equipped intruders are successful for compromising system security. The authentication protocols often fail when they are analysed critically. Formal approaches have emerged to analyse protocol failures. In this study, Communicating Sequential Processes (CSP) which is an abstract language designed especially for the description of communication patterns is employed. Rank functions are also used for verification and analysis which are helpful to establish that some critical information is not available to the intruder. In order to establish this, by assigning a value or rank to each critical information, it is shown that all the critical information that can be generated within the network have a particular characterizing property. This paper presents an application of rank functions approach to an authentication protocol that combines delaying the decryption process with timed authentication while keys are dynamically renewed under pseudo-secure situations. The analysis and verification of authentication properties and results are presented and discussed.

Comparative Evaluation of Steam Gasification Reactivity of Indonesian Low Rank Coals (인도네시아 저등급 석탄의 스팀 가스화 반응성 비교 평가)

  • KIM, SOOHYUN;VICTOR, PAUL;YOO, JIHO;LEE, SIHYUN;RHIM, YOUNGJOON;LIM, JEONGHWAN;KIM, SANGDO;CHUN, DONGHYUK;CHOI, HOKYUNG
    • Journal of Hydrogen and New Energy
    • /
    • v.27 no.6
    • /
    • pp.693-701
    • /
    • 2016
  • Steam gasification of low rank coals is possible at relatively low temperature and low pressure, and thus shows higher efficiency compared to high rank coals. In this study, the gasification reactivity of four different Indonesian low rank coals (Samhwa, Eco, Roto, Kideco-L) was evaluated in $T=700-800^{\circ}C$. The low rank coals containing $53.8{\pm}3.4$ wt% volatile matter in proximate analysis and $71.6{\pm}1.2$ wt% carbon in ultimate analysis showed comparable gasification reactivity. In addition, $K_2CO_3$ catalyst rapidly accelerated the reaction rate at $700^{\circ}C$, and all of the coals were converted over 90% within 1 hour. The XRD analysis showed no significant difference in carbonization between the coals, and the FT-IR spectrum showed similar functional groups except for differences due to moisture and minerals. TGA results in pyrolysis ($N_2$) and $CO_2$ gasification atmosphere showed very similar behavior up to $800^{\circ}C$ regardless of the coal species, which is consistent with the steam gasification results. This confirms that the indirect evaluation of the reactivity can be made by the above instrumental analyses.

Journal PageRank Calculation in the Korean Science Citation Database (국내 인용 데이터베이스에서 저널 페이지랭크 측정 방안)

  • Lee, Jae-Yun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.22 no.4
    • /
    • pp.361-379
    • /
    • 2011
  • This paper aims to propose the most appropriate method for calculating the journal PageRank in a domestic citation database. Korean journals show relatively high journal self-citation ratios and have many outgoing citations to external journals which are not included in the domestic citation database. Because the PageRank algorithm requires recursive calculation to converge, those two characteristics of domestic citation databases must be accounted for in order to measure the citation impact of Korean journals. Therefore, two PageRank calculation methods and four formulas for self-citation adjustment have been examined and tested for KSCD journals. The results of the correlation analysis and regression analysis show that the SCImago Journal Rank formula with the cr2 type self-citation adjustment method seems to be a more appropriate way to measure the relative impact of domestic journals in the Korean Science Citation Database.

AN ITERATIVE METHOD FOR ORTHOGONAL PROJECTIONS OF GENERALIZED INVERSES

  • Srivastava, Shwetabh;Gupta, D.K.
    • Journal of applied mathematics & informatics
    • /
    • v.32 no.1_2
    • /
    • pp.61-74
    • /
    • 2014
  • This paper describes an iterative method for orthogonal projections $AA^+$ and $A^+A$ of an arbitrary matrix A, where $A^+$ represents the Moore-Penrose inverse. Convergence analysis along with the first and second order error estimates of the method are investigated. Three numerical examples are worked out to show the efficacy of our work. The first example is on a full rank matrix, whereas the other two are on full rank and rank deficient randomly generated matrices. The results obtained by the method are compared with those obtained by another iterative method. The performance measures in terms of mean CPU time (MCT) and the error bounds for computing orthogonal projections are listed in tables. If $Z_k$, k = 0,1,2,... represents the k-th iterate obtained by our method then the sequence of the traces {trace($Z_k$)} is a monotonically increasing sequence converging to the rank of (A). Also, the sequence of traces {trace($I-Z_k$)} is a monotonically decreasing sequence converging to the nullity of $A^*$.

Practical Validity of Weighting Methods : A Comparative Analysis Using Bootstrapping (부트스트랩핑을 이용한 가중치 결정방법의 실질적 타당성 비교)

  • Jeong, Ji-Ahn;Cho, Sung-Ku
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.26 no.1
    • /
    • pp.27-35
    • /
    • 2000
  • For a weighting method to be practically valid, it should produce weights which coincide with the relative importance of attributes perceived by the decision maker. In this paper, 'bootstrapping' is used to compare the practical validities of five weighting methods frequently used; the rank order centroid method, the rank reciprocal method, the rank sum method, the entropic method, and the geometric mean method. Bootstrapping refers to the procedure where the analysts allow the decision maker to make careful judgements on a series of similar cases, then infer statistically what weights he was implicitly using to arrive at the particular ranking. The weights produced by bootstrapping can therefore be regarded as well reflecting the decision maker's perceived relative importances. Bootstrapping and the five weighting methods were applied to a job selection problem. The results showed that both the rank order centroid method and the rank reciprocal method had higher level of practical validity than the other three methods, though a large difference could not be found either in the resulting weights or in the corresponding solutions.

  • PDF

Monitoring of Gene Regulations Using Average Rank in DNA Microarray: Implementation of R

  • Park, Chang-Soon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.4
    • /
    • pp.1005-1021
    • /
    • 2007
  • Traditional procedures for DNA microarray data analysis are to preprocess and normalize the gene expression data, and then to analyze the normalized data using statistical tests. Drawbacks of the traditional methods are: genuine biological signal may be unwillingly eliminated together with artifacts, the limited number of arrays per gene make statistical tests difficult to use the normality assumption or nonparametric method, and genes are tested independently without consideration of interrelationships among genes. A novel method using average rank in each array is proposed to eliminate such drawbacks. This average rank method monitors differentially regulated genes among genetically different groups and the selected genes are somewhat different from those selected by traditional P-value method. Addition of genes selected by the average rank method to the traditional method will provide better understanding of genetic differences of groups.

  • PDF