• Title/Summary/Keyword: Cited Documents Analysis

Search Result 21, Processing Time 0.019 seconds

Development of a Korea SCI System for Efficient Citation Analysis (효율적인 인용분석을 위한 한국 SCI 시스템의 개발)

  • 이계준;조현양;최재황;윤희준
    • Journal of KIISE:Databases
    • /
    • v.31 no.2
    • /
    • pp.174-182
    • /
    • 2004
  • In order to produce information the author usually reference other authors' work. A citation index leads users to papers by citations. Citations lead the user to desired information. In this paper, KSCI(Korea Science Citation Index) which defines the relationships between citing documents and cited documents has been constructed. KSCI System is to solve problems for recursive retrieval in ISI's SCI(Science Citation Index) Path Encoding Indexing technique was used to solve the problems. From the analysis of data, this system has efficiency about 8.98% in the aspect of data storage. In the aspect of retrieval, there was efficiency between citing documents and cited documents, especially there was over 40% of efficiency in the retrieval of cited documents. It is concluded that suggested KSCI system will provide efficient storage and retrieval system.

A Study on Information Retrieval Effectiveness by Cited References (인용문헌에 의한 정보검색 효과에 관한 고찰)

  • Lee Lanju
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.27
    • /
    • pp.265-289
    • /
    • 1994
  • Databases publicly available for online searching permit both citation and subject searching, however, subject searching has dominated the online search environment. Despite the power of citation searching, it may be underutilized This study explored the relationship between the number of cited references used in a citation search and information retrieval effectiveness, a relatively unstudied phenomenon. Three articles in the library and information science literature were chosen to represent sample questions. Cited reference searches were conducted for each article and each of its references. All searches were conducted in Social Scisearch and Scisearch on DIALOG. Relevance judgments on the retrieved citations were obtained from the authors of the original articles. This research focused on analyzing, in terms of information retrieval effectiveness, the overlap among postings sets retrieved by various combinations of cited references. The findings from the three case studies clearly showed that the more cited references used for the citation search, the better the performance, in terms of retrieving more relevant documents, up to a point of diminishing retums. In addition, generally the overall level of overlap among relevant documents sets was found to be low. Therefore, if only some of the cited references among many candidates are used for a citation search, a significant proportion of relevant documents may be missed. The analysis of the characteristics of cited references provided the ways to predict which cited refereces would be useful to improve information retrieval. The findings of this comprehensive exploratory study are of interest for both theoretical and practical reasons. They contribute to the development of a theoretical model for the effective use of the citation search. This model might also be implemented in operational online systems. In addition, the findings potentially will help online searchers improve their search strategies using the citation search so that they can better achieve their information retrieval goals: the retrieval of items relevant to a given question and the suppression of nonrelevant items.

  • PDF

A Comparative Study on the Citing Behavior of Scholars in Four Major Engineering Fields (주요 4개 공학분야 연구자의 문헌인용 행태 연구)

  • Cho, Hyun-Yang;Cho, Hyun-Sun
    • Journal of Information Management
    • /
    • v.36 no.2
    • /
    • pp.1-24
    • /
    • 2005
  • This study is aimed at investigating if there were the differences on citing behavior of researchers among different fields of engineering, in terms of five items, such as types of resources cited, the average number of documents cited, the demand of current documents, languages used in cited documents, and the life decrease phenomena of information. 29,160 cited references in 2,333 articles from 4 major selected journals, published in the year of 1999, 2001, and 2003 were analyzed. The result of this study shows that there were the differences on citing behavior of researchers among different fields of engineering on all 5 items. And also, some suggestions were given the priority of library collection and shelf arrangement for the library.

Citation Flow of the ASIST Proceeding Using Pathfinder Network Analysis (패스파인더 네트워크 분석에 의한 ASIST Proceedings 인용흐름 연구)

  • Kim, Hee-Jung
    • Journal of the Korean Society for information Management
    • /
    • v.25 no.2
    • /
    • pp.157-166
    • /
    • 2008
  • In this study, pathfinder network analysis has been carried out to identify subject domains of documents which cited articles in the ASIST Proceedings. This represents how articles in the ASIST Proceedings are flowed and used in what subjects areas. For this analysis, 240 documents were selected through a search of the Scopus database. The complete linkage clustering method was used to draw out 16 clusters from 240 documents. Through MDS and pathfinder network analysis, knowledge networks of clusters have been produced. As a result. articles in the ASIST Proceedings relating to knowledge management, bibliometrics, information retrieval and digital libraries have been cited actively by other publications. The most frequent citation flow type of ASIST proceedings was citation from proceedings(ASIST) to reviews(ARIST), via journals, and the most popular subject areas related to documents were bibliometrics.

A Study on the Citation Document Analysis of Business Administration.Economics.Trade (경영.경제.무역학분야의 인용문헌 분석에 관한 연구)

  • Chung, Jin-Sik;Won, Ji-Wook
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.20 no.1
    • /
    • pp.5-22
    • /
    • 2009
  • This study is analyzed the cited documents after selecting Management Administration Economics Trades in order to comprehend the study sphere and tendency for the three years from 2005 to 2007 in which 540 articles and 22,147 cited documents. In the analyzing result, the scientific study exchange communication activities is making progress more actively by the sole study rather than the joint study and over 77% of references are published before around 10 year-old, specifically 8.5 year-old documents have been used the most. The half-life period is 10.9 years for domestic books and 11.1 years for international books. For the journals, it is 6.0 years for domestic and 8.2 years for international which tells international articles are slightly longer half-life period than domestics.

A Research on Citing Behaviors of Researchers in Mechanical Engineering (기계공학 연구자들의 인용행태 분석 : P대학 기계공학부 박사학위논문을 중심으로)

  • Chang, Duk-Hyun;Jang, Hwan-Seok
    • Journal of Information Management
    • /
    • v.38 no.3
    • /
    • pp.111-135
    • /
    • 2007
  • The purpose of this study is to identify the citing behaviors of researchers in the field of mechanical engineering. It tries to verify if there is a significant difference on citing behavior of researchers between the past and the present with dissertations produced in P University as samples. For the comparison, years 1996 and 2004 are selected for the citation analysis. It analyzed four aspects, such as types of resources cited, languages used in cited documents, years since their publication of cited documents, and the journals indexed in SCI. The results of the analysis are; First, journals are the most cited than any other types of information resources. The citation of WWW resources which are gradually increasing for research is not shown in 1996, but there were some cited in 2004. Second, doctoral candidates usually cite document in English for their study. Statistics show that the use of resources in Japanese is on the decrease. Third, doctoral candidates in the discipline prefer materials published within 4-7 years, 8-11 years rather than 0-3 years since their publication. Last, journals indexed in SCI among the citation in dissertations are about 33 percent for both 1996 and 2004.

A Study of Citing Patterns of Korean Scientists on Korean Journals (국내 과학기술 연구자의 한국 학술지 인용패턴 연구)

  • Choi, Seon-Heui;Kim, Byung-Kyu;Kang, Mu-Yeong;You, Beom-Jong;Lee, Jong-Wook;Park, Jae-Won
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.2
    • /
    • pp.97-115
    • /
    • 2011
  • A large and reliable citation database is necessary to identify and analyze citation behavior of Korean researchers in science and technology. Korea Institute of Science and Technology Information (KISTI) built the Korea Science Citation Database (KSCD), and have provided Korea Science Citation Index (KSCI) and Korea Journal Citation Reports (KJCR) services. In this article, citing behavior of Korean scientists on Korean journals was examined by using the KSCD that covers 459 Korean core journals. This research dealt with (1) statistical numeric information of journals in KSCD, (2) analysis of document types cited, (3) ratio of domestic to international documents cited and ratio of citing different disciplines, (4) analysis on immediacy index, peak time, and half-life of cited documents, and (5) analysis on impact of journals based on KJCR citation indicators. From this research, we could find the immediacy citation rate (average 2.36%), peak-time (average 1.7 years) and half-life (average 5.2 years) of cited journals in Korea. We also found that the average journal self-citation rate is more than 50% in every field. In sum, citing behavior of Korean scientists on Korean journals was comprehensively identified from this research.

Citing Behaviors of Researchers in Korea Civil Engineering (우리나라 토목공학분야 연구자의 인용행태에 관한 연구)

  • Nam, Young-Joon;Seo, Hyun-Jung;Kim, Gyu-Hwan
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.4
    • /
    • pp.201-220
    • /
    • 2011
  • This study analyzes types of primary sources cited by South Korean civil engineers. The results are as follows: 1) primary sources by preference are academic journal (55.7%), book (15.6%), seminar contents (10.2%). 2) documents published within last 10 years (26.1%) are cited most often. 3) domestic journal is the primary academic journal cited, and the finding is similar in preference of top-ranked primary reference (domestic and foreign combined). 4) In terms of time, domestic sources are preferred for up-to-date publications, and foreign sources for relatively non-recent publications. 5) The indices of influence and extemporaneity for both domestic and foreign sources do not show high numbers simultaneously.

Deep Learning Research Trends Analysis with Ego Centered Topic Citation Analysis (자아 중심 주제 인용분석을 활용한 딥러닝 연구동향 분석)

  • Lee, Jae Yun
    • Journal of the Korean Society for information Management
    • /
    • v.34 no.4
    • /
    • pp.7-32
    • /
    • 2017
  • Recently, deep learning has been rapidly spreading as an innovative machine learning technique in various domains. This study explored the research trends of deep learning via modified ego centered topic citation analysis. To do that, a few seed documents were selected from among the retrieved documents with the keyword 'deep learning' from Web of Science, and the related documents were obtained through citation relations. Those papers citing seed documents were set as ego documents reflecting current research in the field of deep learning. Preliminary studies cited frequently in the ego documents were set as the citation identity documents that represents the specific themes in the field of deep learning. For ego documents which are the result of current research activities, some quantitative analysis methods including co-authorship network analysis were performed to identify major countries and research institutes. For the citation identity documents, co-citation analysis was conducted, and key literatures and key research themes were identified by investigating the citation image keywords, which are major keywords those citing the citation identity document clusters. Finally, we proposed and measured the citation growth index which reflects the growth trend of the citation influence on a specific topic, and showed the changes in the leading research themes in the field of deep learning.

Methods for Integration of Documents using Hierarchical Structure based on the Formal Concept Analysis (FCA 기반 계층적 구조를 이용한 문서 통합 기법)

  • Kim, Tae-Hwan;Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.63-77
    • /
    • 2011
  • The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte.syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A "citation index" indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the "citing" and "cited" articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesn't make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesn't make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F.measure is about 60% and it is better as about 15%.