• Title/Summary/Keyword: Document research

Search Result 1,350, Processing Time 0.029 seconds

Differentiation of Signature Traits $vis-\grave{a}-vis$ Mobile- and Table-Based Digitizers

  • Elliott, Stephen J.
    • ETRI Journal
    • /
    • v.26 no.6
    • /
    • pp.641-646
    • /
    • 2004
  • As the use of signatures for identification purposes is pervasive in society and has a long history in business, dynamic signature verification (DSV) could be an answer to authenticating a document signed electronically and establishing the identity of that document in a dispute. DSV has the advantage in that traits of the signature can be collected on a digitizer. The research question of this paper is to understand how the individual variables vary across devices. In applied applications, this is important because if the signature variables change across the digitizers this will impact performance and the ability to use those variable. Understanding which traits are consistent across devices will aid dynamic signature algorithm designers to create more robust algorithms.

  • PDF

A study on the current archival system of government document in korea (한국의 현행 정부기록보존제도에 대한 고찰)

  • 김상호
    • Journal of Korean Library and Information Science Society
    • /
    • v.25
    • /
    • pp.225-256
    • /
    • 1996
  • The purpose of this study is to find the ways that will be helpful to improve archival system and to establish National Archives. The contents of this study were focused on comparing the characteristics of archival system in governmental administration,: Executive, Legislature, and Judiciary. They were also focused on analyzing the problems of those current system. This research was basically conducted depending on the detailed articles of the legislation and regulation pertinent to the archives. Two major results of the research are 1) There are much differences among the governmental administrations in structuring and organizing for archival administrations. Archival works of government document are divided primarily according to the period of conservation and it is necessary to establish the regional archives and central management system, and to employ archivists as an expert staff. 2) The principles and methods of archival process, such as transferences, classifications, preservations, access, and destructions are similar to each other. In order to improve and co-ordinate current systems, it is necessary to constitute several councils endowed with consultative and decision-making power.

  • PDF

A Study on the Exterior Form Composition of Street Buildings considering Landscape Characteristics in Gyeongju (경주시 경관특성을 고려한 가로변건축물 형태구성에 관한 연구)

  • Choi, Moo-Hyun;Hyun, Taek-Soo
    • Journal of the Korean Institute of Rural Architecture
    • /
    • v.10 no.1
    • /
    • pp.83-92
    • /
    • 2008
  • This study aims to examine the perceptual features of street buildings, which affect the urban scape, and establish design guidelines by types of buildings. According to this purpose, this study conducted document research, field survey through site visits and questionnaire survey for each subject. The field survey was carried out to study the exterior form characteristics of street buildings, and images of the streetscape in Gyeongju. For the questionnaire survey, the preference of 33 architecture-related people on the formal constituents of street buildings was investigated. The results of the document research, field survey and questionnaire survey were put together to elaborate the design guideline for urban street facade in historical city.

  • PDF

Multi-task learning with contextual hierarchical attention for Korean coreference resolution

  • Cheoneum Park
    • ETRI Journal
    • /
    • v.45 no.1
    • /
    • pp.93-104
    • /
    • 2023
  • Coreference resolution is a task in discourse analysis that links several headwords used in any document object. We suggest pointer networks-based coreference resolution for Korean using multi-task learning (MTL) with an attention mechanism for a hierarchical structure. As Korean is a head-final language, the head can easily be found. Our model learns the distribution by referring to the same entity position and utilizes a pointer network to conduct coreference resolution depending on the input headword. As the input is a document, the input sequence is very long. Thus, the core idea is to learn the word- and sentence-level distributions in parallel with MTL, while using a shared representation to address the long sequence problem. The suggested technique is used to generate word representations for Korean based on contextual information using pre-trained language models for Korean. In the same experimental conditions, our model performed roughly 1.8% better on CoNLL F1 than previous research without hierarchical structure.

An Efficient Information Retrieval System for Unstructured Data Using Inverted Index

  • Abdullah Iftikhar;Muhammad Irfan Khan;Kulsoom Iftikhar
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.7
    • /
    • pp.31-44
    • /
    • 2024
  • The inverted index is combination of the keywords and posting lists associated for indexing of document. In modern age excessive use of technology has increased data volume at a very high rate. Big data is great concern of researchers. An efficient Document indexing in big data has become a major challenge for researchers. All organizations and web engines have limited number of resources such as space and storage which is very crucial in term of data management of information retrieval system. Information retrieval system need to very efficient. Inverted indexing technique is introduced in this research to minimize the delay in retrieval of data in information retrieval system. Inverted index is illustrated and then its issues are discussed and resolve by implementing the scalable inverted index. Then existing algorithm of inverted compared with the naïve inverted index. The Interval list of inverted indexes stores on primary storage except of auxiliary memory. In this research an efficient architecture of information retrieval system is proposed particularly for unstructured data which don't have a predefined structure format and data volume.

Review of Wind Energy Publications in Korea Citation Index using Latent Dirichlet Allocation (잠재디리클레할당을 이용한 한국학술지인용색인의 풍력에너지 문헌검토)

  • Kim, Hyun-Goo;Lee, Jehyun;Oh, Myeongchan
    • New & Renewable Energy
    • /
    • v.16 no.4
    • /
    • pp.33-40
    • /
    • 2020
  • The research topics of more than 1,900 wind energy papers registered in the Korean Journal Citation Index (KCI) were modeled into 25 topics using latent directory allocation (LDA), and their consistency was cross-validated through principal component analysis (PCA) of the document word matrix. Key research topics in the wind energy field were identified as "offshore, wind farm," "blade, design," "generator, voltage, control," 'dynamic, load, noise," and "performance test." As a new method to determine the similarity between research topics in journals, a systematic evaluation method was proposed to analyze the correlation between topics by constructing a journal-topic matrix (JTM) and clustering them based on topic similarity between journals. By evaluating 24 journals that published more than 20 wind energy papers, it was confirmed that they were classified into meaningful clusters of mechanical engineering, electrical engineering, marine engineering, and renewable energy. It is expected that the proposed systematic method can be applied to the evaluation of the specificity of subsequent journals.

Text Mining of Wood Science Research Published in Korean and Japanese Journals

  • Eun-Suk JANG
    • Journal of the Korean Wood Science and Technology
    • /
    • v.51 no.6
    • /
    • pp.458-469
    • /
    • 2023
  • Text mining techniques provide valuable insights into research information across various fields. In this study, text mining was used to identify research trends in wood science from 2012 to 2022, with a focus on representative journals published in Korea and Japan. Abstracts from Journal of the Korean Wood Science and Technology (JKWST, 785 articles) and Journal of Wood Science (JWS, 812 articles) obtained from the SCOPUS database were analyzed in terms of the word frequency (specifically, term frequency-inverse document frequency) and co-occurrence network analysis. Both journals showed a significant occurrence of words related to the physical and mechanical properties of wood. Furthermore, words related to wood species native to each country and their respective timber industries frequently appeared in both journals. CLT was a common keyword in engineering wood materials in Korea and Japan. In addition, the keywords "MDF," "MUF," and "GFRP" were ranked in the top 50 in Korea. Research on wood anatomy was inferred to be more active in Japan than in Korea. Co-occurrence network analysis showed that words related to the physical and structural characteristics of wood were organically related to wood materials.

Design and Implementation of a CASE Tool with Compound Document Support (복합 문서 지원 기능을 갖는 CASE 도구의 설계 및 구현)

  • Jo, Jang-Wu;Kim, Tae-Gyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.6
    • /
    • pp.41-48
    • /
    • 2007
  • Since the advent of object-oriented(OO) technology, research and development projects have fumed out that OO technology could importantly contribute in productivity and reusability improvement. There are various research areas related to OO technology And two of major research fields in this areas are concerned in issues for CASE tools and component-based technologies. This paper discusses a trial of applying component-based technology to a CASE tool. This paper proposes the design and implementation issues obtained while we have incrementally developed OODesigner, an OO CASE tool, with compound document support functionality based on COM/OLE technology. As OODesigner with OLE container/server functionality has interoperability with other application programs of Windows system, it can provide more powerful documentation environment than other CASE tools. OODesigner can be used not only to design UML diagrams. but also enables us to make more expressive documentation cooperatively with various kinds of OLE-enabled software like Recorder and Excel. Therefore powerful and consistent documentation activity can be achieved with the tool.

  • PDF

Efficient Hangul Word Processor (HWP) Malware Detection Using Semi-Supervised Learning with Augmented Data Utility Valuation (효율적인 HWP 악성코드 탐지를 위한 데이터 유용성 검증 및 확보 기반 준지도학습 기법)

  • JinHyuk Son;Gihyuk Ko;Ho-Mook Cho;Young-Kuk Kim
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.1
    • /
    • pp.71-82
    • /
    • 2024
  • With the advancement of information and communication technology (ICT), the use of electronic document types such as PDF, MS Office, and HWP files has increased. Such trend has led the cyber attackers increasingly try to spread malicious documents through e-mails and messengers. To counter such attacks, AI-based methodologies have been actively employed in order to detect malicious document files. The main challenge in detecting malicious HWP(Hangul Word Processor) files is the lack of quality dataset due to its usage is limited in Korea, compared to PDF and MS-Office files that are highly being utilized worldwide. To address this limitation, data augmentation have been proposed to diversify training data by transforming existing dataset, but as the usefulness of the augmented data is not evaluated, augmented data could end up harming model's performance. In this paper, we propose an effective semi-supervised learning technique in detecting malicious HWP document files, which improves overall AI model performance via quantifying the utility of augmented data and filtering out useless training data.