• Title/Summary/Keyword: Text data

Search Result 2,959, Processing Time 0.029 seconds

IMPLEMENTATION OF SUBSEQUENCE MAPPING METHOD FOR SEQUENTIAL PATTERN MINING

  • Trang, Nguyen Thu;Lee, Bum-Ju;Lee, Heon-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.627-630
    • /
    • 2006
  • Sequential Pattern Mining is the mining approach which addresses the problem of discovering the existent maximal frequent sequences in a given databases. In the daily and scientific life, sequential data are available and used everywhere based on their representative forms as text, weather data, satellite data streams, business transactions, telecommunications records, experimental runs, DNA sequences, histories of medical records, etc. Discovering sequential patterns can assist user or scientist on predicting coming activities, interpreting recurring phenomena or extracting similarities. For the sake of that purpose, the core of sequential pattern mining is finding the frequent sequence which is contained frequently in all data sequences. Beside the discovery of frequent itemsets, sequential pattern mining requires the arrangement of those itemsets in sequences and the discovery of which of those are frequent. So before mining sequences, the main task is checking if one sequence is a subsequence of another sequence in the database. In this paper, we implement the subsequence matching method as the preprocessing step for sequential pattern mining. Matched sequences in our implementation are the normalized sequences as the form of number chain. The result which is given by this method is the review of matching information between input mapped sequences.

  • PDF

PDOCM : Fast Text Compression on MasPar Machine (PDOCM : MasPar머쉰상의 새로운 압축기법과 빠른 텍스트 축약)

  • Min, Yong-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1
    • /
    • pp.40-47
    • /
    • 1995
  • Due to rapid progress in data communications, we are able to acquire the information we need with ease. One means of achieving this is a parallel machine such as the MasPar. Although the parallel machine makes it possible to receive/transmit enormous quantities of data, because of the increasing volume of information that must be processed, it is necessary to transmit only a minimal amount of data bits. This paper suggests a new coding method for the parallel machine, which compresses the data by reducing redundancy. Parallel Dynamic Octal Compact Mapping (PDOCM) compresses at least 1 byte per word, compared with other coding techniques, and achieves a 54.188-fold speedup with 64 processors to transmit 10 million characters.

  • PDF

The Future Past of Humanities Research: Musing Methodology in the Digital Convergence Era

  • Kim, Jiyun
    • International journal of advanced smart convergence
    • /
    • v.9 no.3
    • /
    • pp.161-168
    • /
    • 2020
  • Over the last half-century, computer science has revolutionarily changed the landscape of humanities research. This digital shift in research methodology has reached from the brainstorming process to preserving, constructing, collecting, visualizing, and even analyzing materials. Such transformation has brought about the birth of the new field of study: Digital Humanities (DH). DH undeniably has saved much of the physical chores and provided a new angle to interpret the text, thereby making its meteoric rise as a promising future of the humanities. Based on such innovation, electronic circuitry can seem to replace the imagination that detects relationships and significances of research data with ever-improving interfaces. However, despite hitherto technological development, the thousands-year-old essence of traditional liberal arts-human creativity-remains the heart of humanities research and always will. This paper starts by proving this proposition in the way of comparing the old and new liberal arts research methods, focusing on literary studies. Meanwhile, it thoroughly investigates how digitalized bibliographies, search engines, databases, and digital projects provide the most useful data preservation and virtual experience of browsing in the library, along with their limitations due to the intrinsic quality of humanities research data. Also, it probes the differences between traditional and digital data analysis in current methods of literary studies, ultimately presenting the ideal direction for humanities development in the era of digital convergence.

The Implementation of Real Time Vital Sign Information Management System in Patient Monitoring Systems (환자감시시스템(PMS) 실시간 생체정보관리 시스템 구현)

  • Kang, Ki-Woong;Lim, Se-Jung;Kim, Gwang-Jun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.2 no.4
    • /
    • pp.244-249
    • /
    • 2007
  • HL7 is well-Known standard protocol for text data generated in hospital information systems. In this paper, we have to design to obtain useful vital sign information, which is generated at data receiver modulor of HIS, that is offered by the central monitor. Vital sign informations of central monitor is composed of the row data of several bedsite patient monitors. We are willing to maintain vital sign information of real time and continuity that is generated from the bedsite patient monitor. It is able to apply to remote medical examination and treatment. we proposed integration method between vital sign database systems and hospital information systems. Through the proper exchange and management of patient vital sign information, real time vital sign information management will offer better workflow to all hospital employee.

  • PDF

VP Database Support for a More Efficient Cyber Shopping Mall (효과적인 사이버 쇼핑몰을 위한 VP 데이타베이스 지원)

  • Lim, Jaeguk;Kang, Hyunchul;Han, Sangyong
    • The Journal of Information Technology and Database
    • /
    • v.8 no.1
    • /
    • pp.1-11
    • /
    • 2001
  • More and more cyber shopping malls, one of the new promising Internet businesses of today, are opening business everyday. And instead of the ordinary image and text type of product display, buyers can now view products from any viewpoint through 3D images and also get more detail information on the product more easily thanks to the new VP technology, visual tools, and statecharts. However the currently used virtual prototyping supporting method does not consist of any database support for sharing the data from different virtual prototype developments and reusing the data in developing other prototypes. And in cases of custom order products, there is no linkage with the virtual product database that enables buyers in cyberspace such as cybermalls to try out the products before purchasing. This paper is purported for being applied as the basis for planning the construction of a complete CRM method applied cyber shopping mall that can accommodata all the demands and requests from customers. And the database supporting VP framework that supports data sharing and collaboration between virtual prototype developers and manufacturing custom order products is suggested for this purpose.

  • PDF

A comparative study of filter methods based on information entropy

  • Kim, Jung-Tae;Kum, Ho-Yeun;Kim, Jae-Hwan
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.40 no.5
    • /
    • pp.437-446
    • /
    • 2016
  • Feature selection has become an essential technique to reduce the dimensionality of data sets. Many features are frequently irrelevant or redundant for the classification tasks. The purpose of feature selection is to select relevant features and remove irrelevant and redundant features. Applications of the feature selection range from text processing, face recognition, bioinformatics, speaker verification, and medical diagnosis to financial domains. In this study, we focus on filter methods based on information entropy : IG (Information Gain), FCBF (Fast Correlation Based Filter), and mRMR (minimum Redundancy Maximum Relevance). FCBF has the advantage of reducing computational burden by eliminating the redundant features that satisfy the condition of approximate Markov blanket. However, FCBF considers only the relevance between the feature and the class in order to select the best features, thus failing to take into consideration the interaction between features. In this paper, we propose an improved FCBF to overcome this shortcoming. We also perform a comparative study to evaluate the performance of the proposed method.

The Design and Implementation of Automatic Communication System using Mobile Instant Messenger (모바일 인스턴스 메신저를 활용한 자동화 커뮤니케이션 시스템 설계 및 구현)

  • Kim, Tae Yeol;Lee, Dae Sik
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.10 no.3
    • /
    • pp.11-21
    • /
    • 2014
  • In this paper, concerning the various advertising and policy advertising of the election with respect to whether to deliver a message to a large number of people, we design and implement an automative system what enables sending the text messages directly from the server to the client and also fast feedback is enabled by utilizing a number of operational programs to connect to the server. Therefore, we design and implement the automative communication system which enables delivering message to each user mobile terminal from a plurality of relay mobile terminals by utilizing the mobile instant messenger, not to deliver a message from the server to the mobile instant messenger user directly. In result of comparative analysis on the number of times of data transmission, this automative communication system utilizing mobile instant messenger shows the result that it enables transmitting five times per minute as it can copy and paste in the automation system regardless of the size of the data loading, otherwise in case of transmitting manually it show the result that the number of times of data transmission is reduced if the size of the data is larger.

The Development of Technique for the Visualization of Geological Information Using Geostatistics (지구통계학을 활용한 지반정보 가시화 기법 개발)

  • 송명규;김진하;황제돈;김승렬
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2001.03a
    • /
    • pp.501-508
    • /
    • 2001
  • A graph or topographic map can often convey larger amounts of information in a shorter time than ordinary text-based methods. To visualize information precisely it is necessary to collect all the geological information at design stage, but actually it is almost impossible to bore or explore the entire area to gather the required data. So, tunnel engineers have to rely on the judgement of expert from the limited number of the results of exploration and experiment. In this study, several programs are developed to handle the results of geological investigation with various data processing techniques. The results of the typical case study are also presented. For the electric survey, eleven points are chosen at the valley to measure the resistivity using Schlumberger array. The measured data are interpolated in 3-dimensional space by kriging and the distribution of resistivity are visualized to find weak or fractured zone. The correlation length appears to be around 5 to 20 meter in depth. Regression analyses were performed to find a correlation length. No nugget effect is assumed, and the topographic map, geologic formation, fault zone, joint geometry and the distribution of resistivity are successfully visualized by using the proposed technique.

  • PDF

A Virtual Microscope System for Educational Applications (교육 분야 응용을 위한 가상 현미경 시스템)

  • Cho, Seung-Ho;Beynon, Mike;Saltz, Joel
    • The KIPS Transactions:PartD
    • /
    • v.10D no.1
    • /
    • pp.117-124
    • /
    • 2003
  • The system implemented in this paper partitions and stores specimen data captured by a light microscope on distributed or parallel systems. Users ran observe images on computers as we use a physical microscope. Based on the client-server computing model, the system consists of client, coordinator, and data manager. Three components communicate messages. For retrieving images, we implemented the client program with necessary functions for educational applications such at image mark and text annotation, and defined the communication protocol. We performed the experiment for introducing a tape storage which stores a large volume of data. The experiment results showed performance improvement by data partitioning and indexing technique.

Similarity Measurement Method of Trajectory using Indexing Information of Moving Object in Video (비디오 내 이동 객체의 색인 정보를 이용한 궤적 유사도 측정 기법)

  • Kim, Jeong In;Choi, Chang;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.1 no.3
    • /
    • pp.43-47
    • /
    • 2012
  • The recent proliferation of multimedia data necessitates the effectively and efficiently retrieving of multimedia data. These research not only focus on the retrieving methods of text matching but also on using the multimedia data features. Therefore, this paper is a similarity measurement method of trajectory using indexing information of moving object in video, for similarity measurement. This method consists of 2 steps. Firstly, Video data is processed indexing for trajectory extraction of moving objects using CCTV. Finally, we describe to compare DTW(Dynamic Time Warping) to TSR(Tansent Space Representation) algorithm.

  • PDF