• Title/Summary/Keyword: Semistructured Data

Search Result 35, Processing Time 0.021 seconds

Web Information Extraction using HTML Tag Pattern (HTML 태그페턴을 이용한 웹정보추출시스템)

  • Park, Byung-Kwon
    • Proceedings of the Korea Association of Information Systems Conference
    • /
    • 2005.05a
    • /
    • pp.79-92
    • /
    • 2005
  • To query the vast amount of web pages which are available i]l the Internet, it is necessary to extract the encoded information in the web pages for converting it into structured data (e.g. relational data for SQL) or semistructured data (e.g. XML data for XQuery), In this paper, we propose a new web information extraction system, PIES, to convert web information into XML documents. PIES is based on a user-specified target schema and HTML tag pattern descriptions. The web information is extracted by the pattern descriptions and validated by the target schema. We designed a new language to describe extraction rules, and a new regular expression to describe HTML tag patterns. We implemented PIES and applied it to the US patent web site to evaluate its correctness. It successfully extracted more than thousands of US patent data and converted them into XML documents.

  • PDF

An Efficient Technique for Extracting Lower Bound Schema from Semistructured Data (반구조적 데이터의 효율적인 최소경계 스키마 추출 기법)

  • 박경현;김록원;양은주;최은선;류근호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10a
    • /
    • pp.27-29
    • /
    • 2000
  • 반구조적 데이터는 기존의 스키마와는 달리 고정된 스키마가 없고 주어진 데이터 인스턴스에 대해 하나 이상의 스키마가 존재한다. 따라서 여러 개의 스키마 추출이 가능한데 그중 가장 정확한 스키마를 추출해야 하는 문제(S초듬 Fxtraction)가 발생한다. 이러한 문제를 해결하기 위해 지금까지 여러 가지 스키마 추출 기번들이 제안되었는데 대표적인 것으로 데이터가이드(DataGuide)를 이용하여 최대경계 스키마를 추출하는 방법과 데이터로그(DataLog)를 이용하여 최소경계 스키마를 추출하는 방법이 있다. 이 논문에서는 기존의 데이터로그를 이용하는 방법보다 최소경계 스키마 추출 기법을 제안하고 이전의 스키마 추출 기법들과 비교함으로써 알고리즘의 성능을 살펴본다.

  • PDF

An Efficient Technique for Evaluating Queries with Multiple Regular Path Expressions (다중 정규 경로 질의 처리를 위한 효율적 기법)

  • Chung, Tae-Sun;Kim, Hyoung-Joo
    • Journal of KIISE:Databases
    • /
    • v.28 no.3
    • /
    • pp.449-457
    • /
    • 2001
  • As XML has become an emerging standard for information exchange on the World Wide Web, it has gained attention in database communities to extract information from XML seen as a database model. XML queries are based on regular path queries, which find objects reachable by given regular expressions. To answer many kinds of user queries, it is necessary to evaluate queries that have multiple regular path expressions. However, previous work such as query rewriting and query optimization in the frame work of semistructured data has dealt with a single regular expression. For queries that have multiple regular expressions we suggest a two phase optimizing technique: 1. query rewriting using views by finding the mappings from the view's body to the query's body and 2. for rewritten queries, evaluating each query conjunct and combining them. We show that our rewriting algorithm is sound and our query evaluation technique is more efficient than the previous work on optimizing semistructured queries.

  • PDF

A Study on Health/Illness Concepts in Hospitalized Children (입원아동이 지각한 건강과 질병개념에 관한 연구)

  • Sung Mi-Hae
    • Child Health Nursing Research
    • /
    • v.7 no.2
    • /
    • pp.149-160
    • /
    • 2001
  • The purpose of this study was to explore the health and illness concepts of hospitalized children. The subjects were 129 hospitalized children from 3 to 12 years old in one general hospital. Data were collected through semistructured interviews by authors. This study was conducted from Jun. 1, 2000 to Dec. 31, 2000. Data were coded and categorized by content analysis. The results were as follows : 1. Perceived health concept were physical well-being, food, exercise, powerfulness, emotional stability, obeidence, cleanliness, sleep and ability of social adaptation. 2. Perceived health behavior to maintain health were food, treatment, exercise, cleanliness, obeidence, sleep, emotional stability, power-fulness and psychological stability, physical well-being. 3. Perceived prevention of illness were food, cleanliness, treatment, exercise, obedience, sleep, powerfulness, psychological stability, emotional stability, recreation and ability of social adaptation. 4. Perceived causes of illness were illness, trauma and food. 5. Perceived treatment of illness were treatment, sleep, rest, food, obedience, emotional stability, psychological stability, cleanliness, exercise and powerfulness.

  • PDF

An Efficient Disk Block Allocation Method for XML Data (XML 데이타를 위한 효율적인 디스크 블록 할당 방법)

  • Kim, Jung-Hoon;Son, Jin-Hyun;Chung, Yon-Dohn;Kim, Myoung-Ho
    • Journal of KIISE:Databases
    • /
    • v.34 no.5
    • /
    • pp.465-472
    • /
    • 2007
  • With the recent proliferation of the use of semi-structured data such as XML, it becomes more important to efficiently store and manage the semi-structured data. The XML data can be logically modelled as a rooted tree e.g., the DOM tree. In order to process a query on the XML data, we traverse the tree structure. In this paper we present an algorithm that places the XML data to disk blocks. The proposed algorithm assigns a number to each node of the tree in a bottom-up fashion. Then, the nodes are allocated to disk blocks using the assigned number. The proposed algorithm does not need access pattern information, and provides good performance for any access pattern. The characteristics of the proposed method are presented with analysis. Through experiments, we evaluate the performance of the proposed method.

Three generations of mothers and daughters: attachment patterns and psychological well-being (3세대 모녀간의 애착.자율성 발달특성과 심리적 적응)

  • 유은희
    • Journal of Families and Better Life
    • /
    • v.14 no.4
    • /
    • pp.191-202
    • /
    • 1996
  • This research applied an attachment theory to the study of three generations of women. Questionnaire and semistructured interview techniques were employed to collect the data on intergenerational mother-daughter relationships from 140 triads of adolescent daughters middle-aged mothers an old-aged grandmothers. The focus of the study had been on the characteristics of attachment patterns which is measured by sense of attachment and autonomy across and within generations and their effects on personal well-being. Women in each their three generations perceived a high and seminilar level of attachment across and within the generations. On the other hand the level of autonomy differed by the generations with middle-aged mothers showing a higher level of perceived sense of autonomy than other two generations. Although the levels of attachment and autonomy were related to psychological well-being the level of autonomy was slightly more related to it. The results also showed that not nly one's own attachment toward mother/daughter but attachment of others toward herself were associated with the personal well-being. Overall this study reflects and supports the basis concepts of mother-daughter attachment: its continuity reciprocity and personal development in adulthood.

  • PDF

Storing and Querying XML Data using ORDBBM (ORDBMS를 이용한 XML문서의 저장 및 질의)

  • 박성희;박경현;김록원;남광우;류근호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.04b
    • /
    • pp.81-83
    • /
    • 2000
  • 현재 XML 문서를 저장하고 이에 대한 질의를 처리하는 백엔드 저장소로써는 파일시스템, 기존의 RDBMS와 OODBMS를 이용하는 접근 방법이 있다. 또한 독자적으로 semistrucured 데이터에 대한 저장 및 질의를 처리 할 수 있는 데이터베이스 시스템이 존재한다. 따라서, 이 논문에서는 기존의 응용프로그램에서 이용하는 데이터와 통합을 잘 할 수 있는 RDBMS의 장점과 객체지향 DOM모델을 지원할 수 있는 OODBMS의 특징을 모두 수용할 수 있는 ORDBMS에서 XML 문서를 저장하고 저장된 데이터에 대한 질의를 잘 할 수 있는 XML문서 처리시스템을 설계한다. 여기서, XML문서의 논리적 구조가 정해져 있지 않는 XML문서를 ORDBMS의 테이블 형태로 저장하는 여러 가지 방법을 제시하고, semistructured 데이터에 대한 질의의 특징인 패스표현을 효율적으로 지원하기 위해 패스 인덱스의 개념을 제시한다. 이렇게 함으로써 XML문서에 대한 질의를 ORDBMS에서 처리할 때 효율성을 높일 수 있다.

  • PDF

Mystery Shopping and Well-Being of Service Workers in South Korea

  • Shin, Heeju
    • Safety and Health at Work
    • /
    • v.10 no.4
    • /
    • pp.476-481
    • /
    • 2019
  • Background: Mystery shopping is a method in which a company monitors quality of service and employee conduct and compliance with regulations using an evaluator posing as a customer. It is a typical tool of customer-centered bureaucratic control insofar as it provides overall and standardized evaluation of intangible elements of customer service as well as physical elements of service environments. The purpose of this study is to examine how mystery shopping is related to the health status of service workers in South Korea. Methods: Data from semistructured interviews with 15 workers were collected from January to April 2019 to obtain information on service worker experiences with mystery shopping. Data were analyzed using the constant comparison method. Results: Mystery shopping limits worker autonomy and stiffens the workplace environment by standardizing and monitoring labor processes for service workers. In addition, mystery shopping heightens work stress through increased labor intensity. Five mechanisms by which mystery shopping affects service worker health are identified and comprise (1) multifaceted and multilayered surveillance, (2) evaluator subjectivity and irrational requirements, (3) standardized rules combined with high pressure to achieve sales, (4) self-esteem degradation because of evaluator results, and (5) musculoskeletal disorders because of strict adherence to labor processes based on evaluator results. Conclusion: Mystery shopping as an evaluation method should be reconsidered not only in terms of health problems but also in terms of organizational efficiency and issues of human rights.

A study on Health/Illness concepts in Hospitalized Preschoolers (학령전기 입원 아동의 건강 및 질병 개념에 관한 연구)

  • Sung Mi Hae
    • Child Health Nursing Research
    • /
    • v.6 no.3
    • /
    • pp.291-304
    • /
    • 2000
  • The purpose of this study was to explore the health and illness concepts of hospitalized preschoolers. The subjects were 52 hospitalized preschoolers from 3 to 6 grade in one general hospital. Data were collected through semistructured interviews by author. this study was conducted from Mar 2, 2000 to Jun. 30, 2000. Data were coded and categorized by content analysis. The results were as follows : 1. Hospitalized preschoolers's answers about health concepts were coded and then classificated to 7 categories(physical well-being, food, powerfulness, exercise, obedience to authority, cleanliness, sleep.) 2. Hospitalized preschoolers's answers about health behavior to maintenance health were coded and then classificated to 8 categories (food, obedience to authority, treatment, exercise, cleanliness, powerfulness, sleep, psychological stability). 3. Hospitalized preschoolers's answers about prevention of illness were coded and then classificated to 9 categories(food, treatment, obedience to authority, powerfulness, emotional stability, psychological stability, exercise, physical well-being, ability of social adaption). 4. Hospitalized preschoolers's answers about cause of illness were coded and then classificated to 3 categories(illness, trauma, food). 5. Hospitalized preschoolers's answers about treatments of illness were coded and then classificated to 9 categories(treatment, rest, emotional stability, sleep, psychological stability, food, obedience, exercise, powerfulness). 6. The levels of health and illness concepts in this sample were higher than those of the physical causality.

  • PDF

A Study on the analysis of Research Data Management and Sharing of Science & Technology Government-funded Research Institutes (과학기술분야 출연연구기관 연구데이터 관리 및 공유 사례 분석 연구)

  • Park, Miyoung;Ahn, Inja;Nam, Seungjoo
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.29 no.4
    • /
    • pp.319-344
    • /
    • 2018
  • As a part of the open science policy, this study compared the perception of research data sharing and utilization by academic field. Based on this, in - depth interviews were conducted with semistructured questions to the data task managers of 27 government - funded research institutes in science and technology. Among them, nine excellent organizations were selected from the viewpoint of data management and cases of research data collection and management were specifically presented. The State of the collection and management of research data by the participating research institutes is generally a pilot project stage, and the level of collection and establishment of data also differs by institution. In terms of institutions, they are divided into three levels: the level of collection and establishment of data(KIOM), the advanced level of it (KIST), And level of steps to start sharing (KRIBB, KRICT).