• 제목/요약/키워드: publishing data

검색결과 220건 처리시간 0.025초

A Study on Performing Join Queries over K-anonymous Tables

  • Kim, Dae-Ho;Kim, Jong Wook
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권7호
    • /
    • pp.55-62
    • /
    • 2017
  • Recently, there has been an increasing need for the sharing of microdata containing information regarding an individual entity. As microdata usually contains sensitive information on an individual, releasing it directly for public use may violate existing privacy requirements. Thus, to avoid the privacy problems that occur through the release of microdata for public use, extensive studies have been conducted in the area of privacy-preserving data publishing (PPDP). The k-anonymity algorithm, which is the most popular method, guarantees that, for each record, there are at least k-1 other records included in the released data that have the same values for a set of quasi-identifier attributes. Given an original table, the corresponding k-anonymous table is obtained by generalizing each record in the table into an indistinguishable group, called the equivalent class, by replacing the specific values of the quasi-identifier attributes with more general values. However, query processing over the anonymized data is a very challenging task, due to generalized attribute values. In particular, the problem becomes more challenging with an equi-join query (which is the most common type of query in data analysis tasks) over k-anonymous tables, since with the generalized attribute values, it is hard to determine whether two records can be joinable. Thus, to address this challenge, in this paper, we develop a novel scheme that is able to effectively perform an equi-join between k-anonymous tables. The experiment results show that, through the proposed method, significant gains in accuracy over using a naive scheme can be achieved.

재사용 서비스의 등록/검색을 위한 확장된 UDDI 시스템 (Extended UDDI System for Registering and Discovering the Reusable Services)

  • 신수혜;백선재;박준석;문미경;염근혁
    • 소프트웨어공학소사이어티 논문지
    • /
    • 제24권3호
    • /
    • pp.101-110
    • /
    • 2011
  • 웹 서비스(Web Service)는 SOAP, WSDL, UDDI 등의 표준화된 XML 메시지를 통해 네트워크 상에서 상이한 시스템간의 상호작용을 가능하게 하는 소프트웨어 시스템이다. 특히, UDDI는 서비스 제공자에 의한 서비스 등록과 서비스 요청자의 서비스 검색을 지원하는 레지스트리이다. 기존의 UDDI연구는 서비스의 단순 검색과 등록에 관한 연구로, 서비스 검색 향상에 관한 UDDI 연구나 컴포넌트 단위의 재사용성 향상을 위한 확장된 UDDI설계에 관한 연구를 제시하고 있다. 본 논문에서는 기존 UDDI의 서비스 등록과 검색 기능뿐만 아니라, 재사용을 위한 서비스 모델과 이를 위한 새로운 UDDI 자료구조와 API를 제안하며, 재사용을 위한 서비스 등록과 검색 기능을 제공하는 확장된UDDI를 설계 및 구현한다. 제시된 UDDI 시스템을 통해 서비스 개발자는 이미 개발된 서비스를 사용하여 서비스 애플리케이션을 개발함으로써 개발 비용 및 시간을 줄일 수 있으며, 검증된 서비스를 재사용함으로써 품질도 보장할 수 있을 것으로 기대된다.

  • PDF

염증성 근육뼈대계 질환에 대한 미세전류의 효과: 메타분석 (Effects of Microcurrent on Inflammatory Musculoskeletal Diseases: A Meta-Analysis)

  • 이정우;고운;두영택
    • 대한통합의학회지
    • /
    • 제8권4호
    • /
    • pp.1-11
    • /
    • 2020
  • Purpose : The purpose of this meta-analysis was to examine the effects of microcurrent on inflammatory musculoskeletal diseases. Methods : Domestic databases (RISS, NDSL, KISS, DBpia, and Kmbase) were searched for studies that conducted clinical trials associated with microcurrent and its impact on inflammatory musculoskeletal diseases. A total of 606 studies published between 2002 and 2019 were identified, with 8 studies satisfying the inclusion data. The studies were classified according to patient, intervention, comparison, and outcome (PICO). The search outcomes were items associated with blood component, pain, and function. The 8 studies that were included in the study were evaluated using R meta-analysis (version 4.0). The quality of 7 randomized control trials was evaluated using Cochrane risk of bias (ROB). The quality of 1 non-randomized control trial was evaluated using risk of bias assessment tool for non-randomized studies (RoBANS). Effect sizes were computed as the corrected standard mean difference (SMD). A random-effect model was used to analyze the effect size because of the high heterogeneity among the studies. Egger's regression test was carried out to analyze the publishing bias. Results : The following factors had a large effect size involving microcurrent on inflammatory musculoskeletal diseases: blood component (Hedges's g=-2.46, 95 % CI=-4.20~-0.73), pain (Hedges's g=3.51, 95 % CI=2.44~4.77), and function (Hedges's g=3.06, 95 % CI: 1.53~4.58). Except for function (t=1.572, p=.191), Egger's regression test showed that the publishing bias had statistically significant differences. Conclusion : This study provides evidence for the effectiveness of microcurrent on inflammatory musculoskeletal diseases in terms of blood component, pain, and function. However, due to the small sample sizes used in the included studies, the results of our study should be interpreted cautiously, especially considering the publishing bias.

저자집단 분석을 통한 한국 문헌정보학의 학술커뮤니케이션 동향 연구 (A Study on Scholarly Communication Trends in Korean Library and Information Science Studies through Author Group Analysis)

  • 이재윤
    • 한국문헌정보학회지
    • /
    • 제57권2호
    • /
    • pp.409-434
    • /
    • 2023
  • 이 연구에서는 국내 문헌정보학 분야 4개 학회 학술지에 2002년부터 2021년까지 20년 동안 게재된 논문 전체의 저자를 분석하여 국내 문헌정보학 학술지를 통한 학술 커뮤니케이션 현황을 고찰하고 향후 전망을 제시하는 것을 목표로 한다. 이를 위해서 학술지별 공저자 수, 귀환저자 비율, 투고선호지수, 저자집단 변화 추세, 연구자 유인지수 등을 분석하였다. 분석 결과 4개 학술지의 공동연구 수준, 학술지별로 연관된 저자집단의 형성 정도, 저자집단이 변화된 변곡점, 신진 연구자 집단의 특성, 학술지 간 저자 공유 정도 등이 파악되었다. 전체적으로 2015년이 한국 문헌정보학 저자집단이 변화한 변곡점으로 나타났으며, 이후에 등장한 신진 연구자들은 주로 공동연구를 수행하면서 이전과 다소 다른 학술지 논문발표 행태를 보였다. 계량분석을 수행한 이 연구의 결과가 질적 연구방법을 사용한 선행 연구와 함께 활용된다면 한국 문헌정보학 학술지 발전 전략에 대한 다각화 연구를 수행한 효과를 거둘 것으로 기대된다.

뇌졸중 환자의 동작관찰훈련이 보행에 미치는 효과에 대한 메타분석; 국내연구를 중심으로 (Meta-Analysis on the Effects of Action Observation Training on Stroke Patients' Walking; Focused on Domestic Research)

  • 이정우;고운;두영택
    • 대한통합의학회지
    • /
    • 제7권4호
    • /
    • pp.119-130
    • /
    • 2019
  • Purpose : The purpose of this study was to investigate the meta-analysis on the effects of action observation training on stroke patients' walking. Methods : Domestic databases (DBpia, KISS, NDSL, and RISS) were searched for studies that conducted randomized controlled trials (RCTs) associated with action observation training in adults after stroke. The search outcomes were items associated with the walking function. The 18 studies that were included in the study were analyzed using R meta-analysis. A random-effect model was used for the analysis of the effect size because of the significant heterogeneity among the studies. Sub-group and meta-regression analysis were also used. Egger's regression test was conducted to analyze the publishing bias. Cumulative meta-analysis and sensitivity analysis were also done to analyze a data error. Results : The mean effect size was 2.77. The sub-group analysis showed a statistical difference in the number of training sessions per week. No statistically significant difference was found in the meta-regression analysis. Publishing bias was found in the data, but the results of the trim-and-fill method showed that such bias did not affect the obtained data. Also, the cumulative meta-analysis and sensitivity analysis showed no data errors. Conclusion : The meta-analysis of the studies that conducted randomized clinical trials revealed that action observation training effectively improved walking of the chronic stroke patients.

Analyzing RDF Data in Linked Open Data Cloud using Formal Concept Analysis

  • Hwang, Suk-Hyung;Cho, Dong-Heon
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권6호
    • /
    • pp.57-68
    • /
    • 2017
  • The Linked Open Data(LOD) cloud is quickly becoming one of the largest collections of interlinked datasets and the de facto standard for publishing, sharing and connecting pieces of data on the Web. Data publishers from diverse domains publish their data using Resource Description Framework(RDF) data model and provide SPARQL endpoints to enable querying their data, which enables creating a global, distributed and interconnected dataspace on the LOD cloud. Although it is possible to extract structured data as query results by using SPARQL, users have very poor in analysis and visualization of RDF data from SPARQL query results. Therefore, to tackle this issue, based on Formal Concept Analysis, we propose a novel approach for analyzing and visualizing useful information from the LOD cloud. The RDF data analysis and visualization technique proposed in this paper can be utilized in the field of semantic web data mining by extracting and analyzing the information and knowledge inherent in LOD and supporting classification and visualization.

Enhanced Regular Expression as a DGL for Generation of Synthetic Big Data

  • Kai, Cheng;Keisuke, Abe
    • Journal of Information Processing Systems
    • /
    • 제19권1호
    • /
    • pp.1-16
    • /
    • 2023
  • Synthetic data generation is generally used in performance evaluation and function tests in data-intensive applications, as well as in various areas of data analytics, such as privacy-preserving data publishing (PPDP) and statistical disclosure limit/control. A significant amount of research has been conducted on tools and languages for data generation. However, existing tools and languages have been developed for specific purposes and are unsuitable for other domains. In this article, we propose a regular expression-based data generation language (DGL) for flexible big data generation. To achieve a general-purpose and powerful DGL, we enhanced the standard regular expressions to support the data domain, type/format inference, sequence and random generation, probability distributions, and resource reference. To efficiently implement the proposed language, we propose caching techniques for both the intermediate and database queries. We evaluated the proposed improvement experimentally.

Opening the Nation: Leveraging Open Data to Create New Business and Provide Services

  • ;이홍주
    • 지식경영연구
    • /
    • 제16권4호
    • /
    • pp.157-168
    • /
    • 2015
  • Opening government data has been one of the main goals of nations building their e-government structures. Nonetheless, more than publishing government data for public viewing, the bigger concern right now is promoting the use change to "and proving the usefulness of available public data". In order to do this, governments must be able to, not only publicize data but more so, publish the kind of data usable to infomediaries and developers in order to create new products and services for citizens. This research investigates 30 open data use cases of South Korea as listed in Data.go.kr. This study aims to contribute to a better understanding of open datasets utilization in a technologically-advanced and well-developed nation and hopefully provide some useful insights on how open data is currently being used, how it is opening up new business, and more importantly, how it is contributing to the civic society by providing services to the public.

Influence of R&D intensity on Innovation Performance in the Korean Pharmaceutical Industry: Focusing on the Moderating Effects of R&D Collaboration

  • 김대중;엄기용
    • 지식경영연구
    • /
    • 제19권3호
    • /
    • pp.189-223
    • /
    • 2018
  • This paper examined the effect of innovation networks comprising research and development (R&D) collaboration on innovation performance of Korean pharmaceutical firms. As co-assigned patents and co-affiliated publications are common technical outcomes of successful R&D collaboration in the pharmaceutical industry, social network analysis technique was applied for analyzing innovation networks through patent and publication data. Results of Social network analysis indicated that a small set of highly innovative firms in the Korean pharmaceutical industry were actively involved in patenting and publishing. And the analysis of structural equation model found the followings: (1) R&D intensity significantly affected patenting, publication and new drug development, (2) the activity of patenting and publishing was positively related with the innovation performance measured by new drug development, and (3) R&D collaboration in terms of degree centrality of co-patent network played significant moderating roles on the relationships among R&D intensity, patenting, and new drug development. These findings are expected to be helpful to researchers as well as policy-makers to devise innovation-promoting policies in the Korean pharmaceutical industry. Discussions and limitations of the study are provided in the last part.

Patterns of Citing Korean DOI Journals According to CrossRef's Cited-by Linking and a Local Journal Citation Database

  • Seo, Tae-Sul;Jung, Eun-Gyeong;Kim, Hwanmin
    • Journal of Information Science Theory and Practice
    • /
    • 제1권2호
    • /
    • pp.58-68
    • /
    • 2013
  • Citing literature is a very important activity for scholars in writing articles. Many publishers and libraries build citation databases and provide citation reports on scholarly journals. Cited-by linking is a service representing what an article cites and how many times it cites a specific article within a journal database. Recently, information services based on DOIs (Digital Object Identifiers) have been increasing in number. CrossRef, a non-profit organization for the DOI registration agency, maintains the DOI system and provides the cited-by linking service. Recently, the number of Korean journals adopting DOI is also rapidly increasing. The Korea Institute of Science and Technology Information (KISTI) supports Korean learned societies in DOI related activities in collaboration with CrossRef. This study analyzes cited patterns of Korean DOI journal articles using CrossRef's cited-by linking data and a Korean journal citation database. This analysis has been performed in terms of publication country and the language of journals citing Korean journal articles. The results show that DOI, SCI(E) (Science Citation Index (Expanded)), and English journals are more likely to be cited internationally.