• Title/Summary/Keyword: data publishing

Search Result 224, Processing Time 0.028 seconds

A Research on Module Arrangement of Korean Spelling Corrector to Optimize Correction Rate (교정률 최적화를 위한 한국어 철자교정기의 모듈 배열)

  • Yun Keun-Soo;Kwon Hyuk-Chul
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.5
    • /
    • pp.366-377
    • /
    • 2005
  • We find a module may that takes optimal correction rate of Korean spelling corrector. If there are a lot of module numbers of spelling corrector, it is difficult to calculate optimal correction rate of spelling corrector because permutation of N-modules is N!. This Korean spelling corrector consists of 19 modules. It is impossible to arrange 19 modules actually and the correction rate is various according to input data. We found the range of correction rate using parallel processing between modules and the optimal correction rate using sequential processing of modules. Input data that are used in an experiment is 753,191 eojeol's sets that happen in newspaper publishing company during several years. About this error set, theoretical maximum correction rate of spelling corrector is $97.28\%$ (732,764/753,191). But we got the optimal correction rate $96.62\%$ (727,750/733,191). This optimal correction rate is almost near to $99.31\%$ (727,750/732,764) of the maximum correction rate.

Retrieval of Legal Information Through Discovery Layers: A Case Study Related to Indian Law Libraries

  • Kushwah, Shivpal Singh;Singh, Ritu
    • Journal of Information Science Theory and Practice
    • /
    • v.4 no.3
    • /
    • pp.71-83
    • /
    • 2016
  • Purpose. The purpose of this paper is to analyze and evaluate discovery layer search tools for retrieval of legal information in Indian law libraries. This paper covers current practices in legal information retrieval with special reference to Indian academic law libraries, and analyses its importance in the domain of law.Design/Methodology/Approach. A web survey and observational study method are used to collect the data. Data related to the discovery tools were collected using email and further discussion held with the discovery layer/ tool /product developers and their representatives.Findings. Results show that most of the Indian law libraries are subscribing to bundles of legal information resources such as Hein Online, JSTOR, LexisNexis Academic, Manupatra, Westlaw India, SCC web, AIR Online (CDROM), and so on. International legal and academic resources are compatible with discovery tools because they support various standards related to online publishing and dissemination such as OAI/PMH, Open URL, MARC21, and Z39.50, but Indian legal resources such as Manupatra, Air, and SCC are not compatible with the discovery layers. The central index is one of the important components in a discovery search interface, and discovery layer services/tools could be useful for Indian law libraries also if they can include multiple legal and academic resources in their central index. But present practices and observations reveal that discovery layers are not providing facility to cover legal information resources. Therefore, in the present form, discovery tools are not very useful; they are an incomplete and half solution for Indian libraries because all available Indian legal resources available in the law libraries are not covered.Originality/Value. Very limited research or published literature is available in the area of discovery layers and their compatibility with legal information resources.

A Study on improvement of sounding density of ENCs (전자해도 수심 밀집도 개선에 관한 연구)

  • Oh, Se-Woong;Park, Jong-Min;Suh, Sang-Hyun;Lee, Moon-Jin;Jeon, Tae-Byung
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2011.06a
    • /
    • pp.34-36
    • /
    • 2011
  • ENCs is edited based on the numerical charts for publishing paper charts and serviced in forms of grid styles. For this reason, the density of sounding information of ENCs is not consistent and was required for improvement. In this study, K-Means, ISODATA clustering algorithm as classification methods for satellite image was reviewed and adopted to case study. The developed results include loading module of ENC data, improvement algorithm of sounding information, writing module of ENC data. According to the results of algorithm, we could confirm the improved result.

  • PDF

Application of Deegree of Open Source Middleware to Geo-Portal Implementation (지오 포털 구축을 위한 공개 소스 미들웨어 Deegree의 적용)

  • Park, Yong-Jae;Lee, Ki-Won
    • Korean Journal of Remote Sensing
    • /
    • v.25 no.4
    • /
    • pp.367-374
    • /
    • 2009
  • Recently, new GIS applications such as gee portal and spatial data infrastructure are emerging. These are related to web computing techniques or methodologies based on web 2.0 paradigm, open API of portal, open source GIS, and international GIS standards which are independently on developing. The product of these applications can be realized in the linkage of those components. In this study, a case implementation concerning linkage with Google maps API and open source middleware named Deegree is carried out, and the results are discussed for open source uses in geo portal. Open source middleware supports various levels and types of OGC standards, so that it enables web publishing in the several web standard formats and data exchanges and interoperable uses between external database servers. Also the (unction extensions and the multi tier-based architecture within geo portal for specific purpose are possible.

An Automatically Extracting Formal Information from Unstructured Security Intelligence Report (비정형 Security Intelligence Report의 정형 정보 자동 추출)

  • Hur, Yuna;Lee, Chanhee;Kim, Gyeongmin;Jo, Jaechoon;Lim, Heuiseok
    • Journal of Digital Convergence
    • /
    • v.17 no.11
    • /
    • pp.233-240
    • /
    • 2019
  • In order to predict and respond to cyber attacks, a number of security companies quickly identify the methods, types and characteristics of attack techniques and are publishing Security Intelligence Reports(SIRs) on them. However, the SIRs distributed by each company are huge and unstructured. In this paper, we propose a framework that uses five analytic techniques to formulate a report and extract key information in order to reduce the time required to extract information on large unstructured SIRs efficiently. Since the SIRs data do not have the correct answer label, we propose four analysis techniques, Keyword Extraction, Topic Modeling, Summarization, and Document Similarity, through Unsupervised Learning. Finally, has built the data to extract threat information from SIRs, analysis applies to the Named Entity Recognition (NER) technology to recognize the words belonging to the IP, Domain/URL, Hash, Malware and determine if the word belongs to which type We propose a framework that applies a total of five analysis techniques, including technology.

Development of ESS Fair Trade System Linked with Blockchain (블록체인을 연계한 ESS 공정거래 시스템 개발)

  • Gun-Il Kim;Yang-Kwon Jeong;Young-Sik Kim;Jin-Suk Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.1
    • /
    • pp.149-156
    • /
    • 2023
  • This research tried to develop an ESS electricity trading system linked with blockchain for energy participation consumers. For the purpose of publishing renewable energy ESS power amount and demand information, we will build a smart contract system on the blockchain DB, utilize the blockchain DB data of energy prosumers and consumers, and expand the power trading market flexibly. to provide realistic solutions. Therefore, the main contents of the development of the ESS power trading system linked to the blockchain are cloud-based web construction for ESS management, coin issuance and exchange registration for activating the blockchain, and to reflect the blockchain technology, building a blockchain database for collecting and supplying ESS-based production demand data, selecting a blockchain-based platform and building a foundation, and creating a smart contract, etc.

Analyzing the Main Paths and Intellectual Structure of the Data Literacy Research Domain (데이터 리터러시 연구 분야의 주경로와 지적구조 분석)

  • Jae Yun Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.403-428
    • /
    • 2023
  • This study investigates the development path and intellectual structure of data literacy research, aiming to identify emerging topics in the field. A comprehensive search for data literacy-related articles on the Web of Science reveals that the field is primarily concentrated in Education & Educational Research and Information Science & Library Science, accounting for nearly 60% of the total. Citation network analysis, employing the PageRank algorithm, identifies key papers with high citation impact across various topics. To accurately trace the development path of data literacy research, an enhanced PageRank main path algorithm is developed, which overcomes the limitations of existing methods confined to the Education & Educational Research field. Keyword bibliographic coupling analysis is employed to unravel the intellectual structure of data literacy research. Utilizing the PNNC algorithm, the detailed structure and clusters of the derived keyword bibliographic coupling network are revealed, including two large clusters, one with two smaller clusters and the other with five smaller clusters. The growth index and mean publishing year of each keyword and cluster are measured to pinpoint emerging topics. The analysis highlights the emergence of critical data literacy for social justice in higher education amidst the ongoing pandemic and the rise of AI chatbots. The enhanced PageRank main path algorithm, developed in this study, demonstrates its effectiveness in identifying parallel research streams developing across different fields.

An Efficient Reasoning Method for OWL Properties using Relational Databases (관계형 데이터베이스를 이용한 효율적인 OWL 속성 추론 기법)

  • Lin, Jiexi;Lee, Ji-Hyun;Chung, Chin-Wan
    • Journal of KIISE:Databases
    • /
    • v.37 no.2
    • /
    • pp.92-103
    • /
    • 2010
  • The Web Ontology Language (OWL) has become the W3C recommendation for publishing and sharing ontologies on the Semantic Web. To derive hidden information from OWL data, a number of OWL reasoners have been proposed. Since OWL reasoners are memory-based, they cannot handle large-sized OWL data. To overcome the scalability problem, RDBMS-based systems have been proposed. These systems store OWL data into a database and perform reasoning by incorporating the use of a database. However, they do not consider complete reasoning on all types of properties defined in OWL and the database schemas they use are ineffective for reasoning. In addition, they do not manage updates to the OWL data which can occur frequently in real applications. In this paper, we compare various database schemas used by RDBMS-based systems and propose an improved schema for efficient reasoning. Also, to support reasoning for all the types of properties defined in OWL, we propose a complete and efficient reasoning algorithm. Furthermore, we suggest efficient approaches to managing the updates that may occur on OWL data. Experimental results show that our schema has improved performance in OWL data storage and reasoning, and that our approaches to managing updates to OWL data are more efficient than the existing approaches.

An Overview on Historical Development in Population Survey System (우리나라 인구 통계 작성제도의 변천에 관한 고찰)

  • 최봉호
    • Korea journal of population studies
    • /
    • v.20 no.1
    • /
    • pp.5-25
    • /
    • 1997
  • The historical study reveals that our ancestors had maintained a system which could produce data on the number of population and households as well as on their characteristics. For example, such data on age structure of the population, number of births, number of deaths by age & sex, number of in & out migrants were found in an historical document for the year 755. The main purposes of maintaining the system at that time were taxation and conscription. As the system evolves, another function of identifying the legal status of people was also added. Looking into the figures for those days reveals that ommission rates of the number of population and households were high. Thus, in an effort to obtain a reliable data, the annual population survey system was introduced as of 1 September 1896. This date is now cerebrated as the Statistics Day. Since then, the survey system has been diversified. At the present time, there are three major data sources which produce the statistics on population and households: Civil Registration System (vital statistics), Resident Registratiton System (migration statistics) and Population Census. However, these three systems are found to have some problems to produce the accurate data. There are some inherent problems in the registration systems such as problems in its coverage, accuracies in contents and timeliness in reporting the vital events and publishing the results. The population census has also non-sampling errors such as errors in coverage, response and non-response. Apart from the above mentioned problems, there are also conflicting problems arised from having different three data source. We can find some overlapping problems in laws and difficulties in comparative studies between regions. In the future, these problems should be taken into consideration for the improvement of the quality of statistics on population and households.

  • PDF

The Determinats of Infant and Child Mortality in Korea: 1955-1973

  • Kim, Tai-Hun
    • Korea journal of population studies
    • /
    • v.9 no.2
    • /
    • pp.93-105
    • /
    • 1986
  • The historical study reveals that our ancestors had maintained a system which could produce data on the number of population and households as well as on their characteristics. For example, such data on age structure of the population, number of births, number of deaths by age & sex, number of in & out migrants were found in an historical document for the year 755. The main purposes of maintaining the system at that time were taxation and conscription. As the system evolves, another function of identifying the legal status of people was also added. Looking into the figures for those days reveals that ommission rates of the number of population and households were high. Thus, in an effort to obtain a reliable data, the annual population survey system was introduced as of 1 September 1896. This date is now cerebrated as the Statistics Day. Since then, the survey system has been diversified. At the present time, there are three major data sources which produce the statistics on population and households: Civil Registration System (vital statistics), Resident Registratiton System (migration statistics) and Population Census. However, these three systems are found to have some problems to produce the accurate data. There are some inherent problems in the registration systems such as problems in its coverage, accuracies in contents and timeliness in reporting the vital events and publishing the results. The population census has also non-sampling errors such as errors in coverage, response and non-response. Apart from the above mentioned problems, there are also conflicting problems arised from having different three data source. We can find some overlapping problems in laws and difficulties in comparative studies between regions. In the future, these problems should be taken into consideration for the improvement of the quality of statistics on population and households.

  • PDF