• Title/Summary/Keyword: Data Index Information

Search Result 2,735, Processing Time 0.04 seconds

Power Investigation of the Entropy-Based Test of Fit for Inverse Gaussian Distribution by the Information Discrimination Index

  • Choi, Byungjin
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.837-847
    • /
    • 2012
  • Inverse Gaussian distribution is widely used in applications to analyze and model right-skewed data. To assess the appropriateness of the distribution prior to data analysis, Mudholkar and Tian (2002) proposed an entropy-based test of fit. The test is based on the entropy power fraction(EPF) index suggested by Gokhale (1983). The simulation results report that the power of the entropy-based test is superior compared to other goodness-of-fit tests; however, this observation is based on the small-scale simulation results on the standard exponential, Weibull W(1; 2) and lognormal LN(0:5; 1) distributions. A large-scale simulation should be performed against various alternative distributions to evaluate the power of the entropy-based test; however, the use of a theoretical method is more effective to investigate the powers. In this paper, utilizing the information discrimination(ID) index defined by Ehsan et al. (1995) as a mathematical tool, we scrutinize the power of the entropy-based test. The selected alternative distributions are the gamma, Weibull and lognormal distributions, which are widely used in data analysis as an alternative to inverse Gaussian distribution. The study results are provided and an illustrative example is analyzed.

Development of a GPS/GIS based Real-time Congestion Index for Traffic Information (교통정보제공을 위한 GPS/GIS기반의 실시간 혼잡지표개발)

  • Choi, Kee-Choo;Jang, Jeong-Ah;Jeong, Jae-Young;Shim, Sang-Woo
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.12 no.4 s.31
    • /
    • pp.53-60
    • /
    • 2004
  • Congestion index is needed for quantifying congestion level for various areas. So far, the index has been calculated based on multiple vehicle data for specified time interval. Such being the case, it was costly to build it and the usage of it was focused on policy development and evaluation rather than on traffic information provision. This study focuses on a development on a single vehicle based congestion index which can be a representative value for link congestion level and link speed information at the same time for dual purposes of traditional usages and information provision. A new term has been added for representing real time based arterial congestion level and it has been verified on a real time basis. The index was based on single vehicle GPS data and seemed to be cost effective in deriving the index. With the help of the index, the traffic information contents can be diversified in a constructive way in providing real time traffic information for ITS area and in using congestion level determination for traditional transportation areas.

  • PDF

A Novel Transmission Scheme with Spatial Modulation for Coded OFDM Systems (채널 부호화된 OFDM 시스템을 위한 공간 변조를 이용한 새로운 전송 기법)

  • Hwang, Soon-Up;Kim, Young-Ki;Jeon, Sung-Ho;Kang, Woo-Seok;Seo, Jong-Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.7A
    • /
    • pp.515-522
    • /
    • 2009
  • In this paper, a novel transmission scheme with spatial modulation is proposed for coded orthogonal frequency division multiplexing (OFDM). The multiple-input multiple-output (MIMO) technique, so-called spatial modulation (SM), divides input data into antenna index and data signals, transmitting data signals through the specific antenna chosen by the antenna index. In order to retrieve data stream at the receiver, SM needs to detect the antenna index which means that data signals are transmitted via a certain antenna. For this reason, it should be guaranteed that channel matrix is orthogonal. For the real environment, a MIMO channel has difficulty in maintaining orthogonality due to spatial correlation. Moreover, the receiver of the conventional SM is operated by hard decision, so that this scheme has a limit to be adopted for practical systems. Therefore, soft-output demappers for the conventional and proposed schemes are derived to detect antenna index and data stream by soft decision, and a novel transmission scheme combined with spatial modulation is proposed to improve the bit error rate (BER) performance of the conventional scheme.

A Statistical Methodology Study for Measuring Privacy Disclosure Riskin Open Data Environment (오픈 데이터 환경에서 개인정보 노출 위험 측정을 위한 통계적 방법론 연구)

  • Sieun Kim;Ieck-chae Euom
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.2
    • /
    • pp.323-333
    • /
    • 2024
  • Recently, Syntheic data has been in the spotlight as a technology that can protect personal information while maintaining the patterns and characteristics of actual data. Accordingly, technical and institutional research on synthetic data is actively being conducted, but it is difficult to actively use synthetic data due to the lack of clear standards and guidelines. This study is a preliminary study for quantifying the disclosure risk of synthetic data, and derives a privacy disclosure risk index through statistical methodology and suggests specific application measures to comply with the General Data Protection Regulation(GDPR). It is expected that the disclosure risk and the balance of data utility can be controlled through the privacy disclosure risk index of this study in an open data environment.

Lazy Bulk Insertion Method of Moving Objects Using Index Structure Estimation (색인 구조 예측을 통한 이동체의 지연 다량 삽입 기법)

  • Kim, Jeong-Hyun;Park, Sun-Young;Jang, Hyong-Il;Kim, Ho-Suk;Bae, Hae-Young
    • Journal of Korea Spatial Information System Society
    • /
    • v.7 no.3 s.15
    • /
    • pp.55-65
    • /
    • 2005
  • This paper presents a bulk insertion technique for efficiently inserting data items. Traditional moving object database focused on efficient query processing that happens mainly after index building. Traditional index structures rarely considered disk I/O overhead for index rebuilding by inserting data items. This paper, to solve this problem, describes a new bulk insertion technique which efficiently induces the current positions of moving objects and reduces update cost greatly. This technique uses buffering technique for bulk insertion in spatial index structures such as R-tree. To analyze split or merge node, we add a secondary index for information management on leaf node of primary index. And operations are classified to reduce unnecessary insertion and deletion. This technique decides processing order of moving objects, which minimize split and merge cost as a result of update operations. Experimental results show that this technique reduces insertion cost as compared with existing insertion techniques.

  • PDF

Concentric Core Fiber Design for Optical Fiber Communication

  • Nadeem, Iram;Choi, Dong-You
    • Journal of information and communication convergence engineering
    • /
    • v.14 no.3
    • /
    • pp.163-170
    • /
    • 2016
  • Because of rapid technological advancements, increased data rate support has become the key criterion for future communication medium selection. Multimode optical fibers and multicore optical fibers are well matched to high data rate throughput requirements because of their tendency to support multiple modes through one core at a time, which results in higher data rates. Using the numerical mode solver OptiFiber, we have designed a concentric core fiber by investigating certain design parameters, namely core diameter (µm), wavelength (nm), and refractive index profile, and as a result, the number of channels, material losses, bending losses, polarization mode dispersion, and the effective nonlinear refractive index have been determined. Space division multiplexing is a promising future technology that uses few-mode fibers in parallel to form a multicore fiber. The experimental tests are conducted using the standard second window wavelength of 1,550 nm and simulated results are presented.

The Effect of Discomfort Index on Outfielder's Game Record Data (불쾌지수가 외야수의 경기 기록 데이터에 미치는 영향)

  • Kim, Semin;Shin, Chwa-Cheol
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.8
    • /
    • pp.978-984
    • /
    • 2020
  • In this study, the correlation between sports records and weather data was analyzed using the big data analysis method. To this end, data was collected by API and crawling, data was processed, statistics were performed, and data visualization was performed. The subject of this study was a player who entered the regular at-bat among outfielders in the 2019 KBO League. In addition, meteorological data were analyzed by using the unpleasant index and above 70 and below 70. As a result of the study, in the various hitting indicators, which are the records that pitchers intervene, the higher the unpleasant index, the better the outfielder's record, but pitchers, walks, pitches, pitching success rates, pitches per turn, pitches per game From the records of the back, it was found that the outfielder made the pitcher difficult. It is expected that this study will help the development of the sports data industry and the performance of baseball players, baseball teams, and coaching staff.

An Analysis of the Korean manufacturing export firms' Competitiveness in EU market by Export Competitiveness Index (수출 경쟁력 지수에 의한 EU시장에서의 한국 제조 기업의 경쟁력 분석)

  • Choi, Chang-Yeoul;Jung, Han-Kyeoung
    • International Commerce and Information Review
    • /
    • v.9 no.2
    • /
    • pp.161-182
    • /
    • 2007
  • The objective of this paper is to examine the competitiveness of Korean export firms in EU market. In this study, market share index, RCA index, trade specialization index, and market competitiveness index were used as an analytical tool. On the market share index, Korea had a large market share in the SITC section 7(machinery and transport equipment) market in EU. On the RCA index, Korea appeared to have high export competitiveness in the electrical machinery, apparatus and appliances, n.e.s.(not elsewhere specified[stated]), and electrical parts thereof (77), travel goods, handbags and similar containers(83), textile yarn, fabrics, made-up articles, n.e.s., and related products(65), and iron and steel(67) division. On the trade specialization index, however, Korea appeared to tend to decline generally. On the market competitiveness index, Korea appeared to have competitive advantage in the iron and steel(67), machinery specialized for particular industries(72), office machines and automatic data-processing machines(75), electrical machinery, apparatus and appliances, n.e.s., and electrical parts thereof(77), road vehicles(78), and other transport equipment(79) division; but in 29 divisions, the index indicates that Korean firms' competitiveness was low. Finally, the authors discuss the implications of these findings and offer directions for future study.

  • PDF

A Space-Efficient Inverted Index Technique using Data Rearrangement for String Similarity Searches (유사도 검색을 위한 데이터 재배열을 이용한 공간 효율적인 역 색인 기법)

  • Im, Manu;Kim, Jongik
    • Journal of KIISE
    • /
    • v.42 no.10
    • /
    • pp.1247-1253
    • /
    • 2015
  • An inverted index structure is widely used for efficient string similarity search. One of the main requirements of similarity search is a fast response time; to this end, most techniques use an in-memory index structure. Since the size of an inverted index structure usually very large, however, it is not practical to assume that an index structure will fit into the main memory. To alleviate this problem, we propose a novel technique that reduces the size of an inverted index. In order to reduce the size of an index, the proposed technique rearranges data strings so that the data strings containing the same q-grams can be placed close to one other. Then, the technique encodes those multiple strings into a range. Through an experimental study using real data sets, we show that our technique significantly reduces the size of an inverted index without sacrificing query processing time.

KDBcs-Tree : An Efficient Cache Conscious KDB-Tree for Multidimentional Data (KDBcs-트리 : 캐시를 고려한 효율적인 KDB-트리)

  • Yeo, Myung-Ho;Min, Young-Soo;Yoo, Jae-Soo
    • Journal of KIISE:Databases
    • /
    • v.34 no.4
    • /
    • pp.328-342
    • /
    • 2007
  • We propose a new cache conscious indexing structure for processing frequently updated data efficiently. Our proposed index structure is based on a KDB-Tree, one of the representative index structures based on space partitioning techniques. In this paper, we propose a data compression technique and a pointer elimination technique to increase the utilization of a cache line. To show our proposed index structure's superiority, we compare our index structure with variants of the CR-tree(e.g. the FF CR-tree and the SE CR-tree) in a variety of environments. As a result, our experimental results show that the proposed index structure achieves about 85%, 97%, and 86% performance improvements over the existing index structures in terms of insertion, update and cache-utilization, respectively.