• Title/Summary/Keyword: multi database

Search Result 688, Processing Time 0.024 seconds

Trajectory Indexing for Efficient Processing of Range Queries (영역 질의의 효과적인 처리를 위한 궤적 인덱싱)

  • Cha, Chang-Il;Kim, Sang-Wook;Won, Jung-Im
    • The KIPS Transactions:PartD
    • /
    • v.16D no.4
    • /
    • pp.487-496
    • /
    • 2009
  • This paper addresses an indexing scheme capable of efficiently processing range queries in a large-scale trajectory database. After discussing the drawbacks of previous indexing schemes, we propose a new scheme that divides the temporal dimension into multiple time intervals and then, by this interval, builds an index for the line segments. Additionally, a supplementary index is built for the line segments within each time interval. This scheme can make a dramatic improvement in the performance of insert and search operations using a main memory index, particularly for the time interval consisting of the segments taken by those objects which are currently moving or have just completed their movements, as contrast to the previous schemes that store the index totally on the disk. Each time interval index is built as follows: First, the extent of the spatial dimension is divided onto multiple spatial cells to which the line segments are assigned evenly. We use a 2D-tree to maintain information on those cells. Then, for each cell, an additional 3D $R^*$-tree is created on the spatio-temporal space (x, y, t). Such a multi-level indexing strategy can cure the shortcomings of the legacy schemes. Performance results obtained from intensive experiments show that our scheme enhances the performance of retrieve operations by 3$\sim$10 times, with much less storage space.

Oceanic Skin-Bulk Temperature Difference through the Comparison of Satellite-Observed Sea Surface Temperature and In-Situ Measurements (인공위성관측 해수면온도와 현장관측 수온의 비교를 통해 본 해양 피층-표층 수온의 차이)

  • Park, Kyung-Ae;Sakaida, Futoki;Kawamura, Hiroshi
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.4
    • /
    • pp.273-287
    • /
    • 2008
  • Characteristics of skin-bulk sea surface temperature (SST) differences in the Northeast Asia seas were analyzed by utilizing 845 collocated matchup data between NOAA/AVHRR data and oceanic in-situ temperature measurements for selected months from 1994 to 2003. In order to understand diurnal variation of SST within a few meters of the upper ocean, the matchup database were classified into four categories according to day-night and drifter-shipboard measurements. Temperature measurements from daytime drifters showed a good agreement with satellite MCSST (Multi-Channel Sea Surface Temperature) with an RMS error of about $0.56^{\circ}C$. Poor accuracy of SST with an rrns error of $1.12^{\circ}C$ was found in the case of daytime shipboard CTD (Conductivity, Temperature, Depth) measurements. SST differences between MCSST and in-situ measurements are caused by various errors coming from atmospheric moist effect, coastal effect, and others. Most of the remarkable errors were resulted from the diurnal variation of vertical temperature structure within a few meters as well as in-situ oceanic temperatures at different depth, about 20 cm for a satellite-tracked drifting buoy and a few meters for shipboard CTD or moored buoy. This study suggests that satellite-derived SST shows significant errors of about ${\pm}3^{\circ}C$ in some cases and therefore it should be carefully used for one's purpose on the base of in-depth understanding of skin-bulk SST difference and vertical temperature structure in regional sea.

The Impact of K-Beauty Search Volumes on Export and Tourism: Based on the Google Search and YouTube Page View (K-뷰티(K-Beauty) 검색량이 수출과 관광에 미치는 영향: Google과 YouTube 검색 데이터 분석을 중심으로)

  • Lee, Sun-Jeong;Lee, Soobum
    • Review of Culture and Economy
    • /
    • v.20 no.2
    • /
    • pp.119-147
    • /
    • 2017
  • This study analyzes Big Data to understand the economic influence of K-Beauty which is expected as a fast-growing industry. Because the content of K-beauty is mainly transmitted over the Internet, Big Data about K-Beauty in the database of online services can show interest and engagement in K-Beauty. The export volume of the beauty industry and the number of foreign tourist in Korea were used as dependent variables. The volume of Google search and the volume of YouTube page view were independent variables. According to the result of a multi-regression analysis, the volume of Google search of K-Beauty had a positive influence on both dependent variables, even after controlling for GDP (Gross Domestic Product) and distances between nations. When it comes to the volume of YouTube page view of K-Beauty, it had a positive relationship with the export volume of the beauty industry, whereas there was no significant relationship between the volume of YouTube page view and the number of foreign tourists. The result indicates that the content of K-Beauty has a significant impact on the beauty industry. Moreover, this empirical study shows that web search and YouTube search have a positive relationship with the economical aspect. These results can be used to discuss public relations strategy to promote K-Beauty industry.

Multi-modal Image Processing for Improving Recognition Accuracy of Text Data in Images (이미지 내의 텍스트 데이터 인식 정확도 향상을 위한 멀티 모달 이미지 처리 프로세스)

  • Park, Jungeun;Joo, Gyeongdon;Kim, Chulyun
    • Database Research
    • /
    • v.34 no.3
    • /
    • pp.148-158
    • /
    • 2018
  • The optical character recognition (OCR) is a technique to extract and recognize texts from images. It is an important preprocessing step in data analysis since most actual text information is embedded in images. Many OCR engines have high recognition accuracy for images where texts are clearly separable from background, such as white background and black lettering. However, they have low recognition accuracy for images where texts are not easily separable from complex background. To improve this low accuracy problem with complex images, it is necessary to transform the input image to make texts more noticeable. In this paper, we propose a method to segment an input image into text lines to enable OCR engines to recognize each line more efficiently, and to determine the final output by comparing the recognition rates of CLAHE module and Two-step module which distinguish texts from background regions based on image processing techniques. Through thorough experiments comparing with well-known OCR engines, Tesseract and Abbyy, we show that our proposed method have the best recognition accuracy with complex background images.

Development of prediction model identifying high-risk older persons in need of long-term care (장기요양 필요 발생의 고위험 대상자 발굴을 위한 예측모형 개발)

  • Song, Mi Kyung;Park, Yeongwoo;Han, Eun-Jeong
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.4
    • /
    • pp.457-468
    • /
    • 2022
  • In aged society, it is important to prevent older people from being disability needing long-term care. The purpose of this study is to develop a prediction model to discover high-risk groups who are likely to be beneficiaries of Long-Term Care Insurance. This study is a retrospective study using database of National Health Insurance Service (NHIS) collected in the past of the study subjects. The study subjects are 7,724,101, the population over 65 years of age registered for medical insurance. To develop the prediction model, we used logistic regression, decision tree, random forest, and multi-layer perceptron neural network. Finally, random forest was selected as the prediction model based on the performances of models obtained through internal and external validation. Random forest could predict about 90% of the older people in need of long-term care using DB without any information from the assessment of eligibility for long-term care. The findings might be useful in evidencebased health management for prevention services and can contribute to preemptively discovering those who need preventive services in older people.

Commercial fishery assessment of Malaysian water offshore structure

  • Mohd, Mohd Hairil;Thiyahuddin, Mohd Izzat Mohd;Rahman, Mohd Asamudin A;Hong, Tan Chun;Siang, Hii Yii;Othman, Nor Adlina;Rahman, Azam Abdul;Rahman, Ahmad Rizal Abdul;Fitriadhy, Ahmad
    • Fisheries and Aquatic Sciences
    • /
    • v.25 no.9
    • /
    • pp.473-488
    • /
    • 2022
  • To have a better understanding of the impact of the PETRONAS oil and gas platform on commercial fisheries activities, Universiti Malaysia Terengganu (UMT) examined two approaches which are data collection from satellite and data collection from fishermen and anglers. By profiling the anglers who utilize reefed oil and gas structures for fishing, it can determine if the design and location of the reef platforms will benefit or negatively impacts those anglers and fisherman. Furthermore, this assessment will be contributing to the knowledge regarding the value of offshore oil and gas platforms as fisheries resources. Collectively, the apparent fishing activity data included, combined with the findings in the reefing viability index will help to inform PETRONAS's future decommissioning decisions and may help determine if the design and proposed locations for future rigs-to-reefs candidates would benefit commercial fishing groups, further qualifying them as appropriate artificial reef candidates. The method applied in this study is approaching by using a data satellite known as Google's Global Fishing Watch technology, which is one of the applications to measure commercial fishing efforts around the globe. The apparent commercial fishing effort around the selected twelve PETRONAS platforms was analyzed from January 2012 to December 2018. Using the data collection from fishermen which is the total estimation of commercial fish value cost (in Malaysia ringgit, MYR [RM]) in Peninsular Malaysia Asset, Sabah Asset, and Sarawak Operation region. The data were extracted every month from 2016 to 2018 from the National Oceanic and Atmospheric Administration database. Most of the selected platforms that show a high frequency of vessels around the year are platform KP-A, platform BG-A and platform PL-B. The estimated values of commercial fishes varied between platforms, with ranged from RM 10,209.92 to RM 89,023.78. Thus, platforms with high commercial fish value are selected for reefing in-situ and will serve multi-purposes and benefit the locals as well as the country. The current study has successfully assessed the potential reefing area of the Malaysian offshore environment with greater representativeness and this paper focused on its potential as a new fishing ground.

Position of Hungarian Merino among other Merinos, within-breed genetic similarity network and markers associated with daily weight gain

  • Attila, Zsolnai;Istvan, Egerszegi;Laszlo, Rozsa;David, Mezoszentgyorgyi;Istvan, Anton
    • Animal Bioscience
    • /
    • v.36 no.1
    • /
    • pp.10-18
    • /
    • 2023
  • Objective: In this study, we aimed to position the Hungarian Merino among other Merinoderived sheep breeds, explore the characteristics of our sampled animals' genetic similarity network within the breed, and highlight single nucleotide polymorphisms (SNPs) associated with daily weight-gain. Methods: Hungarian Merino (n = 138) was genotyped on Ovine SNP50 Bead Chip (Illumina, San Diego, CA, USA) and positioned among 30 Merino and Merino-derived breeds (n = 555). Population characteristics were obtained via PLINK, SVS, Admixture, and Treemix software, within-breed network was analysed with python networkx 2.3 library. Daily weight gain of Hungarian Merino was standardised to 60 days and was collected from the database of the Association of Hungarian Sheep and Goat Breeders. For the identification of loci associated with daily weight gain, a multi-locus mixed-model was used. Results: Supporting the breed's written history, the closest breeds to Hungarian Merino were Estremadura and Rambouillet (pairwise FST values are 0.035 and 0.036, respectively). Among Hungarian Merino, a highly centralised connectedness has been revealed by network analysis of pairwise values of identity-by-state, where the animal in the central node had a betweenness centrality value equal to 0.936. Probing of daily weight gain against the SNP data of Hungarian Merinos revealed five associated loci. Two of them, OAR8_17854216.1 and s42441.1 on chromosome 8 and 9 (-log10P>22, false discovery rate<5.5e-20) and one locus on chromosome 20, s28948.1 (-log10P = 13.46, false discovery rate = 4.1e-11), were close to the markers reported in other breeds concerning daily weight gain, six-month weight, and post-weaning gain. Conclusion: The position of Hungarian Merino among other Merino breeds has been determined. We have described the similarity network of the individuals to be applied in breeding practices and highlighted several markers useful for elevating the daily weight gain of Hungarian Merino.

Development of a Model for Predicting Modulus on Asphalt Pavements Using FWD Deflection Basins (FWD 처짐곡선을 이용한 아스팔트 포장구조체의 탄성계수 추정 모형 개발)

  • Park, Seong Wan;Hwang, Jung Joon;Hwang, Kyu Young;Park, Hee Mun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.5D
    • /
    • pp.797-804
    • /
    • 2006
  • A development of regression model for asphalt concrete pavements using Falling Weight Deflectometer deflections is presented in this paper. A backcalculation program based on layered elastic theory was used to generate the synthetic modulus database, which was used to generate 95% confidence intervals of modulus in each layer. Using deflection basins of FWD data used in developing this procedure were collected from Pavement Management System in flexible pavements. Assumptions of back-calculation are that one is 3 layered flexible pavement structure and another is depth to bedrock is finite. It is found that difference of between 95% confidence intervals and modulus ranges of other papers does not exist. So, the data of 95% confidence intervals in each layer was used to develop multiple regression models. Multiple regression equations of each layer were established by SPSS, package of Statics analysis. These models were proved by regression diagnostics, which include case analysis, multi-collinearity analysis, influence diagnostics and analysis of variance. And these models have higher degree of coefficient of determination than 0.75. So this models were applied to predict modulus of domestic asphalt concrete pavement at FWD field test.

GIS Based Distributed Flood Damage Assessment (GIS기반의 분포형 홍수피해산정 기법)

  • Yi, Choong Sung;Choi, Seung An;Shim, Myung Pil;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.3B
    • /
    • pp.301-310
    • /
    • 2006
  • Typically, we needs enormous national budget for the flood control project and so the project usually has big influence on the national economy. Therefore, the reliable estimation of flood damage is the key issue for the economic analysis of the flood control project. This study aims to provide a GIS based technique for distributed flood damage estimation. We consider two aspects of engineering and economic sides, which are the inundation analysis and MD-FDA (Multi-Dimensional Flood Damage Analysis), for the flood damage assessment. We propose the analysis framework and data processing using GIS for assessing flood damages. The proposed methodology is applied to the flood control channel project for flood disaster prevention in Mokgamcheon/Dorimcheon streams and this study presents the detailed GIS database and the assessment results of flood damages. This study may have the worth in improving practical usability of MD-FDA and also providing research direction for combining economic side with the engineering aspect. Also this distributed technique will help decision-making in evaluating the feasibility of flood damage reduction programs for structural and nonstructural measures.

An Empirical Study on Korean Stock Market using Firm Characteristic Model (한국주식시장에서 기업특성모형 적용에 관한 실증연구)

  • Kim, Soo-Kyung;Park, Jong-Hae;Byun, Young-Tae;Kim, Tae-Hyuk
    • Management & Information Systems Review
    • /
    • v.29 no.2
    • /
    • pp.1-25
    • /
    • 2010
  • This study attempted to empirically test the determinants of stock returns in Korean stock market applying multi-factor model proposed by Haugen and Baker(1996). Regression models were developed using 16 variables related to liquidity, risk, historical price, price level, and profitability as independent variables and 690 stock monthly returns as dependent variable. For the statistical analysis, the data were collected from the Kis Value database and the tests of forecasting power in this study minimized various possible bias discussed in the literature as possible. The statistical results indicated that: 1) Liquidity, one-month excess return, three-month excess return, PER, ROE, and volatility of total return affect stock returns simultaneously. 2) Liquidity, one-month excess return, three-month excess return, six-month excess return, PSR, PBR, ROE, and EPS have an antecedent influence on stock returns. Meanwhile, realized returns of decile portfolios increase in proportion to predicted returns. This results supported previous study by Haugen and Baker(1996) and indicated that firm-characteristic model can better predict stock returns than CAPM. 3) The firm-characteristic model has better predictive power than Fama-French three-factor model, which indicates that a portfolio constructed based on this model can achieve excess return. This study found that expected return factor models are accurate, which is consistent with other countries' results. There exists a surprising degree of commonality in the factors that are most important in determining the expected returns among different stocks.

  • PDF