• Title/Summary/Keyword: Data 누락

Search Result 263, Processing Time 0.025 seconds

Sentence Similarity Measurement Method Using a Set-based POI Data Search (집합 기반 POI 검색을 이용한 문장 유사도 측정 기법)

  • Ko, EunByul;Lee, JongWoo
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.12
    • /
    • pp.711-716
    • /
    • 2014
  • With the gradual increase of interest in plagiarism and intelligent file content search, the demand for similarity measuring between two sentences is increasing. There is a lot of researches for sentence similarity measurement methods in various directions such as n-gram, edit-distance and LSA. However, these methods have their own advantages and disadvantages. In this paper, we propose a new sentence similarity measurement method approaching from another direction. The proposed method uses the set-based POI data search that improves search performance compared to the existing hard matching method when data includes the inverse, omission, insertion and revision of characters. Using this method, we are able to measure the similarity between two sentences more accurately and more quickly. We modified the data loading and text search algorithm of the set-based POI data search. We also added a word operation algorithm and a similarity measure between two sentences expressed as a percentage. From the experimental results, we observe that our sentence similarity measurement method shows better performance than n-gram and the set-based POI data search.

A Drama Bound Data Service Providing the Updated Information on Characters based on Scenes (장면 별 등장인물의 업데이트 정보를 제공하는 드라마 연동 데이터 서비스 설계)

  • Ko, Kwangil
    • Journal of Digital Contents Society
    • /
    • v.18 no.2
    • /
    • pp.311-317
    • /
    • 2017
  • A drama, generally composed of dozens of times, is one of the most popular broadcasting program genres, and it has the characteristics that the status and interrelationships of characters are changing as time goes on and important acts affecting the story occur sporadically. Therefore, when viewers missed some times in the middle of a drama, it is difficult to understand the drama without the changed information on the characters and notable actions. This paper proposes a data service which provides the updated information of the characters and important acts based on the drama scene. The data service has the characteristics that it provides the information on the characters shown in the current drama scene, and the information about the people mentioned in the dialogue in the scene. This paper includes a general introduction to the data services, the formal definition of the drama scene identification and character information, and an DVB-SI-based method for transmitting the information to the data service.

Construction of Guide and Management System for University Facility Using Multi-Imagery and Geospatial Information System (다중영상과 GIS를 이용한 대학시설물 안내 및 관리시스템 구축)

  • 손덕재;이혜진;이승환
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.20 no.1
    • /
    • pp.47-57
    • /
    • 2002
  • The intention of this study is to construct the spatial database and to extract attribute data which are able to complete the omitted data in the topographical map or completion map of construction using the image data of various form such as artificial satellite images, aerial photographs, terrestrial photographs and so on. This study used the single frame images only for the raw image data, supposing the case of rigorous photogrammetric method is not available or rapid acquisition of information is need. The extracted spatial and attribute data from the images are used for modifying and updating the database, and for providing visual effect useful for guide and management of the facilities. This study intended to develop the technique able to apply in the case where comparative high accuracy is not required or rapid modification is necessary, and to verify the possibility of editing and updating the digital map using the photographs or video images remained as old data. Many of previous research on the management system for university facility has been accomplished focusing on the design and construction of database itself. Otherwise, this study aimed for the construction of guide and management system based on compiling the digital map of present state using multi-imagery, and the more applicability was intended too.

A study on the imputation solution for missing speed data on UTIS by using adaptive k-NN algorithm (적응형 k-NN 기법을 이용한 UTIS 속도정보 결측값 보정처리에 관한 연구)

  • Kim, Eun-Jeong;Bae, Gwang-Soo;Ahn, Gye-Hyeong;Ki, Yong-Kul;Ahn, Yong-Ju
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.3
    • /
    • pp.66-77
    • /
    • 2014
  • UTIS(Urban Traffic Information System) directly collects link travel time in urban area by using probe vehicles. Therefore it can estimate more accurate link travel speed compared to other traffic detection systems. However, UTIS includes some missing data caused by the lack of probe vehicles and RSEs on road network, system failures, and other factors. In this study, we suggest a new model, based on k-NN algorithm, for imputing missing data to provide more accurate travel time information. New imputation model is an adaptive k-NN which can flexibly adjust the number of nearest neighbors(NN) depending on the distribution of candidate objects. The evaluation result indicates that the new model successfully imputed missing speed data and significantly reduced the imputation error as compared with other models(ARIMA and etc). We have a plan to use the new imputation model improving traffic information service by applying UTIS Central Traffic Information Center.

A Study on the Problems and Improvement Plan of Cadastral Map Data Maintenance Project (지적·임야도 자료정비 사업의 문제점 및 개선방안 연구)

  • Baek, Kyu-Yeong;Choi, Yun-Soo
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.1
    • /
    • pp.63-73
    • /
    • 2020
  • The Ministry of Land, Infrastructure and Transport carried out a pilot project for data maintenance in 2011 to ensure the accuracy of cadastral records nationwide, and now trying to reduce the errors such as omission of the cadastral book, inconsistency of land category between cadastral account book and cadastral map, etc. and the boundary between the map boundary, scale, and administrative areas. In this study, we looked at the currents status and problems of the cadastral research, improving the cadastral map, which has been promoted by the government and the cadastral office. In order to revitalize the cadastral map data maintenance project, it is necessary to re-establish the plan for each step for challenging the limitation of current data maintenance, and the master plan for promotion system, and develop manuals to maintain consistency and secure accuracy of cadastral map maintenance, such as "Coverage and Maintenance Guidelines for Cadastral Maps", and secure the national budget for error correction in cadastral map led by government.

Determining the Orientation of Accelerograph Stations in South Korea using Ambient Noise Data (배경잡음 자료를 이용한 국내 가속도 관측망의 방위각 보정값 측정)

  • Lee, Sang-Jun
    • Journal of the Korean earth science society
    • /
    • v.42 no.2
    • /
    • pp.195-200
    • /
    • 2021
  • Orientation corrections for the total of 268 accelerograph stations of the Korea Meteorological Administration (KMA) were estimated using ambient noise cross-correlation. As this method uses ambient noise data instead of teleseismic waveforms from earthquakes under certain conditions, reliable orientation corrections can be obtained using only two-month long continuous seismic data from dense seismic networks in the Korean peninsula.Three-component continuous data recorded at the 268 accelerograph stations from January to February 2020 were used to estimate orientation corrections. The results are comparable to the previous results obtained from teleseismic waveforms; the overall standard deviations of the orientation corrections are less than 5°. Therefore, orientation corrections for the accelerograph station network can be tracked periodically by the ambient-noise method and the result can be used in various studies using the horizontal-component of acceleration data.

Summarization of Korean Dialogues through Dialogue Restructuring (대화문 재구조화를 통한 한국어 대화문 요약)

  • Eun Hee Kim;Myung Jin Lim;Ju Hyun Shin
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.77-85
    • /
    • 2023
  • After COVID-19, communication through online platforms has increased, leading to an accumulation of massive amounts of conversational text data. With the growing importance of summarizing this text data to extract meaningful information, there has been active research on deep learning-based abstractive summarization. However, conversational data, compared to structured texts like news articles, often contains missing or transformed information, necessitating consideration from multiple perspectives due to its unique characteristics. In particular, vocabulary omissions and unrelated expressions in the conversation can hinder effective summarization. Therefore, in this study, we restructured by considering the characteristics of Korean conversational data, fine-tuning a pre-trained text summarization model based on KoBART, and improved conversation data summary perfomance through a refining operation to remove redundant elements from the summary. By restructuring the sentences based on the order of utterances and extracting a central speaker, we combined methods to restructure the conversation around them. As a result, there was about a 4 point improvement in the Rouge-1 score. This study has demonstrated the significance of our conversation restructuring approach, which considers the characteristics of dialogue, in enhancing Korean conversation summarization performance.

A Study for the Border line Extraction technique of City Spatial Building by LiDAR Data (LiDAR 데이터와 항공사진의 통합을 위한 사각 빌딩의 경계점 설정)

  • Yeon, Sang-Ho;Lee, Young-Wook
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.27-29
    • /
    • 2007
  • The visual implementation of 3-dimensional national environment is focused by the requirement and importance in the fields such as, national development plan, telecommunication facility deployment plan, railway construction, construction engineering, spatial city development, safety and disaster prevention engineering. The currently used DEM system using contour lines, which embodies national geographic information based on the 2-D digital maps and facility information has limitation in implementation in reproducing the 3-D spatial city. Moreover, this method often neglects the altitude of the rail way infrastructure which has narrow width and long length. There it is needed to apply laser measurement technique in the spatial target object to obtain accuracy. Currently, the LiDAR data which combines the laser measurement skill and GPS has been introduced to obtain high resolution accuracy in the altitude measurement. In this paper, we first investigate the LiDAR based researches in advanced foreign countries, then we propose data a generation scheme and an algorithm for the optimal manage and synthesis of railway facility system in our 3-D spatial terrain information. For this object, LiDAR based height data transformed to DEM, and the realtime unification of the vector via digital image mapping and raster via exactness evaluation is transformed to make it possible to trace the model of generated 3-dimensional railway model with long distance for 3D tract model generation.

  • PDF

Development of Variable Selection Technique using Stepwise Regression and Data Envelopment Analysis (단계적 회귀법과 자료봉합분석을 이용한 변수선택기법의 개발)

  • Jeong, Min-Eui;Yu, Song-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.41 no.8
    • /
    • pp.598-604
    • /
    • 2014
  • In this paper, we develop stepwise regression data envelopment model to select important variables. We formulate null hypothesis to understand the importance of each variable and use Kruskal-Wallis test for this purpose. If the Kruskal-Wallis test does reject the null hypothesis this will imply there is significant fluctuation in the efficiency score relative to base model. And therefore we have to further check the pair of variables that causes the fluctuation in order to determine its importance using Conover-Inman test. The proposed models helps understand the extent of misclassification decision making units as efficient/inefficient when variables are retained or discarded alongside provides useful managerial prescription to make improvement strategies.

A Study on the Database Design for Road Facility Management (도로 시설물관리를 위한 자료기반 설계에 관한 연구)

  • 박운용;차성렬;신상철
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.13 no.1
    • /
    • pp.21-30
    • /
    • 1995
  • As it is being changed to information society with high economic development, every kind of work is being specialized and diversified in the aspects of quantity and quality. Therefore, Geo-Spatial Information System (GSIS) is getting more important because it can integrate the attribute, the image and the written data of objects including the site one. The purpose of GSIS is to integrate such geo-spatial data as the situation of public facilities, land uses and its sketch and statistics which are scattered in every administration department. Additionally, GSIS can make users get the goo-spatial information easily through computer, and can also do it worthier. An integrated management system of transportation data is needed to improve the efficiency of road-related operations. This can be accomplished by analysing various operation about road management and the utilization of base maps. Thus, in this study, effective road facility information management system based on GSIS database was developed for the management of various types of drawings and data using in various tasks.

  • PDF