• Title/Summary/Keyword: Automated Data Collection

Search Result 76, Processing Time 0.018 seconds

Estimation of Mass Rapid Transit Passenger's Train Choice Using a Mixture Distribution Analysis (통행시간 기반 혼합분포모형 분석을 통한 도시철도 승객의 급행 탑승 여부 추정 연구)

  • Jang, Jinwon;Yoon, Hosang;Park, Dongjoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.5
    • /
    • pp.1-17
    • /
    • 2021
  • Identifying the exact train and the type of train boarded by passengers is practically cumbersome. Previous studies identified the trains boarded by each passenger by matching the Automated Fare Collection (AFC) data and the train schedule diagram. However, this approach has been shown to be inefficient as the exact train boarded by a considerable number of passengers cannot be accurately determined. In this study, we demonstrate that the AFC data - diagram matching technique could not estimate 28% of the train type selected by passengers using the Seoul Metro line no.9. To obtain more accurate results, this paper developed a two-step method for estimating the train type boarded by passengers by applying the AFC data - diagram matching method followed by a mixture distribution analysis. As a result of the analysis, we derived reasonable express train use/non-use passenger classification points based on 298 origin-destination pairs that satisfied the verification criteria of this study.

Dynamic ontology construction algorithm from Wikipedia and its application toward real-time nation image analysis (국가이미지 분석을 위한 위키피디아 실시간 동적 온톨로지 구축 알고리즘 및 적용)

  • Lee, Youngwhan
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.979-991
    • /
    • 2016
  • Measuring nation images was a challenging task when employing offline surveys was the only option. It was not only prohibitively expensive, but too much time-consuming and therefore unfitted to this rapidly changing world. Although demands for monitoring real-time nation images were ever-increasing, an affordable and reliable solution to measure nation images has not been available up to this date. The researcher in this study developed a semi-automatic ontology construction algorithm, named "double-crossing double keyword collection (or DCDKC)" to measure nation images from Wikipedia in real-time. The ontology, WikiOnto, can be used to reflect dynamic image changes. In this study, an instance of WikiOnto was constructed by applying the algorithm to the big-three exporting countries in East Asia, Korea, Japan, and China. Then, the numbers of page views for words in the instance of WikiOnto were counted. A collection of the counting for each country was compared to each other to inspect the possibility to use for dynamic nation images. As for the conclusion, the result shows how the images of the three countries have changed for the period the study was performed. It confirms that DCDKC can very well be used for a real-time nation-image monitoring system.

The Correlation between Library Users' Fields of Study and the Use of Translated Works in University Libraries (대학도서관에서 대출된 번역서와 대출자 전공과의 관계 연구)

  • Lee Hyun-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.1
    • /
    • pp.155-167
    • /
    • 1998
  • In the climate of increasing calls for academic assessment, the author undertook a study to ascertain availability of original texts and their translations in the academic library. The object of this study is to compare the use frequency of original texts by academic major of users in the university libraries. To achieve this object, the author collected the data stored in 3 Korean university library online systems from September 10th to 30th, 1995 and tested the hypotheses by using the Minitab statistical package. Libraries with multilingual collection and automated systems will find the methodology Presented here Particularly valuable.

  • PDF

Design and Implementation of UDDI to Provide the User-Side Quality of Web Service (사용자 측면의 웹서비스의 품질데이터를 제공하기 위한 UDDI의 설계 및 구현)

  • Cho, Poong-Youn;Lee, Nam-Yong;Lee, Choul-Ki
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.8 no.4
    • /
    • pp.102-112
    • /
    • 2009
  • Quality of the web service is one of the most important elements as various web services provide diverse functionality, and therefore, it is more and more difficult to satisfy customer's needs. Since the existing UDDI registry provides the basic information such as name and URL of a web service, users are having hard times to choose and customize their web service. In this paper, we propose an extended UDDI architecture for providing quality data of a web service. And in this architecture, we used an automated collection technique of data for testing web service which provides quality information to the users, including response time, throughput, availability, reliability and accessibility. With this new architecture, users and web developers can benefit from web services that customize information for the users and this ensures reliability of web-based applications.

  • PDF

A Study on the Smart Healthcare health management System (스마트 헬스케어 건강관리 시스템에 관한 연구)

  • Han, Jeong-Ah;Na, Won-Shik
    • Journal of Convergence for Information Technology
    • /
    • v.10 no.6
    • /
    • pp.8-13
    • /
    • 2020
  • In this paper, we study smart healthcare devices that enable active health care by building health care system with acquaintances or family members rather than single health care. The company develops health care services for families regardless of age and gender through intuitive UI design as a target for young users who serve elderly parents. Automated collection of health information and real-time feedback are available, and data can be aggregated and analyzed through repeaters. It can also utilize structured databases in the form of big data. The services offered can be used to prevent diseases and reduce medical expenses through health care, while automatic management can maximize users' convenience and increase demand. By reducing the development period of products that are based on this technology, reducing the development period of products and strengthening competitiveness, the company has the advantage of inducing generation-to-generation communication in an era when it is becoming a nuclear family.

Analysis of Phenological Changes by Phenocams on Some Major Species Distributed in Wetland and Forest Ecosystems in Korea (Phenocam을 활용한 국내 습지 및 산림생태계 대표 수종의 계절적 변화 분석)

  • Minki Hong;Hyohyemi Lee;Jeong-Soo Park
    • Ecology and Resilient Infrastructure
    • /
    • v.10 no.4
    • /
    • pp.226-236
    • /
    • 2023
  • As climate change intensifies, the importance of studying plant phenology has increased, leading to a surge in research employing automated video recording devices like Phenocams. In this study, using the Phenocams operated by the National Institute of Ecology, we examined the trends in plant phenological changes across diverse ecosystem types in South Korea and analyzed their correlations with climate factors. The patterns of plant phenological changes varied by region and tree species. Pinus thunbergii and Pinus densiflora typically show an overall increase in their growth period, positively correlating with temperatures and precipitation during winter. However, uniquely, for Abies koreana on Hallasan Mt., a higher amount of precipitation in August leads to an earlier end of season (eos), and the correlation analysis with the recent phenomenon of dying A. Koreana seems necessary. beyond the analysis, solutions for handling missing data issues during the data collection process were proposed. Furthermore, to expand future research scope and encompass diverse ecosystem types, a suggestion to combine Phenocam research with satellite observations was presented.

Speech Emotion Recognition in People at High Risk of Dementia

  • Dongseon Kim;Bongwon Yi;Yugwon Won
    • Dementia and Neurocognitive Disorders
    • /
    • v.23 no.3
    • /
    • pp.146-160
    • /
    • 2024
  • Background and Purpose: The emotions of people at various stages of dementia need to be effectively utilized for prevention, early intervention, and care planning. With technology available for understanding and addressing the emotional needs of people, this study aims to develop speech emotion recognition (SER) technology to classify emotions for people at high risk of dementia. Methods: Speech samples from people at high risk of dementia were categorized into distinct emotions via human auditory assessment, the outcomes of which were annotated for guided deep-learning method. The architecture incorporated convolutional neural network, long short-term memory, attention layers, and Wav2Vec2, a novel feature extractor to develop automated speech-emotion recognition. Results: Twenty-seven kinds of Emotions were found in the speech of the participants. These emotions were grouped into 6 detailed emotions: happiness, interest, sadness, frustration, anger, and neutrality, and further into 3 basic emotions: positive, negative, and neutral. To improve algorithmic performance, multiple learning approaches were applied using different data sources-voice and text-and varying the number of emotions. Ultimately, a 2-stage algorithm-initial text-based classification followed by voice-based analysis-achieved the highest accuracy, reaching 70%. Conclusions: The diverse emotions identified in this study were attributed to the characteristics of the participants and the method of data collection. The speech of people at high risk of dementia to companion robots also explains the relatively low performance of the SER algorithm. Accordingly, this study suggests the systematic and comprehensive construction of a dataset from people with dementia.

A Study on the Compression and Major Pattern Extraction Method of Origin-Destination Data with Principal Component Analysis (주성분분석을 이용한 기종점 데이터의 압축 및 주요 패턴 도출에 관한 연구)

  • Kim, Jeongyun;Tak, Sehyun;Yoon, Jinwon;Yeo, Hwasoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.19 no.4
    • /
    • pp.81-99
    • /
    • 2020
  • Origin-destination data have been collected and utilized for demand analysis and service design in various fields such as public transportation and traffic operation. As the utilization of big data becomes important, there are increasing needs to store raw origin-destination data for big data analysis. However, it is not practical to store and analyze the raw data for a long period of time since the size of the data increases by the power of the number of the collection points. To overcome this storage limitation and long-period pattern analysis, this study proposes a methodology for compression and origin-destination data analysis with the compressed data. The proposed methodology is applied to public transit data of Sejong and Seoul. We first measure the reconstruction error and the data size for each truncated matrix. Then, to determine a range of principal components for removing random data, we measure the level of the regularity based on covariance coefficients of the demand data reconstructed with each range of principal components. Based on the distribution of the covariance coefficients, we found the range of principal components that covers the regular demand. The ranges are determined as 1~60 and 1~80 for Sejong and Seoul respectively.

Automation of Sampling for Public Survey Performance Assessment (공공측량 성과심사 표본추출 자동화 가능성 분석)

  • Choi, Hyun;Jin, Cheol;Lee, Jung Il;Kim, Gi Hong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.44 no.1
    • /
    • pp.95-100
    • /
    • 2024
  • The public survey performance review conducted by the Spatial Information Quality Management Institute is conducted at the screening rate in accordance with the regulations, and the examiner directly judges the overall trend of the submitted performance based on the extracted sample. However, the evaluation of the Ministry of Land, Infrastructure and Transport, the evaluation trustee shall be specified by random extraction (Random Collection) is specified by the sample. In this study, it analyzed the details of the actual site and analyzed through securing actual performance review data. In addition, we analyzed considerations according to various field conditions and studied ways to apply the public survey performance review sampling algorithm. Therefore, detailed sampling criteria analysis by performance reviewers is necessary. A relative comparison was made feasible by comparing the data for which the real performance evaluation was performed with the outcomes of the Python automation program. This automation program is expected to be employed as a foundation program for the automated application of public survey performance evaluation sampling in the future.

Automatic Detection of Off-topic Documents using ConceptNet and Essay Prompt in Automated English Essay Scoring (영어 작문 자동채점에서 ConceptNet과 작문 프롬프트를 이용한 주제-이탈 문서의 자동 검출)

  • Lee, Kong Joo;Lee, Gyoung Ho
    • Journal of KIISE
    • /
    • v.42 no.12
    • /
    • pp.1522-1534
    • /
    • 2015
  • This work presents a new method that can predict, without the use of training data, whether an input essay is written on a given topic. ConceptNet is a common-sense knowledge base that is generated automatically from sentences that are extracted from a variety of document types. An essay prompt is the topic that an essay should be written about. The method that is proposed in this paper uses ConceptNet and an essay prompt to decide whether or not an input essay is off-topic. We introduce a way to find the shortest path between two nodes on ConceptNet, as well as a way to calculate the semantic similarity between two nodes. Not only an essay prompt but also a student's essay can be represented by concept nodes in ConceptNet. The semantic similarity between the concepts that represent an essay prompt and the other concepts that represent a student's essay can be used for a calculation to rank "on-topicness" ; if a low ranking is derived, an essay is regarded as off-topic. We used eight different essay prompts and a student-essay collection for the performance evaluation, whereby our proposed method shows a performance that is better than those of the previous studies. As ConceptNet enables the conduction of a simple text inference, our new method looks very promising with respect to the design of an essay prompt for which a simple inference is required.