• Title/Summary/Keyword: 탐색적 특징

Search Result 869, Processing Time 0.025 seconds

An Exploratory Study on Daily Activity Types based on Life-logging Data (라이프로그 기반 일상생활 활동유형에 대한 탐색적 연구)

  • Lim, Hoyeon;Chung, Seungeun;Jeong, Chi Yoon;Jeong, Hyun-Tae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.761-764
    • /
    • 2020
  • 본 논문에서는 라이프로그 데이터를 기반으로 한 행동인식 결과로부터 일상생활의 활동유형을 분석하는 기술에 대해 제안한다. 실제 일상생활 중에 수집한 가속도 센서 데이터만을 이용하여 분석한 행동인식 결과를 정적-동적 행동으로 분류된 특징 벡터로 나타내었고, 이를 클러스터링하여 6개의 대표 활동유형으로 분류하였다. 50명의 사용자 데이터를 분석하여 정적-동적 활동의 비율에 따른 활동유형을 분류함으로써 실제 라이프로그 데이터로부터 일상생활 활동유형을 확인하였다.

Feature Selection for Multiple K-Nearest Neighbor classifiers using GAVaPS (GAVaPS를 이용한 다수 K-Nearest Neighbor classifier들의 Feature 선택)

  • Lee, Hee-Sung;Lee, Jae-Hun;Kim, Eun-Tai
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.6
    • /
    • pp.871-875
    • /
    • 2008
  • This paper deals with the feature selection for multiple k-nearest neighbor (k-NN) classifiers using Genetic Algorithm with Varying reputation Size (GAVaPS). Because we use multiple k-NN classifiers, the feature selection problem for them is vary hard and has large search region. To solve this problem, we employ the GAVaPS which outperforms comparison with simple genetic algorithm (SGA). Further, we propose the efficient combining method for multiple k-NN classifiers using GAVaPS. Experiments are performed to demonstrate the efficiency of the proposed method.

Self-optimizing feature selection algorithm for enhancing campaign effectiveness (캠페인 효과 제고를 위한 자기 최적화 변수 선택 알고리즘)

  • Seo, Jeoung-soo;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.173-198
    • /
    • 2020
  • For a long time, many studies have been conducted on predicting the success of campaigns for customers in academia, and prediction models applying various techniques are still being studied. Recently, as campaign channels have been expanded in various ways due to the rapid revitalization of online, various types of campaigns are being carried out by companies at a level that cannot be compared to the past. However, customers tend to perceive it as spam as the fatigue of campaigns due to duplicate exposure increases. Also, from a corporate standpoint, there is a problem that the effectiveness of the campaign itself is decreasing, such as increasing the cost of investing in the campaign, which leads to the low actual campaign success rate. Accordingly, various studies are ongoing to improve the effectiveness of the campaign in practice. This campaign system has the ultimate purpose to increase the success rate of various campaigns by collecting and analyzing various data related to customers and using them for campaigns. In particular, recent attempts to make various predictions related to the response of campaigns using machine learning have been made. It is very important to select appropriate features due to the various features of campaign data. If all of the input data are used in the process of classifying a large amount of data, it takes a lot of learning time as the classification class expands, so the minimum input data set must be extracted and used from the entire data. In addition, when a trained model is generated by using too many features, prediction accuracy may be degraded due to overfitting or correlation between features. Therefore, in order to improve accuracy, a feature selection technique that removes features close to noise should be applied, and feature selection is a necessary process in order to analyze a high-dimensional data set. Among the greedy algorithms, SFS (Sequential Forward Selection), SBS (Sequential Backward Selection), SFFS (Sequential Floating Forward Selection), etc. are widely used as traditional feature selection techniques. It is also true that if there are many risks and many features, there is a limitation in that the performance for classification prediction is poor and it takes a lot of learning time. Therefore, in this study, we propose an improved feature selection algorithm to enhance the effectiveness of the existing campaign. The purpose of this study is to improve the existing SFFS sequential method in the process of searching for feature subsets that are the basis for improving machine learning model performance using statistical characteristics of the data to be processed in the campaign system. Through this, features that have a lot of influence on performance are first derived, features that have a negative effect are removed, and then the sequential method is applied to increase the efficiency for search performance and to apply an improved algorithm to enable generalized prediction. Through this, it was confirmed that the proposed model showed better search and prediction performance than the traditional greed algorithm. Compared with the original data set, greed algorithm, genetic algorithm (GA), and recursive feature elimination (RFE), the campaign success prediction was higher. In addition, when performing campaign success prediction, the improved feature selection algorithm was found to be helpful in analyzing and interpreting the prediction results by providing the importance of the derived features. This is important features such as age, customer rating, and sales, which were previously known statistically. Unlike the previous campaign planners, features such as the combined product name, average 3-month data consumption rate, and the last 3-month wireless data usage were unexpectedly selected as important features for the campaign response, which they rarely used to select campaign targets. It was confirmed that base attributes can also be very important features depending on the type of campaign. Through this, it is possible to analyze and understand the important characteristics of each campaign type.

Exploring Regional Decline Risk Areas and Factors Using Topic Modeling and Cluster Analysis (토픽모델링과 군집분석을 통한 지방 소멸 위험지역과 요인의 탐색)

  • Ji-Min Kim;Heeryon Cho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.349-350
    • /
    • 2023
  • 우리나라는 지속적인 저출산과 고령화로 인해 지방 소멸 위험지역이 점차 늘어나고 있다. 본 연구는 지방 소멸과 관련된 다양한 요인을 '인구 소멸'이라는 키워드를 포함하는 신문 기사에 대한 토픽모델링을 통해 발견하고, 추출된 토픽과 관련된 공공 데이터를 수집하여 비슷한 특징을 가지는 지역을 묶는 군집분석을 수행한다. 그리고 지방소멸위험지수로 분류된 소멸 위험지역과 군집분석 결과를 비교한다.

A Stochastic Transit Assignment Model for Intercity Rail Network (지역간 철도의 확률적 통행배정모형 구측 연구)

  • Kwon, Yong-Seok;Kim, Kyoung-Tae;Lim, Chong-Hoon
    • Journal of the Korean Society for Railway
    • /
    • v.12 no.4
    • /
    • pp.488-498
    • /
    • 2009
  • The characteristics of intercity rail network are different from those of public transit network in urban area. In this paper, we proposed a new transit assignment model which is generalized form of deterministic assignment model by introducing line selection probability on route section. This model consider various characteristics of intercity rail and simplify network expansion for appling search algorithms developed in road assignment model. We showed the model availability by comparing with existing models using virtual networks. The tests on a small scale network show that this model is superior to existing models for predicting intercity rail demand.

Feature Vector Extraction for Solar Energy Prediction through Data Visualization and Exploratory Data Analysis (데이터 시각화 및 탐색적 데이터 분석을 통한 태양광 에너지 예측용 특징벡터 추출)

  • Jung, Wonseok;Ham, Kyung-Sun;Park, Moon-Ghu;Jeong, Young-Hwa;Seo, Jeongwook
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.10a
    • /
    • pp.514-517
    • /
    • 2017
  • In solar photovoltaic systems, power generation is greatly affected by the weather conditions, so it is essential to predict solar energy for stable load operation. Therefore, data on weather conditions are needed as inputs to machine learning algorithms for solar energy prediction. In this paper, we use 15 kinds of weather data such as the precipitation accumulated during the 3 hours of the surface, upward and downward longwave radiation average, upward and downward shortwave radiation average, the temperature during the past 3 hours at 2 m above from the ground and temperature from the ground surface as input data to the algorithm. We analyzed the statistical characteristics and correlations of weather data and extracted the downward and upward shortwave radiation averages as a major elements of a feature vector with high correlation of 70% or more with solar energy.

  • PDF

A Study on Automatic Detection of The Face and Facial Features for Face Recognition System in Real Time (실시간 얼굴인식 시스템을 위한 얼굴의 위치 및 각 부위 자동 검출에 관한 연구)

  • 구자일;홍준표
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.4
    • /
    • pp.379-388
    • /
    • 2002
  • In this paper, the real-time algorithm is proposed for automatic detection of the face and facial features. In the face region, we extracted eyes, nose, mouth and so forth. There are two methods to extract them; one is the method of using the location information of them, other is the method of using Gaussian second derivatives filters. This system have high speed and accuracy because the facial feature extraction is processed only by detected face region, not by whole image. There are some kinds of good experimental result for the proposed algorithm; high face detection rate of 95%, high speed of lower than 1sec. the reduction of illumination effect, and the compensation of face tilt.

Discussions on Mechanisms, Features and Implications of the Digital Divide in Old Age (노년기 정보격차의 메카니즘, 특징 및 시사점에 관하여)

  • Kim, Myoung-Yong
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.4
    • /
    • pp.246-262
    • /
    • 2015
  • The digital divide has become an important social issue in the two waves of informatization and aging society. This study is to comprehensively examine various theoretical approaches and discussions on the mechanisms of the digital divide and to synthetically understand the features of the digital divide in old age. This study examines 1) four theoretical approaches on the mechanisms of digital divide among people in general, 2) three theoretical perspectives and six specific explanations on the digital divide in old age, and 3) five controversial features and concepts of the digital divide in old age. Consequently, this study suggests the need for a comprehensive approach to the multi-dimensional digital divide in old age and more attention to the digital divide in old age in terms of individual well-being and social inequalities. Further implications and limitations are discussed.

Exploring the Characteristics of Environmental Catalysts of the Disadvantaged Gifted in Music (사회적 배려대상 음악영재의 환경요인 특징 탐색)

  • Kim, Sunghye;Lee, Kyungjin
    • Journal of Gifted/Talented Education
    • /
    • v.24 no.4
    • /
    • pp.629-655
    • /
    • 2014
  • This study aims to explore the characteristics of environmental catalysts which have affected the development of music giftedness of the disadvantaged students. For this purpose, this study deals with nineteen disadvantaged gifted in music and examines their self-evaluation test, personal statement, and interview. Based on Gagn$\acute{e}$'s environmental catalysts of differentiated model of giftedness and talent(DMGT), the analysis of the interviews conveys the milieu of the disadvantaged gifted hardly exerts positive influences on their musical activities and studies. While concerning music and supporting their children financially and emotionally, parents unintentionally tend to exert negative influences on their children for their misapprehension of giftedness and incompetent advice. On the whole, the disadvantaged gifted hardly admit their teachers as experts in music. In relation to provisions, most students participate in extra school and local program and none of them participates in music gifted program. They are not satisfied with the quality in education. Despite the importance of the events such as crystallizing experience, award-winning, and performance, most students don't have enough events for inspiring their giftedness. As a conclusion, this study gives a proposition for a strategy to improve the environmental catalysts for the disadvantaged gifted in many different ways: the improvement of social recognition, the enhancement of parent consulting and teachers training programs, and the development and diffusions of more qualified gifted programs and so on.