• Title/Summary/Keyword: 인구통계학기반

Search Result 12, Processing Time 0.146 seconds

Personalized insurance product based on similarity (유사도를 활용한 맞춤형 보험 추천 시스템)

  • Kim, Joon-Sung;Cho, A-Ra;Oh, Hayong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1599-1607
    • /
    • 2022
  • The data mainly used for the model are as follows: the personal information, the information of insurance product, etc. With the data, we suggest three types of models: content-based filtering model, collaborative filtering model and classification models-based model. The content-based filtering model finds the cosine of the angle between the users and items, and recommends items based on the cosine similarity; however, before finding the cosine similarity, we divide into several groups by their features. Segmentation is executed by K-means clustering algorithm and manually operated algorithm. The collaborative filtering model uses interactions that users have with items. The classification models-based model uses decision tree and random forest classifier to recommend items. According to the results of the research, the contents-based filtering model provides the best result. Since the model recommends the item based on the demographic and user features, it indicates that demographic and user features are keys to offer more appropriate items.

Using Degree of Match to Improve Prediction Quality in Collaborative Filtering Systems (협업 필터링 시스템에서 Degree of Match를 이용한 성능향상)

  • Sohn, Jae-Bong;Suh, Yong-Moo
    • Information Systems Review
    • /
    • v.8 no.2
    • /
    • pp.139-154
    • /
    • 2006
  • Recommender systems help users find their interesting items more easily or provide users with meaningful items based on their preferences. Collaborative filtering(CF) recommender systems, the most successful recommender system, use opinions of users to recommend for an active user who needs recommendation. That is, ratings which users have voted on items to indicate preference on them are the source for making recommendation. Although CF systems are designed only to use users' preferences as the source of recommendation, use of some available information is believed to increase both the performance and the accuracy of CF systems. In this paper, we propose a CF recommender system which utilizes both degree of match and demographic information(e.g., occupation, gender, age) to increase the performance and the accuracy. Since more and more information is accumulated in CF systems, it is important to reduce the data volume while maintaining the same or the higher level of accuracy. We used both degree of match and demographic information as criteria for reducing the data volume, thereby naturally enhancing the performance. It is shown that using degree of match improves the prediction accuracy too in CF systems and also that using some demographic information also results in better accuracy.

Factors affecting regional population of Korea using Bayesian quantile regression (베이지안 분위회귀모형을 이용한 지역인구에 영향을 미치는 요인분석)

  • Kim, Minyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.823-835
    • /
    • 2021
  • Identification of factors influencing regional population is critical for establishing government's population policies as well as for improving residents' social, economic and cultural well-being in the region. In this study we analysed the data from 2019 Population Housing Survey in Korea to identify the factors affecting the population size in each of the three regions: Seoul, metropolitan cities, and provincial regions. We applied a Bayesian quantile regression to account for asymmetry and heteroscedasticity of data. The analysis results showed that the effects of factors vary greatly between the three regions of Seoul, metropolitan cities, and provincial regions as well as between sub regions within the same region. These results suggest that population-related variables have very heterogeneous characteristics from region to region and therefore it is important to establish customized population policies that suit regional characteristics rather than uniform population policies that apply to every region.

An Explorator Spatial Analysis of Shigellosis (세균성 이질의 탐색적 공간분석)

  • 박기호
    • Journal of the Korean Geographical Society
    • /
    • v.34 no.5
    • /
    • pp.473-491
    • /
    • 1999
  • 세균성 이질은 국내 제1종 법정 전염병으로 분류되어 관리되고 있는 질환으로서 1998년 이후 그 발병 사례가 급속히 증가하고 있다. 본 연구는 1999년 3월 부산시 사상구에서 집단 발병한 세균성 이질을 대상으로 하여, 각 환자들의 발병 시점과 장소의 분포패턴에 대한 지리학적 고찰을 목적으로 한다. 환자분포의 특징적 공간패턴과 그들의 시계열적 확산 양상 등을 탐색하기 위한 방법론은 보건지리학과 지도학 및 공간통계학에 기반을 둔 공간분석기법을 중심으로 설정하였다. 분석자료는 해당 지역의 수치지형도, 지적도, 인구 센서스 자료를 포함한 GIS 데이터베이스로 구축되었다. 인구분포를 감안한 밀도구분도를 바탕으로 개별환자의 위치자료와 동 단위로 집계된 자료를 자료의 형태에 따라 분석기법을 달리하였으며, 환자 발생 밀도, 상대적 위험지수 등을 지도화하여 역학자료의 시각적 통계적 분석을 수행하였다. 환자분포의 공간적 중심위치와 분산의 변화 등 기술적 통계분석과 함께 제1차 공간속성을 커널추정법으로 찾아보았다. 이와 더불어 ‘공간적 의존성’과 관련된 제2차 공간속성은 K-함수와 시뮬레이션을 통해 분석하여 군집성 등이 통계적으로 확인되었다. 본 연구를 통해 역학조사시 GIS의 활용사례가 제시되었으며, 모집단 인구를 고려한 확률지도 작성 기법과 다양한 데이터 가시화 방법, 그리고 시계열별 발생 환자들의 지리적 변이를 분석 하는데 따르는 문제들이 논의되었다.

  • PDF

Factors Affecting the Resistance of Innovation Technology based Smartphone Environment (스마트폰 혁신기술이 사용자 저항에 미치는 영향)

  • Nam, Soo-Tai;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.10a
    • /
    • pp.137-138
    • /
    • 2016
  • 본 연구에서는 기술수용모델과 혁신확산이론을 기반으로 스마트폰 혁신기술 저항에 관한 재사용 의도에 미치는 영향을 알아보고자 한다. 외부변수는 기술수용모델의 인지된 유용성과 인지된 사용 용이성 그리고 혁신확산이론의 적합성과 복잡성을 4개 변수를 두고자 한다. 예측변수는 혁신저항 변수를 두고 재사용의도에 미치는 영향으로 하여 개념모델을 완성하였다. 또한 혁신저항 변수가 인지된 위험 요인을 매개하여 재사용의도에 미치는 영향을 알아보고자 하였다. 연구대상은 부산 경남지역 및 전북지역에 거주하는 스마트폰 사용자를 대상으로 설문을 통해 자료를 수집하고자 한다. 인구통계학인 분석은 IBM SPSS Statistics 19로 하고 확인적 요인분석과 변수들 간의 인과관계에 대한 경로분석은 Smart PLS를 사용하여 분석하고자 한다. 분석결과를 바탕으로 이론적 실무적 시사점을 제시하고자 한다.

  • PDF

Utilization of Demographic Analysis with IMDB User Ratings on the Recommendation of Movies (IMDB 사용자평점에 대한 인구통계학적 분석의 활용)

  • Bae, Sung Moon;Lee, Sang Chun;Park, Jong Hun
    • The Journal of Society for e-Business Studies
    • /
    • v.19 no.3
    • /
    • pp.125-141
    • /
    • 2014
  • Nowadays, overflowing data produced every second from the internet make people to be difficult to search for the useful information. That's why people have invented and developed unique tools that they get some relevant information. In this paper, the recommender system, one of the effective tools, is used and it helps us to get the useful information that we want by using demographic information to predict new items of interest. The demographic recommender system in this paper computes users' similarity using demographic information, age and gender. So we performed demographic analysis on movie ratings on Internet Movie Database (IMDB) web site that movies are rated by thousands of people, where users submitted a movie rating after they watched a recent popular film. Meanwhile, we can understand that user's ratings, among various determinants of box office, is very essential factor in the study on recommendation of movie. This paper is aimed at analyzing movie average ratings directly given by film viewers, categorizing them into groups by sex and age, investigating the entire group and finding the representative group by examining it with F-test and T-test. This result is used to promote and recommend for the target group only. Therefore, this study is considerably significant as presenting utilization for movie business as well as showing how to analyze demographic information on movie ratings on the web.

Defining boundaries of urban centers and measuring the impact for diagnosing urban spatial structure (도시 공간구조 진단을 위한 도시 중심지의 경계 설정 및 영향력 측정에 관한 연구)

  • Ho-Yong Kim;Jisook Kim
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.27 no.1
    • /
    • pp.52-66
    • /
    • 2024
  • The purpose of this study is to identify the spatial system and characteristics of the urban center by deriving the boundaries of the urban center set in the urban basic plan for Busan Metropolitan City and diagnosing the role and status of the center. To this end, four indicators representing the characteristics of the center were selected through a review of previous studies, and the boundaries of the center were derived using spatial statistical techniques with strengths in geographical boundary analysis. Then, using the indicators of center characteristics and population potential functions, we diagnosed the influence and potential of each center in the spatial structure of Busan Metropolitan City. The analysis showed that the scale of the centers varies greatly, and the unutilized areas where commercial areas are not activated and the expansion areas that spread beyond commercial areas to residential and industrial areas are different for each urban center. The results of the potential measurement, which indicates the attractiveness of the center, also showed areas with strong and weak population potential. Therefore, systematic management and strategies based on the hierarchical characteristics and influence measurement results are needed to strengthen the function of urban centers. The results analyzed in this study can be used as a resource for responding to various urban planning needs and policy changes in the future, along with station area development plans and spatial innovation zones for building a sustainable urban growth system, balanced development, and strengthening the function of centers.

How can the post-war reconstruction project be carried out in a stable manner? - terrorism prediction using a Bayesian hierarchical model (전후 재건사업을 안정적으로 진행하려면? - 베이지안 계층모형을 이용한 테러 예측)

  • Eom, Seunghyun;Jang, Woncheol
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.5
    • /
    • pp.603-617
    • /
    • 2022
  • Following the September 11, 2001 terrorist attacks, the United States declared war on terror and invaded Afghanistan and Iraq, winning quickly. However, interest in analyzing terrorist activities has developed as a result of a significant amount of time being spent on the post-war stabilization effort, which failed to minimize the number of terrorist activities that occurred later. Based on terrorist data from 2003 to 2010, this study utilized a Bayesian hierarchical model to forecast the terrorist threat in 2011. The model depicts spatiotemporal dependence with predictors such as population and religion by autonomous district. The military commander in charge of the region can utilize the forecast value based on the our model to prevent terrorism by deploying forces efficiently.

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

A Comparative Pedagogical Approach to Lifelong Education: Possibilities and Limitations (평생교육의 비교교육학적 접근: 가능성과 한계)

  • Choi, DonMin
    • Korean Journal of Comparative Education
    • /
    • v.28 no.3
    • /
    • pp.291-307
    • /
    • 2018
  • As the value of lifelong learning becomes important, states are making efforts to build a system of lifelong learning. According to this tendency, this paper intends to compare the participation rate of lifelong learning, learning outcomes, learning support infrastructure, support of learning expenses, and recognition of lifelong learning. For the comparative pedagogical approach, Bray and Thomas' cubes such as geographical / regional level, non - geographical demographic statistics, social and educational aspects were utilized. The participation rate of lifelong learning in Korea is 34.4% in 2017, which is lower than the OECD average of 46%. The competency scores of Korean adults were lower than the OECD national averages of the PIAAC survey which measured adult competence, language ability, numeracy, and computer-based problem solving ability. In order to recognize prior learning, EU countries have developed EQFs to evaluate all non-formal and informal learning outcomes, while Korea recognizes qualification as a credit banking credit under the academic credit banking system. International comparisons of lifelong learning can be used as an important tool for diagnosing the actual conditions of lifelong learning in a country and establishing future lifelong learning policies. Therefore, it is necessary to maintain that the comparative pedagogical approach of lifelong learning differs according to the historical context, socioeconomic characteristics, and population dynamics, including the formation process and characteristics of modern countries.