• Title/Summary/Keyword: 평균회귀

Search Result 2,326, Processing Time 0.031 seconds

Study on BMI, Dietary Behavior, and Nutrient Intake Status According to Frequency of Breakfast Intake in Female College Students in Chuncheon Area (춘천지역 일부 여대생의 아침식사 빈도에 따른 BMI, 식행동 및 영양소 섭취상태)

  • Kim, Yoon-Sun;Kim, Bok-Ran
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.46 no.10
    • /
    • pp.1234-1242
    • /
    • 2017
  • The purpose of this study was to investigate BMI, dietary behavior, and nutrient intake status according to frequency of breakfast intake in female college students (n=253) in Chuncheon area. This study was conducted by employing a self-administered questionnaire. Dietary assessment was measured by the 24-h recall method. The subjects were divided into two groups by frequency of breakfast: Five to seven times per week (eating breakfast group, n=139) and none to four times per week (skipping breakfast group, n=114). The living with parents group showed significant high frequency of breakfast intake, whereas the self-boarding group showed significant low frequency of breakfast intake. The body image satisfaction score of the 5~7 times/week group was higher than that of the 0~4 times/week group. The average height and weight of the 5~7 times/week group were $161.0{\pm}0.1cm$ and $52.6{\pm}7.6kg$, respectively, whereas those of the 0~4 times/week group were $160.7{\pm}0.1cm$ and $57.1{\pm}11.8kg$, respectively. The average body mass index (BMI) values of the 5~7 times/week and 0~4 times/week groups were $19.8{\pm}1.9kg/m^2$ and $21.5{\pm}3.4kg/m^2$, respectively. The dietary behavior score of the 5~7 times/week group was higher than that of the 0~4 times/week group. The daily averages for energy, carbohydrate, and protein intakes in the 5~7 times/week group were significantly higher than those of the 0~4 times/week group. Intakes of vitamin A, vitamin $B_1$, vitamin $B_2$, niacin, vitamin $B_6$, P, Zn, and cholesterol in the 5~7 times/week group were significantly higher than those of the 0~4 times/week group. Multiple regression analysis revealed that resident type was the most significant variable associated with breakfast intake frequency. Therefore, strengthening dietary education programs that largely focus on resident type will greatly contribute to prevent skipping breakfast.

A Study on the Clustering Method of Row and Multiplex Housing in Seoul Using K-Means Clustering Algorithm and Hedonic Model (K-Means Clustering 알고리즘과 헤도닉 모형을 활용한 서울시 연립·다세대 군집분류 방법에 관한 연구)

  • Kwon, Soonjae;Kim, Seonghyeon;Tak, Onsik;Jeong, Hyeonhee
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.95-118
    • /
    • 2017
  • Recent centrally the downtown area, the transaction between the row housing and multiplex housing is activated and platform services such as Zigbang and Dabang are growing. The row housing and multiplex housing is a blind spot for real estate information. Because there is a social problem, due to the change in market size and information asymmetry due to changes in demand. Also, the 5 or 25 districts used by the Seoul Metropolitan Government or the Korean Appraisal Board(hereafter, KAB) were established within the administrative boundaries and used in existing real estate studies. This is not a district classification for real estate researches because it is zoned urban planning. Based on the existing study, this study found that the city needs to reset the Seoul Metropolitan Government's spatial structure in estimating future housing prices. So, This study attempted to classify the area without spatial heterogeneity by the reflected the property price characteristics of row housing and Multiplex housing. In other words, There has been a problem that an inefficient side has arisen due to the simple division by the existing administrative district. Therefore, this study aims to cluster Seoul as a new area for more efficient real estate analysis. This study was applied to the hedonic model based on the real transactions price data of row housing and multiplex housing. And the K-Means Clustering algorithm was used to cluster the spatial structure of Seoul. In this study, data onto real transactions price of the Seoul Row housing and Multiplex Housing from January 2014 to December 2016, and the official land value of 2016 was used and it provided by Ministry of Land, Infrastructure and Transport(hereafter, MOLIT). Data preprocessing was followed by the following processing procedures: Removal of underground transaction, Price standardization per area, Removal of Real transaction case(above 5 and below -5). In this study, we analyzed data from 132,707 cases to 126,759 data through data preprocessing. The data analysis tool used the R program. After data preprocessing, data model was constructed. Priority, the K-means Clustering was performed. In addition, a regression analysis was conducted using Hedonic model and it was conducted a cosine similarity analysis. Based on the constructed data model, we clustered on the basis of the longitude and latitude of Seoul and conducted comparative analysis of existing area. The results of this study indicated that the goodness of fit of the model was above 75 % and the variables used for the Hedonic model were significant. In other words, 5 or 25 districts that is the area of the existing administrative area are divided into 16 districts. So, this study derived a clustering method of row housing and multiplex housing in Seoul using K-Means Clustering algorithm and hedonic model by the reflected the property price characteristics. Moreover, they presented academic and practical implications and presented the limitations of this study and the direction of future research. Academic implication has clustered by reflecting the property price characteristics in order to improve the problems of the areas used in the Seoul Metropolitan Government, KAB, and Existing Real Estate Research. Another academic implications are that apartments were the main study of existing real estate research, and has proposed a method of classifying area in Seoul using public information(i.e., real-data of MOLIT) of government 3.0. Practical implication is that it can be used as a basic data for real estate related research on row housing and multiplex housing. Another practical implications are that is expected the activation of row housing and multiplex housing research and, that is expected to increase the accuracy of the model of the actual transaction. The future research direction of this study involves conducting various analyses to overcome the limitations of the threshold and indicates the need for deeper research.

Product Community Analysis Using Opinion Mining and Network Analysis: Movie Performance Prediction Case (오피니언 마이닝과 네트워크 분석을 활용한 상품 커뮤니티 분석: 영화 흥행성과 예측 사례)

  • Jin, Yu;Kim, Jungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.49-65
    • /
    • 2014
  • Word of Mouth (WOM) is a behavior used by consumers to transfer or communicate their product or service experience to other consumers. Due to the popularity of social media such as Facebook, Twitter, blogs, and online communities, electronic WOM (e-WOM) has become important to the success of products or services. As a result, most enterprises pay close attention to e-WOM for their products or services. This is especially important for movies, as these are experiential products. This paper aims to identify the network factors of an online movie community that impact box office revenue using social network analysis. In addition to traditional WOM factors (volume and valence of WOM), network centrality measures of the online community are included as influential factors in box office revenue. Based on previous research results, we develop five hypotheses on the relationships between potential influential factors (WOM volume, WOM valence, degree centrality, betweenness centrality, closeness centrality) and box office revenue. The first hypothesis is that the accumulated volume of WOM in online product communities is positively related to the total revenue of movies. The second hypothesis is that the accumulated valence of WOM in online product communities is positively related to the total revenue of movies. The third hypothesis is that the average of degree centralities of reviewers in online product communities is positively related to the total revenue of movies. The fourth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. The fifth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. To verify our research model, we collect movie review data from the Internet Movie Database (IMDb), which is a representative online movie community, and movie revenue data from the Box-Office-Mojo website. The movies in this analysis include weekly top-10 movies from September 1, 2012, to September 1, 2013, with in total. We collect movie metadata such as screening periods and user ratings; and community data in IMDb including reviewer identification, review content, review times, responder identification, reply content, reply times, and reply relationships. For the same period, the revenue data from Box-Office-Mojo is collected on a weekly basis. Movie community networks are constructed based on reply relationships between reviewers. Using a social network analysis tool, NodeXL, we calculate the averages of three centralities including degree, betweenness, and closeness centrality for each movie. Correlation analysis of focal variables and the dependent variable (final revenue) shows that three centrality measures are highly correlated, prompting us to perform multiple regressions separately with each centrality measure. Consistent with previous research results, our regression analysis results show that the volume and valence of WOM are positively related to the final box office revenue of movies. Moreover, the averages of betweenness centralities from initial community networks impact the final movie revenues. However, both of the averages of degree centralities and closeness centralities do not influence final movie performance. Based on the regression results, three hypotheses, 1, 2, and 4, are accepted, and two hypotheses, 3 and 5, are rejected. This study tries to link the network structure of e-WOM on online product communities with the product's performance. Based on the analysis of a real online movie community, the results show that online community network structures can work as a predictor of movie performance. The results show that the betweenness centralities of the reviewer community are critical for the prediction of movie performance. However, degree centralities and closeness centralities do not influence movie performance. As future research topics, similar analyses are required for other product categories such as electronic goods and online content to generalize the study results.

Stock-Index Invest Model Using News Big Data Opinion Mining (뉴스와 주가 : 빅데이터 감성분석을 통한 지능형 투자의사결정모형)

  • Kim, Yoo-Sin;Kim, Nam-Gyu;Jeong, Seung-Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.143-156
    • /
    • 2012
  • People easily believe that news and stock index are closely related. They think that securing news before anyone else can help them forecast the stock prices and enjoy great profit, or perhaps capture the investment opportunity. However, it is no easy feat to determine to what extent the two are related, come up with the investment decision based on news, or find out such investment information is valid. If the significance of news and its impact on the stock market are analyzed, it will be possible to extract the information that can assist the investment decisions. The reality however is that the world is inundated with a massive wave of news in real time. And news is not patterned text. This study suggests the stock-index invest model based on "News Big Data" opinion mining that systematically collects, categorizes and analyzes the news and creates investment information. To verify the validity of the model, the relationship between the result of news opinion mining and stock-index was empirically analyzed by using statistics. Steps in the mining that converts news into information for investment decision making, are as follows. First, it is indexing information of news after getting a supply of news from news provider that collects news on real-time basis. Not only contents of news but also various information such as media, time, and news type and so on are collected and classified, and then are reworked as variable from which investment decision making can be inferred. Next step is to derive word that can judge polarity by separating text of news contents into morpheme, and to tag positive/negative polarity of each word by comparing this with sentimental dictionary. Third, positive/negative polarity of news is judged by using indexed classification information and scoring rule, and then final investment decision making information is derived according to daily scoring criteria. For this study, KOSPI index and its fluctuation range has been collected for 63 days that stock market was open during 3 months from July 2011 to September in Korea Exchange, and news data was collected by parsing 766 articles of economic news media M company on web page among article carried on stock information>news>main news of portal site Naver.com. In change of the price index of stocks during 3 months, it rose on 33 days and fell on 30 days, and news contents included 197 news articles before opening of stock market, 385 news articles during the session, 184 news articles after closing of market. Results of mining of collected news contents and of comparison with stock price showed that positive/negative opinion of news contents had significant relation with stock price, and change of the price index of stocks could be better explained in case of applying news opinion by deriving in positive/negative ratio instead of judging between simplified positive and negative opinion. And in order to check whether news had an effect on fluctuation of stock price, or at least went ahead of fluctuation of stock price, in the results that change of stock price was compared only with news happening before opening of stock market, it was verified to be statistically significant as well. In addition, because news contained various type and information such as social, economic, and overseas news, and corporate earnings, the present condition of type of industry, market outlook, the present condition of market and so on, it was expected that influence on stock market or significance of the relation would be different according to the type of news, and therefore each type of news was compared with fluctuation of stock price, and the results showed that market condition, outlook, and overseas news was the most useful to explain fluctuation of news. On the contrary, news about individual company was not statistically significant, but opinion mining value showed tendency opposite to stock price, and the reason can be thought to be the appearance of promotional and planned news for preventing stock price from falling. Finally, multiple regression analysis and logistic regression analysis was carried out in order to derive function of investment decision making on the basis of relation between positive/negative opinion of news and stock price, and the results showed that regression equation using variable of market conditions, outlook, and overseas news before opening of stock market was statistically significant, and classification accuracy of logistic regression accuracy results was shown to be 70.0% in rise of stock price, 78.8% in fall of stock price, and 74.6% on average. This study first analyzed relation between news and stock price through analyzing and quantifying sensitivity of atypical news contents by using opinion mining among big data analysis techniques, and furthermore, proposed and verified smart investment decision making model that could systematically carry out opinion mining and derive and support investment information. This shows that news can be used as variable to predict the price index of stocks for investment, and it is expected the model can be used as real investment support system if it is implemented as system and verified in the future.

Soil Texture, Electrical Conductivity and Chemical Components of Soils under the Plastic Film House Cultivation in Northern Central Areas of Korea (중북부지역(中北部地域) 시설원예지(施設園藝地) 토양(土壤)의 토성(土性), 염농도(鹽濃度) 및 화학성분(化學成分)의 조성(組成))

  • Jung, Goo-Bok;Ryu, In-Soo;Kim, Bok-Young
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.27 no.1
    • /
    • pp.33-39
    • /
    • 1994
  • This survey was conducted to investigate the factors affecting on salt accumulation and chemical components of soils cultivated with horticulture crops in plastic film houses. The soil samples were taken from 40 sites in the northern central areas of Korea and were analyzed for the chemical properties and soil separates. The data were evaluated with soil texture and years of cultivation as major factors. The results are summarized as follows : 1. The chemical properties of surface soils in plastic film house were pH 5.80, EC $3.59mScm^{-1}$, O.M. 4.20%, Av. $P_2O_5$ 1,178ppm, $NO_3-N$ 180ppm, Av. $SO_4{^{2-}}$ 353ppm, $Cl^-$ 240ppm, Ex. Na 0.40me/100g. 2. Compared to the outside soil of plastic film house, the inside soil had 2.5~3 times higher contents of $NO_3-N$, Av. $SO_4{^{2-}}$ and $Cl^-$, 1.2~1.8 times higher exchangeable base elements, and 2.8 times higher electrical conductivity. But pH value of the inside soil was lower than the outside soil by 0.3 pH unit. 3. Soil texture classification showed that sandy loam, loam and silt loam were 32.5 %, 37.5 %, and 30.0 %, respectively. The contents of $NO_3-N$, Av. $SO_4{^{2-}}$, $NH_4-N$ and EC value were very high in silt loam soils. Av. $P_2O_5$ content and pH value of sandy loam soils were higher than those of silt loam and loam soils. 4. The contents of O.M. and Av. $P_2O_5$ were higher in long term cultivation, but the contents of $NO_3-N$, Av. $SO_4{^{2-}}$, $Cl^-$, Ex. Mg and Ex. Na including EC of the soil with 2~4 years cultivation were higher than those of the soil with above 5 years cultivation. 5. Multiple linear regression analysis showed that contribution degree of soil chemical properties to the EC was high in the order of $NO_3-N$ > Av. $SO_4{^{2-}}$ > Ex. Na > $Cl^-$ > Av. $P_2O_5$ > $NH_4-N$ > Ex. Mg>Ex. Ca. Among the soil chemical properties the contribution of anions was remarkably high. 6. EC value correlated with ${\sum}A$(total content of anions)as $r=0.932^{**}$ and with ${\sum}C$(total content of cations) as $r=0.452^{**}$.

  • PDF

A Study on the Differences in Breeding Call of Cicadas in Urban and Forest Areas (도시와 산림지역 매미과 번식울음 차이 연구)

  • Kim, Yoon-Jae;Ki, Kyong-Seok
    • Korean Journal of Environment and Ecology
    • /
    • v.32 no.6
    • /
    • pp.698-708
    • /
    • 2018
  • The purpose of this study was to investigate differences in the breeding call characteristics of cicada species found in urban and forest areas in the central region of Korea by examining the interspecific effects and environmental factors affecting the breeding calls and breeding call patterns. The selected research sites were Gyungnam Apartment in Bangbae-dong, Seoul for the urban area and Chiak Mountain National Park in Wonju for the forest area. The research method for both sites was to record cicada breeding calls for 24 hours with a recorder installed at the site and analyze the results. Data from the Korea Meteorological Administration were used for environmental factors. The research period was from June 19, 2017 to September 30, 2017. As a result of the study, there were differences in the emergence of species between the two research sites: while Platypleura kaempferi, Hyalessa fuscata, Meimuna opalifera, Graptopsaltria nigrofuscata, and Suisha coreana were observed at both sites, Cryptotympana atrata was observed in the urban area and Leptosemia takanonis in the forest area only. The emergence periods of cicadas at the two sites were also different. The activities of P. kaempferi and L. takanonis were noticeable in the forest area. In the urban area, however, L. takanonis was not observed and the duration of activity of P. kaempferi was short. In the urban area, C. atrata appeared and sang for a long period; H. fuscata, M. opalifera, and G. nigrofuscata appeared earlier than in the forest area. S. coreana appeared earlier in the forest area than in the urban area. According to the daily call cycle analysis, even cospecific cicada showed a wide variation in their daily cycle depending on the region and the interspecific effects between different cicadas, and the environmental differences between the urban and forest areas affected the calls of cicadas. The results of correlation analysis between each cicada breeding calls and environmental factors of each site showed positive correlation with average temperature of most cicadas except P. kaempferi and C. atrata. The same species of each site showed positive correlations with more diverse weather factors such as solar irradiance. Logistic regression analysis showed that cicadas with overlapping calling times had significant effects on each other's breeding calls. C. atrata, which appeared only in the urban area, had a positive effect on the calling frequency of H. fuscata, M. opalifera, and G. nigrofuscata, which called in the same period. Additionally, L. takanonis, which appeared only in the forest area, and P. kaempferi had a positive effect on each other, and M. opalifera had a positive effect on the calling frequency of H. fuscata and G. nigrofuscata in the forest area. For the environmental factors, the calling frequency of cicadas was affected by the average temperatures of the urban and forest areas, and cicadas that appeared in the forest area were also affected by the amount of solar radiation. According to the results of statistical analysis, urban cicadas with similar activity periods are influenced by species, especially with respect to urban dominant species, C. atrata. Forest cicadas were influenced by species, mainly M. opalifera, which is a forest dominant species. The results of the meteorological impact analysis were similar to those of the correlation analysis, and were influenced mainly by the temperature, and the influence of the insolation was more increased in the forests.