• Title/Summary/Keyword: Classification Variables

Search Result 920, Processing Time 0.028 seconds

Gender discrimination and multivariate analysis using deboning data

  • Shim, Joon-Yong;Kim, Ha-Yeong;Cho, Byoung-Kwan;Lee, Wang-Hee
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2017.04a
    • /
    • pp.23-23
    • /
    • 2017
  • Recent favor on high quality food and concern on food safety have demonstrated the superiority of Hanwoo (Korean native cattle). In general, the price of cow is higher than those of steer and bull, causing cheating issues in the market. Hence, this study is to discriminate genders of Hanwoo with identification of factors which highly influence gender discrimination based on the big-size deboning data. Totally, there were 31 variables in the deboning data, and we divided into them two categories: data obtained before and after deboning. Discriminant function analysis was then applied into the data to determined the accuracy of gender discrimination in Hanwoo. The result showed that Hanwoo could be classified by gender with 99.2% of accuracy when using all 31 variables. In detail, it was possible to identify 93 of 94 bulls (98.9%), 96 of 96 cows (100%) and 74 of 75 steers (98.7%). The most significant variables was chuck, sirloin, armbone shin, plates, retail and cuts percentage, sequentially. With variables obtainable before deboning, accuracies of classification were 91.5% for bulls, 92.7% for cows, and 89.3% for steers. The most significant variables was water, cold carcass weight and back-fat thickness. The discrimination accuracy was higher with data obtainable after deboning: bulls (98.9%), cows (99.0%) and steers (98.7%). In this case, chuck, sirloin and armbone shin were the factors determined the classification ability. This study showed that Hanwoo can be classified based on deboning data with appropriate statistics, further suggesting weight of cut of beef might be the standard for gender classification.

  • PDF

Performance Improvement of Polynomial Adaline by Using Dimension Reduction of Independent Variables (독립변수의 차원감소에 의한 Polynomial Adaline의 성능개선)

  • Cho, Yong-Hyun
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.5 no.1
    • /
    • pp.33-38
    • /
    • 2002
  • This paper proposes an efficient method for improving the performance of polynomial adaline using the dimension reduction of independent variables. The adaptive principal component analysis is applied for reducing the dimension by extracting efficiently the features of the given independent variables. It can be solved the problems due to high dimensional input data in the polynomial adaline that the principal component analysis converts input data into set of statistically independent features. The proposed polynomial adaline has been applied to classify the patterns. The simulation results shows that the proposed polynomial adaline has better performances of the classification for test patterns, in comparison with those using the conventional polynomial adaline. Also, it is affected less by the scope of the smoothing factor.

  • PDF

Development of Traffic Accident Models in Seoul Considering Land Use Characteristics (토지이용특성을 고려한 서울시 교통사고 발생 모형 개발)

  • Lim, Samjin;Park, Juntae
    • Journal of the Society of Disaster Information
    • /
    • v.9 no.1
    • /
    • pp.30-49
    • /
    • 2013
  • In this research we developed a new traffic accident forecasting model on the basis of land use. A new traffic accident forecasting model by type was developed based on market segmentation and further introduction of variables that may reflect characteristics of various regions using Classification and Regression Tree Method. From the results of analysis, activities variables such as the registered population, commuters as well as road size, traffic accidents causing facilities being the subjects of activities were derived as variables explaining traffic accidents.

A Study on the Variables of Clothing Consumer Behavior and Market: Literature Review (선행연구에 나타난 의복소비자 행동변인 및 시장 변인연구)

  • 박혜선
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.20 no.6
    • /
    • pp.1125-1137
    • /
    • 1996
  • The author reviewed seventy papers on social psychology of clothing and fashion marketing fields, which were published in the Journal of the Korean Society of Clothing and Textiles between 1983 and 1996. The market variables and consumer behavior variables were focused on. This review showed that the market variables had been divided into three groups of variables: 1) product variables (product image and product classification): 2) brand variables (brand image and brand positioning): and 3) store variables (store image, store type, and distribution system) Consumer behavior variables have been studied on the basis of EBM Consumer Behavior Model: 1) purchasing motivation as need recognition: 2) information using as search information: 3) evaluation criteria and choice criteria as alternative evaluatioin : 4) clothing purchase, brand choice and store choice as purchase: 5) degree of wear, satisfaction and dissatisfaction as outcome: and 6) clothing discard. Variables that influence on consumer behavior, including situation variables, clothing attitude variables, personal . social variables were added to develop a variable model of clothing consumer behavior using the EBM Consumer Behavior Model.

  • PDF

Convergence of weighted sums of linearly negative quadrant dependent random variables (선형 음의 사분 종속확률변수에서 가중합에 대한 수렴성 연구)

  • Lee, Seung-Woo;Baek, Jong-Il
    • Journal of Applied Reliability
    • /
    • v.12 no.4
    • /
    • pp.265-274
    • /
    • 2012
  • We in this paper discuss the strong law of large numbers for weighted sums of arrays of rowwise LNQD random variables by using a new exponential inequality of LNQD r.v.'s under suitable conditions and we obtain one of corollary.

Bioclimatic Classification and Characterization in South Korea (남한의 생물기후권역 구분과 특성 규명)

  • Choi, Yu-Young;Lim, Chul-Hee;Ryu, Ji-Eun;Piao, Dongfan;Kang, Jin-Young;Zhu, Weihong;Cui, Guishan;Lee, Woo-Kyun;Jeon, Seong-Woo
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.20 no.3
    • /
    • pp.1-18
    • /
    • 2017
  • This study constructed a high-resolution bioclimatic classification map of South Korea which classifies land into homogeneous zones by similar environment properties using advanced statistical techniques compared to existing ecological area classification studies. The climate data provided by WorldClim(1960-1990) were used to generate 27 bioclimatic variables affecting biological habitats, and key environmental variables were derived from Correlation Analysis and Principal Component Analysis. Clustering Analysis was performed using the ISODATA method to construct a 30'(~1km) resolution bioclimatic classification map. South Korea was divided into 21 regions and the results of classification were verified by correlation analysis with the Gross Primary Production(GPP), Actual Vegetation map made by the Ministry of Environment. Each zones' were described and named by its environmental characteristics and major vegetation distribution. This study could provide useful spatial frameworks to support ecosystem research, monitoring and policy decisions.

Predictive Analysis of Problematic Smartphone Use by Machine Learning Technique

  • Kim, Yu Jeong;Lee, Dong Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.2
    • /
    • pp.213-219
    • /
    • 2020
  • In this paper, we propose a classification analysis method for diagnosing and predicting problematic smartphone use in order to provide policy data on problematic smartphone use, which is getting worse year after year. Attempts have been made to identify key variables that affect the study. For this purpose, the classification rates of Decision Tree, Random Forest, and Support Vector Machine among machine learning analysis methods, which are artificial intelligence methods, were compared. The data were from 25,465 people who responded to the '2018 Problematic Smartphone Use Survey' provided by the Korea Information Society Agency and analyzed using the R statistical package (ver. 3.6.2). As a result, the three classification techniques showed similar classification rates, and there was no problem of overfitting the model. The classification rate of the Support Vector Machine was the highest among the three classification methods, followed by Decision Tree and Random Forest. The top three variables affecting the classification rate among smartphone use types were Life Service type, Information Seeking type, and Leisure Activity Seeking type.

Comparing Classification Accuracy of Ensemble and Clustering Algorithms Based on Taguchi Design (다구찌 디자인을 이용한 앙상블 및 군집분석 분류 성능 비교)

  • Shin, Hyung-Won;Sohn, So-Young
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.27 no.1
    • /
    • pp.47-53
    • /
    • 2001
  • In this paper, we compare the classification performances of both ensemble and clustering algorithms (Data Bagging, Variable Selection Bagging, Parameter Combining, Clustering) to logistic regression in consideration of various characteristics of input data. Four factors used to simulate the logistic model are (1) correlation among input variables (2) variance of observation (3) training data size and (4) input-output function. In view of the unknown relationship between input and output function, we use a Taguchi design to improve the practicality of our study results by letting it as a noise factor. Experimental study results indicate the following: When the level of the variance is medium, Bagging & Parameter Combining performs worse than Logistic Regression, Variable Selection Bagging and Clustering. However, classification performances of Logistic Regression, Variable Selection Bagging, Bagging and Clustering are not significantly different when the variance of input data is either small or large. When there is strong correlation in input variables, Variable Selection Bagging outperforms both Logistic Regression and Parameter combining. In general, Parameter Combining algorithm appears to be the worst at our disappointment.

  • PDF

A Study on the Classification of Variables Affecting Smartphone Addiction in Decision Tree Environment Using Python Program

  • Kim, Seung-Jae
    • International journal of advanced smart convergence
    • /
    • v.11 no.4
    • /
    • pp.68-80
    • /
    • 2022
  • Since the launch of AI, technology development to implement complete and sophisticated AI functions has continued. In efforts to develop technologies for complete automation, Machine Learning techniques and deep learning techniques are mainly used. These techniques deal with supervised learning, unsupervised learning, and reinforcement learning as internal technical elements, and use the Big-data Analysis method again to set the cornerstone for decision-making. In addition, established decision-making is being improved through subsequent repetition and renewal of decision-making standards. In other words, big data analysis, which enables data classification and recognition/recognition, is important enough to be called a key technical element of AI function. Therefore, big data analysis itself is important and requires sophisticated analysis. In this study, among various tools that can analyze big data, we will use a Python program to find out what variables can affect addiction according to smartphone use in a decision tree environment. We the Python program checks whether data classification by decision tree shows the same performance as other tools, and sees if it can give reliability to decision-making about the addictiveness of smartphone use. Through the results of this study, it can be seen that there is no problem in performing big data analysis using any of the various statistical tools such as Python and R when analyzing big data.

On EM Algorithm For Discrete Classification With Bahadur Model: Unknown Prior Case

  • Kim, Hea-Jung;Jung, Hun-Jo
    • Journal of the Korean Statistical Society
    • /
    • v.23 no.1
    • /
    • pp.63-78
    • /
    • 1994
  • For discrimination with binary variables, reformulated full and first order Bahadur model with incomplete observations are presented. This allows prior probabilities associated with multiple population to be estimated for the sample-based classification rule. The EM algorithm is adopted to provided the maximum likelihood estimates of the parameters of interest. Some experiences with the models are evaluated and discussed.

  • PDF