• Title/Summary/Keyword: probabilistic study

Search Result 1,439, Processing Time 0.028 seconds

Features of sample concepts in the probability and statistics chapters of Korean mathematics textbooks of grades 1-12 (초.중.고등학교 확률과 통계 단원에 나타난 표본개념에 대한 분석)

  • Lee, Young-Ha;Shin, Sou-Yeong
    • Journal of Educational Research in Mathematics
    • /
    • v.21 no.4
    • /
    • pp.327-344
    • /
    • 2011
  • This study is the first step for us toward improving high school students' capability of statistical inferences, such as obtaining and interpreting the confidence interval on the population mean that is currently learned in high school. We suggest 5 underlying concepts of 'discretion of contingency and inevitability', 'discretion of induction and deduction', 'likelihood principle', 'variability of a statistic' and 'statistical model', those are necessary to appreciate statistical inferences as a reliable arguing tools in spite of its occasional erroneous conclusions. We assume those 5 concepts above are to be gradually developing in their school periods and Korean mathematics textbooks of grades 1-12 were analyzed. Followings were found. For the right choice of solving methodology of the given problem, no elementary textbook but a few high school textbooks describe its difference between the contingent circumstance and the inevitable one. Formal definitions of population and sample are not introduced until high school grades, so that the developments of critical thoughts on the reliability of inductive reasoning could not be observed. On the contrary of it, strong emphasis lies on the calculation stuff of the sample data without any inference on the population prospective based upon the sample. Instead of the representative properties of a random sample, more emphasis lies on how to get a random sample. As a result of it, the fact that 'the random variability of the value of a statistic which is calculated from the sample ought to be inherited from the randomness of the sample' could neither be noticed nor be explained as well. No comparative descriptions on the statistical inferences against the mathematical(deductive) reasoning were found. Few explanations on the likelihood principle and its probabilistic applications in accordance with students' cognitive developmental growth were found. It was hard to find the explanation of a random variability of statistics and on the existence of its sampling distribution. It is worthwhile to explain it because, nevertheless obtaining the sampling distribution of a particular statistic, like a sample mean, is a very difficult job, mere noticing its existence may cause a drastic change of understanding in a statistical inference.

  • PDF

Microbial Risk Assessment of High Risk Vibrio Foodborne Illness Through Raw Oyster Consumption (생굴 섭취로 인한 고병원성 Vibrio균 식중독 위해평가)

  • Ha, Jimyeong;Lee, Jeeyeon;Oh, Hyemin;Shin, Il-Shik;Kim, Young-Mog;Park, Kwon-Sam;Yoon, Yohan
    • Journal of Food Hygiene and Safety
    • /
    • v.35 no.1
    • /
    • pp.37-44
    • /
    • 2020
  • This study investigated the probability of foodborne illness caused by raw oyster consumption contaminated with high risk Vibrio species such as V. vulnificus and V. cholerae. Eighty-eight raw oyster samples were collected from the south coast, west coast and Seoul areas, and examined for the prevalence of high risk Vibrio species. The growth patterns of V. vulnificus and V. cholerae in raw oysters were evaluated, and consumption frequency and amounts for raw oyster were investigated from a Korean National Health and Nutrition Examination Survey. With the collected data, a risk assessment simulation was conducted to estimate the probability of foodborne illness caused by intake of raw oysters, using @RISK. Of 88 raw oysters, there were no V. vulnificus- or V. cholerae-positive samples. Thus, initial contamination levels of Vibrio species in raw oysters were estimated by the statistical methods developed by Vose and Sanaa, and the estimated value for the both Vibrio spp. was -3.6 Log CFU/g. In raw oyster, cell counts of V. vulnificus and V. cholerae remained unchanged. The incidence of raw oyster consumers was 0.35%, and the appropriate probabilistic distribution for the consumption amounts was the exponential distribution. A risk assessment simulation model was developed with the collected data, and the probability of the foodborne illness caused by the consumption of raw oyster was 9.08×10-15 for V. vulnificus and 8.16×10-13 for V. cholerae. Consumption frequency was the first factor, influencing the probability of foodborne illness.

Quantitative Microbial Risk Assessment of Pathogenic Vibrio through Sea Squirt Consumption in Korea (우렁쉥이에 대한 병원성 비브리오균 정량적 미생물 위해평가)

  • Ha, Jimyeong;Lee, Jeeyeon;Oh, Hyemin;Shin, Il-Shik;Kim, Young-Mog;Park, Kwon-Sam;Yoon, Yohan
    • Journal of Food Hygiene and Safety
    • /
    • v.35 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • This study evalutated the risk of foodborne illness from Vibrio spp. (Vibrio vulnificus and Vibrio cholerae) through sea squirt consumption. The prevalence of V. vulnificus and V. cholerae in sea squirt was evaluated, and the predictive models to describe the kinetic behavior of the Vibrio in sea squirt were developed. Distribution temperatures and times were collected, and they were fitted to probabilistic distributions to determine the appropriate distributions. The raw data from the Korea National Health and Nutrition Examination Survey 2016 were used to estimate the consumption rates and amount of sea squirt. In the hazard characterization, the Beta-Poisson model for V. vulnificus and V. cholerae infection was used. With the collected data, a simulation model was prepared and it was run with @RISK to estimate probabilities of foodborne illness by pathogenic Vibrio spp. through sea squirt consumption. Among 101 sea squirt samples, there were no V. vulnificus positive samples, but V. cholerae was detected in one sample. The developed predictive models described the fates of Vibrio spp. in sea squirt during distribution and storage, appropriately shown as 0.815-0.907 of R2 and 0.28 of RMSE. The consumption rate of sea squirt was 0.26%, and the daily consumption amount was 68.84 g per person. The Beta-Poisson model [P=1-(1+Dose/β)] was selected as a dose-response model. With these data, a simulation model was developed, and the risks of V. vulnificus and V. cholerae foodborne illness from sea squirt consumption were 2.66×10-15, and 1.02×10-12, respectively. These results suggest that the risk of pathogenic Vibrio spp. in sea squirt could be considered low in Korea.

Assessment of Hyperperfusion by Brain Perfusion SPECT in Transient Neurological Deterioration after Superficial Temporal Artery-Middle Cerebral Artery Anastomosis Surgery (천측두동맥-중대뇌동맥 문합술 후 발생한 일과성 신경학적 악화에서 뇌관류 SPECT를 이용한 과관류 평가)

  • Lee, Jeong-Won;Kim, Yu-Kyeong;Lee, Sang-Mi;Eo, Jae-Sun;Oh, Chang-Wan;Lee, Won-Woo;Paeng, Jin-Chul;Kim, Sang-Eun
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.42 no.4
    • /
    • pp.267-274
    • /
    • 2008
  • Purpose: Transient neurological deterioration (TND) is one of the complications after extracranial-intracranial bypass surgery, and it has been assumed to be caused by postoperative transient hyperperfusion. This study was performed to evaluate the relationship between TND and preoperative and postoperative cerebral perfusion status on brain perfusion SPECT following superficial temporal artery - middle cerebral artery (STA-MCA) anastomosis surgery. Materials and Methods: A total of 60 STA-MCA anastomosis surgeries of 56 patients (mean age: $50{\pm}16$ yrs; M:F=29:27; atherosclerotic disease: 33, moyamoya disease: 27) which were done between September 2003 and July 2006 were enrolled. The resting cerebral perfusion and cerebral vascular reserve (CVR) after acetazolamide challenge were measured before and 10 days after surgery using 99mTc-ethylcysteinate dimer (ECD) SPECT. Moreover, the cerebral perfusion was measured on the third postoperative day. With the use of the statistical parametric mapping and probabilistic brain atlas, the counts for the middle cerebral artery (MCA) territory were calculated for each image, and statistical analyses were performed. Results: In 6 of 60 cases (10%), TND occurred after surgery. In all patients, the preoperative cerebral perfusion of affected MCA territory was significantly lower than that of contralateral side (p=0.002). The cerebral perfusion on the third and tenth day after surgery was significantly higher than preoperative cerebral perfusion (p=0.001, p=0.02). In TND patients, basal cerebral perfusion and CVR on preoperative SPECT were significantly lower than those of non-TND patients (p=0.01, p=0.05). Further, the increases in cerebral perfusion on the third day after surgery were significant higher than those in other patients (p=0.008). In patients with TND, the cerebral perfusion ratio of affected side to contralateral side on third postoperative day was significantly higher than that of other patients (p=0.002). However, there was no significant difference of the cerebral perfusion ratio on preoperative and tenth postoperative day between patients with TND and other patients. Conclusion: In patients with TND, relative and moderate hyperperfusion was observed in affected side after bypass surgery. These finding may help to understand the pathophysiology of TND.

Estimation of freeze damage risk according to developmental stage of fruit flower buds in spring (봄철 과수 꽃눈 발육 수준에 따른 저온해 위험도 산정)

  • Kim, Jin-Hee;Kim, Dae-jun;Kim, Soo-ock;Yun, Eun-jeong;Ju, Okjung;Park, Jong Sun;Shin, Yong Soon
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.21 no.1
    • /
    • pp.55-64
    • /
    • 2019
  • The flowering seasons can be advanced due to climate change that would cause an abnormally warm winter. Such warm winter would increase the frequency of crop damages resulted from sudden occurrences of low temperature before and after the vegetative growth stages, e.g., the period from germination to flowering. The degree and pattern of freezing damage would differ by the development stage of each individual fruit tree even in an orchard. A critical temperature, e.g., killing temperature, has been used to predict freeze damage by low-temperature conditions under the assumption that such damage would be associated with the development stage of a fruit flower bud. However, it would be challenging to apply the critical temperature to a region where spatial variation in temperature would be considerably high. In the present study, a phenological model was used to estimate major bud development stages, which would be useful for prediction of regional risks for the freeze damages. We also derived a linear function to calculate a probabilistic freeze risk in spring, which can quantitatively evaluate the risk level based solely on forecasted weather data. We calculated the dates of freeze damage occurrences and spatial risk distribution according to main production areas by applying the spring freeze risk function to apple, peach, and pear crops in 2018. It was predicted that the most extensive low-temperature associated freeze damage could have occurred on April 8. It was also found that the risk function was useful to identify the main production areas where the greatest damage to a given crop could occur. These results suggest that the freezing damage associated with the occurrence of low-temperature events could decrease providing early warning for growers to respond abnormal weather conditions for their farm.

Surrogate Model-Based Global Sensitivity Analysis of an I-Shape Curved Steel Girder Bridge under Seismic Loads (지진하중을 받는 I형 곡선거더 단경간 교량의 대리모델 기반 전역 민감도 분석)

  • Jun-Tai, Jeon;Hoyoung Son;Bu-Seog, Ju
    • Journal of the Society of Disaster Information
    • /
    • v.19 no.4
    • /
    • pp.976-983
    • /
    • 2023
  • Purpose: The dynamic behavior of a bridge structure under seismic loading depends on many uncertainties, such as the nature of the seismic waves and the material and geometric properties. However, not all uncertainties have a significant impact on the dynamic behavior of a bridge structure. Since probabilistic seismic performance evaluation considering even low-impact uncertainties is computationally expensive, the uncertainties should be identified by considering their impact on the dynamic behavior of the bridge. Therefore, in this study, a global sensitivity analysis was performed to identify the main parameters affecting the dynamic behavior of bridges with I-curved girders. Method: Considering the uncertainty of the earthquake and the material and geometric uncertainty of the curved bridge, a finite element analysis was performed, and a surrogate model was developed based on the analysis results. The surrogate model was evaluated using performance metrics such as coefficient of determination, and finally, a global sensitivity analysis based on the surrogate model was performed. Result: The uncertainty factors that have the greatest influence on the stress response of the I-curved girder under seismic loading are the peak ground acceleration (PGA), the height of the bridge (h), and the yield stress of the steel (fy). The main effect sensitivity indices of PGA, h, and fy were found to be 0.7096, 0.0839, and 0.0352, respectively, and the total sensitivity indices were found to be 0.9459, 0.1297, and 0.0678, respectively. Conclusion: The stress response of the I-shaped curved girder is dominated by the uncertainty of the input motions and is strongly influenced by the interaction effect between each uncertainty factor. Therefore, additional sensitivity analysis of the uncertainty of the input motions, such as the number of input motions and the intensity measure(IM), and a global sensitivity analysis considering the structural uncertainty, such as the number and curvature of the curved girders, are required.

A Study on the Market Structure Analysis for Durable Goods Using Consideration Set:An Exploratory Approach for Automotive Market (고려상표군을 이용한 내구재 시장구조 분석에 관한 연구: 자동차 시장에 대한 탐색적 분석방법)

  • Lee, Seokoo
    • Asia Marketing Journal
    • /
    • v.14 no.2
    • /
    • pp.157-176
    • /
    • 2012
  • Brand switching data frequently used in market structure analysis is adequate to analyze non- durable goods, because it can capture competition between specific two brands. But brand switching data sometimes can not be used to analyze goods like automobiles having long term duration because one of main assumptions that consumer preference toward brand attributes is not changed against time can be violated. Therefore a new type of data which can precisely capture competition among durable goods is needed. Another problem of using brand switching data collected from actual purchase behavior is short of explanation why consumers consider different set of brands. Considering above problems, main purpose of this study is to analyze market structure for durable goods with consideration set. The author uses exploratory approach and latent class clustering to identify market structure based on heterogeneous consideration set among consumers. Then the relationship between some factors and consideration set formation is analyzed. Some benefits and two demographic variables - age and income - are selected as factors based on consumer behavior theory. The author analyzed USA automotive market with top 11 brands using exploratory approach and latent class clustering. 2,500 respondents are randomly selected from the total sample and used for analysis. Six models concerning market structure are established to test. Model 1 means non-structured market and model 6 means market structure composed of six sub-markets. It is exploratory approach because any hypothetical market structure is not defined. The result showed that model 1 is insufficient to fit data. It implies that USA automotive market is a structured market. Model 3 with three market structures is significant and identified as the optimal market structure in USA automotive market. Three sub markets are named as USA brands, Asian Brands, and European Brands. And it implies that country of origin effect may exist in USA automotive market. Comparison between modal classification by derived market structures and probabilistic classification by research model was conducted to test how model 3 can correctly classify respondents. The model classify 97% of respondents exactly. The result of this study is different from those of previous research. Previous research used confirmatory approach. Car type and price were chosen as criteria for market structuring and car type-price structure was revealed as the optimal structure for USA automotive market. But this research used exploratory approach without hypothetical market structures. It is not concluded yet which approach is superior. For confirmatory approach, hypothetical market structures should be established exhaustively, because the optimal market structure is selected among hypothetical structures. On the other hand, exploratory approach has a potential problem that validity for derived optimal market structure is somewhat difficult to verify. There also exist market boundary difference between this research and previous research. While previous research analyzed seven car brands, this research analyzed eleven car brands. Both researches seemed to represent entire car market, because cumulative market shares for analyzed brands exceeds 50%. But market boundary difference might affect the different results. Though both researches showed different results, it is obvious that country of origin effect among brands should be considered as important criteria to analyze USA automotive market structure. This research tried to explain heterogeneity of consideration sets among consumers using benefits and two demographic factors, sex and income. Benefit works as a key variable for consumer decision process, and also works as an important criterion in market segmentation. Three factors - trust/safety, image/fun to drive, and economy - are identified among nine benefit related measure. Then the relationship between market structures and independent variables is analyzed using multinomial regression. Independent variables are three benefit factors and two demographic factors. The result showed that all independent variables can be used to explain why there exist different market structures in USA automotive market. For example, a male consumer who perceives all benefits important and has lower income tends to consider domestic brands more than European brands. And the result also showed benefits, sex, and income have an effect to consideration set formation. Though it is generally perceived that a consumer who has higher income is likely to purchase a high priced car, it is notable that American consumers perceived benefits of domestic brands much positive regardless of income. Male consumers especially showed higher loyalty for domestic brands. Managerial implications of this research are as follow. Though implication may be confined to the USA automotive market, the effect of sex on automotive buying behavior should be analyzed. The automotive market is traditionally conceived as male consumers oriented market. But the proportion of female consumers has grown over the years in the automotive market. It is natural outcome that Volvo and Hyundai motors recently developed new cars which are targeted for women market. Secondly, the model used in this research can be applied easier than that of previous researches. Exploratory approach has many advantages except difficulty to apply for practice, because it tends to accompany with complicated model and to require various types of data. The data needed for the model in this research are a few items such as purchased brands, consideration set, some benefits, and some demographic factors and easy to collect from consumers.

  • PDF

A Study on Interactions of Competitive Promotions Between the New and Used Cars (신차와 중고차간 프로모션의 상호작용에 대한 연구)

  • Chang, Kwangpil
    • Asia Marketing Journal
    • /
    • v.14 no.1
    • /
    • pp.83-98
    • /
    • 2012
  • In a market where new and used cars are competing with each other, we would run the risk of obtaining biased estimates of cross elasticity between them if we focus on only new cars or on only used cars. Unfortunately, most of previous studies on the automobile industry have focused on only new car models without taking into account the effect of used cars' pricing policy on new cars' market shares and vice versa, resulting in inadequate prediction of reactive pricing in response to competitors' rebate or price discount. However, there are some exceptions. Purohit (1992) and Sullivan (1990) looked into both new and used car markets at the same time to examine the effect of new car model launching on the used car prices. But their studies have some limitations in that they employed the average used car prices reported in NADA Used Car Guide instead of actual transaction prices. Some of the conflicting results may be due to this problem in the data. Park (1998) recognized this problem and used the actual prices in his study. His work is notable in that he investigated the qualitative effect of new car model launching on the pricing policy of the used car in terms of reinforcement of brand equity. The current work also used the actual price like Park (1998) but the quantitative aspect of competitive price promotion between new and used cars of the same model was explored. In this study, I develop a model that assumes that the cross elasticity between new and used cars of the same model is higher than those amongst new cars and used cars of the different model. Specifically, I apply the nested logit model that assumes the car model choice at the first stage and the choice between new and used cars at the second stage. This proposed model is compared to the IIA (Independence of Irrelevant Alternatives) model that assumes that there is no decision hierarchy but that new and used cars of the different model are all substitutable at the first stage. The data for this study are drawn from Power Information Network (PIN), an affiliate of J.D. Power and Associates. PIN collects sales transaction data from a sample of dealerships in the major metropolitan areas in the U.S. These are retail transactions, i.e., sales or leases to final consumers, excluding fleet sales and including both new car and used car sales. Each observation in the PIN database contains the transaction date, the manufacturer, model year, make, model, trim and other car information, the transaction price, consumer rebates, the interest rate, term, amount financed (when the vehicle is financed or leased), etc. I used data for the compact cars sold during the period January 2009- June 2009. The new and used cars of the top nine selling models are included in the study: Mazda 3, Honda Civic, Chevrolet Cobalt, Toyota Corolla, Hyundai Elantra, Ford Focus, Volkswagen Jetta, Nissan Sentra, and Kia Spectra. These models in the study accounted for 87% of category unit sales. Empirical application of the nested logit model showed that the proposed model outperformed the IIA (Independence of Irrelevant Alternatives) model in both calibration and holdout samples. The other comparison model that assumes choice between new and used cars at the first stage and car model choice at the second stage turned out to be mis-specfied since the dissimilarity parameter (i.e., inclusive or categroy value parameter) was estimated to be greater than 1. Post hoc analysis based on estimated parameters was conducted employing the modified Lanczo's iterative method. This method is intuitively appealing. For example, suppose a new car offers a certain amount of rebate and gains market share at first. In response to this rebate, a used car of the same model keeps decreasing price until it regains the lost market share to maintain the status quo. The new car settle down to a lowered market share due to the used car's reaction. The method enables us to find the amount of price discount to main the status quo and equilibrium market shares of the new and used cars. In the first simulation, I used Jetta as a focal brand to see how its new and used cars set prices, rebates or APR interactively assuming that reactive cars respond to price promotion to maintain the status quo. The simulation results showed that the IIA model underestimates cross elasticities, resulting in suggesting less aggressive used car price discount in response to new cars' rebate than the proposed nested logit model. In the second simulation, I used Elantra to reconfirm the result for Jetta and came to the same conclusion. In the third simulation, I had Corolla offer $1,000 rebate to see what could be the best response for Elantra's new and used cars. Interestingly, Elantra's used car could maintain the status quo by offering lower price discount ($160) than the new car ($205). In the future research, we might want to explore the plausibility of the alternative nested logit model. For example, the NUB model that assumes choice between new and used cars at the first stage and brand choice at the second stage could be a possibility even though it was rejected in the current study because of mis-specification (A dissimilarity parameter turned out to be higher than 1). The NUB model may have been rejected due to true mis-specification or data structure transmitted from a typical car dealership. In a typical car dealership, both new and used cars of the same model are displayed. Because of this fact, the BNU model that assumes brand choice at the first stage and choice between new and used cars at the second stage may have been favored in the current study since customers first choose a dealership (brand) then choose between new and used cars given this market environment. However, suppose there are dealerships that carry both new and used cars of various models, then the NUB model might fit the data as well as the BNU model. Which model is a better description of the data is an empirical question. In addition, it would be interesting to test a probabilistic mixture model of the BNU and NUB on a new data set.

  • PDF

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.