• Title/Summary/Keyword: ranked set simple

Search Result 20, Processing Time 0.025 seconds

DISTRIBUTiON-FREE TWO-SAMPLE TEST ON RANKED-SET SAMPLES

  • DONG HEE KIM;YOUNG CHEOL KIM;MYUNG HWA CHO
    • Communications for Statistical Applications and Methods
    • /
    • v.5 no.1
    • /
    • pp.133-144
    • /
    • 1998
  • In this paper, we propose the two-sample test statistic using Wilcoxon signed rank test on ranked-set sampling(RSS) and obtain the asymptotic relative efficiencies(ARE) of the proposed test statistic with respect to Mann-Whitney-Wilcoxon statistic on simple random sampling(SRS), the Mann-Whitney-Wilcoxon statistic on RSS, sign statistic on RSS and Wilcoxon signed rank test on SRS. From the simulation works, we compare the powers of the proposed test statistic, Mann-Whitney-Wilcoxon statistic on RSS, the usual two-sample t statistic, sign statistic on RSS, where the underlying distributions are uniform, normal, double exponential, logistic and Cauchy distributions.

  • PDF

Estimation of P(X > Y) when X and Y are dependent random variables using different bivariate sampling schemes

  • Samawi, Hani M.;Helu, Amal;Rochani, Haresh D.;Yin, Jingjing;Linder, Daniel
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.5
    • /
    • pp.385-397
    • /
    • 2016
  • The stress-strength models have been intensively investigated in the literature in regards of estimating the reliability ${\theta}$ = P(X > Y) using parametric and nonparametric approaches under different sampling schemes when X and Y are independent random variables. In this paper, we consider the problem of estimating ${\theta}$ when (X, Y) are dependent random variables with a bivariate underlying distribution. The empirical and kernel estimates of ${\theta}$ = P(X > Y), based on bivariate ranked set sampling (BVRSS) are considered, when (X, Y) are paired dependent continuous random variables. The estimators obtained are compared to their counterpart, bivariate simple random sampling (BVSRS), via the bias and mean square error (MSE). We demonstrate that the suggested estimators based on BVRSS are more efficient than those based on BVSRS. A simulation study is conducted to gain insight into the performance of the proposed estimators. A real data example is provided to illustrate the process.

Evaluating the efficiency of treatment comparison in crossover design by allocating subjects based on ranked auxiliary variable

  • Huang, Yisong;Samawi, Hani M.;Vogel, Robert;Yin, Jingjing;Gato, Worlanyo Eric;Linder, Daniel F.
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.6
    • /
    • pp.543-553
    • /
    • 2016
  • The validity of statistical inference depends on proper randomization methods. However, even with proper randomization, we can have imbalanced with respect to important characteristics. In this paper, we introduce a method based on ranked auxiliary variables for treatment allocation in crossover designs using Latin squares models. We evaluate the improvement of the efficiency in treatment comparisons using the proposed method. Our simulation study reveals that our proposed method provides a more powerful test compared to simple randomization with the same sample size. The proposed method is illustrated by conducting an experiment to compare two different concentrations of titanium dioxide nanofiber (TDNF) on rats for the purpose of comparing weight gain.

Using ranked auxiliary covariate as a more efficient sampling design for ANCOVA model: analysis of a psychological intervention to buttress resilience

  • Jabrah, Rajai;Samawi, Hani M.;Vogel, Robert;Rochani, Haresh D.;Linder, Daniel F.;Klibert, Jeff
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.241-254
    • /
    • 2017
  • Drawing a sample can be costly or time consuming in some studies. However, it may be possible to rank the sampling units according to some baseline auxiliary covariates, which are easily obtainable, and/or cost efficient. Ranked set sampling (RSS) is a method to achieve this goal. In this paper, we propose a modified approach of the RSS method to allocate units into an experimental study that compares L groups. Computer simulation estimates the empirical nominal values and the empirical power values for the test procedure of comparing L different groups using modified RSS based on the regression approach in analysis of covariance (ANCOVA) models. A comparison to simple random sampling (SRS) is made to demonstrate efficiency. The results indicate that the required sample sizes for a given precision are smaller under RSS than under SRS. The modified RSS protocol was applied to an experimental study. The experimental study was designed to obtain a better understanding of the pathways by which positive experiences (i.e., goal completion) contribute to higher levels of happiness, well-being, and life satisfaction. The use of the RSS method resulted in a cost reduction associated with smaller sample size without losing the precision of the analysis.

SIMPLE RANKED SAMPLING SCHEME: MODIFICATION AND APPLICATION IN THE THEORY OF ESTIMATION OF ERLANG DISTRIBUTION

  • RAFIA GULZAR;IRSA SAJJAD;M. YOUNUS BHAT;SHAKEEL UL REHMAN
    • Journal of applied mathematics & informatics
    • /
    • v.41 no.2
    • /
    • pp.449-468
    • /
    • 2023
  • This paper deals in the study of the estimation of the parameters of Erlang distribution based on rank set sampling and some of its modifications. Here we considered Maximum Likelihood (ML) and the Bayesian technique to estimate the shape and scale parameter of Erlang distribution based on RSS and its some modifications such as ERSS, MRSS, and MRSSu. The derivation for unknown parameters of Erlang distribution is well presented using normal approximation to the asymptotic distribution of ML estimators. But due to the complexity involves in the integral, the Bayes estimator of unknown parameters is obtained using MCMC method. Further, we compared the MSE of estimation in different sampling schemes with different set sizes and cycle size. A real-life data application is also given to illustrate the efficiency of the proposed scheme.

An Economic Analysis of Flat Pricing for Unlimited Voice Calls : Necessary Conditions and MNO's Strategy (음성무제한 요금제경쟁의 경제적 분석 : 무제한요금제 도입 필요조건과 통신사의 선택)

  • Kim, Weonseek
    • Journal of Information Technology Services
    • /
    • v.12 no.3
    • /
    • pp.111-126
    • /
    • 2013
  • As the gaps become narrower in interconnection fee and volume rate, the MNOs began to introduce flat pricing for unlimited voice traffic competitively in Korea wireless telecommunication market : 'unlimited talks within intra-network' by the 1st operator, followed by the 3rd operator's 'unlimited talks over all networks'. As a result, subscribers tip in toward the third ranked operator and could bring a substantial change to steadfast market structure over the last decade in Korea. This paper aims to develop a simple economic model to analyze competition with flat pricing for unlimited voice traffic, and to check whether the pricing can be appropriate for the MNOs. The results show that MNOs already step in the necessary conditions to launch flat pricing for voice traffic. It also predicts that the MNOs compete with unlimited talk over all networks and set a single fee in an equilibrium. At present, the MNOs run virtually identical pricing for unlimited talk over all networks, considering their differentiation with respect to service quality, coverage and brand preference.

A Study on Improving the Effectiveness of Information Retrieval Through P-norm, RF, LCAF

  • Kim, Young-cheon;Lee, Sung-joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.2 no.1
    • /
    • pp.9-14
    • /
    • 2002
  • Boolean retrieval is simple and elegant. However, since there is no provision for term weighting, no ranking of the answer set is generated. As a result, the size of the output might be too large or too small. Relevance feedback is the most popular query reformulation strategy. in a relevance feedback cycle, the user is presented with a list of the retrieved documents and, after examining them, marks those which are relevant. In practice, only the top 10(or 20) ranked documents need to be examined. The main idea consists of selecting important terms, or expressions, attached to the documents that have been identified as relevant by the user, and of enhancing the importance of these terms in a new query formulation. The expected effect is that the new query will be moved towards the relevant documents and away from the non-relevant ones. Local analysis techniques are interesting because they take advantage of the local context provided with the query. In this regard, they seem more appropriate than global analysis techniques. In a local strategy, the documents retrieved for a given query q are examined at query time to determine terms for query expansion. This is similar to a relevance feedback cycle but might be done without assistance from the user.

A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns (인터넷 검색기록 분석을 통한 쇼핑의도 포함 키워드 자동 추출 기법)

  • Kim, Mingyu;Kim, Namgyu;Jung, Inhwan
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.123-136
    • /
    • 2014
  • Recently, online shopping has further developed as the use of the Internet and a variety of smart mobile devices becomes more prevalent. The increase in the scale of such shopping has led to the creation of many Internet shopping malls. Consequently, there is a tendency for increasingly fierce competition among online retailers, and as a result, many Internet shopping malls are making significant attempts to attract online users to their sites. One such attempt is keyword marketing, whereby a retail site pays a fee to expose its link to potential customers when they insert a specific keyword on an Internet portal site. The price related to each keyword is generally estimated by the keyword's frequency of appearance. However, it is widely accepted that the price of keywords cannot be based solely on their frequency because many keywords may appear frequently but have little relationship to shopping. This implies that it is unreasonable for an online shopping mall to spend a great deal on some keywords simply because people frequently use them. Therefore, from the perspective of shopping malls, a specialized process is required to extract meaningful keywords. Further, the demand for automating this extraction process is increasing because of the drive to improve online sales performance. In this study, we propose a methodology that can automatically extract only shopping-related keywords from the entire set of search keywords used on portal sites. We define a shopping-related keyword as a keyword that is used directly before shopping behaviors. In other words, only search keywords that direct the search results page to shopping-related pages are extracted from among the entire set of search keywords. A comparison is then made between the extracted keywords' rankings and the rankings of the entire set of search keywords. Two types of data are used in our study's experiment: web browsing history from July 1, 2012 to June 30, 2013, and site information. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The original sample dataset contains 150 million transaction logs. First, portal sites are selected, and search keywords in those sites are extracted. Search keywords can be easily extracted by simple parsing. The extracted keywords are ranked according to their frequency. The experiment uses approximately 3.9 million search results from Korea's largest search portal site. As a result, a total of 344,822 search keywords were extracted. Next, by using web browsing history and site information, the shopping-related keywords were taken from the entire set of search keywords. As a result, we obtained 4,709 shopping-related keywords. For performance evaluation, we compared the hit ratios of all the search keywords with the shopping-related keywords. To achieve this, we extracted 80,298 search keywords from several Internet shopping malls and then chose the top 1,000 keywords as a set of true shopping keywords. We measured precision, recall, and F-scores of the entire amount of keywords and the shopping-related keywords. The F-Score was formulated by calculating the harmonic mean of precision and recall. The precision, recall, and F-score of shopping-related keywords derived by the proposed methodology were revealed to be higher than those of the entire number of keywords. This study proposes a scheme that is able to obtain shopping-related keywords in a relatively simple manner. We could easily extract shopping-related keywords simply by examining transactions whose next visit is a shopping mall. The resultant shopping-related keyword set is expected to be a useful asset for many shopping malls that participate in keyword marketing. Moreover, the proposed methodology can be easily applied to the construction of special area-related keywords as well as shopping-related ones.

Evaluation of Regional Rural Amenity Values on Living and Tourism Resource Characteristics (생활 및 관광자원으로서의 특성을 고려한 농촌어메니티의 지역별 수준평가)

  • Oh, Yun-Gyeong;Choi, Jin-Yong;Bae, Seung-Jong
    • Journal of Korean Society of Rural Planning
    • /
    • v.14 no.4
    • /
    • pp.21-32
    • /
    • 2008
  • The rural area has kept traditions and green open spaces highlighted in these days since the life quality elevated. Institute of Rural Resources Development has been conducting nation-wide survey project for rural amenity resources to construct the databases of rural amenity distribution and richness. Using surveyed data from the project, this study was implemented to evaluate rural amenity values based on SAW (Simple Additive Weighting) method considering two aspects including living and tourism amenity. For defining the set of evaluation criteria, the rural amenity resources were classified into almost intact nature resources(natural resources), interaction between nature and man resources(cultural resources) and man-made resources(social resources). The weighting values of the criteria were evaluated from the step wise pair-comparison results by AHP(Analytic Hierarchy Process) method. In the results of weighting values related to living amenity, social resources was the hightest ranked criterion (0.512), followed by cultural resources (0.245) and natural resources (0.243). On the other hand, the results related to tourism amenity was that weighting values of natural resources, cultural resources and social resources were 0.481, 0.340 and 0.179, respectively. The two aspects evaluation methods was applied to the selected 18 areas (Myeon administration level) in Chungcheongbuk Do. The results demonstrated the differences of amenity values for living conditions and tourism conditions and could be used for prioritizing rural amenity planning.

A Study on Text Pattern Analysis Applying Discrete Fourier Transform - Focusing on Sentence Plagiarism Detection - (이산 푸리에 변환을 적용한 텍스트 패턴 분석에 관한 연구 - 표절 문장 탐색 중심으로 -)

  • Lee, Jung-Song;Park, Soon-Cheol
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.22 no.2
    • /
    • pp.43-52
    • /
    • 2017
  • Pattern Analysis is One of the Most Important Techniques in the Signal and Image Processing and Text Mining Fields. Discrete Fourier Transform (DFT) is Generally Used to Analyzing the Pattern of Signals and Images. We thought DFT could also be used on the Analysis of Text Patterns. In this Paper, DFT is Firstly Adapted in the World to the Sentence Plagiarism Detection Which Detects if Text Patterns of a Document Exist in Other Documents. We Signalize the Texts Converting Texts to ASCII Codes and Apply the Cross-Correlation Method to Detect the Simple Text Plagiarisms such as Cut-and-paste, term Relocations and etc. WordNet is using to find Similarities to Detect the Plagiarism that uses Synonyms, Translations, Summarizations and etc. The Data set, 2013 Corpus, Provided by PAN Which is the One of Well-known Workshops for Text Plagiarism is used in our Experiments. Our Method are Fourth Ranked Among the Eleven most Outstanding Plagiarism Detection Methods.