• Title/Summary/Keyword: analyzing data

Search Result 10,075, Processing Time 0.039 seconds

A Proposal of Some Analysis Methods for Discovery of User Information from Web Data

  • Ahn, JeongYong;Han, Kyung Soo
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.1
    • /
    • pp.281-289
    • /
    • 2001
  • The continuous growth in the use of the World Wide Web is creating the data with very large scale and different types. Analyzing such data can help to determine the life time value of users, evaluate the effectiveness of web sites, and design marketing strategies and services. In this paper, we propose some analysis methods for web data and present an example of a prototypical web data analysis.

  • PDF

A Continuation-Ratio Logits Mixed Model for Structured Polytomous Data

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.187-193
    • /
    • 2006
  • This paper shows how to use continuation-ratio logits for the analysis of structured polytomous data. Here, response categories are considered to have a nested binary structure. Thus, conditionally nested binary random variables can be defined in each step. Two types of factors are considered as independent variables affecting response probabilities. For the purpose of analyzing categorical data with binary nested strutures a continuation-ratio mixed model is suggested. Estimation procedure for the unknown parameters in a suggested model is also discussed in detail by an example.

  • PDF

A Mixed Model for Oredered Response Categories

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.2
    • /
    • pp.339-345
    • /
    • 2004
  • This paper deals with a mixed logit model for ordered polytomous data. There are two types of factors affecting the response varable in this paper. One is a fixed factor with finite quantitative levels and the other is a random factor coming from an experimental structure such as a randomized complete block design. It is discussed how to set up the model for analyzing ordered polytomous data and illustrated how to estimate the paramers in the given model.

  • PDF

Development of Remote Vehicle Information Storage System Using Wireless Communication (무선통신을 이용한 원격 차량운행정보 저장시스템 개발)

  • 이중현;고국원;최병욱;고경철
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2004.10a
    • /
    • pp.576-579
    • /
    • 2004
  • Recently, using GPS and equipment that recognizes the position of the car such a computer system inside the car are very universalized. Specially, the technique that diagnoses troubles and prevents troubles through scanning engine ECU is very popularized also. However, because these data have to be directly transferred and received from the car, in cases of traffic accident such as serious damage or car theft, it is impossible to receive the data at the time of accident. In order to receive and preserve the data safely regardless of these situations, it is possible to provide data for analyzing reasons of accident and prevent accidents from occurring by using wireless communication to receive the transferred information of the car, then saving into a Database system DB, or grasping the situation of the car and the driving pattern of drivers through analyzing stored data. Moreover, due to developing some related services such as providing the information about the real time of the accident, diagnoses of the car and alarms, etc. It is expected to contribute to creating added values.

  • PDF

A Case Study of Six Sigma Project for Improving method of measuring pulse wave (6시그마 기법을 통한 안정된 맥파측정 프로세스 설계)

  • Lee, Jeon;Lee, Yu-Jung;Lee, Hae-Jung;Choi, Eun-Ji;Kim, Jong-Yeol
    • Korean Journal of Oriental Medicine
    • /
    • v.12 no.2 s.17
    • /
    • pp.85-92
    • /
    • 2006
  • Pulse is one of the basic diagnostic information of TKM(Traditional Korean Medicine). To quantify and standardize pulse diagnosis, we had collected an amount of clinical data from May 2005 by using newly developed pulse analyzer. But there were many noises in pulse wave according to measuring method, environment, operator and condition of patient. So some data can’t be included for analyzing diagnosis. To reduce noises from measuring pulse and to collect reliable pulse wave data, we made the process map of measuring method and applied six sigma project. With this we can improved the method of measuring pulse wave in collecting clinical data. The project follows a disciplined process of five macro phases: define, measure, analyze, improve and control (DMAIC). A process map and C-E diagram are used to identify process input and output variables. The major input variables are selected by using C&E matrix, and process map is developed by analyzing input variables. And the optimum process conditions are going to be controled to avoid in increasing loss of collecting pulse wave data.

  • PDF

An analytical approach of behavior change for concrete dam by panel data model

  • Gu, Hao;Yang, Meng;Gu, Chongshi;Cao, Wenhan;Huang, Xiaofei;Su, Huaizhi
    • Steel and Composite Structures
    • /
    • v.36 no.5
    • /
    • pp.521-531
    • /
    • 2020
  • The behavior variation of concrete dam is investigated, based on a new method for analyzing the data model of concrete dam in service process for the limitation of wavelet transform for solving concrete dam service process model. The study takes into account the time and position of behavior change during the process of concrete dam service. There is no dependence on the effect quantity for overcoming the shortcomings of the traditional identification method. The panel data model is firstly proposed for analyzing the behavior change of composite concrete dam. The change-point theory is used to identify whether the behavior of concrete dams changes during service. The phase space reconstruction technique is used to reconstruct the phase plane of the trend effect component. The time dimension method is used to solve the construction of multi-transformation model of composite panel data. An existing 76.3-m-high dam is used to investigate some key issues on the behavior change. Emphasis is placed on conversion time and location for three time periods consistent with the practical analysis report for evaluating the validity of the analysis method of the behavior variation of concrete dams presented in this paper.

Selection and Analysis of the Typical Somatotype for the Development of a Torso Dummy for the Chinese Adult Women

  • Chang, Hee-Kyung;Sohn, Hee-Soon
    • Journal of Fashion Business
    • /
    • v.15 no.6
    • /
    • pp.134-147
    • /
    • 2011
  • This research was motivated to provide the Korean apparel companies doing business in China with some basic data useful to the development of their apparel commodities. As a result of selecting the standard or 'A' somatotype based on the body measurement data of the Chinese women in their early 20's and then, analyzing the correlated distribution and the most frequent intervals, it was found that 'height 160,' 'bust circumference 84' and 'waist circumference 66' were most prevalent. It was found that their average body measurements almost coincided with the standard '160A-84/66.' As discussed above, the researcher selected 13 women corresponding to '160A-84/66' in reference to 2008 body measurement data, and chose 6 women among them secondarily. Then, the researcher comparatively analyzed the direct measurement data and the 3D measurement ones, while analyzing the vertical/horizontal sagittal and Median plane section drawings, it was found that Subject 4 showed the most common somatotype data, while her upper body bent backwards reflected the population most properly.

A Development of LDA Topic Association Systems Based on Spark-Hadoop Framework

  • Park, Kiejin;Peng, Limei
    • Journal of Information Processing Systems
    • /
    • v.14 no.1
    • /
    • pp.140-149
    • /
    • 2018
  • Social data such as users' comments are unstructured in nature and up-to-date technologies for analyzing such data are constrained by the available storage space and processing time when fast storing and processing is required. On the other hand, it is even difficult in using a huge amount of dynamically generated social data to analyze the user features in a high speed. To solve this problem, we design and implement a topic association analysis system based on the latent Dirichlet allocation (LDA) model. The LDA does not require the training process and thus can analyze the social users' hourly interests on different topics in an easy way. The proposed system is constructed based on the Spark framework that is located on top of Hadoop cluster. It is advantageous of high-speed processing owing to that minimized access to hard disk is required and all the intermediately generated data are processed in the main memory. In the performance evaluation, it requires about 5 hours to analyze the topics for about 1 TB test social data (SNS comments). Moreover, through analyzing the association among topics, we can track the hourly change of social users' interests on different topics.

A Study on Employment Strategy Based on Employment Information Filtering (취업정보 필터링 기반 취업전략에 관한 연구)

  • Yoon, Sunhee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.5 no.4
    • /
    • pp.251-258
    • /
    • 2019
  • This study proposed a system that can improve the employment rate and maintenance employment rate by filtering information related to employment in analyzing big data for students who want to find employment. The subject was a two-year female university, the existing employment strategy participated in the job search with simple information such as school grades and personality. As a result, the maintenance employment rate was relatively low due to the decrease in the satisfaction of students seeking employment and the incompatibility with the post-employment aptitude. In order to solve these problems, we propose a system that determines and filters whether the input data in the process of analyzing big data such as employment-related information to improve employment and maintenance employment rates.

AI-Based Project Similarity Evaluation Model Using Project Scope Statements

  • Ko, Taewoo;Jeong, H. David;Lee, JeeHee
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.284-291
    • /
    • 2022
  • Historical data from comparable projects can serve as benchmarking data for an ongoing project's planning during the project scoping phase. As project owners typically store substantial amounts of data generated throughout project life cycles in digitized databases, they can capture appropriate data to support various project planning activities by accessing digital databases. One of the most important work tasks in this process is identifying one or more past projects comparable to a new project. The uniqueness and complexity of construction projects along with unorganized data, impede the reliable identification of comparable past projects. A project scope document provides the preliminary overview of a project in terms of the extent of the project and project requirements. However, narratives and free-formatted descriptions of project scopes are a significant and time-consuming barrier if a human needs to review them and determine similar projects. This study proposes an Artificial Intelligence-driven model for analyzing project scope descriptions and evaluating project similarity using natural language processing (NLP) techniques. The proposed algorithm can intelligently a) extract major work activities from unstructured descriptions held in a database and b) quantify similarities by considering the semantic features of texts representing work activities. The proposed model enhances historical comparable project identification by systematically analyzing project scopes.

  • PDF