• 제목/요약/키워드: analysis data

검색결과 85,083건 처리시간 0.076초

제주지역 풍력발전량 실시간 감시 시스템 구축에 관한 연구 (A Study on the Real-Time Monitoring System of Wind Power in Jeju)

  • 김경보;양경부;박윤호;문창은;박정근;허종철
    • 한국태양에너지학회 논문집
    • /
    • 제30권3호
    • /
    • pp.25-32
    • /
    • 2010
  • A real-time monitoring system was developed for transfer, receive, backup and analysis of wind power data at three wind farm(Hang won, Hankyung and Sung san) in Jeju. For this monitoring system a communication system analysis, a collection of data and transmission module development, data base construction and data analysis and management module was developed, respectively. These modules deal with mechanical, electrical and environmental problem. Especially, time series graphic is supported by the data analysis and management module automatically. The time series graphic make easier to raw data analysis. Also, the real-time monitoring system is connected with wind power forecasting system through internet web for data transfer to wind power forecasting system's data base.

조선산업의 비용분석 데이터 웨어하우스 시스템 개발 (Development of Data Warehouse Systems to Support Cost Analysis in the Ship Production)

  • 황성룡;김재균;장길상
    • 산업공학
    • /
    • 제15권2호
    • /
    • pp.159-171
    • /
    • 2002
  • Data Warehouses integrate data from multiple heterogeneous information sources and transform them into a multidimensional representation for decision support applications. Data warehousing has emerged as one of the most powerful tools in delivering information to users. Most previous researches have focused on marketing, customer service, financing, and insurance industry. Further, relatively less research has been done on data warehouse systems in the complex manufacturing industry such as ship production, which is characterized complex product structures and production processes. In the ship production, data warehouse systems is a requisite for effective cost analysis because collecting and analysis of diverse and large of cost-related(material/production cost, productivity) data in its operational systems, was becoming increasingly cumbersome and time consuming. This paper proposes architecture of the data warehouse systems to support cost analysis in the ship production. Also, in order to illustrate the usefulness of the proposed architecture, the prototype system is designed and implemented with the object of the enterprise of producing a large-scale ship.

일변량 자료의 왜도와 첨도에서 특이점의 영향을 평가하기 위한 탐색적 자료분석 그림도구로서의 불꽃그림 (Firework plot as a graphical exploratory data analysis tool for evaluating the impact of outliers in skewness and kurtosis of univariate data)

  • 문승호
    • 응용통계연구
    • /
    • 제29권2호
    • /
    • pp.355-368
    • /
    • 2016
  • 특이점 및 영향점은 자료분석을 하는 데 사용되는 계량적이고 기술적인 많은 측도들을 왜곡한다. 각종 자료분석에 있어서의 특이점 검색을 위한 검정 통계량이나 그림도구에 관한 연구는 꾸준히 전개되어 왔다. Jang과 Anderson-Cook (2014)은 불꽃그림이란 이름을 붙인 그림도구를 발표하였는데 이상점이나 영향점이 일변량/이변량 자료분석 및 회귀분석에 어떠한 영향을 미치는지 알기 위하여 3-D 불꽃그림 및 불꽃그림 행렬을 제시하였다. 본 연구에서는 이러한 불꽃그림이 일변량 자료의 왜도와 첨도에서 특이점의 영향을 평가하기 위한 탐색적 자료분석 그림도구로서 사용될 수 있음을 보였다.

Comparison of the Performance of Clustering Analysis using Data Reduction Techniques to Identify Energy Use Patterns

  • Song, Kwonsik;Park, Moonseo;Lee, Hyun-Soo;Ahn, Joseph
    • 국제학술발표논문집
    • /
    • The 6th International Conference on Construction Engineering and Project Management
    • /
    • pp.559-563
    • /
    • 2015
  • Identification of energy use patterns in buildings has a great opportunity for energy saving. To find what energy use patterns exist, clustering analysis has been commonly used such as K-means and hierarchical clustering method. In case of high dimensional data such as energy use time-series, data reduction should be considered to avoid the curse of dimensionality. Principle Component Analysis, Autocorrelation Function, Discrete Fourier Transform and Discrete Wavelet Transform have been widely used to map the original data into the lower dimensional spaces. However, there still remains an ongoing issue since the performance of clustering analysis is dependent on data type, purpose and application. Therefore, we need to understand which data reduction techniques are suitable for energy use management. This research aims find the best clustering method using energy use data obtained from Seoul National University campus. The results of this research show that most experiments with data reduction techniques have a better performance. Also, the results obtained helps facility managers optimally control energy systems such as HVAC to reduce energy use in buildings.

  • PDF

탐색적 자료 분석 및 연관규칙 분석을 활용한 잔류농약 부적합 농업인 유형 분석 (Pattern Analysis of Nonconforming Farmers in Residual Pesticides using Exploratory Data Analysis and Association Rule Analysis)

  • 김상웅;박은수;조현정;홍성희;손병철;홍지화
    • 품질경영학회지
    • /
    • 제49권1호
    • /
    • pp.81-95
    • /
    • 2021
  • Purpose: The purpose of this study was to analysis pattern of nonconforming farmers who is one of the factors of unconformity in residual pesticides. Methods: Pattern analysis of nonconforming farmers were analyzed through convergence of safety data and farmer's DB data. Exploratory data analysis and association rule analysis were used for extracting factors related to unconformity. Results: The results of this study are as follows; regarding the exploratory data analysis, it was found that factors of farmers influencing unconformity in residual pesticides by total 9 factors; sampling time, gender, age, cultivation region, farming career, agricultural start form, type of agriculture, cultivation area, classification of agricultural products. Regarding the association rule analysis, non-conformity association rules were found over the past three years. There was a difference in the pattern of nonconforming farmers depending on the cultivation period. Conclusion: Exploratory data analysis and association rule analysis will be useful tools to establish more efficient and economical safety management plan for agricultural products.

2012년, 2014년과 2016년의 어린이급식관리지원센터에 대한 빅데이터와 오피니언 마이닝을 통한 비교 (Comparison of the Center for Children's Foodservice Management in 2012, 2014, and 2016 Using Big Data and Opinion Mining)

  • 정은진;장은재
    • 대한영양사협회학술지
    • /
    • 제23권2호
    • /
    • pp.192-201
    • /
    • 2017
  • This study compared the Center for Children's Foodservice Management in 2012, 2014, and 2016 using big data and opinion mining. The data on the Center for Children's Foodservice Management were collected from the portal site, Naver, from January 1 to December 31 in 2012, 2014, & 2016 and analyzed by keyword frequency analysis, influx route analysis of data, polarity analysis via opinion mining, and positive and negative keyword analysis by polarity analysis. The results showed that nursery had the highest rank every year and education supported by Center for Children's Foodservice Management has increased significantly. The influx of data has increased through the influx route analysis of data. Blog and $caf\acute{e}e$, which have a considerable amount of information by the mother should be helpful for use as public relations and participation recruitment paths. By polarity analysis using opinion mining, the positive image of the Center for Children's Foodservice Management was increased. Therefore, the Center for Children's Foodservice Management was well-suited to the purpose and the interests of the people has been increasing steadily. In the near future, the Center for Children's Foodservice Management is expected have good recognition if various programs to participate with family are developed and advertised.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • 한국멀티미디어학회논문지
    • /
    • 제17권7호
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.

비정형 텍스트 테이터 분석을 위한 워드클라우드 기법에 관한 연구 (A Study on Word Cloud Techniques for Analysis of Unstructured Text Data)

  • 이원조
    • 문화기술의 융합
    • /
    • 제6권4호
    • /
    • pp.715-720
    • /
    • 2020
  • 빅데이터 분석에서 텍스트 데이터는 대부분 비정형이고 대용량으로 분석 기법이 정립되지 않아 분석에 어려움이 많았다. 따라서 텍스트 데이터 분석 기법의 하나인 빅데이터 워드클라우드 기법의 실무 적용시 문제점과 유용성 검증을 통한 상용화 가능성을 위해 본 연구를 수행하였다. 본 논문에서는 R 프로그램 워드클라우드 기법을 이용하여 "대통령 UN연설문"을 시각화 분석을 하고 이 기법의 한계와 문제점을 도출한다. 그리고 이를 해결하기 위한 개선된 모델을 제안하여 워드클라우드 기법의 실무 적용에 대한 효율적인 방안을 제시한다.

Speaker Verification with the Constraint of Limited Data

  • Kumari, Thyamagondlu Renukamurthy Jayanthi;Jayanna, Haradagere Siddaramaiah
    • Journal of Information Processing Systems
    • /
    • 제14권4호
    • /
    • pp.807-823
    • /
    • 2018
  • Speaker verification system performance depends on the utterance of each speaker. To verify the speaker, important information has to be captured from the utterance. Nowadays under the constraints of limited data, speaker verification has become a challenging task. The testing and training data are in terms of few seconds in limited data. The feature vectors extracted from single frame size and rate (SFSR) analysis is not sufficient for training and testing speakers in speaker verification. This leads to poor speaker modeling during training and may not provide good decision during testing. The problem is to be resolved by increasing feature vectors of training and testing data to the same duration. For that we are using multiple frame size (MFS), multiple frame rate (MFR), and multiple frame size and rate (MFSR) analysis techniques for speaker verification under limited data condition. These analysis techniques relatively extract more feature vector during training and testing and develop improved modeling and testing for limited data. To demonstrate this we have used mel-frequency cepstral coefficients (MFCC) and linear prediction cepstral coefficients (LPCC) as feature. Gaussian mixture model (GMM) and GMM-universal background model (GMM-UBM) are used for modeling the speaker. The database used is NIST-2003. The experimental results indicate that, improved performance of MFS, MFR, and MFSR analysis radically better compared with SFSR analysis. The experimental results show that LPCC based MFSR analysis perform better compared to other analysis techniques and feature extraction techniques.

신경망 분석을 활용한 하수처리장 데이터 분석 기법 연구 (Wastewater Treatment Plant Data Analysis Using Neural Network)

  • 서정식;김태욱;이해각;윤종호
    • 한국환경과학회지
    • /
    • 제31권7호
    • /
    • pp.555-567
    • /
    • 2022
  • With the introduction of the tele-monitoring system (TMS) in South Korea, monitoring of the concentration of pollutants discharged from nationwide water quality TMS attachments is possible. In addition, the Ministry of Environment is implementing a smart sewage system program that combines ICT technology with wastewater treatment plants. Thus, many institutions are adopting the automatic operation technique which uses process operation factors and TMS data of sewage treatment plants. As a part of the preliminary study, a multilayer perceptron (MLP) analysis method was applied to TMS data to identify predictability degree. TMS data were designated as independent variables, and each pollutant was considered as an independent variables. To verify the validity of the prediction, root mean square error analysis was conducted. TMS data from two public sewage treatment plants in Chungnam were used. The values of RMSE in SS, T-N, and COD predictions (excluding T-P) in treatment plant A showed an error range of 10%, and in the case of treatment plant B, all items showed an error exceeding 20%. If the total amount of data used MLP analysis increases, the predictability of MLP analysis is expected to increase further.