• 제목/요약/키워드: Data term

검색결과 7,506건 처리시간 0.038초

자동 문서분류에서의 정규화 용어빈도 가중치방법 (Normalized Term Frequency Weighting Method in Automatic Text Categorization)

  • 김수진;박혁로
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 컴퓨터소사이어티 추계학술대회논문집
    • /
    • pp.255-258
    • /
    • 2003
  • This paper defines Normalized Term Frequency Weighting method for automatic text categorization by using Box-Cox, and then it applies automatic text categorization. Box-Cox transformation is statistical transformation method which makes normalized data. This paper applies that and suggests new term frequency weighting method. Because Normalized Term Frequency is different from every term compared by existing term frequency weighting method, it is general method more than fixed weighting method such as log or root. Normalized term frequency weighting method's reasonability has been proved though experiments, used 8000 newspapers divided in 4 groups, which resulted high categorization correctness in all cases.

  • PDF

미계측 유역의 장기 물수지 분석에 관한 연구 (A Long-Term Water Budget Analysis for an Ungaged River Baisn)

  • 유금환;김태균;윤용남
    • 대한토목학회논문집
    • /
    • 제11권4호
    • /
    • pp.113-119
    • /
    • 1991
  • 본 연구에서는 월 강우량과 월 증발량 자료만 있는 하천유역에 대하여 장기 물수지 분석을 실시하는 방법론을 제시하고져 하였다. 단기간의 월 강우량 자료를 경혐공식에 의해 월 유출량 자료로 변환시킨 후 추계학적 모의발생 모형을 사용하여 이들 단기 유출자료로부터 일군의 장기 유출자료계열을 발생시켰고, 자료계열별로 갈수빈도해석에 의해 최대 갈수기간 및 월 강수량계열을 작성하였다. 계획년도별 각종 용수수요를 표준절차에 의해 추정하였으며 순 물소모량도 계산하였다. 유역내의 기존 저수지를 총괄하는 합성저수지를 통해 Deficit-Supply 방법으로 물 수지분석을 실시한 결과 물 부족량은 갈수재현기간이 커짐에 따라 급격하게 커지는 것으로 나타났다. 이는 하천 유역의 장기 물 수지분석을 통해 신뢰성있는 물 부족량을 계산하기 위해서는 추계학적 모의발생모형에 의한 장기간 유출량의 발생이 필수적이며 수자원 시스템의 적정 갈수재현기간의 선정이 대단히 중요함을 시사해 주는 것이다.

  • PDF

대단위발전소의 대기오염물질 확산에 관한 모델링 및 평가에 관한 연구 (Modeling and Evaluation on the Dispersion of Air Pollutants in the Large Scale Thermal Power Plant)

  • 전상기;이성철
    • 환경영향평가
    • /
    • 제6권2호
    • /
    • pp.81-92
    • /
    • 1997
  • This paper presents the results from the comparison analysis and evaluation between the air pollutant dispersion modeling results and the observation data in the area within a 10 km radius from the Boryong thermal power plants. The observation data used in this study were the air pollutant concentrations which had been continuously measured from 8 locations around the Boryong power plants by TMS(tele-monitoring system) for 3 months from September to November, 1996. The short-term and long-term predictions were carried out using ISC3 model and LPDM(Lagrangian Panicle Dispersion Model). The results of ISC3 modeling in a short-term showed highly as 0.7 in a correlation coefficient, but in a long-term showed just 0.54. On the other hand, LPDM showed 0.78 in a correlation coefficient for a long-term, but in a short-term showed highly value than the observation concentrations.

  • PDF

공격 메일 식별을 위한 비정형 데이터를 사용한 유전자 알고리즘 기반의 특징선택 알고리즘 (Feature-selection algorithm based on genetic algorithms using unstructured data for attack mail identification)

  • 홍성삼;김동욱;한명묵
    • 인터넷정보학회논문지
    • /
    • 제20권1호
    • /
    • pp.1-10
    • /
    • 2019
  • 빅 데이터에서 텍스트 마이닝은 많은 수의 데이터로부터 많은 특징 추출하기 때문에, 클러스터링 및 분류 과정의 계산 복잡도가 높고 분석결과의 신뢰성이 낮아질 수 있다. 특히 텍스트마이닝 과정을 통해 얻는 Term document matrix는 term과 문서간의 특징들을 표현하고 있지만, 희소행렬 형태를 보이게 된다. 본 논문에서는 탐지모델을 위해 텍스트마이닝에서 개선된 GA(Genetic Algorithm)을 이용한 특징 추출 방법을 설계하였다. TF-IDF는 특징 추출에서 문서와 용어간의 관계를 반영하는데 사용된다. 반복과정을 통해 사전에 미리 결정된 만큼의 특징을 선택한다. 또한 탐지모델의 성능 향상을 위해 sparsity score(희소성 점수)를 사용하였다. 스팸메일 세트의 희소성이 높으면 탐지모델의 성능이 낮아져 최적화된 탐지 모델을 찾기가 어렵다. 우리는 fitness function에서 s(F)를 사용하여 희소성이 낮고 TF-IDF 점수가 높은 탐지모델을 찾았다. 또한 제안된 알고리즘을 텍스트 분류 실험에 적용하여 성능을 검증하였다. 결과적으로, 제안한 알고리즘은 공격 메일 분류에서 좋은 성능(속도와 정확도)을 보여주었다.

Short-term Electric Load Forecasting Using Data Mining Technique

  • Kim, Cheol-Hong;Koo, Bon-Gil;Park, June-Ho
    • Journal of Electrical Engineering and Technology
    • /
    • 제7권6호
    • /
    • pp.807-813
    • /
    • 2012
  • In this paper, we introduce data mining techniques for short-term load forecasting (STLF). First, we use the K-mean algorithm to classify historical load data by season into four patterns. Second, we use the k-NN algorithm to divide the classified data into four patterns for Mondays, other weekdays, Saturdays, and Sundays. The classified data are used to develop a time series forecasting model. We then forecast the hourly load on weekdays and weekends, excluding special holidays. The historical load data are used as inputs for load forecasting. We compare our results with the KEPCO hourly record for 2008 and conclude that our approach is effective.

장기요양시설 요양보호사의 인권의식이 돌봄행위 이행에 미치는 영향 (The Effects of Awareness of Human Rights on Compliance of Caring Behavior of Long-term Care Workers)

  • 김진학;송민선
    • 가정간호학회지
    • /
    • 제27권1호
    • /
    • pp.5-15
    • /
    • 2020
  • Purpose: To identify the relationship between care worker's awareness of human rights and the compliance of caring behaviors among long-term care workers, and to identify factors affecting compliance with caring behaviors. Methods: Using self-report questionnaires, data were collected from 153 long-term care workers between October 4th and October 20th, 2019. Collected data were analyzed using the SPSS/WIN 26.0 program. Results: The data indicate a difference in awareness of human rights according to: the careers of care workers, the possession of other health care-related licenses, and the perceived needs of human rights education. The data also indicate a difference in the compliance of caring behaviors according to: gender, family care experience, and dementia care experience. The factors influencing compliance of caring behaviors, according to the study, are gender (β=.19, p=.009), family care experience (β=.19, p=.023), and human rights (β=.38, p<.001). It was found that 23% could explain the compliance of caring behaviors. Conclusion: Long term care workers were found to have a higher level of the compliance of caring behaviors as their awareness of human rights increased. In order to increase the compliance of caring behaviors among long-term care workers, more educational programs on human rights should be provided.

A Short-Term Prediction Method of the IGS RTS Clock Correction by using LSTM Network

  • Kim, Mingyu;Kim, Jeongrae
    • Journal of Positioning, Navigation, and Timing
    • /
    • 제8권4호
    • /
    • pp.209-214
    • /
    • 2019
  • Precise point positioning (PPP) requires precise orbit and clock products. International GNSS service (IGS) real-time service (RTS) data can be used in real-time for PPP, but it may not be possible to receive these corrections for a short time due to internet or hardware failure. In addition, the time required for IGS to combine RTS data from each analysis center results in a delay of about 30 seconds for the RTS data. Short-term orbit prediction can be possible because it includes the rate of correction, but the clock correction only provides bias. Thus, a short-term prediction model is needed to preidict RTS clock corrections. In this paper, we used a long short-term memory (LSTM) network to predict RTS clock correction for three minutes. The prediction accuracy of the LSTM was compared with that of the polynomial model. After applying the predicted clock corrections to the broadcast ephemeris, we performed PPP and analyzed the positioning accuracy. The LSTM network predicted the clock correction within 2 cm error, and the PPP accuracy is almost the same as received RTS data.

기록속도에 따른 BD-R의 데이터 장기보존 안정성 평가 (Long-Term Data Stability Evaluation of BD-R according to Recording Speed for Archival Application)

  • 이관용;박선주;조이형;김영주
    • 정보저장시스템학회논문집
    • /
    • 제8권2호
    • /
    • pp.50-55
    • /
    • 2012
  • Optical disks are widely used in libraries and archives as digital data media due to their long-term storage stability. Though archive-grade optical disks are already available on the market, there is a relative less focusing on the reliable recording conditions. Commercial BD-R media were recorded at various recording speeds with the maximum speed of 6X, and tested at the acceleration aging conditions. Through the evaluation of long-term storage features by the data stability test, lower recording speed of 2x resulted in better long-term storage stability. In addition, degradation aspects of unstable long-term storage feature at outer region of disk were discussed.

요양병원 노인 입원환자의 특성 및 ADL (일상생활수행능력) 관련 요인 : 환자조사 자료 (2013-2014)를 이용하여 (Characteristics and ADL (Activities of Daily Living) Associated Factors of Elderly Inpatients in Long-Term Care Hospitals : A Survey of Patients (2013-2014))

  • 박영희
    • 보건의료산업학회지
    • /
    • 제10권3호
    • /
    • pp.159-171
    • /
    • 2016
  • Objectives : This study was performed to investigate the characteristics and ADL(Activities of Daily Living) associated factors of elderly inpatients in long-term care hospitals. Methods : Data were collected from the nationwide data of 'Survey of Patients (2013-2014)' administerd by the Ministry of Health & Welfare. The data included in this study consisted of 27,606 cases of elderly inpatients in long-term care hospitals. Results : The survey scores for the elderly inpatients were as follows: 57.6% 'Needed much and total help' with ADL, followed by 26.6% who 'Needed much help', and 15.8% who 'needed minimal supervision' in long-term care hospitals. The ADL score was high in the following categories: women, old age, referred visit, health insurance type, not-recovered & death, transferred, corporate hospitals, small hospital size, low number of physicians per 100 beds, and high number of nursing staff per 100 beds. The inpatients with 'diseases of the nervous system', 'diseases of the circulatory system' and 'diseases of the genitourinary system' were more likely to have high ADL scores. Conclusions : The results of this study suggest that long-term care hospitals should provide active and proper care for patients with high ADL scores and improve medical personnel training as well provide more medical care.

노인장기요양 방문간호서비스의 소요시간별 방문당 원가 분석 (Estimation of Nursing Costs Based on Nurse Visit Time for Long-Term Care Services)

  • 김은경;김윤미;김명애
    • 대한간호학회지
    • /
    • 제40권3호
    • /
    • pp.349-358
    • /
    • 2010
  • Purpose: The purpose of this study was to estimate nursing costs and to establish appropriate nursing fees for long-term care services for community elders. Methods: Seven nurses participated in data collection related to visiting time by nurses for 1,100 elders. Data on material costs and management costs were collected from 5 visiting nursing agencies. The nursing costs were classified into 3 groups based on the nurse's visit time under the current reimbursement system of long-term care insurance. Results: The average nursing cost per minute was 246 won. The material costs were 3,214 won, management costs, 10,707 won, transportation costs, 7,605 won, and capital costs, 5,635 won per visit. As a result, the average cost of nursing services per visit by classification of nursing time were 41,036 won (care time <30 min), 46,005 won (care time 30-59 min), and 57,321 won (care time over 60 min). Conclusion: The results of the study indicate that the fees for nurse visits currently being charged for long-term care insurance should be increased. Also these results will contribute to baseline data for establishing appropriate nursing fees for long-term care services to maintain quality nursing and management in visiting nursing agencies.