• Title/Summary/Keyword: Official Statistics

Search Result 142, Processing Time 0.031 seconds

Analysis of Vehicle Selection Factors Using Energy Census (에너지총조사를 이용한 차량 선택 요인 분석)

  • Shin, Him Chul;Won, DooHwan
    • Environmental and Resource Economics Review
    • /
    • v.31 no.2
    • /
    • pp.291-317
    • /
    • 2022
  • This study tried to analyze the factors affecting consumers' vehicle selection for the spread of eco-friendly vehicles. We used the energy census data for this purpose, and although the energy census collects useful information from a large number of samples, it has been limitedly used to create simple statistics in many cases. Based on 2,771 transport sector microdata from the 2017 Energy Census, we collected vehicle price, fuel efficiency, and number of vehicle models, which are alternative characteristic variables that change according to consumers' choice, and converted and analyzed data to enable conjoint analysis. The analysis results in two-folds. First, it was confirmed that the official fuel efficiency of a vehicle and the fuel cost, which is affected by changes in the relative price of each fuel, are important variables in selecting an eco-friendly vehicle. In order to achieve the goal of spread of eco-friendly vehicles, it is necessary to develop technologies to improve fuel efficiency and set appropriate electric rates for charging electric vehicles. Second, an increase in the number of vehicle models through the expansion of the eco-friendly car industry and market also affects consumers' choice of eco-friendly vehicles, so efforts to expand the supply of eco-friendly vehicles will be an important factor. In addition, it is also significant that this study showed that the use of the energy census can be diversified by deriving meaningful policy implications using the results of the energy census periodically conducted in the country without a separate survey.

Research Trends in Record Management Using Unstructured Text Data Analysis (비정형 텍스트 데이터 분석을 활용한 기록관리 분야 연구동향)

  • Deokyong Hong;Junseok Heo
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.23 no.4
    • /
    • pp.73-89
    • /
    • 2023
  • This study aims to analyze the frequency of keywords used in Korean abstracts, which are unstructured text data in the domestic record management research field, using text mining techniques to identify domestic record management research trends through distance analysis between keywords. To this end, 1,157 keywords of 77,578 journals were visualized by extracting 1,157 articles from 7 journal types (28 types) searched by major category (complex study) and middle category (literature informatics) from the institutional statistics (registered site, candidate site) of the Korean Citation Index (KCI). Analysis of t-Distributed Stochastic Neighbor Embedding (t-SNE) and Scattertext using Word2vec was performed. As a result of the analysis, first, it was confirmed that keywords such as "record management" (889 times), "analysis" (888 times), "archive" (742 times), "record" (562 times), and "utilization" (449 times) were treated as significant topics by researchers. Second, Word2vec analysis generated vector representations between keywords, and similarity distances were investigated and visualized using t-SNE and Scattertext. In the visualization results, the research area for record management was divided into two groups, with keywords such as "archiving," "national record management," "standardization," "official documents," and "record management systems" occurring frequently in the first group (past). On the other hand, keywords such as "community," "data," "record information service," "online," and "digital archives" in the second group (current) were garnering substantial focus.

A Suggestion for Spatiotemporal Analysis Model of Complaints on Officially Assessed Land Price by Big Data Mining (빅데이터 마이닝에 의한 공시지가 민원의 시공간적 분석모델 제시)

  • Cho, Tae In;Choi, Byoung Gil;Na, Young Woo;Moon, Young Seob;Kim, Se Hun
    • Journal of Cadastre & Land InformatiX
    • /
    • v.48 no.2
    • /
    • pp.79-98
    • /
    • 2018
  • The purpose of this study is to suggest a model analysing spatio-temporal characteristics of the civil complaints for the officially assessed land price based on big data mining. Specifically, in this study, the underlying reasons for the civil complaints were found from the spatio-temporal perspectives, rather than the institutional factors, and a model was suggested monitoring a trend of the occurrence of such complaints. The official documents of 6,481 civil complaints for the officially assessed land price in the district of Jung-gu of Incheon Metropolitan City over the period from 2006 to 2015 along with their temporal and spatial poperties were collected and used for the analysis. Frequencies of major key words were examined by using a text mining method. Correlations among mafor key words were studied through the social network analysis. By calculating term frequency(TF) and term frequency-inverse document frequency(TF-IDF), which correspond to the weighted value of key words, I identified the major key words for the occurrence of the civil complaint for the officially assessed land price. Then the spatio-temporal characteristics of the civil complaints were examined by analysing hot spot based on the statistics of Getis-Ord $Gi^*$. It was found that the characteristic of civil complaints for the officially assessed land price were changing, forming a cluster that is linked spatio-temporally. Using text mining and social network analysis method, we could find out that the occurrence reason of civil complaints for the officially assessed land price could be identified quantitatively based on natural language. TF and TF-IDF, the weighted averages of key words, can be used as main explanatory variables to analyze spatio-temporal characteristics of civil complaints for the officially assessed land price since these statistics are different over time across different regions.

A study on the air pollutant emission trends in Gwangju (광주시 대기오염물질 배출량 변화추이에 관한 연구)

  • Seo, Gwang-Yeob;Shin, Dae-Yewn
    • Journal of environmental and Sanitary engineering
    • /
    • v.24 no.4
    • /
    • pp.1-26
    • /
    • 2009
  • We conclude the following with air pollution data measured from city measurement net administered and managed in Gwangju for the last 7 years from January in 2001 to December in 2007. In addition, some major statistics governed by Gwangju city and data administered by Gwangju as national official statistics obtained by estimating the amount of national air pollutant emission from National Institute of Environmental Research were used. The results are as follows ; 1. The distribution by main managements of air emission factory is the following ; Gwangju City Hall(67.8%) > Gwangsan District Office(13.6%) > Buk District Office(9.8%) > Seo District Office(5.5%) > Nam District Office(3.0%) > Dong District Office(0.3%) and the distribution by districts of air emission factory ; Buk District(32.8%) > Gwangsan District(22.4%) > Seo District(21.8%) > Nam District(14.9%) > Dong District(8.1%). That by types(Year 2004~2007 average) is also following ; Type 5(45.2%) > Type 4(40.7%) > Type 3(8.6%) > Type 2(3.2%) > Type 1(2.2%) and the most of them are small size of factory, Type 4 and 5. 2. The distribution by districts of the number of car registrations is the following ; Buk District(32.8%) > Gwangsan District(22.4%) > Seo District(21.8%) > Nam District(14.9%) > Dong District(8.1%) and the distribution by use of car fuel in 2001 ; Gasoline(56.3%) > Diesel(30.3%) > LPG(13.4%) > etc.(0.2%). In 2007, there was no ranking change ; Gasoline(47.8%) > Diesel(35.6%) > LPG(16.2%) >etc.(0.4%). The number of gasoline cars increased slightly, but that of diesel and LPG cars increased remarkably. 3. The distribution by items of the amount of air pollutant emission in Gwangju is the following; CO(36.7%) > NOx(32.7%) > VOC(26.7%) > SOx(2.3%) > PM-10(1.5%). The amount of CO and NOx, which are generally generated from cars, is very large percentage among them. 4. The distribution by mean of air pollutant emission(SOx, NOx, CO, VOC, PM-10) of each county for 5 years(2001~2005) is the following ; Buk District(31.0%) > Gwangsan District(28.2%) > Seo District(20.4%) > Nam District(12.5%) > Dong District(7.9%). The amount of air pollutant emission in Buk District, which has the most population, car registrations, and air pollutant emission businesses, was the highest. On the other hand, that of air pollutant emission in Dong District, which has the least population, car registrations, and air pollutant emission businesses, was the least. 5. The average rates of SOx for 5 years(2001~2005) in Gwangju is the following ; Non industrial combustion(59.5%) > Combustion in manufacturing industry(20.4%) > Road transportation(11.4%) > Non-road transportation(3.8%) > Waste disposal(3.7%) > Production process(1.1%). And the distribution of average amount of SOx emission of each county is shown as Gwangsan District(33.3%) > Buk District(28.0%) > Seo District(19.3%) > Nam District(10.2%) > Dong District(9.1%). 6. The distribution of the amount of NOx emission in Gwangju is shown as Road transportation(59.1%) > Non-road transportation(18.9%) > Non industrial combustion(13.3%) > Combustion in manufacturing industry(6.9%) > Waste disposal(1.6%) > Production process(0.1%). And the distribution of the amount of NOx emission from each county is the following ; Buk District(30.7%) > Gwangsan District(28.8%) > Seo District(20.5%) > Nam District(12.2%) > Dong District(7.8%). 7. The distribution of the amount of carbon monoxide emission in Gwangju is shown as Road transportation(82.0%) > Non industrial combustion(10.6%) > Non-road transportation(5.4%) > Combustion in manufacturing industry(1.7%) > Waste disposal(0.3%). And the distribution of the amount of carbon monoxide emission from each county is the following ; Buk District(33.0%) > Seo District(22.3%) > Gwangsan District(21.3%) > Nam District(14.3%) > Dong District(9.1%). 8. The distribution of the amount of Volatile Organic Compound emission in Gwangju is shown as Solvent utilization(69.5%) > Road transportation(19.8%) > Energy storage & transport(4.4%) > Non-road transportation(2.8%) > Waste disposal(2.4%) > Non industrial combustion(0.5%) > Production process(0.4%) > Combustion in manufacturing industry(0.3%). And the distribution of the amount of Volatile Organic Compound emission from each county is the following ; Gwangsan District(36.8%) > Buk District(28.7%) > Seo District(17.8%) > Nam District(10.4%) > Dong District(6.3%). 9. The distribution of the amount of minute dust emission in Gwangju is shown as Road transportation(76.7%) > Non-road transportation(16.3%) > Non industrial combustion(6.1%) > Combustion in manufacturing industry(0.7%) > Waste disposal(0.2%) > Production process(0.1%). And the distribution of the amount of minute dust emission from each county is the following ; Buk District(32.8%) > Gwangsan District(26.0%) > Seo District(19.5%) > Nam District(13.2%) > Dong District(8.5%). 10. According to the major source of emission of each items, that of oxides of sulfur is Non industrial combustion, heating of residence, business and agriculture and stockbreeding. And that of NOx, carbon monoxide, minute dust is Road transportation, emission of cars and two-wheeled vehicles. Also, that of VOC is Solvent utilization emission facilities due to Solvent utilization. 11. The concentration of sulfurous acid gas has been 0.004ppm since 2001 and there has not been no concentration change year by year. It is considered that the use of sulfurous acid gas is now reaching to the stabilization stage. This is found by the facts that the use of fuel is steadily changing from solid or liquid fuel to low sulfur liquid fuel containing very little amount of sulfur element or gas, so that nearly no change in concentration has been shown regularly. 12. Concerning changes of the concentration of throughout time, the concentration of NO has been shown relatively higher than that of $NO_2$ between 6AM~1PM and the concentration of $NO_2$ higher during the other time. The concentration of NOx(NO, $NO_2$) has been relatively high during weekday evenings. This result shows that there is correlation between the concentration of NOx and car traffics as we can see the Road transportation which accounts for 59.1% among the amount of NOx emission. 13. 49.1~61.2% of PM-10 shows PM-2.5 concerning the relationship between PM-10 and PM-2.5 and PM-2.5 among dust accounts for 45.4%~44.5% of PM-10 during March and April which is the lowest rates. This proves that particles of yellow sand that are bigger than the size $2.5\;{\mu}m$ are sent more than those that are smaller from China. This result shows that particles smaller than $2.5\;{\mu}m$ among dust exist much during July~August and December~January and 76.7% of minute dust is proved to be road transportation in Gwangju.

An Intelligent Intrusion Detection Model Based on Support Vector Machines and the Classification Threshold Optimization for Considering the Asymmetric Error Cost (비대칭 오류비용을 고려한 분류기준값 최적화와 SVM에 기반한 지능형 침입탐지모형)

  • Lee, Hyeon-Uk;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.157-173
    • /
    • 2011
  • As the Internet use explodes recently, the malicious attacks and hacking for a system connected to network occur frequently. This means the fatal damage can be caused by these intrusions in the government agency, public office, and company operating various systems. For such reasons, there are growing interests and demand about the intrusion detection systems (IDS)-the security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. The intrusion detection models that have been applied in conventional IDS are generally designed by modeling the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. These kinds of intrusion detection models perform well under the normal situations. However, they show poor performance when they meet a new or unknown pattern of the network attacks. For this reason, several recent studies try to adopt various artificial intelligence techniques, which can proactively respond to the unknown threats. Especially, artificial neural networks (ANNs) have popularly been applied in the prior studies because of its superior prediction accuracy. However, ANNs have some intrinsic limitations such as the risk of overfitting, the requirement of the large sample size, and the lack of understanding the prediction process (i.e. black box theory). As a result, the most recent studies on IDS have started to adopt support vector machine (SVM), the classification technique that is more stable and powerful compared to ANNs. SVM is known as a relatively high predictive power and generalization capability. Under this background, this study proposes a novel intelligent intrusion detection model that uses SVM as the classification model in order to improve the predictive ability of IDS. Also, our model is designed to consider the asymmetric error cost by optimizing the classification threshold. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, when considering total cost of misclassification in IDS, it is more reasonable to assign heavier weights on FNE rather than FPE. Therefore, we designed our proposed intrusion detection model to optimize the classification threshold in order to minimize the total misclassification cost. In this case, conventional SVM cannot be applied because it is designed to generate discrete output (i.e. a class). To resolve this problem, we used the revised SVM technique proposed by Platt(2000), which is able to generate the probability estimate. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 1,000 samples from them by using random sampling method. In addition, the SVM model was compared with the logistic regression (LOGIT), decision trees (DT), and ANN to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell 4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on SVM outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that our model reduced the total misclassification cost compared to the ANN-based intrusion detection model. As a result, it is expected that the intrusion detection model proposed in this paper would not only enhance the performance of IDS, but also lead to better management of FNE.

A Research Survey on the Characteristics of Burglaries: Focused on How to Utilize Private Security (침입절도 특성에 관한 조사연구: 민간경비 활용방안을 중심으로)

  • Kim, Dae-Kwon
    • Korean Security Journal
    • /
    • no.22
    • /
    • pp.15-35
    • /
    • 2010
  • A larceny means stealing others' properties, as one of crime types most closely connected with common people. Along with fraud, it is the mostly common property crime; in particular, the largest number of people are exposed to a burglary. This study aims to find the schemes to utilize private guards according to the characteristics of burglary. To do so, a questionnaire survey was conducted into an actual condition of official statistics of burglary and into the criminals of burglary, with a view to understanding the behavioral characteristics of burglary and suggesting defense mechanisms to prevent the crime. Burglary is not just a major crime to be dealt with by public guards like the police but also one to be handled increasingly more by private guards. It is why this study intends to identify how to utilize private guards in preventing the crime. Investigations were made into 208 burglars, who were inmates of 10 correctional institutions (prisons or detention houses) across the country. It is found that only about 24% of burglars committed the crime through rational choice, about 60.7% were feared of their arrest at the time of their crime, and a very high percentage (69.9%) of them were assured of their successful crime. Burglaries usually happened at night, mostly in a summer day when everybody goes away from home for vacation. Primarily, the crimes took place in a private house of urban residential quarters. What burglars considered mostly for target selection includes 'profitability,' followed by 'surveillance' and 'risk.' Most (42%) of them committed the crime for the first time ever. Generally, they were not inclined to commit the crime while under the influence of alcohol or drug, which might prevent them from making reasonable decisions. 73.9% of the criminals said that they committed the crime singly without any accomplices.

  • PDF

Research Trends in Driving Rehabilitation for the Disabled in South Korea since 2000 (국내 장애인 운전재활 연구동향: 2000년도 이후)

  • Jo, Eun-Ju;Noh, Dong-Hee;Kim, Kwang-Jae;Bae, Seon-Young;Kang, Seong-Ku;Moon, Seong-Bae;Kam, Kyung-Yoon
    • The Journal of Korean society of community based occupational therapy
    • /
    • v.8 no.1
    • /
    • pp.33-44
    • /
    • 2018
  • Objective : This study aims to review research trends in driving rehabilitation for the disabled in South Korea since 2000 suggesting research directions for clinicians and researchers. Methods : Fifty eight articles in 16 journals listed in accredited or candidate journal lists of National Research Foundation of Korea from January, 2000 to December, 2016 were reviewed. 'Driving rehabilitation' and 'driving for disabled' were used as search terms. Descriptive statistics were used to classify articles according to study methodology, levels of evidence, study participants, research topics, and academic associations or official journals. Results : Fifty percent of analyzed researches have been published since 2012. Twenty-two studies (37.9%) were published as group comparison and correlational research. Only seven studies (12.1%) were included in evidence level I. There were 19 studies (38.8%) conducted with brain-injured patients among 49 studies including participants. The Korean Society of Occupational Therapy Journal, having published 15 studies (25.9%) about driving rehabilitation, ranked first among the analyzed journals. In research topic, 15 (25.9%) studies were performed about clinical evaluation. Conclusion : The present study showed that the quality of driving rehabilitation-related studies has been increasing, but more intervention-based researches need to be carried out and it is also necessary to carry out various researches in related fields in order to establish efficient driving rehabilitation in Korea.

An Integrated Model based on Genetic Algorithms for Implementing Cost-Effective Intelligent Intrusion Detection Systems (비용효율적 지능형 침입탐지시스템 구현을 위한 유전자 알고리즘 기반 통합 모형)

  • Lee, Hyeon-Uk;Kim, Ji-Hun;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.125-141
    • /
    • 2012
  • These days, the malicious attacks and hacks on the networked systems are dramatically increasing, and the patterns of them are changing rapidly. Consequently, it becomes more important to appropriately handle these malicious attacks and hacks, and there exist sufficient interests and demand in effective network security systems just like intrusion detection systems. Intrusion detection systems are the network security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. Conventional intrusion detection systems have generally been designed using the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. However, they cannot handle new or unknown patterns of the network attacks, although they perform very well under the normal situation. As a result, recent studies on intrusion detection systems use artificial intelligence techniques, which can proactively respond to the unknown threats. For a long time, researchers have adopted and tested various kinds of artificial intelligence techniques such as artificial neural networks, decision trees, and support vector machines to detect intrusions on the network. However, most of them have just applied these techniques singularly, even though combining the techniques may lead to better detection. With this reason, we propose a new integrated model for intrusion detection. Our model is designed to combine prediction results of four different binary classification models-logistic regression (LOGIT), decision trees (DT), artificial neural networks (ANN), and support vector machines (SVM), which may be complementary to each other. As a tool for finding optimal combining weights, genetic algorithms (GA) are used. Our proposed model is designed to be built in two steps. At the first step, the optimal integration model whose prediction error (i.e. erroneous classification rate) is the least is generated. After that, in the second step, it explores the optimal classification threshold for determining intrusions, which minimizes the total misclassification cost. To calculate the total misclassification cost of intrusion detection system, we need to understand its asymmetric error cost scheme. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, total misclassification cost is more affected by FNE rather than FPE. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 10,000 samples from them by using random sampling method. Also, we compared the results from our model with the results from single techniques to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell R4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on GA outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that the proposed model outperformed all the other comparative models in the total misclassification cost perspective. Consequently, it is expected that our study may contribute to build cost-effective intelligent intrusion detection systems.

The Present State and Curriculum Implementation Overview of the Nursing-Specialized Vocational High Schools (특성화고등학교 간호과 운영 현황 및 교육과정 운영실태 분석)

  • Yoon, In-Kyung;Jang, Myung-Hee;Lee, Hyun-Young
    • Journal of vocational education research
    • /
    • v.35 no.4
    • /
    • pp.19-46
    • /
    • 2016
  • The purpose of this study is to analyze the curriculum implementation of the Nursing-Specialized Vocational High School by researching on operation, organization and environment of the program of Nursing. This study aims to improve the curriculum of the Nursing-Specialized Vocational High School. This study has analyzed previous existing studies, Link of School info, Educational Statistics and data indicating establishment, operation and curriculum of the department of Nursing which have been collected from web sites of institutions and associations relevant to Nurse Education. The major results of this study are as follows: 1) As of the first semester of the year 2016, out of a total of thirty eight Specialized Vocational High Schools and Meister High Schools in the country, 6.4% of the schools have nursing educational programs. These schools have established the programs under various names, such as Health Nursing, Dental Health Nursing, Nursing, Nursing and Medical Tourism, Accounting in Nursing and Nursing Management, etc. Since 2012, enrollment rates have increased while post-graduation employment rates have decreased, with the average employment rate of Specialized Vocational High School graduates having reached up to 46% by 2015. 2) The Nursing-Specialized Vocational High School aims to create skilled Nurses Assistant such as Nurse Aide and Care giver. The program is successful in providing necessary courses to acquire required certification and proficient field experience but requires revisional changes in order to create a long-term program of sufficient qualification. The official requirement of 780 hours of field practice was completed during the three educational breaks from the first year of high school to the second year, while the curriculum was conducted separately in the field hospitals. 3) An average of two laboratory classrooms were available based on the facility requirement standard of Cities and Provinces Educational Policies. In order to secure proficient instructors of Nursing education, establishment of specific indicated subjects, regional placement, in-service education, research and supervision are essential for establishing excellence and continual improvement.

An Exploratory Study on the Status of and Demand for Higher Education Programs in Fashion in Myanmar (미얀마의 패션 고등교육 현황과 수요에 대한 탐색적 연구)

  • Kang, Min-Kyung;Jin, Byoungho Ellie;Cho, Ahra;Lee, Hyojeong;Lee, Jaeil;Lee, Yoon-Jung
    • Journal of Korean Home Economics Education Association
    • /
    • v.34 no.3
    • /
    • pp.1-23
    • /
    • 2022
  • This study examined the perceptions of Myanmar university students and professors regarding the status and necessity of higher education programs in fashion. Data were collected from professors in textile engineering at Yangon Technological University and Myanmar university students. Closed- and open-ended questions were asked either through interviews or by email. The responses were analyzed using keyword extraction and categorization, and descriptive statistics(closed questions). Generally, the professors perceived higher education, as well as the cultural industries including art and fashion, as important for Myanmar's social and economic development. According to the students interests in pursuing a degree in textile were limited, despite the high interest in fashion. Low wages in the apparel industry and lack of fashion degrees that meet the demand of students were cited as reasons. The demand was high for educational programs in fashion product development, fashion design, pattern-making, fashion marketing, branding, management, costume history, and cultural studies. Students expected to find their future career in textiles and clothing factories. Many students wanted to be hired by global fashion brands for higher salaries and training for advanced knowledge and technical skills. They perceived advanced fashion education programs will have various positive effects on Myanmar's national economy.