Search | Korea Science

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.24 no.3
- /
- pp.21-44
- /
- 2018
In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.
https://doi.org/10.13088/jiis.2018.24.3.021 인용 PDF KSCI

Social Learning Values in the Justification Discourses for One Million-pyeong Park, Busan, South Korea (담론분석을 통한 100만평공원운동의 사회학습적 가치)

Lee, Sungkyung;Kim, Seung-Hwan
- Journal of the Korean Institute of Landscape Architecture
- /
- v.41 no.5
- /
- pp.19-27
- /
- 2013
This paper claims that the One Million-peyong Park(hereafter abbreviated as OMP) project is different from a typical citizen participatory park project by recognizing the exceptional leadership of the Civic Committee for the One Million-pyeong Park Construction(CCOMPC) in promoting and developing the OMP project. Since 2001 the CCOMPC has published a variety of written promotional materials to inform and educate the public about the project. In terms of approaching the promotional materials, this research focuses on the use of language on how the CCOMPC justifies the OMP project, namely the OMP justification discourse, and considers the discourse as a unique form of social document that represents the perspective of the CCOMPC in explaining the local environmental issues and values of urban parks to the public. Using a discourse analysis method, this research analyzes the justification discourses and investigates how they changed over the three main development phases of the OMP: the initiation and preliminary development phase(1999-2001.2), the development phase (2001.2-2008), and the time period after the greenbelt policy release on Dunchi Island(2008-present). In each discourse, the OMP project is rationalized as a citizen participation park project that (1) aims to enhance the quality of public green space in Busan, (2) is accompanied by various community engagement programs that emphasize the value of urban nature and environmental education to expand citizen participation, and (3) has contributed to the National Urban Park Bill. This research emphasizes the role of the discourses in helping the public gain a critical understanding about the local environment and values of urban parks. By analyzing the contents of the discourses, it explains the social learning values of the OMP expressed in the discourses.
https://doi.org/10.9715/KILA.2013.41.5.019 인용 PDF KSCI

Modeling and Intelligent Control for Activated Sludge Process (활성슬러지 공정을 위한 모델링과 지능제어의 적용)

Cheon, Seong-pyo;Kim, Bongchul;Kim, Sungshin;Kim, Chang-Won;Kim, Sanghyun;Woo, Hae-Jin
- Journal of Korean Society of Environmental Engineers
- /
- v.22 no.10
- /
- pp.1905-1919
- /
- 2000
The main motivation of this research is to develop an intelligent control strategy for Activated Sludge Process (ASP). ASP is a complex and nonlinear dynamic system because of the characteristic of wastewater, the change in influent flow rate, weather conditions, and etc. The mathematical model of ASP also includes uncertainties which are ignored or not considered by process engineer or controller designer. The ASP is generally controlled by a PID controller that consists of fixed proportional, integral, and derivative gain values. The PID gains are adjusted by the expert who has much experience in the ASP. The ASP model based on $Matlab^{(R)}5.3/Simulink^{(R)}3.0$ is developed in this paper. The performance of the model is tested by IWA(International Water Association) and COST(European Cooperation in the field of Scientific and Technical Research) data that include steady-state results during 14 days. The advantage of the developed model is that the user can easily modify or change the controller by the help of the graphical user interface. The ASP model as a typical nonlinear system can be used to simulate and test the proposed controller for an educational purpose. Various control methods are applied to the ASP model and the control results are compared to apply the proposed intelligent control strategy to a real ASP. Three control methods are designed and tested: conventional PID controller, fuzzy logic control approach to modify setpoints, and fuzzy-PID control method. The proposed setpoints changer based on the fuzzy logic shows a better performance and robustness under disturbances. The objective function can be defined and included in the proposed control strategy to improve the effluent water quality and to reduce the operating cost in a real ASP.
PDF

Opportunity Tree Framework Design For Optimization of Software Development Project Performance (소프트웨어 개발 프로젝트 성능의 최적화를 위한 Opportunity Tree 모델 설계)

Song Ki-Won;Lee Kyung-Whan
- The KIPS Transactions:PartD
- /
- v.12D no.3 s.99
- /
- pp.417-428
- /
- 2005
Today, IT organizations perform projects with vision related to marketing and financial profit. The objective of realizing the vision is to improve the project performing ability in terms of QCD. Organizations have made a lot of efforts to achieve this objective through process improvement. Large companies such as IBM, Ford, and GE have made over $80\%$ of success through business process re-engineering using information technology instead of business improvement effect by computers. It is important to collect, analyze and manage the data on performed projects to achieve the objective, but quantitative measurement is difficult as software is invisible and the effect and efficiency caused by process change are not visibly identified. Therefore, it is not easy to extract the strategy of improvement. This paper measures and analyzes the project performance, focusing on organizations' external effectiveness and internal efficiency (Qualify, Delivery, Cycle time, and Waste). Based on the measured project performance scores, an OT (Opportunity Tree) model was designed for optimizing the project performance. The process of design is as follows. First, meta data are derived from projects and analyzed by quantitative GQM(Goal-Question-Metric) questionnaire. Then, the project performance model is designed with the data obtained from the quantitative GQM questionnaire and organization's performance score for each area is calculated. The value is revised by integrating the measured scores by area vision weights from all stakeholders (CEO, middle-class managers, developer, investor, and custom). Through this, routes for improvement are presented and an optimized improvement method is suggested. Existing methods to improve software process have been highly effective in division of processes' but somewhat unsatisfactory in structural function to develop and systemically manage strategies by applying the processes to Projects. The proposed OT model provides a solution to this problem. The OT model is useful to provide an optimal improvement method in line with organization's goals and can reduce risks which may occur in the course of improving process if it is applied with proposed methods. In addition, satisfaction about the improvement strategy can be improved by obtaining input about vision weight from all stakeholders through the qualitative questionnaire and by reflecting it to the calculation. The OT is also useful to optimize the expansion of market and financial performance by controlling the ability of Quality, Delivery, Cycle time, and Waste.
https://doi.org/10.3745/KIPSTD.2005.12D.3.417 인용 PDF KSCI

IPv6 Migration, OSPFv3 Routing based on IPv6, and IPv4/IPv6 Dual-Stack Networks and IPv6 Network: Modeling, and Simulation (IPv6 이관, IPv6 기반의 OSPFv3 라우팅, IPv4/IPv6 듀얼 스택 네트워크와 IPv6 네트워크: 모델링, 시뮬레이션)

Kim, Jeong-Su
- The KIPS Transactions:PartC
- /
- v.18C no.5
- /
- pp.343-360
- /
- 2011
The objective of this paper is to analyze and characterize to simulate routing observations on end-to-end routing circuits and a ping experiment of a virtual network after modeling, such as IPv6 migration, an OSPFv3 routing experiment based on an IPv6 environment, and a ping experiment for IPv4/IPv6 dual-stack networks and IPv6 network for OSPFv3 routing using IPv6 planning and operations in an OPNET Modeler. IPv6 deployment based largely on the integrated wired and wireless network was one of the research tasks at hand. The previous studies' researchers recommended that future research work be done on the explicit features of both OSPFv3 and EIGRP protocols in the IPv4/IPv6 environment, and more research should be done to explore how to improve the end-to-end IPv6 performance. Also, most related work was performed with an IPv4 environment but lacked studies related to the OSPFv3 virtual network based on an end-to-end IPv6 environment. Hence, this research continues work in previous studies in analyzing IPv6 migration, an OSPFv3 routing experiment based on IPv6, and a ping experiment for IPv4/IPv6 dual-stack networks and IPv6 network for OSPFv3 routing. In the not too distant future, before enabling the default IPv6, it would help to understand network design and deployment based on an IPv6 environment through IPv6 planning and operations for the end-user perspective such as success or failure of connection on IPv6 migration, exploration of an OSPFv3 routing circuit based on an end-to-end IPv6 environment, and a ping experiment for IPv4/IPv6 dual-stack networks and IPv6 network for OSPFv3 routing. We were able to observe an optimal route for modeling of an end-to-end virtual network through simulation results as well as find what appeared to be a fast ping response time VC server to ensure Internet quality of service better than an HTTP server.
https://doi.org/10.3745/KIPSTC.2011.18C.5.343 인용 PDF KSCI

Studies on Wood Quality of Pinus koraiensis Sieb. et Zucc. (III) - On Annual Ring Width and Summer Wood Percentage - (잣나무의 재질(材質)에 관(關)한 연구(硏究) (제(第) III 보(報)) - 연륜폭(年輪幅)과 추재율(秋材率) -)

Lee, Won Yong
- Journal of Korean Society of Forest Science
- /
- v.24 no.1
- /
- pp.25-44
- /
- 1974
In the present paper I described the results of the observations made on the visual characteristics such as the annual ring width and summerwood percentage of Pinus koraiensis Sieb. et Zucc. grown at out university forest. The results of the study are as follows: Characteristics of annual ring width and summerwood percentage 1. The range of dispersion of annual ring width and summerwood percentage are respectively 0.5-6.5mm and 5-50% on the normal wood and its arithmetic mean values are each 3.0mm and 24% on all sample trees. 2. The values of annual ring width of heart wood are larger than that of sapwood but on the contrary the values of summerwood percentage of heartwood are smaller than that of sapwood. On the other hand variations of these values are distingushed on the heartwood. 3. The values of annual ring width due to the parts of stem with crown. with clear length and at bottom showed that the largest values are given at the parts of stem with crown. But on the contrary the summerwood percentage values are largest at the parts of stem at bottom on all sample trees. 4. The values of annual ring width and summerwood percentage depending on the stand sides are not obvious. Horizontal and vertical variations of annual ring width and summerwood percentage 5. It was recognized that horizontal (radial direction) variations of annual ring width and summerwood percentage indicated two different patterns (the region of large fluctuation and that of small fluctuation) in a tree stem. These boundaries are seemed to appear at the parts of 12-15 annual rings from pith. 6. According to the increase of height in tree the values of annual ring width increase but the values of summerwood percentage gradually decrease. 7. But vertical variations of annual ring width and summerwood percentage on the sapwood are divided into two different parts (region of increased or decreased upwards and that of remained constant in successive height) in a tree stem and these limits are seemed to appear at the 7m of height in trees. Relations between annual ring width and summerwood percentage 8. The modes of summerwood percentage related with annual ring width are seemed to appear almost in the definite range (10-25%). 9. The relations between annual ring width and summerwood percentage show a highly negatine correlation on all sample trees.
PDF

Analysis of Household Textbooks for MiddleㆍHigh School in Colonial Age (식민지 시대 '가사교과서'에 관한 연구: 1930년대를 중심으로)

Jun Mi-Kyung
- Journal of Korean Home Economics Education Association
- /
- v.16 no.3
- /
- pp.1-25
- /
- 2004
This study analyzes the external forms of the household textbooks and also the contents of them used at girls' middleㆍhigh schools during the period of Japanese ruling over Korea. To this end, 8 household textbooks published from 1928 to 1937 were analyzed. The results of the study are summarized as follows. 1. The household subject had become the one of the most important subjects to girl students as the practical uses were emphasized in educational area during the period. As a result. the classes of the household were the second in hours, following the class of Japanese (the national language) to girl students. 2. The contents of the household textbooks were intended to contain 'the modern' and 'the newest'. The students were also suggested to apply the contents of the textbooks to real home life. Many pictures, photos and illustrations were included in household textbooks to help students to understand the contents of the subject. 3. The purposes of the household class were the reformation of the living conditions and home economics. 4. The external characteristics of the household textbooks during the period were as follows. - Written in Japanese vertically and the size of the textbook was A5 (150/210) with pulp paper of good quality - The type style of the body of the textbooks was Ming-style type- The sequent order of the textbooks was the outer cover, the title page, pictorial, introduction, table of contents, the body, appendix and the back cover. 5. The household textbooks consisted of the first volume and the second volume. The first volume contained clothing and textiles, food and nutrition and housing. Taking care of the aged. nursing. child care, household economy and home management were included in the second volume. 6. The household textbooks were designed to make women the housewives.
PDF

Analysis of Relationship between Sanitary Knowledge and Sanitary Management Performance of School Foodservice Employees in Gyeongnam (경남 일부지역 학교급식 조리종사자의 위생지식과 위생관리 수행도의 관계 분석)

An, Jeong-Mi;Kim, Hyun-Ah
- Journal of the Korean Society of Food Science and Nutrition
- /
- v.42 no.7
- /
- pp.1139-1147
- /
- 2013
The purpose of this study was to analyze the relationship between sanitary knowledge score and sanitary management performance among school foodservice employees. For this purpose, a paper-based questionnaire was developed and distributed to 300 school foodservice employees in Jinhae-gu, Changwon from May 13 to June 10 in 2009. A total of 276 responses were received and analyzed. The results of this study were as follows. The sanitary knowledge score of school foodservice employees was 16.60 (total score: 20). Their sanitary management performance level was 4.77 (based on a 5-point Likert scale). We found that sanitary management performance level of high sanitary knowledge score group was significantly higher than that of low sanitary knowledge score group (P<0.001). There was a significant positive correlation between sanitary knowledge score and sanitary management performance of school foodservice employees (P<0.01). Regression analysis showed that sanitary knowledge score of school foodservice employees had a positive effect on sanitary management performance (P<0.001). It implies that as school foodservice employees' sanitary knowledge increased, their sanitary management performance increased. In conclusion, to improve the sanitary quality of school foodservice, school foodservice employees' sanitary management performance level should be increased by improving their sanitary knowledge. So, a systematic and consistent sanitary education program should be conducted for school foodservice employees.
https://doi.org/10.3746/jkfn.2013.42.7.1139 인용 PDF KSCI

Design and Full Size Flexural Test of Spliced I-type Prestressed Concrete Bridge Girders Having Holes in the Web (분절형 복부 중공 프리스트레스트 콘크리트 교량 거더의 설계 및 실물크기 휨 실험 분석)

Han, Man Yop;Choi, Sokhwan;Jeon, Yong-Sik
- KSCE Journal of Civil and Environmental Engineering Research
- /
- v.31 no.3A
- /
- pp.235-249
- /
- 2011
A new form of I-type PSC bridge girder, which has hole in the web, is proposed in this paper. Three different concepts were combined and implemented in the design. First of all, a girder was precast at a manufacturing plant as divided pieces and assembled at the construction site using post-tensioning method, and the construction period at the site will be reduced dramatically. In this way, the quality of concrete can be assured at the manufacturing factory and concrete curing can be well controlled, and the spliced girder segments can be moved to the construction site without a transportation problem. Secondly, a numerous number of holes was made in the web of the girder. This reduces the self-weight of the girder. But more important thing related to the holes is that about half of the total anchorages can be moved from the girder ends into individual holes. The magnitude of negative moment developed at girder ends will be reduced. Also, since the longitudinal compressive stresses are reduced at ends, thick end diaphragm is not necessary. Thirdly, Prestressing force was introduced into the member through multiple stages. This concept of multi-stage prestressing method overcomes the prestressing force limit restrained by the allowable stresses at each loading stage, and maximizes the magnitude of applicable prestressing force. It makes the girder longer and shallower. Two 50 meter long full scale girders were fabricated and tested. One of them was non-spliced, or monolithic girder, made as one piece from the beginning, and the other one was assembled using post-tensioning method from five pieces of segments. It was found from the result that monolithic and spliced girder show similar load-deflection relationships and crack patterns. Girders satisfied specific girder design specification in flexural strength, deflection, and live load deflection control limit. Both spliced and monolithic holed web post-tensioned girders can be used to achieve span lengths of more than 50m with the girder height of 2 m.
https://doi.org/10.12652/Ksce.2011.31.3A.235 인용 PDF KSCI

Novel LTE based Channel Estimation Scheme for V2V Environment (LTE 기반 V2V 환경에서 새로운 채널 추정 기법)

Chu, Myeonghun;Moon, Sangmi;Kwon, Soonho;Lee, Jihye;Bae, Sara;Kim, Hanjong;Kim, Cheolsung;Kim, Daejin;Hwang, Intae
- Journal of the Institute of Electronics and Information Engineers
- /
- v.54 no.3
- /
- pp.3-9
- /
- 2017
Recently, in 3rd Generation Partnership Project(3GPP), there is a study of the Long Term Evolution(LTE) based vehicle communication which has been actively conducted to provide a transport efficiency, telematics and infortainment. Because the vehicle communication is closely related to the safety, it requires a reliable communication. Because vehicle speed is very fast, unlike the movement of the user, radio channel is rapidly changed and generate a number of problems such as transmission quality degradation. Therefore, we have to continuously updates the channel estimates. There are five types of conventional channel estimation scheme. Least Square(LS) is obtained by pilot symbol which is known to transmitter and receiver. Decision Directed Channel Estimation(DDCE) scheme uses the data signal for channel estimation. Constructed Data Pilot(CDP) scheme uses the correlation characteristic between adjacent two data symbols. Spectral Temporal Averaging(STA) scheme uses the frequency-time domain average of the channel. Smoothing scheme reduces the peak error value of data decision. In this paper, we propose the novel channel estimation scheme in LTE based Vehicle-to-Vehicle(V2V) environment. In our Hybrid Reliable Channel Estimation(HRCE) scheme, DDCE and Smoothing schemes are combined and finally the Linear Minimum Mean Square Error(LMMSE) scheme is applied to minimize the channel estimation error. Therefore it is possible to detect the reliable data. In simulation results, overall performance can be improved in terms of Normalized Mean Square Error(NMSE) and Bit Error Rate(BER).
https://doi.org/10.5573/ieie.2017.54.3.3 인용 PDF KSCI

Search Result 18,826, Processing Time 0.049 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)