Search | Korea Science

Efficient Topic Modeling by Mapping Global and Local Topics (전역 토픽의 지역 매핑을 통한 효율적 토픽 모델링 방안)

Choi, Hochang;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.23 no.3
- /
- pp.69-94
- /
- 2017
Recently, increase of demand for big data analysis has been driving the vigorous development of related technologies and tools. In addition, development of IT and increased penetration rate of smart devices are producing a large amount of data. According to this phenomenon, data analysis technology is rapidly becoming popular. Also, attempts to acquire insights through data analysis have been continuously increasing. It means that the big data analysis will be more important in various industries for the foreseeable future. Big data analysis is generally performed by a small number of experts and delivered to each demander of analysis. However, increase of interest about big data analysis arouses activation of computer programming education and development of many programs for data analysis. Accordingly, the entry barriers of big data analysis are gradually lowering and data analysis technology being spread out. As the result, big data analysis is expected to be performed by demanders of analysis themselves. Along with this, interest about various unstructured data is continually increasing. Especially, a lot of attention is focused on using text data. Emergence of new platforms and techniques using the web bring about mass production of text data and active attempt to analyze text data. Furthermore, result of text analysis has been utilized in various fields. Text mining is a concept that embraces various theories and techniques for text analysis. Many text mining techniques are utilized in this field for various research purposes, topic modeling is one of the most widely used and studied. Topic modeling is a technique that extracts the major issues from a lot of documents, identifies the documents that correspond to each issue and provides identified documents as a cluster. It is evaluated as a very useful technique in that reflect the semantic elements of the document. Traditional topic modeling is based on the distribution of key terms across the entire document. Thus, it is essential to analyze the entire document at once to identify topic of each document. This condition causes a long time in analysis process when topic modeling is applied to a lot of documents. In addition, it has a scalability problem that is an exponential increase in the processing time with the increase of analysis objects. This problem is particularly noticeable when the documents are distributed across multiple systems or regions. To overcome these problems, divide and conquer approach can be applied to topic modeling. It means dividing a large number of documents into sub-units and deriving topics through repetition of topic modeling to each unit. This method can be used for topic modeling on a large number of documents with limited system resources, and can improve processing speed of topic modeling. It also can significantly reduce analysis time and cost through ability to analyze documents in each location or place without combining analysis object documents. However, despite many advantages, this method has two major problems. First, the relationship between local topics derived from each unit and global topics derived from entire document is unclear. It means that in each document, local topics can be identified, but global topics cannot be identified. Second, a method for measuring the accuracy of the proposed methodology should be established. That is to say, assuming that global topic is ideal answer, the difference in a local topic on a global topic needs to be measured. By those difficulties, the study in this method is not performed sufficiently, compare with other studies dealing with topic modeling. In this paper, we propose a topic modeling approach to solve the above two problems. First of all, we divide the entire document cluster(Global set) into sub-clusters(Local set), and generate the reduced entire document cluster(RGS, Reduced global set) that consist of delegated documents extracted from each local set. We try to solve the first problem by mapping RGS topics and local topics. Along with this, we verify the accuracy of the proposed methodology by detecting documents, whether to be discerned as the same topic at result of global and local set. Using 24,000 news articles, we conduct experiments to evaluate practical applicability of the proposed methodology. In addition, through additional experiment, we confirmed that the proposed methodology can provide similar results to the entire topic modeling. We also proposed a reasonable method for comparing the result of both methods.
https://doi.org/10.13088/jiis.2017.23.3.069 인용 PDF KSCI

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

Yoo, So-yeon;Lim, Gyoo-gun
- Journal of Intelligence and Information Systems
- /
- v.28 no.1
- /
- pp.155-174
- /
- 2022

From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

) were the topic modeling results for each research topic (

) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

https://doi.org/10.13088/jiis.2022.28.1.155 인용 PDF KSCI

MCC의 부유부상 효율에 미치는 MCC의 표면에너지와 액상의 표면장력의 영향에 대한 기초연구

Lee, Hak-Rae;Lee, Jin-Hui;Park, Il;Lee, Yong-Min;Han, Sin-Ho;Jo, Jung-Yeon
- Proceedings of the Korea Technical Association of the Pulp and Paper Industry Conference
- /
- 2001.11a
- /
- pp.20-20
- /
- 2001
우리나라 제지산업은 화학펼프의 80%를 수입에 의존하고 었으나 고지회수율 및 이용율이 세계적으로 볼 때 매우 높은 환경친화적 산업이다. 고지 재활용 공정 중에 서 가장 핵심적인 공정인 부유부상 공정은 고상계의 표면특성 차이를 이용하여 소수성 의 잉크업자를 기포에 부착시켜 부상을 통하여 제거하는 공정이다. 고지 사용의 고도화 를 위해서는 부유부상 공정의 효율 증대가 절실히 요구되고 있다. 또한 부유부상 공정 의 핵심적인 인자로 부유부상을 통하여 제거되는 고형물질의 표면 특성 특히 소수화도 가 중요하다는 것은 보고된 바 있으나 부유부상에 필요한 표면 특성의 존재 여부와 표 면 에너지와 부유부상 효율의 관계 등에 관한 기본적인 연구가 더욱 필요한 실정이다. 이에 본 연구에서는 부유부상 공정을 기초과학적 측면에서 규명하기 위해 마 이 크로 크리 스탈린 셀룰로오스(Microcrystalline cellulose: MCC)를 모델 물질로 사용하 고 이들의 표면특성을 접촉각 측정을 통하여 평가하였다. 친수성의 표면 특성을 지닌 M MCC의 표면 특성을 소수성으로 바꾸기 위하여 AKD(alkyl ketene dimer)의 함량별로 사이징 처리하여 소수성을 지닌 잉크를 모벨링 하고 친수성 MCC를 염색시약을 이용 하여 흑색으로 염색함으로써 소수화 된 MCC와의 색차를 두어 섬유를 모델링 하였다. 이렇게 제조된 MCC의 소수화 정도를 평가하기 위하여 분말상태인 MCC를 pellet으로 제조하여 각기 다른 표면장력과 표변특성을 지난 용액을 이용하여 Advancing Contact A Angle을 측정하고 다양한 방법으로 이를 분석하여 시료의 표면에너지를 평가하였다 그 리고 부유부상 셀내의 액상의 이온강도와 표면장력 등 화학적인 인자에 의한 부유부상 분리효과를 평가하였다.있었다 (그림 2). 칼렌다는 종이를 높은 전단력과 압축력으로 변형시키는데 비해 도침은 단순히 압축 압력만을 종이에 가하는 것이 다르다고 볼 수 있는데, 라 이너지와 백상지가 같은 조건하에서 왜 이러한 큰 차이를 보이는 이유를 아직 알수 없다.해 동일한 공정 데이터들올 이용하여 보편적으로 사용하는 통계기법 중의 하나인 주성분회귀분석을 실시하였다. 주성분 분석은 여러 개의 반응변수에 대하여 얻어진 다변량 자료의 다차원적인 변 수들을 축소, 요약하는 차원의 단순화와 더불어 서로 상관되어있는 반응변수들 상호간 의 복잡한 구조를 분석하는 기법이다. 본 발표에서는 공정 자료를 활용하여 인공신경망 과 주성분분석을 통해 공정 트러블의 발생에 영향 하는 인자들을 보다 현실적으로 추 정하고, 그 대책을 모색함으로써 이를 최소화할 수 있는 방안을 소개하고자 한다.금 빛 용사 둥과 같은 표면처리를 할 경우임의 소재 표면에 도금 및 용 사에 용이한 재료를 오버레이용접시킨 후 표면처리를 함으로써 보다 고품질의 표면층을 얻기위한 시도가 이루어지고 있다. 따라서 국내, 외의 오버레이 용접기술의 적용현황 및 대표적인 적용사례, 오버레이 용접기술 및 용접재료의 개발현황 둥을 중심으로 살펴봄으로서 아직 국내에서는 널리 알려지지 않은 본 기 술의 활용을 넓이고자 한다. within minimum time from beginning of the shutdown.및 12.36%, $101{\sim}200$일의 경우 12.78% 및 12.44%, 201일 이상의 경우 13.17% 및 11.30%로 201일 이상의 유기의 경우에만 대조구와 삭제 구간에 유의적인(p<0.05) 차이를 나타내었다.는 담수(淡水)에서 10%o의 해수(海水)로 이주된지 14일(日) 이후에 신장(腎臟)에서 수축된
PDF

The Inelastic Behavior of High Strength Reinforced Concrete Tall Walls (고강도 철근콘크리트 고층형 내력벽의 비탄성 거동에 관한 실험 연구)

윤현도;정학영;최창식;이리형
- Magazine of the Korea Concrete Institute
- /
- v.7 no.3
- /
- pp.139-148
- /
- 1995
The test results from three one fourth scale models using high strength Reinforced Concrete $f_x=704\;kg/cm^2,\;f_y=5.830\;kg/cm^2$ are presented. Such specimens are considered to represent the critical 3 storics of 60-story tall building of a structural wall system in area of high seismicity respectively. They are tested under inplane vertical and horizontal loading. The main varlable is the level of axial stress. The amounts of vertical and horizontal reinforcement are identical for the three walls testcd. The cross-section of all walls is barbell shape. The aspectratio($h_w/I_w$) of test specimen is 1.8. The aim of the study is to investigate the effects of levels of applied axial stresses on the inelastic behavior of high-strength R /C tall walls. Experimental results of high strength R /C tall walls subjected to axial load and simulated sels rnic loading show that it is possible to insure a ductlle dominant performance by promotmg flex ural yielding of vertical reinforcement and that axial stresses within $O.21f_x$ causes an increase in horizontal load-carrying capacity, initial secant st~ffness characteristics, but an decrease in displacement ductility. energy dissipation index and work damage index of high strength K /C tall walls
https://doi.org/10.22636/MKCI.1995.7.3.139 인용 PDF

Principles of Simulated Moving Bed Reactor(SMBR) (Simulated Moving Bed Reactor(SMBR)의 원리)

Song, Jae-Ryong;Kim, Jin-Il;Koo, Yoon-Mo
- Korean Chemical Engineering Research
- /
- v.49 no.2
- /
- pp.129-136
- /
- 2011
Simulated Moving Bed(SMB) process consists of multiple chromatographic columns, which are usually partitioned into four zones. Such a process characteristic allows a continuous binary separations those are impracticable in conventional batch chromatographic processes. Compared with batch chromatography, SMB has advantages of continuity, high purity and productivity. Various researches have been reported for the integration of reaction and recovery during process operation on the purpose of economics and effectiveness. Simulated Moving Bed Reactor(SMBR) is introduced to combine SMB as a continuous separation process and reactor. Several cases of SMBR have been reported for diverse reactions with catalytic, enzymatic and chemical reaction on ion exchange resin as main streams. With an early type of fixed bed using catalyst, SMBR has been developed as SMB using fluidized enzyme, SMB with immobilized enzyme and SMB with discrete reaction region. For simple modeling and optimization of SMBR, a method considering convection only is possible. A complex method considering axial dispersion and mass transfer resistance is needed to explain the real behavior of solutes in SMBR. By combining reaction and separation, SMBR has benefits of lower installation cost by minimizing equipment use, higher purity and yield by avoiding the equilibrium restriction in case of reversible reaction.
https://doi.org/10.9713/kcer.2011.49.2.129 인용 PDF KSCI

The Analysis of Informatics Gifted Elementary Students' Computational Problem Solving Approaches in Puzzle-Based Learning (퍼즐 기반 학습에서 초등정보영재의 컴퓨팅적 문제 해결 접근법 분석)

Lee, Eunkyoung;Choi, JeongWon;Lee, Youngjun
- Journal of the Korea Society of Computer and Information
- /
- v.19 no.1
- /
- pp.191-201
- /
- 2014
The purpose of this study is to propose strategies of puzzle-based learning for Informatics gifted education through analyzing Informatics gifted elementary students' computational problem solving approaches in puzzle-based learning contexts. Six types of educational puzzles, which are constraints, optimization, probability, statistically speaking, pattern recognition, and strategy, were used in teaching 14 Informatics gifted students for 8 sessions. The results of pre and post test and each students' answers were analyzed to identify why students were not able to solve the puzzles. We also analysed what essential computational strategies are needed to solve each type of puzzles, and what students did not know in solving puzzle problems. We identified some problems caused by puzzle representation methods, and various students' intuitions that disturb puzzle solving. Also, we identified essential computational strategies to solve puzzles: backtracking, dynamic programming, abstraction, modeling, and reduction of big problem. However, students had difficulties in applying these strategies to solve their puzzle problems. We proposed the revised puzzle-based learning strategies, which is based on the improved problem representation, just-in-time cognitive feedbacks, and web-based learning system.
https://doi.org/10.9708/jksci.2014.19.1.191 인용 PDF KSCI

Effects of Change in Heat Release Rate on Unsteady Fire Characteristics in a Semi-Closed Compartment (반밀폐된 구획에서 발열량 변화에 따른 비정상 화재특성)

Hwang, Cheol-Hong
- Fire Science and Engineering
- /
- v.26 no.2
- /
- pp.75-83
- /
- 2012
An experimental study was conducted to investigate the effects of change in heat release rate on unsteady fire characteristics of under-ventilated fire in a semi-closed compartment. A standard doorway width of the full-scale ISO 9705 room was modified to 0.1 m and the flow rate of heptane fuel was increased linearly with time using a spray nozzle located at the center of enclosure. Temperature, heat flux, species concentrations and heat release rate were continuously measured and then global equivalence ratio (GER) concept was adopted to represent the unsteady thermal and chemical characteristics inside the compartment. It was observed that there was a significant difference in unsteady behavior between global and local combustion efficiency, and the GERs predicted by ideal and measured heat release rate were also shown different results in time. The unsteady behaviors of temperature, heat flux and species concentrations were represented well using the GER concept. It was important to note that CO concentration was gradually decreased with the increase in GER after reaching its maximum value in the range of 2.0~3.0 of global equivalence ratio. In addition, the experimental data on unsteady thermal and chemical behaviors obtained in a semi-closed compartment will be usefully used to validate a realistic fire simulation.
https://doi.org/10.7731/KIFSE.2012.26.2.075 인용 PDF KSCI

Performance Evaluation of FDS for Predicting the Unsteady Fire Characteristics in a Semi-Closed ISO 9705 Room (반밀폐된 ISO 9705 화재실에서 비정상 화재특성 예측을 위한 FDS의 성능평가)

Mun, Sun-Yeo;Hwang, Cheol-Hong
- Fire Science and Engineering
- /
- v.26 no.3
- /
- pp.21-28
- /
- 2012
The objective of this study is to evaluate the prediction accuracy of FDS(Fire Dynamic Simulator) for the thermal and chemical characteristics of under-ventilated fire with unsteady fire growth in a semi-closed compartment. To this end, a standard doorway width of the full-scale ISO 9705 room was modified to 0.1 m and the flow rate of heptane fuel was increased linearly with time (until maximum 2.0 MW based on ideal heat release rate) using a spray nozzle located at the center of enclosure. To verify the capability of FDS, the predicted results were compared with a previous experimental data under the identical fire conditions. It was observed that with an appropriate grid system, the numerically predicted temperature and heat flux inside the compartment showed reasonable agreement with the experimental data. On the other hand, there were considerable limitations to predict accurately the unsteady behaviors of CO and $CO_2$ concentration under the condition of continuous fire growth. These results leaded to a discrepancy between the present evaluation of FDS and the previous evaluation conducted for steady-state under-ventilated fires. It was important to note that the prediction of transient CO production characteristics using FDS was approached carefully for the under-ventilated fire in a semi-closed compartment.
https://doi.org/10.7731/KIFSE.2012.26.3.021 인용 PDF KSCI

Optimal Sensor Allocation for Health Monitoring of Roller-Coaster Structure (롤러코스터의 모니터링을 위한 최적 센서 구성)

Heo, Gwang Hee;Jeon, Seung Gon;Park, In Joon
- Journal of the Korea institute for structural maintenance and inspection
- /
- v.15 no.4
- /
- pp.165-174
- /
- 2011
This research aims at the optimal constitution of sensors required to identify the structural shortcoming of roller-coaster. In this research we analyzed the dynamic characteristics of roller-coaster by three dimensional FE modelling, decided on the appropriate location and number of sensors through optimal transducer theory, abstracted the mathematical value of modal features before and after damage on the basis of optimally placed and numbered sensors. and then presented it as a primary information about the basic structure which would be applied to damage estimation. As a target structure, the roller-coater at Seoul Children's Grand Park was chosen and built as a model reduced by one twentieth in size. In order to consider the Kinetics features particular to the roller-coaster structure, we made an exact three-dimensional FE modelling for the model structure by means of Spline function. As for the proper location and number of sensors, it was done by applying EIM and EOT. We also estimated the damage from the combination of strength, flexibility, and model corelation after abstracting the value of modal features. Finally the optimal transducer theory presented here in this research was proved to be valid, and the structural damage was well identified through changes in strength and flexibility. As a result, we were able to present the optimal constitution of sensors needed for the analysis of dynamic characteristics and the development of techniques in dynamic characteristics, which would ultimately contribute to the development of health monitoring for roller-coaster.
https://doi.org/10.11112/jksmi.2011.15.4.165 인용 PDF KSCI

Aeroelastic Tailoring of a Forward-Swept Wing Using One-dimensional Beam Analysis (1차원 보 해석을 활용한 전진익 항공기의 복합적층 날개 공력탄성학적 테일러링)

Choi, JaeWon;Lim, ByeongUk;Lee, SiHun;Shin, SangJoon
- Journal of the Korean Society for Aeronautical & Space Sciences
- /
- v.48 no.8
- /
- pp.555-563
- /
- 2020
Foward-swept wings are known to possess superior aerodynamic performance compared to the conventional straight wings. However major concerns regarding forward-swept wings include divergence at lower airspeeds which require careful consideration at the design stage. As an endeavor to overcome such drawbacks, aeroelastic tailoring is attempted. In order to find an optimal ply sequence, recursive aeroelastic analyses is conducted and one-dimensional beam analysis coupled with simple aerodynamics is used for the improved computational efficiency and modelling convenience. The analysis used in this paper, DYMORE and analytic formula, both use one-dimensional beam model for the structure. Cross-sectional analysis for multi-cell NACA0015 airfoil section is conducted using VABS and oblique function is used for the sweep angle. Throughout the present aeroelastic tailoring, the maximum divergence speed of 290.2m/s is achieved which is increased by approximately 43% than that for the conventional ply configuration.
https://doi.org/10.5139/JKSAS.2020.48.8.555 인용 PDF KSCI

맨앞
이전
11
12
13
14
15현재
다음
맨뒤
15 / 17 pages

Search Result 164, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)