Search | Korea Science

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

Park, Jongin;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.25 no.3
- /
- pp.19-41
- /
- 2019
According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.
https://doi.org/10.13088/jiis.2019.25.3.019 인용 PDF KSCI

Call Admission Control Using Adaptive-MMOSPRED for Resource Prediction in Wireless Networks (무선망의 자원예측을 위한 Adaptive-MMOSPRED 기법을 사용한 호 수락제어)

Lee, Jin-Yi
- Journal of Advanced Navigation Technology
- /
- v.12 no.1
- /
- pp.22-27
- /
- 2008
This paper presents adaptive-MMOSPRED method for prediction of resource demands requested by multimedia calls, and shows the performance of the call admission control based on proposed resource prediction method in multimedia wireless networks. The proposed method determines (I-CDP) random variables of the standard normal distribution by using LMS algorithm that minimize errors of prediction in resource demands, while parameters in an existing method are constant all through the prediction time. Our simulation results show that prediction error in adaptive-MMOSPRED method is much smaller than in fixed-MMOSPRED method. Also we can see via simulation the CAC performance based on the proposed method improves the new call blocking performance compared with the existing method under the desired handoff dropping probability.
PDF

Development of Cloud Detection Method with Geostationary Ocean Color Imagery for Land Applications (GOCI 영상의 육상 활용을 위한 구름 탐지 기법 개발)

Lee, Hwa-Seon;Lee, Kyu-Sung
- Korean Journal of Remote Sensing
- /
- v.31 no.5
- /
- pp.371-384
- /
- 2015
Although GOCI has potential for land surface monitoring, there have been only a few cases for land applications. It might be due to the lack of reliable land products derived from GOCI data for end-users. To use for land applications, it is often essential to provide cloud-free composite over land surfaces. In this study, we proposed a cloud detection method that was very important to make cloud-free composite of GOCI reflectance and vegetation index. Since GOCI does not have SWIR and TIR spectral bands, which are very effective to separate clouds from other land cover types, we developed a multi-temporal approach to detect cloud. The proposed cloud detection method consists of three sequential steps of spectral tests. Firstly, band 1 reflectance threshold was applied to separate confident clear pixels. In second step, thick cloud was detected by the ratio (b1/b8) of band 1 and band 8 reflectance. In third step, average of b1/b8 ratio values during three consecutive days was used to detect thin cloud having mixed spectral characteristics of both cloud and land surfaces. The proposed method provides four classes of cloudiness (thick cloud, thin cloud, probably clear, confident clear). The cloud detection method was validated by the MODIS cloud mask products obtained during the same time as the GOCI data acquisition. The percentages of cloudy and cloud-free pixels between GOCI and MODIS are about the same with less than 10% RMSE. The spatial distributions of clouds detected from the GOCI images were also similar to the MODIS cloud mask products.
https://doi.org/10.7780/kjrs.2015.31.5.2 인용 PDF KSCI

Geometric nonlinear analysis of steel structures with external pretension using the multi-noded cable element (다절점 케이블요소를 이용한 외부 긴장된 강구조 시스템의 기하학적 비선형해석)

Lee, Jun Seok;Kim, Moon Young;Han, Man Yop;Kim, Sung Bo;Kim, Nak Kyung
- Journal of Korean Society of Steel Construction
- /
- v.18 no.6
- /
- pp.727-735
- /
- 2006
In this paper, a geometric nonlinear analysis procedure of the beam-column element including multi-noded cable element in extension of companion paper (Kim et al., 2005) is presented. First, a stiffness matrix was derived about the beam-column element that considers the second effect of the initial force supposing the curved shape at each time-step, with Hermitian polynomials as the shape function. Second, the multi-noded cable element was also subjected to the tangent stiffness matrix. To verify the geometric nonlinearity of this newly developed multi-noded cable-truss element, the Innovative Prestressed Support (IPS) system using this theory was analysed by geometric nonlinear method and the results were compared with those produced by linear analysis.
PDF KSCI

Investigation of NO Formation Characteristics in Multi Staged Air Combustor (공기 다단 연소기 화염의 NO 발생특성에 관한 연구)

Kim, Han-Seok;An, Guk-Yeong;Baek, Seung-Uk;Yu, Myeong-Jong
- Transactions of the Korean Society of Mechanical Engineers B
- /
- v.25 no.11
- /
- pp.1594-1605
- /
- 2001
In this study, a numerical simulation was developed which was capable of predicting the characteristics of NO formation in pilot scale combustor adopting the air-staged burner flame. The numerical calculation was constructed by means of establishing the mathematical models fur turbulence, turbulent combustion, radiation and turbulent nitric oxide chemistry. Turbulence was solved with standard k-$\xi$ model and the turbulent combustion model was incorporated using a two step reaction scheme together with an eddy dissipation model. The radiative transfer equation was calculated by means of the discrete ordinates method with the weighted sum of gray gases model for CO$_2$and H$_2$O. In the NO chemistry model, the chemical reaction rates for thermal and prompt NO were statistically averaged using the $\beta$ probability density function. The results were validated by comparison with measurements. For the experiment, a 0.2 MW pilot multi-air staged burner has been designed and fabricated. Only when the radiation was taken into account, the predicted gas temperature was in good agreement with the experimental one, which meant that the inclusion of radiation was indispensable for modeling multi-air staged gas flame. This was also true of the prediction of the NO formation, since it heavily depended on temperature. Subsequently, it was found that the multi-air staged combustion technique might be used as a practical tool in reducing the NO formation by controlling the peak flame temperature.
https://doi.org/10.22634/KSME-B.2001.25.11.1594 인용 PDF

Study on Nosing Method for Large Size Tube Formed Body (대형 튜브성형체의 노징 공법 연구)

Cho, C.Y.;Park, Z.S.;Lee, J.O.;Jeong, D.J.
- Proceedings of the Korean Society for Technology of Plasticity Conference
- /
- 2009.10a
- /
- pp.408-411
- /
- 2009
The plastic working process is a well-known molding method to produce products with good mechanical properties whilst reducing material loss and production time at the same time. Among those methods, the nosing process is commonly used for valves, tubes and ammunition which require high mechanical properties since it provides change in shape without additional mechanical process, minimum material loss during the post-process and superior properties. However, high manufacturing cost and time are required for the large-size tubes due to the multi-step nosing processes. In addition, there are some potential risks due to the buckling and property variation caused by the nosing process, too. Therefore, the shell nosing process is investigated and used in this study in order to resolve the problems described previously. Thus, we could obtain the process with lower cost and improved efficiency by means of the shell nosing process.
PDF

A Sensorless MPPT Control Using an Adaptive Neuro-Fuzzy Logic for PV Battery Chargers (태양광 배터리 충전기를 위한 적응형 신경회로망-퍼지로직 기반의 센서리스 MPPT 제어)

Kim, Jung-Hyun;Kim, Gwang-Seob;Lee, Kyo-Beum
- The Transactions of the Korean Institute of Power Electronics
- /
- v.18 no.4
- /
- pp.349-358
- /
- 2013
In this paper, the sensorless MPPT algorithm is proposed where the performance of varied duty ratio change has been improved using multi-layer neuro-fuzzy that aligns with neuro-fuzzy based optimized membership function. Since the change of duty ratio of sensorless MPPT is varied by using the neuro-fuzzy, the MPPT response speed is faster than the convectional method and is able to reduce the steady-state ripple. The neuro fuzzy controller has the response characteristics which is superior to the existing fuzzy controller, because of the usage of the optimal width of the fuzzy membership function. The effectiveness of the proposed method has been verified by simulations and experimental results.
https://doi.org/10.6113/TKPE.2013.18.4.349 인용 PDF KSCI

Reactive navigation of mobile robots using optmal via-point selection method (최적 경유점 선택 방법을 이용한 이동로봇의 반응적 주행)

김경훈;조형석
- 제어로봇시스템학회:학술대회논문집
- /
- 1997.10a
- /
- pp.227-230
- /
- 1997
In this paper, robot navigation experiments with a new navigation algorithm are carried out in real environments. The authors already proposed a reactive navigation algorithm for mobile robots using optimal via-point selection method. At each sampling time, a number of via-point candidates is constructed with various candidates of heading angles and velocities. The robot detects surrounding obstacles, and the proposed algorithm utilizes fuzzy multi-attribute decision making in selecting the optimal via-point the robot would proceed at next step. Fuzzy decision making allows the robot to choose the most qualified via-point even when the two navigation goals-obstacle avoidance and target point reaching-conflict each other. The experimental result shows the successful navigation can be achieved with the proposed navigation algorithm for real environments.
PDF

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

Gerber, Christian;Chung, Mokdong
- Journal of Information Processing Systems
- /
- v.12 no.1
- /
- pp.100-108
- /
- 2016
In this paper, we propose a method to achieve improved number plate detection for mobile devices by applying a multiple convolutional neural network (CNN) approach. First, we processed supervised CNN-verified car detection and then we applied the detected car regions to the next supervised CNN-verifier for number plate detection. In the final step, the detected number plate regions were verified through optical character recognition by another CNN-verifier. Since mobile devices are limited in computation power, we are proposing a fast method to recognize number plates. We expect for it to be used in the field of intelligent transportation systems.
https://doi.org/10.3745/JIPS.04.0022 인용 PDF KSCI

Lateral p-n junction Diode with organic single crystal by direct printing

Park, Yoon kyoung;Sung, Myung Mo
- Proceedings of the Korean Vacuum Society Conference
- /
- 2016.02a
- /
- pp.144.1-144.1
- /
- 2016
We fabricate organic single crystal nanowire heterojunction p-n diode poly(3-hexylthiophene)(P3HT) and from Phenyl-C61-butyric acid methyl ester(PCBM) using by liquid-bridge mediated nanotransfer molding(LB-nTM) method. LB-nTM has been reported an one step direct printing method for making well-aligned nanowire arrays. Moreover, multi-patterning nanostructures can be fabricated with the consecutive printing process. As a result, it is possible to make simple and basic concept of heterojunction devices such as lateral organic p-n nanojunction diode. P3HT/PCBM nanowires heterojunction diode has rectifying behavior with on/off ratios of ~20.
PDF

Search Result 644, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)