Search | Korea Science

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

Jeong, Hanjo;Park, Byeonghwa
- Journal of Intelligence and Information Systems
- /
- v.21 no.1
- /
- pp.1-13
- /
- 2015
As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.
https://doi.org/10.13088/jiis.2015.21.1.01 인용 PDF KSCI

A Study on the Intelligent Quick Response System for Fast Fashion(IQRS-FF) (패스트 패션을 위한 지능형 신속대응시스템(IQRS-FF)에 관한 연구)

Park, Hyun-Sung;Park, Kwang-Ho
- Journal of Intelligence and Information Systems
- /
- v.16 no.3
- /
- pp.163-179
- /
- 2010
Recentlythe concept of fast fashion is drawing attention as customer needs are diversified and supply lead time is getting shorter in fashion industry. It is emphasized as one of the critical success factors in the fashion industry how quickly and efficiently to satisfy the customer needs as the competition has intensified. Because the fast fashion is inherently susceptible to trend, it is very important for fashion retailers to make quick decisions regarding items to launch, quantity based on demand prediction, and the time to respond. Also the planning decisions must be executed through the business processes of procurement, production, and logistics in real time. In order to adapt to this trend, the fashion industry urgently needs supports from intelligent quick response(QR) system. However, the traditional functions of QR systems have not been able to completely satisfy such demands of the fast fashion industry. This paper proposes an intelligent quick response system for the fast fashion(IQRS-FF). Presented are models for QR process, QR principles and execution, and QR quantity and timing computation. IQRS-FF models support the decision makers by providing useful information with automated and rule-based algorithms. If the predefined conditions of a rule are satisfied, the actions defined in the rule are automatically taken or informed to the decision makers. In IQRS-FF, QRdecisions are made in two stages: pre-season and in-season. In pre-season, firstly master demand prediction is performed based on the macro level analysis such as local and global economy, fashion trends and competitors. The prediction proceeds to the master production and procurement planning. Checking availability and delivery of materials for production, decision makers must make reservations or request procurements. For the outsourcing materials, they must check the availability and capacity of partners. By the master plans, the performance of the QR during the in-season is greatly enhanced and the decision to select the QR items is made fully considering the availability of materials in warehouse as well as partners' capacity. During in-season, the decision makers must find the right time to QR as the actual sales occur in stores. Then they are to decide items to QRbased not only on the qualitative criteria such as opinions from sales persons but also on the quantitative criteria such as sales volume, the recent sales trend, inventory level, the remaining period, the forecast for the remaining period, and competitors' performance. To calculate QR quantity in IQRS-FF, two calculation methods are designed: QR Index based calculation and attribute similarity based calculation using demographic cluster. In the early period of a new season, the attribute similarity based QR amount calculation is better used because there are not enough historical sales data. By analyzing sales trends of the categories or items that have similar attributes, QR quantity can be computed. On the other hand, in case of having enough information to analyze the sales trends or forecasting, the QR Index based calculation method can be used. Having defined the models for decision making for QR, we design KPIs(Key Performance Indicators) to test the reliability of the models in critical decision makings: the difference of sales volumebetween QR items and non-QR items; the accuracy rate of QR the lead-time spent on QR decision-making. To verify the effectiveness and practicality of the proposed models, a case study has been performed for a representative fashion company which recently developed and launched the IQRS-FF. The case study shows that the average sales rateof QR items increased by 15%, the differences in sales rate between QR items and non-QR items increased by 10%, the QR accuracy was 70%, the lead time for QR dramatically decreased from 120 hours to 8 hours.
PDF KSCI

Object Tracking Based on Exactly Reweighted Online Total-Error-Rate Minimization (정확히 재가중되는 온라인 전체 에러율 최소화 기반의 객체 추적)

JANG, Se-In;PARK, Choong-Shik
- Journal of Intelligence and Information Systems
- /
- v.25 no.4
- /
- pp.53-65
- /
- 2019
Object tracking is one of important steps to achieve video-based surveillance systems. Object tracking is considered as an essential task similar to object detection and recognition. In order to perform object tracking, various machine learning methods (e.g., least-squares, perceptron and support vector machine) can be applied for different designs of tracking systems. In general, generative methods (e.g., principal component analysis) were utilized due to its simplicity and effectiveness. However, the generative methods were only focused on modeling the target object. Due to this limitation, discriminative methods (e.g., binary classification) were adopted to distinguish the target object and the background. Among the machine learning methods for binary classification, total error rate minimization can be used as one of successful machine learning methods for binary classification. The total error rate minimization can achieve a global minimum due to a quadratic approximation to a step function while other methods (e.g., support vector machine) seek local minima using nonlinear functions (e.g., hinge loss function). Due to this quadratic approximation, the total error rate minimization could obtain appropriate properties in solving optimization problems for binary classification. However, this total error rate minimization was based on a batch mode setting. The batch mode setting can be limited to several applications under offline learning. Due to limited computing resources, offline learning could not handle large scale data sets. Compared to offline learning, online learning can update its solution without storing all training samples in learning process. Due to increment of large scale data sets, online learning becomes one of essential properties for various applications. Since object tracking needs to handle data samples in real time, online learning based total error rate minimization methods are necessary to efficiently address object tracking problems. Due to the need of the online learning, an online learning based total error rate minimization method was developed. However, an approximately reweighted technique was developed. Although the approximation technique is utilized, this online version of the total error rate minimization could achieve good performances in biometric applications. However, this method is assumed that the total error rate minimization can be asymptotically achieved when only the number of training samples is infinite. Although there is the assumption to achieve the total error rate minimization, the approximation issue can continuously accumulate learning errors according to increment of training samples. Due to this reason, the approximated online learning solution can then lead a wrong solution. The wrong solution can make significant errors when it is applied to surveillance systems. In this paper, we propose an exactly reweighted technique to recursively update the solution of the total error rate minimization in online learning manner. Compared to the approximately reweighted online total error rate minimization, an exactly reweighted online total error rate minimization is achieved. The proposed exact online learning method based on the total error rate minimization is then applied to object tracking problems. In our object tracking system, particle filtering is adopted. In particle filtering, our observation model is consisted of both generative and discriminative methods to leverage the advantages between generative and discriminative properties. In our experiments, our proposed object tracking system achieves promising performances on 8 public video sequences over competing object tracking systems. The paired t-test is also reported to evaluate its quality of the results. Our proposed online learning method can be extended under the deep learning architecture which can cover the shallow and deep networks. Moreover, online learning methods, that need the exact reweighting process, can use our proposed reweighting technique. In addition to object tracking, the proposed online learning method can be easily applied to object detection and recognition. Therefore, our proposed methods can contribute to online learning community and object tracking, detection and recognition communities.
https://doi.org/10.13088/jiis.2019.25.4.053 인용 PDF KSCI

Study of Web Services Interoperabiliy for Multiple Applications (다중 Application을 위한 Web Services 상호 운용성에 관한 연구)

유윤식;송종철;최일선;임산송;정회경
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2004.05b
- /
- pp.217-220
- /
- 2004
According as utilization for web increases rapidly, it is demanded that model about support interaction between web-based applications systematically and solutions can integrate new distributed platforms and existing environment effectively, accordingly, Web Services appeared by solution in reply. These days, a lot of software and hardware companies try to adoption of Web Services to their market, attenpt to construct their applications associationing components from various Web Services providers. However, to execute Web Services completely. it must have interoperability and need the standardization work that avoid thing which is subject to platform, application as well as service and programming language from other companies. WS-I (Web Services Interoperability organization) have established Basic Profile 1.0 based on XML, UDDI, WSDL and SOAP for web services interoperability and developed usage scenario Profile to apply Web Services in practice. In this paper, to verify suitability Web Services interoperability between heterogeneous two applications, have design and implements the Book Information Web Services that based on the Web Services Client of J2SE platform and the Web Services Server of .NET platform, so that analysis and verify the service by adaptation of WS-I Basic Profile.
PDF

The Performance Bottleneck of Subsequence Matching in Time-Series Databases: Observation, Solution, and Performance Evaluation (시계열 데이타베이스에서 서브시퀀스 매칭의 성능 병목 : 관찰, 해결 방안, 성능 평가)

김상욱
- Journal of KIISE:Databases
- /
- v.30 no.4
- /
- pp.381-396
- /
- 2003
Subsequence matching is an operation that finds subsequences whose changing patterns are similar to a given query sequence from time-series databases. This paper points out the performance bottleneck in subsequence matching, and then proposes an effective method that improves the performance of entire subsequence matching significantly by resolving the performance bottleneck. First, we analyze the disk access and CPU processing times required during the index searching and post processing steps through preliminary experiments. Based on their results, we show that the post processing step is the main performance bottleneck in subsequence matching, and them claim that its optimization is a crucial issue overlooked in previous approaches. In order to resolve the performance bottleneck, we propose a simple but quite effective method that processes the post processing step in the optimal way. By rearranging the order of candidate subsequences to be compared with a query sequence, our method completely eliminates the redundancy of disk accesses and CPU processing occurred in the post processing step. We formally prove that our method is optimal and also does not incur any false dismissal. We show the effectiveness of our method by extensive experiments. The results show that our method achieves significant speed-up in the post processing step 3.91 to 9.42 times when using a data set of real-world stock sequences and 4.97 to 5.61 times when using data sets of a large volume of synthetic sequences. Also, the results show that our method reduces the weight of the post processing step in entire subsequence matching from about 90% to less than 70%. This implies that our method successfully resolves th performance bottleneck in subsequence matching. As a result, our method provides excellent performance in entire subsequence matching. The experimental results reveal that it is 3.05 to 5.60 times faster when using a data set of real-world stock sequences and 3.68 to 4.21 times faster when using data sets of a large volume of synthetic sequences compared with the previous one.
PDF KSCI

Spatial analysis of financial activities in the Korean urban system (한국 금융의 공간적 특색에 관한 연구)

Choi, Jae Heon
- Journal of the Korean Geographical Society
- /
- v.28 no.4
- /
- pp.321-355
- /
- 1993
This paper focuses on the geographical pattern of financial activities in the Korean urban system during 1975-1990, based on the assumption that financial activities can reveal control points in Korea's urban economy. In terms of spatial evolution of financial insitutions, different locational characteristics are revealed among different types of financial institutions, implying the role of urban hierarchy. Financial resources are highly concentrated in the capital region, Seoul and Kyonggi Province. Both centralization trends into the large metropolitan cities and relative declines of medium and small cities within the Korean urban system, have been experienced over the study period. Financial activities sustain relatively stable hierarchical structure in the urban hierarchy. Regarding the financial flows, dominant flow zones centered on major metropolitan cities are identified, clearly showing a prominant role of Seoul in financial flows in the entire urban system.
PDF

The Determination Factor's Variation of Real Estate Price after Financial Crisis in Korea (2008년 금융위기 이후 부동산가격 결정요인 변화 분석)

Kim, Yong-Soon;Kwon, Chi-Hung;Lee, Kyung-Ae;Lee, Hyun-Rim
- Land and Housing Review
- /
- v.2 no.4
- /
- pp.367-377
- /
- 2011
This paper investigates the determination factors' variation of real estate price after sub-prime financial crisis, in korea, using a VAR model. The model includes land price, housing price, housing rent (Jensei) price, which time period is from 2000:1Q to 2011:2Q and uses interest rate, real GDP, consumer price index, KOSPI, the number of housing construction, the amount of land sales and practices to impulse response and variance decomposition analysis. Data cover two sub-periods and divided by 2008:3Q that occurred the sub-prime crisis; one is a period of 2000:1Q to 2008:3Q, the other is based a period of 2000:1Q to 2011:2Q. As a result, Comparing sub-prime crisis before and after, land price come out that the influence of real GDP is expanding, but current interest rate's variation is weaken due to the stagnation of current economic status and housing construction market. Housing price is few influenced to interest rate and real GDP, but it is influenced its own variation or Jensei price's variation. According to the Jensei price's rapidly increasing in nowadays, housing price might be increasing a rising possibility. Jensei price is also weaken the influence of all economic index, housing price, comparing before sub-prime financial crisis and it is influenced its own variation the same housing price. As you know, real estate price is weakened market basic value factors such as, interest rate, real GDP, because it is influenced exogenous economic factors such as population structural changes. Economic participators, economic officials, consumer, construction supplyers need to access an accurate observation about current real estate market and economic status.
https://doi.org/10.5804/LHIJ.2011.2.4.367 인용 PDF

A Integrated Model of Land/Transportation System

이상용
- Proceedings of the KOR-KST Conference
- /
- 1995.12a
- /
- pp.45-73
- /
- 1995
The current paper presents a system dynamics model which can generate the land use anq transportation system performance simultaneously is proposed. The model system consists of 7 submodels (population, migration of population, household, job growth-employment-land availability, housing development, travel demand, and traffic congestion level), and each of them is designed based on the causality functions and feedback loop structure between a large number of physical, socio-economic, and policy variables. The important advantages of the system dynamics model are as follows. First, the model can address the complex interactions between land use and transportation system performance dynamically. Therefore, it can be an effective tool for evaluating the time-by-time effect of a policy over time horizons. Secondly, the system dynamics model is not relied on the assumption of equilibrium state of urban systems as in conventional models since it determines the state of model components directly through dynamic system simulation. Thirdly, the system dynamics model is very flexible in reflecting new features, such as a policy, a new phenomenon which has not existed in the past, a special event, or a useful concept from other methodology, since it consists of a lots of separated equations. In Chapter I, II, and III, overall approach and structure of the model system are discussed with causal-loop diagrams and major equations. In Chapter V _, the performance of the developed model is applied to the analysis of the impact of highway capacity expansion on land use for the area of Montgomery County, MD. The year-by-year impacts of highway capacity expansion on congestion level and land use are analyzed with some possible scenarios for the highway capacity expansion. This is a first comprehensive attempt to use dynamic system simulation modeling in simultaneous treatment of land use and transportation system interactions. The model structure is not very elaborate mainly due to the problem of the availability of behavioral data, but the model performance results indicate that the proposed approach can be a promising one in dealing comprehensively with complicated urban land use/transportation system.
PDF

Experimental Comparison and Analysis of Measurement Results Using Various Flow Meters (유량측정 기기별 측정성과에 대한 실험적 비교분석)

Lee, Jae-Hyug;Lee, Suk-Ho;Jung, Sung-Won;Kim, Tae-Woong
- Journal of Wetlands Research
- /
- v.12 no.1
- /
- pp.95-103
- /
- 2010
Discharge data examine the process of hydrologic cycle and used significantly in water resource planning and irrigation and flood control planning. However, it needs lots of time and money to get the discharge data. So discharge rating curve is usually used in converting discharge data. Therefore reliability of discharge rating curve absolutely depends on quality of discharge data. Many engineers who study hydrologic engineering make high quality discharge data to develop reliable discharge rating curve. And they carry out research on standard and method of discharge measurement, and equipment improvement. Now various flow meters are utilized to make discharge data in Korea. However, accuracy of equipment and experimental research data from measurement are not enough. In this paper, constant discharge flowed through standard concrete channel, and the velocity is measured using various flow meters. Also Discharge is calculated by measured data to compare and analyze. The equipment for the experiment is Price AA(USGS Type AA Current meter), flow meter, ADC, C2 small current meter, flow tracker, Electromagnetic current meter. The discharge got form various flow meters which are widely used for discharge measurement. The various depths of water were examined and compared such as 0.30 m, 0.35 m, 0.40 m, 0.45 m, 0.50 m, 0.55 m. The experiment progresses a round-measurement on 6-case. Wading measurement(one point method : the 60 % height in surface of the water) was applied to improve creditability and accuracy among measurement methods. USGS Type AA current Meter, Flow Meter, ADC, C2 Small Current meter got the certificate of quality guaranteed. So the results of experiment were used to compare discharge. The Results showed the difference based on USGS Type AA current Meter at average discharge and velocity. Electromagnetic current meter made differences over $\pm$ 10 % and Flow Meter made differences under $\pm$ 10 %. Also ADC, Flow Meter, C2 Small Current meter made differences under $\pm$ 5 %.
PDF KSCI

Comparison of Raw versus Relative scores in the Assessment of Coping Patterns in Chronic Arthritis Patients (만성관절염 환자의 대응양상정도와 관련변수 분석 -원점수와 상대점수를 이용한 비교-)

Chun, Chung-Ja;Mun, Mi-Suk
- Journal of muscle and joint health
- /
- v.3 no.1
- /
- pp.90-103
- /
- 1996
The purpose of this paper is to compare two approach to assessment of coping patterns. The sampling method was a purposive sampling technique. The study participants were out patients 113 of rheumatoid arthritis center of one University hospitals in Seoul. Datas were collected from Nov. 13 to Nov. 24, 1995. The instruments used for this study were Graphic Rating Scales of pain, The Beck Depression Inventory and Coping Patterns tool. The collected data were analyzed for frequency, means, SD, factor analysis., Pearsons' correlations and ANOVA. The results were summerized as follows ; 1. When raw scores were used : there were not correlation in all three coping patterns. 2. When relative scores were used : there were significantly correlated in all three coping patterns. 1) Active coping and Positive-cognitive coping (r=-0.352, p< 0.0001) 2) Positive-cognitive coping and Negative-cognitive coping (r=-0.594, p< 0.0001) 3) Active coping and Negative-cognitive coping(r=-0.544, p< 0.0001) The results of this research with relative scales provided more insight into the correlation in all three coping patterns. 3. Pearsons' Correlations were computed for each coping pattern, age, pain level, duration of pain and BDI. 1) Using raw score : (1) Active coping was significantly related to pain level(sensory score ; r=0.268, p<0.05, affective score ; r=0.266, p< 0.05) (2) Positive-cognitive coping was significantly related to age (r=-0.252, p< 0.05), pain level (sensory score ; r= -0.244, p< 0.05) (3) Negative-cognitive coping was significantly related to depression level (r=0.312 p< 0.0001). 2) Using relative score (1) Active coping was significantly related to pain level(sensory score ; r=0.299, p<0.05, affective score ; r=0.246, p< 0.05) (2) Positive-cognitive coping was significantly related to age (r= -0.187, P< 0.05), pain level (sensory score ; r=-0.317, p<0.0001, affective score : r=-0.305, p<0.0001) and depression level(-0.339, p<0.0001)) (3) Negative-cognitive coping was significantly related to depression. level(r=0.313, p<0.0001). 4. When raw and realtive coping scores were compared to those of age groups, religious groups and BDI level(high, middle, low) ; 1) Using raw score : (1) Active coping : there were not significantly difference (2) Positive-cognitive coping ; 20-39 age group and 50-59age group had significantly higher scores than over 60age group. BDI-low level group had significantly higher scores than other groups. (3) Negative-cognitive coping : 20-39age group and over 60age group had significantly higher scores than 40-49age group. Non-religious group had significantly higher scores than christian group. BDI-high level group had significantly higher scores than other groups. 2) Using relative score : (1) Active coping : over 60 age group had significantly higher scores than 20-39 age group and 40-49age group had significantly higher scores than 20-39 age group (2) Positive-cognitive coping ; 40-49age group, 20-39age group and 50-59age group had significantly higher scores than over 60age group. Christian group had significantly higher scores than non-religious group. BDI-low level group had significantly higher scores than other groups. (3) Negative-cognitive coping ; Non-religious group had significantly higher scores than christian group and buddhistic group. BDI-high level group had significantly higher scores than other groups. The current data suggest that relative scores may yield a different perspective on coping patters than raw scores. The use of relative scores reveals the relation clearly, without its being blurred statistically by the effect of other coping strategies or being relegated to a partial correlation. The use of relative scores holds promise for delineating the relations between ways of coping and health-related behavior.
PDF

Search Result 70,427, Processing Time 0.093 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)