• Title/Summary/Keyword: Data-driven based Method

Search Result 297, Processing Time 0.03 seconds

Hi, KIA! Classifying Emotional States from Wake-up Words Using Machine Learning (Hi, KIA! 기계 학습을 이용한 기동어 기반 감성 분류)

  • Kim, Taesu;Kim, Yeongwoo;Kim, Keunhyeong;Kim, Chul Min;Jun, Hyung Seok;Suk, Hyeon-Jeong
    • Science of Emotion and Sensibility
    • /
    • v.24 no.1
    • /
    • pp.91-104
    • /
    • 2021
  • This study explored users' emotional states identified from the wake-up words -"Hi, KIA!"- using a machine learning algorithm considering the user interface of passenger cars' voice. We targeted four emotional states, namely, excited, angry, desperate, and neutral, and created a total of 12 emotional scenarios in the context of car driving. Nine college students participated and recorded sentences as guided in the visualized scenario. The wake-up words were extracted from whole sentences, resulting in two data sets. We used the soundgen package and svmRadial method of caret package in open source-based R code to collect acoustic features of the recorded voices and performed machine learning-based analysis to determine the predictability of the modeled algorithm. We compared the accuracy of wake-up words (60.19%: 22%~81%) with that of whole sentences (41.51%) for all nine participants in relation to the four emotional categories. Accuracy and sensitivity performance of individual differences were noticeable, while the selected features were relatively constant. This study provides empirical evidence regarding the potential application of the wake-up words in the practice of emotion-driven user experience in communication between users and the artificial intelligence system.

Research Trend and Futuristic Guideline of Platform-Based Business in Korea (플랫폼 기반 비즈니스에 대한 국내 연구동향 및 미래를 위한 가이드라인)

  • Namn, Su Hyeon
    • Management & Information Systems Review
    • /
    • v.39 no.1
    • /
    • pp.93-114
    • /
    • 2020
  • Platform is considered as an alternative strategy to the traditional linear pipeline based business. Moreover, in the 4th industrial revolution period, efficiency driven pipeline business model needs to be changed to platform business. We have such success stories about platform as Apple, Google, Amazon, Uber, and so on. However, for those smaller corporations, it is not easy to find out the transformation strategy. The essence of platform business is to leverage network effect in management. Thus platform based management can be rephrased as network management across the business functions. Research on platform business is popular and related to diverse facets. But few scholars cover what the research trend of the domain is. The main purpose of this paper is to identify the research trend on platform business in Korea. To do that we first propose the analytical model for platform architecture whose components are consumers, suppliers, artifacts, and IT platform system. We conjecture that mapping of the research work on platform to the components of the model will make us understand the hidden domain of platform research. We propose three hypotheses regarding the characteristics of research and one proposition for the transitional path from pipeline to platform business model. The mapping is based on the research articles filtered from the Korea Citation Index, using keyword search. Research papers are searched through the keywords provided by authors using the word of "platform". The filtered articles are summarized in terms of the attributes such as major component of platform considered, platform type, main purpose of the research, and research method. Using the filtered data, we test the hypotheses in exploratory ways. The contribution of our research is as follows: First, based on the findings, scholars can find the areas of research on the domain: areas where research has been matured and territory where future research is actively sought. Second, the proposition provided can give business practitioners the guideline for changing their strategy from pipeline to platform oriented. This research needs to be considered as exploratory not inferential since subjective judgments are involved in data collection, classification, and interpretation of research articles.

Target Word Selection Disambiguation using Untagged Text Data in English-Korean Machine Translation (영한 기계 번역에서 미가공 텍스트 데이터를 이용한 대역어 선택 중의성 해소)

  • Kim Yu-Seop;Chang Jeong-Ho
    • The KIPS Transactions:PartB
    • /
    • v.11B no.6
    • /
    • pp.749-758
    • /
    • 2004
  • In this paper, we propose a new method utilizing only raw corpus without additional human effort for disambiguation of target word selection in English-Korean machine translation. We use two data-driven techniques; one is the Latent Semantic Analysis(LSA) and the other the Probabilistic Latent Semantic Analysis(PLSA). These two techniques can represent complex semantic structures in given contexts like text passages. We construct linguistic semantic knowledge by using the two techniques and use the knowledge for target word selection in English-Korean machine translation. For target word selection, we utilize a grammatical relationship stored in a dictionary. We use k- nearest neighbor learning algorithm for the resolution of data sparseness Problem in target word selection and estimate the distance between instances based on these models. In experiments, we use TREC data of AP news for construction of latent semantic space and Wail Street Journal corpus for evaluation of target word selection. Through the Latent Semantic Analysis methods, the accuracy of target word selection has improved over 10% and PLSA has showed better accuracy than LSA method. finally we have showed the relatedness between the accuracy and two important factors ; one is dimensionality of latent space and k value of k-NT learning by using correlation calculation.

A Study on Land Acquisition Priority for Establishing Riparian Buffer Zones in Korea (수변녹지 조성을 위한 토지매수 우선순위 산정 방안 연구)

  • Hong, Jin-Pyo;Lee, Jae-Won;Choi, Ok-Hyun;Son, Ju-Dong;Cho, Dong-Gil;Ahn, Tong-Mahn
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.17 no.4
    • /
    • pp.29-41
    • /
    • 2014
  • The Korean government has purchased land properties alongside any significant water bodies before setting up the buffers to secure water qualities. Since the annual budgets are limited, however, there has always been the issue of which land parcels ought to be given the priority. Therefore, this study aims to develop efficient mechanism for land acquisition priorities in stream corridors that would ultimately be vegetated for riparian buffer zones. The criteria of land acquisition priority were driven through literary review along with experts' advice. The relative weights of their value and priorities for each criterion were computed using the Analytical Hierarchy Process(AHP) method. Major findings of the study are as follows: 1. The decision-making structural model for land acquisition priority focuses mainly on the reduction of non-point source pollutants(NSPs). This fact is highly associated with natural and physical conditions and land use types of surrounding areas. The criteria were classified into two categories-NSPs runoff areas and potential NSPs runoff areas. 2. Land acquisition priority weights derived for NSPs runoff areas and potential NSPs runoff areas were 0.862 and 0.138, respectively. This implicates that much higher priority should be given to the land parcels with NSPs runoff areas. 3. Weights and priorities of sub-criteria suggested from this study include: proximity to the streams(0.460), land cover(0.189), soil permeability(0.117), topographical slope(0.096), proximity to the roads(0.058), land-use types(0.036), visibility to the streams(0.032), and the land price(0.012). This order of importance suggests, as one can expect, that it is better to purchase land parcels that are adjacent to the streams. 4. A standard scoring system including the criteria and weights for land acquisition priority was developed which would likely to allow expedited decision making and easy quantification for priority evaluation due to the utilization of measurable spatial data. Further studies focusing on both point and non-point pollutants and GIS-based spatial analysis and mapping of land acquisition priority are needed.

Mega Flood Simulation Assuming Successive Extreme Rainfall Events (연속적인 극한호우사상의 발생을 가정한 거대홍수모의)

  • Choi, Changhyun;Han, Daegun;Kim, Jungwook;Jung, Jaewon;Kim, Duckhwan;Kim, Hung Soo
    • Journal of Wetlands Research
    • /
    • v.18 no.1
    • /
    • pp.76-83
    • /
    • 2016
  • In recent, the series of extreme storm events were occurred by those continuous typhoons and the severe flood damages due to the loss of life and the destruction of property were involved. In this study, we call Mega flood for the Extreme flood occurred by these successive storm events and so we can have a hypothetical Mega flood by assuming that a extreme event can be successively occurred with a certain time interval. Inter Event Time Definition (IETD) method was used to determine the time interval between continuous events in order to simulate Mega flood. Therefore, the continuous extreme rainfall events are determined with IETD then Mega flood is simulated by the consecutive events : (1) consecutive occurrence of two historical extreme events, (2) consecutive occurrence of two design events obtained by the frequency analysis based on the historical data. We have shown that Mega floods by continuous extreme rainfall events were increased by 6-17% when we compared to typical flood by a single event. We can expect that flood damage caused by Mega flood leads to much greater than damage driven by a single rainfall event. The second increase in the flood caused by heavy rain is not much compared to the first flood caused by heavy rain. But Continuous heavy rain brings the two times of flood damage. Therefore, flood damage caused by the virtual Mega flood of is judged to be very large. Here we used the hypothetical rainfall events which can occur Mega floods and this could be used for preparing for unexpected flood disaster by simulating Mega floods defined in this study.

Submarket Identification in Property Markets: Focusing on a Hedonic Price Model Improvement (부동산 하부시장 구획: 헤도닉 모형의 개선을 중심으로)

  • Lee, Chang Ro;Eum, Young Seob;Park, Key Ho
    • Journal of the Korean Geographical Society
    • /
    • v.49 no.3
    • /
    • pp.405-422
    • /
    • 2014
  • Two important issues in hedonic model are to specify accurate model and delineate submarkets. While the former has experienced much improvement over recent decades, the latter has received relatively little attention. However, the accuracy of estimates from hedonic model will be necessarily reduced when the analysis does not adequately address market segmentation which can capture the spatial scale of price formation process in real estate. Placing emphasis on improvement of performance in hedonic model, this paper tried to segment real estate markets in Gangnam-gu and Jungrang-gu, which correspond to most heterogeneous and homogeneous ones respectively in 25 autonomous districts of Seoul. First, we calculated variable coefficients from mixed geographically weighted regression model (mixed GWR model) as input for clustering, since the coefficient from hedonic model can be interpreted as shadow price of attributes constituting real estate. After that, we developed a spatially constrained data-driven methodology to preserve spatial contiguity by utilizing the SKATER algorithm based on a minimum spanning tree. Finally, the performance of this method was verified by applying a multi-level model. We concluded that submarket does not exist in Jungrang-gu and five submarkets centered on arterial roads would be reasonable in Gangnam-gu. Urban infrastructure such as arterial roads has not been considered an important factor for delineating submarkets until now, but it was found empirically that they play a key role in market segmentation.

  • PDF

Axial Load Capacity Prediction of Single Piles in Clay and Sand Layers Using Nonlinear Load Transfer Curves (비선형 하중전이법에 의한 점토 및 모래층에서 파일의 지지력 예측)

  • Kim, Hyeongjoo;Mission, Joseleo;Song, Youngsun;Ban, Jaehong;Baeg, Pilsoon
    • Journal of the Korean GEO-environmental Society
    • /
    • v.9 no.5
    • /
    • pp.45-52
    • /
    • 2008
  • The present study has extended OpenSees, which is an open-source software framework DOS program for developing applications to idealize geotechnical and structural problems, for the static analysis of axial load capacity and settlement of single piles in MS Windows environment. The Windows version of OpenSees as improved by this study has enhanced the DOS version from a general purpose software program to a special purpose program for driven and bored pile analysis with additional features of pre-processing and post-processing and a user friendly graphical interface. The method used in the load capacity analysis is the numerical methods based on load transfer functions combined with finite elements. The use of empirical nonlinear T-z and Q-z load transfer curves to model soil-pile interaction in skin friction and end bearing, respectively, has been shown to capture the nonlinear soil-pile response under settlement due to load. Validation studies have shown the static load capacity and settlement predictions implemented in this study are in fair agreement with reference data from the static loading tests.

  • PDF

Accuracy Analysis of Velocity and Water Depth Measurement in the Straight Channel using ADCP (ADCP를 이용한 직선 하천의 유속 및 수심 측정 정확도 분석)

  • Kim, Jongmin;Kim, Dongsu;Son, Geunsoo;Kim, Seojun
    • Journal of Korea Water Resources Association
    • /
    • v.48 no.5
    • /
    • pp.367-377
    • /
    • 2015
  • ADCPs have been highlighted so far for measuring steramflow discharge in terms of their high-order of accuracy, relatively low cost and less field operators driven by their easy in-situ operation. While ADCPs become increasingly dominant in hydrometric area, their actual measurement accuracy for velocity and bathymetry measurement has not been sufficiently validated due to the lack of reliable bench-mark data, and subsequently there are still many uncertain aspects for using ADCPs in the field. This research aimed at analyzing inter-comparison results between ADCP measurements with respect to the detailed ADV measurement in a specified field environment. Overall, 184 ADV points were collected for densely designed grids for the given cross-section that has 6 m of width, 1 m of depth, and 0.7 m/s of averaged mean flow velocity. Concurrently, ADCP fixed-points measurements were conducted for each 0.2m and 0.02m of horizontal and vertical spacing respectively. The inter-comparison results indicated that ADCP matched ADV velocity very accurately for 0.4~0.8 of relative depth (y/h), but noticeable deviation occurred between them in near surface and bottom region. For evaluating the capacity of measuring bathymetry of ADCPs, bottom tracking bathymetry based on oblique beams showed better performance than vertical beam approach, and similar results were shown for fixed and moving-boat method as well. Error analysis for velocity and bathymetry measurements of ADCP can be potentially able to be utilized for the more detailed uncertainty analysis of the ADCP discharge measurement.

A New Exploratory Research on Franchisor's Provision of Exclusive Territories (가맹본부의 배타적 영업지역보호에 대한 탐색적 연구)

  • Lim, Young-Kyun;Lee, Su-Dong;Kim, Ju-Young
    • Journal of Distribution Research
    • /
    • v.17 no.1
    • /
    • pp.37-63
    • /
    • 2012
  • In franchise business, exclusive sales territory (sometimes EST in table) protection is a very important issue from an economic, social and political point of view. It affects the growth and survival of both franchisor and franchisee and often raises issues of social and political conflicts. When franchisee is not familiar with related laws and regulations, franchisor has high chance to utilize it. Exclusive sales territory protection by the manufacturer and distributors (wholesalers or retailers) means sales area restriction by which only certain distributors have right to sell products or services. The distributor, who has been granted exclusive sales territories, can protect its own territory, whereas he may be prohibited from entering in other regions. Even though exclusive sales territory is a quite critical problem in franchise business, there is not much rigorous research about the reason, results, evaluation, and future direction based on empirical data. This paper tries to address this problem not only from logical and nomological validity, but from empirical validation. While we purse an empirical analysis, we take into account the difficulties of real data collection and statistical analysis techniques. We use a set of disclosure document data collected by Korea Fair Trade Commission, instead of conventional survey method which is usually criticized for its measurement error. Existing theories about exclusive sales territory can be summarized into two groups as shown in the table below. The first one is about the effectiveness of exclusive sales territory from both franchisor and franchisee point of view. In fact, output of exclusive sales territory can be positive for franchisors but negative for franchisees. Also, it can be positive in terms of sales but negative in terms of profit. Therefore, variables and viewpoints should be set properly. The other one is about the motive or reason why exclusive sales territory is protected. The reasons can be classified into four groups - industry characteristics, franchise systems characteristics, capability to maintain exclusive sales territory, and strategic decision. Within four groups of reasons, there are more specific variables and theories as below. Based on these theories, we develop nine hypotheses which are briefly shown in the last table below with the results. In order to validate the hypothesis, data is collected from government (FTC) homepage which is open source. The sample consists of 1,896 franchisors and it contains about three year operation data, from 2006 to 2008. Within the samples, 627 have exclusive sales territory protection policy and the one with exclusive sales territory policy is not evenly distributed over 19 representative industries. Additional data are also collected from another government agency homepage, like Statistics Korea. Also, we combine data from various secondary sources to create meaningful variables as shown in the table below. All variables are dichotomized by mean or median split if they are not inherently dichotomized by its definition, since each hypothesis is composed by multiple variables and there is no solid statistical technique to incorporate all these conditions to test the hypotheses. This paper uses a simple chi-square test because hypotheses and theories are built upon quite specific conditions such as industry type, economic condition, company history and various strategic purposes. It is almost impossible to find all those samples to satisfy them and it can't be manipulated in experimental settings. However, more advanced statistical techniques are very good on clean data without exogenous variables, but not good with real complex data. The chi-square test is applied in a way that samples are grouped into four with two criteria, whether they use exclusive sales territory protection or not, and whether they satisfy conditions of each hypothesis. So the proportion of sample franchisors which satisfy conditions and protect exclusive sales territory, does significantly exceed the proportion of samples that satisfy condition and do not protect. In fact, chi-square test is equivalent with the Poisson regression which allows more flexible application. As results, only three hypotheses are accepted. When attitude toward the risk is high so loyalty fee is determined according to sales performance, EST protection makes poor results as expected. And when franchisor protects EST in order to recruit franchisee easily, EST protection makes better results. Also, when EST protection is to improve the efficiency of franchise system as a whole, it shows better performances. High efficiency is achieved as EST prohibits the free riding of franchisee who exploits other's marketing efforts, and it encourages proper investments and distributes franchisee into multiple regions evenly. Other hypotheses are not supported in the results of significance testing. Exclusive sales territory should be protected from proper motives and administered for mutual benefits. Legal restrictions driven by the government agency like FTC could be misused and cause mis-understandings. So there need more careful monitoring on real practices and more rigorous studies by both academicians and practitioners.

  • PDF

Geochemistry of Total Gaseous Mercury in Nan-Ji-Do, Seoul, Korea (난지도 지역의 대기수은 지화학)

  • Kim, Min-Young;Lee, Gang-Woong;Shin, Jae-Young;Kim, Ki-Hyun
    • Journal of the Korean earth science society
    • /
    • v.21 no.5
    • /
    • pp.611-622
    • /
    • 2000
  • To investigate the exchange rates of mercury(Hg) across soil-air boundary, we undertook the measurements of Hg flux using gradient technique from a major waste reclamation site, Nan-Ji-Do. Based on these measurement data, we attempted to provide insights into various aspects of Hg exchange in a strongly polluted soil environment. According to our analysis, the study site turned out to be not only a major emission source area but also a major sink area. When these data were compared on hourly basis over a full day scale, large fluxes of emission and deposition centered on daytime periods relative to nighttime periods. However, when comparison of frequency with which emission or deposition occurs was made, there emerged a very contrasting pattern. While emission was dominant during nighttime periods, deposition was most favored during daytime periods. When similar comparison was made as a function of wind direction, it was noticed that there may be a major Hg source at easterly direction to bring out significant deposition of Hg in the study area. To account for the environmental conditions controlling the vertical direction of Hg exchange, we compared environmental conditions for both the whole data group and those observed from the wind direction of strong deposition events. Results of this analysis indicated that the concentrations of pollutant species varied sensitively enough to reflect the environmental conditions for each direction of exchange. When correlation analysis was applied to our data, results indicated that windspeed and ozone concentrations best reflected changes in the magnitudes of emission/deposition fluxes. The results of factor analysis also indicated the possibility that Hg emission of study area is temperature-driven process, while that of deposition is affected by a mixed effects of various factors including temperature, ozone, and non-methane HCs. If the computed emission rate is extrapolated to the whole study area we estimate that annual emission of Hg from the study area can amount to approximately 6kg.

  • PDF