• Title/Summary/Keyword: linear system model

Search Result 3,069, Processing Time 0.033 seconds

Associations between Characteristics of Green Spaces, Physical Activity and Health - Focusing on the Case Study of Changwon City - (공원녹지의 특성과 신체활동 및 건강의 상호관련성 - 창원시를 대상으로 -)

  • Baek, Su-Kyeongq;Park, Kyung-Hun
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.42 no.3
    • /
    • pp.1-12
    • /
    • 2014
  • Urban Green space takes charge of the important role for the physical activity and promotion of health to the residents. Therefore, this study is trying to examine the relationship between the various characteristics of green space and green space usage for physical activity and health promotion. A questionnaire survey was conducted to obtain the information about patterns of green space usage and perceived neighborhood environments for the residents living in Changwon-si, Gyeongsangnam-do(n=541). Geographic Information System(GIS) was used to construct spatial data about green space accessibility and physical neighborhood environments. A Multiple Linear Regression model was used to examine the association between the characteristics of green space and physical activity, perceived health status and BMI(Body Mass Index). The study results revealed that the residents' physical activities are positively and directly influenced by the number of available public parks and green spaces in the vicinity(${\leq}200m$). The frequency at which residents witness others exercising nearby or the perceived abundance of low-cost gym facilities also factor as positive influences. The closer to the park, the higher the number of parks and area of green spaces, the more comfortable the walk thereto and the denser the neighboring residential area distribution, the perceived health level was found to be the more positively influenced. Further, it was verified that BMI is correlated with the number of public parks and green spaces within 400 m of the resident's home as well as the safety of walkways, the density of neighboring residential areas, the ratio of road, and the density of crosswalk. The significant multiple regression models between the characteristics of green spaces and physical activities and perceived health level were extracted within the significance level of 10%. This study will contribute to provide better understanding the ways in which green space and neighborhood characteristics are associated with physical activity and health. The result of this research will be available in the landscape architecture plan aimed at improving the use of green space for physical activity and reducing obesity.

Prediction of commitment and persistence in heterosexual involvements according to the styles of loving using a datamining technique (데이터마이닝을 활용한 사랑의 형태에 따른 연인관계 몰입수준 및 관계 지속여부 예측)

  • Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.69-85
    • /
    • 2016
  • Successful relationship with loving partners is one of the most important factors in life. In psychology, there have been some previous researches studying the factors influencing romantic relationships. However, most of these researches were performed based on statistical analysis; thus they have limitations in analyzing complex non-linear relationships or rules based reasoning. This research analyzes commitment and persistence in heterosexual involvement according to styles of loving using a datamining technique as well as statistical methods. In this research, we consider six different styles of loving - 'eros', 'ludus', 'stroge', 'pragma', 'mania' and 'agape' which influence romantic relationships between lovers, besides the factors suggested by the previous researches. These six types of love are defined by Lee (1977) as follows: 'eros' is romantic, passionate love; 'ludus' is a game-playing or uncommitted love; 'storge' is a slow developing, friendship-based love; 'pragma' is a pragmatic, practical, mutually beneficial relationship; 'mania' is an obsessive or possessive love and, lastly, 'agape' is a gentle, caring, giving type of love, brotherly love, not concerned with the self. In order to do this research, data from 105 heterosexual couples were collected. Using the data, a linear regression method was first performed to find out the important factors associated with a commitment to partners. The result shows that 'satisfaction', 'eros' and 'agape' are significant factors associated with the commitment level for both male and female. Interestingly, in male cases, 'agape' has a greater effect on commitment than 'eros'. On the other hand, in female cases, 'eros' is a more significant factor than 'agape' to commitment. In addition to that, 'investment' of the male is also crucial factor for male commitment. Next, decision tree analysis was performed to find out the characteristics of high commitment couples and low commitment couples. In order to build decision tree models in this experiment, 'decision tree' operator in the datamining tool, Rapid Miner was used. The experimental result shows that males having a high satisfaction level in relationship show a high commitment level. However, even though a male may not have a high satisfaction level, if he has made a lot of financial or mental investment in relationship, and his partner shows him a certain amount of 'agape', then he also shows a high commitment level to the female. In the case of female, a women having a high 'eros' and 'satisfaction' level shows a high commitment level. Otherwise, even though a female may not have a high satisfaction level, if her partner shows a certain amount of 'mania' then the female also shows a high commitment level. Finally, this research built a prediction model to establish whether the relationship will persist or break up using a decision tree. The result shows that the most important factor influencing to the break up is a 'narcissistic tendency' of the male. In addition to that, 'satisfaction', 'investment' and 'mania' of both male and female also affect a break up. Interestingly, while the 'mania' level of a male works positively to maintain the relationship, that of a female has a negative influence. The contribution of this research is adopting a new technique of analysis using a datamining method for psychology. In addition, the results of this research can provide useful advice to couples for building a harmonious relationship with each other. This research has several limitations. First, the experimental data was sampled based on oversampling technique to balance the size of each classes. Thus, it has a limitation of evaluating performances of the predictive models objectively. Second, the result data, whether the relationship persists of not, was collected relatively in short periods - 6 months after the initial data collection. Lastly, most of the respondents of the survey is in their 20's. In order to get more general results, we would like to extend this research to general populations.

Label Embedding for Improving Classification Accuracy UsingAutoEncoderwithSkip-Connections (다중 레이블 분류의 정확도 향상을 위한 스킵 연결 오토인코더 기반 레이블 임베딩 방법론)

  • Kim, Museong;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.175-197
    • /
    • 2021
  • Recently, with the development of deep learning technology, research on unstructured data analysis is being actively conducted, and it is showing remarkable results in various fields such as classification, summary, and generation. Among various text analysis fields, text classification is the most widely used technology in academia and industry. Text classification includes binary class classification with one label among two classes, multi-class classification with one label among several classes, and multi-label classification with multiple labels among several classes. In particular, multi-label classification requires a different training method from binary class classification and multi-class classification because of the characteristic of having multiple labels. In addition, since the number of labels to be predicted increases as the number of labels and classes increases, there is a limitation in that performance improvement is difficult due to an increase in prediction difficulty. To overcome these limitations, (i) compressing the initially given high-dimensional label space into a low-dimensional latent label space, (ii) after performing training to predict the compressed label, (iii) restoring the predicted label to the high-dimensional original label space, research on label embedding is being actively conducted. Typical label embedding techniques include Principal Label Space Transformation (PLST), Multi-Label Classification via Boolean Matrix Decomposition (MLC-BMaD), and Bayesian Multi-Label Compressed Sensing (BML-CS). However, since these techniques consider only the linear relationship between labels or compress the labels by random transformation, it is difficult to understand the non-linear relationship between labels, so there is a limitation in that it is not possible to create a latent label space sufficiently containing the information of the original label. Recently, there have been increasing attempts to improve performance by applying deep learning technology to label embedding. Label embedding using an autoencoder, a deep learning model that is effective for data compression and restoration, is representative. However, the traditional autoencoder-based label embedding has a limitation in that a large amount of information loss occurs when compressing a high-dimensional label space having a myriad of classes into a low-dimensional latent label space. This can be found in the gradient loss problem that occurs in the backpropagation process of learning. To solve this problem, skip connection was devised, and by adding the input of the layer to the output to prevent gradient loss during backpropagation, efficient learning is possible even when the layer is deep. Skip connection is mainly used for image feature extraction in convolutional neural networks, but studies using skip connection in autoencoder or label embedding process are still lacking. Therefore, in this study, we propose an autoencoder-based label embedding methodology in which skip connections are added to each of the encoder and decoder to form a low-dimensional latent label space that reflects the information of the high-dimensional label space well. In addition, the proposed methodology was applied to actual paper keywords to derive the high-dimensional keyword label space and the low-dimensional latent label space. Using this, we conducted an experiment to predict the compressed keyword vector existing in the latent label space from the paper abstract and to evaluate the multi-label classification by restoring the predicted keyword vector back to the original label space. As a result, the accuracy, precision, recall, and F1 score used as performance indicators showed far superior performance in multi-label classification based on the proposed methodology compared to traditional multi-label classification methods. This can be seen that the low-dimensional latent label space derived through the proposed methodology well reflected the information of the high-dimensional label space, which ultimately led to the improvement of the performance of the multi-label classification itself. In addition, the utility of the proposed methodology was identified by comparing the performance of the proposed methodology according to the domain characteristics and the number of dimensions of the latent label space.

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.

Reliability Analysis on Firewater Supply Facilities based on the Probability Theory with Considering Common Cause Failures (소방수 공급설비에 대한 공통원인고장을 고려한 확률론적 신뢰도 분석)

  • Ko, Jae-Sun;Kim, Hyo
    • Fire Science and Engineering
    • /
    • v.17 no.4
    • /
    • pp.76-85
    • /
    • 2003
  • In this study, we write down the definitions, their causes and the techniques of analysis as a theoretical consideration of common cause failures, and investigate the limitation and the importance of the common cause failures by applying to the analysis on the fire protection as a representative safety facility. As you can know in the reliability analysis, most impressive cause is the malfunctions of pumping operations; especially the common cause failure of two pumps is dominant. In other words, it is possible to assess system-reliability as twice as actual without CCF From these, CCF is extraordinarily important and the results are highly dependent on the CCF factor. And although it would increase with multiple installations, the reliability are not defined as linear with those multiplications. In addition, the differences in results due to the models for analysis are not significant, whereas the various sources of data produce highly different results. Therefore, we conclude that the reliabilities are dependent on the quality of the usable data much better than the variety of models. As a result, the basic and engineering device for the preventions of CCF of the multiple facilities is to design it as reliably as to design the fire-water pump. That is to say, we must assess those reliabilities using PFD whether they are appropriate to SIL (Safety Integrity Level) which is required for the reliability in SIS (Safety Instrumented System). The result of the analysis on the reliability of the fire-water supply with CCF shows that PFD is 3.80E-3, so that it cannot be said to be designed as safely as in the level of SIL5. However, without CCF, PFD is 1.82E-3 which means that they are designed as unsafely as before.

Associations Between Heart Rate Variability and Symptom Severity in Patients With Somatic Symptom Disorder (신체 증상 장애 환자의 심박변이도와 증상 심각도의 연관성)

  • Eunhwan Kim;Hesun Kim;Jinsil Ham;Joonbeom Kim;Jooyoung Oh
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.31 no.2
    • /
    • pp.108-117
    • /
    • 2023
  • Objectives : Somatic symptom disorder (SSD) is characterized by the manifestation of a variety of physical symptoms, but little is known about differences in autonomic nervous system activity according to symptom severity, especially within patient groups. In this study, we examined differences in heart rate variability (HRV) across symptom severity in a group of SSD patients to analyze a representative marker of autonomic nervous system changes by symptoms severity. Methods : Medical records were retrospectively reviewed for patients who were diagnosed with SSD based on DSM-5 from September 18, 2020 to October 29, 2021. We applied inverse probability of treatment weighting (IPTW) methods to generate more homogeneous comparisons in HRV parameters by correcting for selection biases due to sociodemographic and clinical characteristic differences between groups. Results : There were statistically significant correlations between the somatic symptom severity and LF (nu), HF (nu), LF/HF, as well as SD1/SD2 and Alpha1/Alpha2. After IPTW estimation, the mild to moderate group was corrected to 27 (53.0%) and the severe group to 24 (47.0%), and homogeneity was achieved as the differences in demographic and clinical characteristics were not significant. The analysis of inverse probability weighted regression adjustment model showed that the severe group was associated with significantly lower RMSSD (β=-0.70, p=0.003) and pNN20 (β=-1.04, p=0.019) in the time domain and higher LF (nu) (β=0.29, p<0.001), lower HF (nu) (β=-0.29, p<0.001), higher LF/HF (β=1.41, p=0.001), and in the nonlinear domain, significant differences were tested for SampEn15 (β=-0.35, p=0.014), SD1/SD2 (β=-0.68, p<0.001), and Alpha1/Alpha2 (ß=0.43, p=0.001). Conclusions : These results suggest that differences in HRV parameters by SSD severity were showed in the time, frequency and nonlinear domains, specific parameters demonstrating significantly higher sympathetic nerve activity and reduced ability of the parasympathetic nervous system in SSD patients with severe symptoms.

The Effects of the Computer Aided Innovation Capabilities on the R&D Capabilities: Focusing on the SMEs of Korea (Computer Aided Innovation 역량이 연구개발역량에 미치는 효과: 국내 중소기업을 대상으로)

  • Shim, Jae Eok;Byeon, Moo Jang;Moon, Hyo Gon;Oh, Jay In
    • Asia pacific journal of information systems
    • /
    • v.23 no.3
    • /
    • pp.25-53
    • /
    • 2013
  • This study analyzes the effect of Computer Aided Innovation (CAI) to improve R&D Capabilities empirically. Survey was distributed by e-mail and Google Docs, targeting CTO of 235 SMEs. 142 surveys were returned back (rate of return 60.4%) from companies. Survey results from 119 companies (83.8%) which are effective samples except no-response, insincere response, estimated value, etc. were used for statistics analysis. Companies with less than 50billion KRW sales of entire researched companies occupy 76.5% in terms of sample traits. Companies with less than 300 employees occupy 83.2%. In terms of the type of company business Partners (called 'partners with big companies' hereunder) who work with big companies for business occupy 68.1%. SMEs based on their own business (called 'independent small companies') appear to occupy 31.9%. The present status of holding IT system according to traits of company business was classified into partners with big companies versus independent SMEs. The present status of ERP is 18.5% to 34.5%. QMS is 11.8% to 9.2%. And PLM (Product Life-cycle Management) is 6.7% to 2.5%. The holding of 3D CAD is 47.1% to 21%. IT system-holding and its application of independent SMEs seemed very vulnerable, compared with partner companies of big companies. This study is comprised of IT infra and IT Utilization as CAI capacity factors which are independent variables. factors of R&D capabilities which are independent variables are organization capability, process capability, HR capability, technology-accumulating capability, and internal/external collaboration capability. The highest average value of variables was 4.24 in organization capability 2. The lowest average value was 3.01 in IT infra which makes users access to data and information in other areas and use them with ease when required during new product development. It seems that the inferior environment of IT infra of general SMEs is reflected in CAI itself. In order to review the validity used to measure variables, Factors have been analyzed. 7 factors which have over 1.0 pure value of their dependent and independent variables were extracted. These factors appear to explain 71.167% in total of total variances. From the result of factor analysis about measurable variables in this study, reliability of each item was checked by Cronbach's Alpha coefficient. All measurable factors at least over 0.611 seemed to acquire reliability. Next, correlation has been done to explain certain phenomenon by correlation analysis between variables. As R&D capabilities factors which are arranged as dependent variables, organization capability, process capability, HR capability, technology-accumulating capability, and internal/external collaboration capability turned out that they acquire significant correlation at 99% reliability level in all variables of IT infra and IT Utilization which are independent variables. In addition, correlation coefficient between each factor is less than 0.8, which proves that the validity of this study judgement has been acquired. The pair with the highest coefficient had 0.628 for IT utilization and technology-accumulating capability. Regression model which can estimate independent variables was used in this study under the hypothesis that there is linear relation between independent variables and dependent variables so as to identify CAI capability's impact factors on R&D. The total explanations of IT infra among CAI capability for independent variables such as organization capability, process capability, human resources capability, technology-accumulating capability, and collaboration capability are 10.3%, 7%, 11.9%, 30.9%, and 10.5% respectively. IT Utilization exposes comprehensively low explanatory capability with 12.4%, 5.9%, 11.1%, 38.9%, and 13.4% for organization capability, process capability, human resources capability, technology-accumulating capability, and collaboration capability respectively. However, both factors of independent variables expose very high explanatory capability relatively for technology-accumulating capability among independent variable. Regression formula which is comprised of independent variables and dependent variables are all significant (P<0.005). The suitability of regression model seems high. When the results of test for dependent variables and independent variables are estimated, the hypothesis of 10 different factors appeared all significant in regression analysis model coefficient (P<0.01) which is estimated to affect in the hypothesis. As a result of liner regression analysis between two independent variables drawn by influence factor analysis for R&D capability and R&D capability. IT infra and IT Utilization which are CAI capability factors has positive correlation to organization capability, process capability, human resources capability, technology-accumulating capability, and collaboration capability with inside and outside which are dependent variables, R&D capability factors. It was identified as a significant factor which affects R&D capability. However, considering adjustable variables, a big gap is found, compared to entire company. First of all, in case of partner companies with big companies, in IT infra as CAI capability, organization capability, process capability, human resources capability, and technology capability out of R&D capacities seems to have positive correlation. However, collaboration capability appeared insignificance. IT utilization which is a CAI capability factor seemed to have positive relation to organization capability, process capability, human resources capability, and internal/external collaboration capability just as those of entire companies. Next, by analyzing independent types of SMEs as an adjustable variable, very different results were found from those of entire companies or partner companies with big companies. First of all, all factors in IT infra except technology-accumulating capability were rejected. IT utilization was rejected except technology-accumulating capability and collaboration capability. Comprehending the above adjustable variables, the following results were drawn in this study. First, in case of big companies or partner companies with big companies, IT infra and IT utilization affect improving R&D Capabilities positively. It was because most of big companies encourage innovation by using IT utilization and IT infra building over certain level to their partner companies. Second, in all companies, IT infra and IT utilization as CAI capability affect improving technology-accumulating capability positively at least as R&D capability factor. The most of factor explanation is low at around 10%. However, technology-accumulating capability is rather high around 25.6% to 38.4%. It was found that CAI capability contributes to technology-accumulating capability highly. Companies shouldn't consider IT infra and IT utilization as a simple product developing tool in R&D section. However, they have to consider to use them as a management innovating strategy tool which proceeds entire-company management innovation centered in new product development. Not only the improvement of technology-accumulating capability in department of R&D. Centered in new product development, it has to be used as original management innovative strategy which proceeds entire company management innovation. It suggests that it can be a method to improve technology-accumulating capability in R&D section and Dynamic capability to acquire sustainable competitive advantage.

A Study on the Cultivation Processes and Settlement Developments on the Mangyoung River Valley (만경강유역의 개간과정과 취락형성발달에 관한 연구)

  • NamGoong, Bong
    • Journal of the Korean association of regional geographers
    • /
    • v.3 no.2
    • /
    • pp.37-87
    • /
    • 1997
  • As a results of researches on the cultivation processes and settlement developments on the Mangyoung river valley as a whole could be have four 'Space-Time Continuity' through a [Origin-Destination] theory model. On a initial phases of cultivation, the cultivation process has been begun at mountain slopes and tributory plains in upper part of river-basin from Koryo Dynasty to early Chosun Dynasty. At first, indigenous peasants burned forests on the mountain slopes for making 'dryfield' for a cereal crops. Following population increase more stable food supply is necessary facets of life inducing a change production method into a 'wetfield' in tributory plains matching the population increase. First sedentary agriculture maybe initiated at this mountain slopes and tributory plains on upper part of river basin through a burning cultivation methods. Mountain slopes and tributory plains are become a Origin area in cultivation processes. It expanded from up to down through the valleys with 'a bits of land' fashion in a steady pace like a terraced fields expanded with bit by bit of land to downward. They expanded their land to the middle part of river basin in mid period of Chosun Dynasty with dike construction techniques on the river bank. Lower part of river cultivated with embankment building techniques in 1920s and then naturally expanded to the tidal marshes on the estuaries and river inlets of coastal areas. 'Pioneer fringes' are consolidated at there in modern times. Changes in landscapes are appeared it's own characters with each periods of time. Followings are results of study through the Mangyoung river valley as a whole. (1) Mountain slopes and tributory plains on the upper part of river are cultivated 'dryfields' by indigenous peasants with Burning cultivation methods at first and developed sedentary settlements at the edges of mountain slopes and on the river terrace near the fields. They formed a kind of 'periphery-located cluster type' of settlement. This type of settlement are become a prominant type in upper part of river basin. 'Dryfields' has been changed into a 'wetfields' at the narrow tributory plains by increasing population pressure in later time. These wetfields are supplied water by Weir and Ponds Irrigation System(제언수리방법). Streams on the tributory plains has been attracted wetfields besides of it and formed a [water+land] complex on it. 'Wetfields' are expanded from up to downward with a terraced land pattern(adder like pattern, 붕전) according to the gradient of valley. These periphery located settlements are formed a intimate ecological linkage with several sets of surroundings. Inner villages are expanded to Outer villages according to the expansion of arable lands into downward. (2) Mountain slopes and tributory plains expanded its territory to the alluvial deposited plains on the middle part of river valley with a urgent need of new land by population increase. This part of alluvial plains are cultivated mainly in mid period of Chosun Dynasty. Irrigation methods are changed into a Dike Construction Irrigation method(천방수리방법) for the control of floods. It has a trend to change the subjectives of cultivation from community-oriented one who constructed Bochang along tributories making rice paddies to local government authorities who could be gather large sums of capitals, techniques and labours for the big dike construction affairs. Settlements are advanced in the midst of plains avoiding friction of distances and formed a 'Centrallocated cluster type' of settlements. There occured a hierarchical structures of settlements in ranks and sizes according merits of water supply and transportation convenience at the broad plains. Big towns are developed at there. It strengthened a more prominant [water+land] complex along the canals. Ecological linkages between settlements and surroundings are shaded out into a tiny one in this area. (3) It is very necessary to get a modern technology of flood control at the rivers that have a large volume of water and broad width. The alluvial plains are remained in a wilderness phase until a technical level reached a large artificial levee construction ability that could protect the arable land from flood. Until that time on most of alluvial land at the lower part of river are remained a wilderness of overgrown with reeds in lacks of techniques to build a large-scale artificial levee along the riverbank. Cultivation processes are progressed in a large scale one by Japanese agricultural companies with [River Rennovation Project] of central government in 1920s. Large scale artificial levees are constructed along the riverbank. Subjectives of cultivation are changed from Korean peasants to Japanese agricultural companies and Korean peasants fell down as a tenant in a colonial situation of that time in Korea. They could not have any voices in planning of spatial structure and decreased their role in planning. Newly cultivated lands are reflected company's intensions, objectives and perspectives for achieving their goals for the sake of colonial power. Newly cultivated lands are planned into a regular Rectangular Block settings of rice paddies and implanted a large scale Bureaucratic-oriented Irrigation System on the cultivated plains. Every settlements are located in the midst of rice paddies with a Central located Cluster type of settlements. [water+land] complex along the canal system are more strengthened. Cultivated space has a characters of [I-IT] landscapes. (4) Artificial levees are connected into a coastal emnankment for a reclamation of broad tidal marshes on the estuaries and inlets of rivers in the colonial times. Subjectives of reclamation are enlarged into a big agricultural companies that could be acted a role as a big cultivator. After that time on most of reclamation project of tidal marshes are controlled by these agricultural companies formed by mostly Japanese capitalists. Reclaimed lands on the estuaries and river inlets are under hands of agricultural companies and all the spatial structures are formed by their intensions, objectives and perspectives. They constructed a Unit Farming Area for the sake of companies. Spatial structures are planned in a regular one with broad arable land for the rice production of rectangular blocks, regular canal systems and tank reservoir for the irrigation water supply into reclaimed lands. There developed a 'Central-located linear type' of settlements in midst of reclaimed land. These settlements are settled in a detail program upon this newly reclaimed land at once with a master plan and they have planned patterns in their distribution, building materials, location, and form. Ecological linkage between Newly settled settlemrnts and its surroundings are lost its colours and became a more artificial one by human-centred environment. [I-IT] landscapes are become more prominant. This region is a destination area of [Origin-Destination] theory model and formed a 'Pioneer Fringe'. It is a kind of pioneer front that could advance or retreat discontinously by physical conditions and socio-cultural conditions of that region.

  • PDF

Corporate Bond Rating Using Various Multiclass Support Vector Machines (다양한 다분류 SVM을 적용한 기업채권평가)

  • Ahn, Hyun-Chul;Kim, Kyoung-Jae
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.157-178
    • /
    • 2009
  • Corporate credit rating is a very important factor in the market for corporate debt. Information concerning corporate operations is often disseminated to market participants through the changes in credit ratings that are published by professional rating agencies, such as Standard and Poor's (S&P) and Moody's Investor Service. Since these agencies generally require a large fee for the service, and the periodically provided ratings sometimes do not reflect the default risk of the company at the time, it may be advantageous for bond-market participants to be able to classify credit ratings before the agencies actually publish them. As a result, it is very important for companies (especially, financial companies) to develop a proper model of credit rating. From a technical perspective, the credit rating constitutes a typical, multiclass, classification problem because rating agencies generally have ten or more categories of ratings. For example, S&P's ratings range from AAA for the highest-quality bonds to D for the lowest-quality bonds. The professional rating agencies emphasize the importance of analysts' subjective judgments in the determination of credit ratings. However, in practice, a mathematical model that uses the financial variables of companies plays an important role in determining credit ratings, since it is convenient to apply and cost efficient. These financial variables include the ratios that represent a company's leverage status, liquidity status, and profitability status. Several statistical and artificial intelligence (AI) techniques have been applied as tools for predicting credit ratings. Among them, artificial neural networks are most prevalent in the area of finance because of their broad applicability to many business problems and their preeminent ability to adapt. However, artificial neural networks also have many defects, including the difficulty in determining the values of the control parameters and the number of processing elements in the layer as well as the risk of over-fitting. Of late, because of their robustness and high accuracy, support vector machines (SVMs) have become popular as a solution for problems with generating accurate prediction. An SVM's solution may be globally optimal because SVMs seek to minimize structural risk. On the other hand, artificial neural network models may tend to find locally optimal solutions because they seek to minimize empirical risk. In addition, no parameters need to be tuned in SVMs, barring the upper bound for non-separable cases in linear SVMs. Since SVMs were originally devised for binary classification, however they are not intrinsically geared for multiclass classifications as in credit ratings. Thus, researchers have tried to extend the original SVM to multiclass classification. Hitherto, a variety of techniques to extend standard SVMs to multiclass SVMs (MSVMs) has been proposed in the literature Only a few types of MSVM are, however, tested using prior studies that apply MSVMs to credit ratings studies. In this study, we examined six different techniques of MSVMs: (1) One-Against-One, (2) One-Against-AIL (3) DAGSVM, (4) ECOC, (5) Method of Weston and Watkins, and (6) Method of Crammer and Singer. In addition, we examined the prediction accuracy of some modified version of conventional MSVM techniques. To find the most appropriate technique of MSVMs for corporate bond rating, we applied all the techniques of MSVMs to a real-world case of credit rating in Korea. The best application is in corporate bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. For our study the research data were collected from National Information and Credit Evaluation, Inc., a major bond-rating company in Korea. The data set is comprised of the bond-ratings for the year 2002 and various financial variables for 1,295 companies from the manufacturing industry in Korea. We compared the results of these techniques with one another, and with those of traditional methods for credit ratings, such as multiple discriminant analysis (MDA), multinomial logistic regression (MLOGIT), and artificial neural networks (ANNs). As a result, we found that DAGSVM with an ordered list was the best approach for the prediction of bond rating. In addition, we found that the modified version of ECOC approach can yield higher prediction accuracy for the cases showing clear patterns.

The PRISM-based Rainfall Mapping at an Enhanced Grid Cell Resolution in Complex Terrain (복잡지형 고해상도 격자망에서의 PRISM 기반 강수추정법)

  • Chung, U-Ran;Yun, Kyung-Dahm;Cho, Kyung-Sook;Yi, Jae-Hyun;Yun, Jin-I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.11 no.2
    • /
    • pp.72-78
    • /
    • 2009
  • The demand for rainfall data in gridded digital formats has increased in recent years due to the close linkage between hydrological models and decision support systems using the geographic information system. One of the most widely used tools for digital rainfall mapping is the PRISM (parameter-elevation regressions on independent slopes model) which uses point data (rain gauge stations), a digital elevation model (DEM), and other spatial datasets to generate repeatable estimates of monthly and annual precipitation. In the PRISM, rain gauge stations are assigned with weights that account for other climatically important factors besides elevation, and aspects and the topographic exposure are simulated by dividing the terrain into topographic facets. The size of facet or grid cell resolution is determined by the density of rain gauge stations and a $5{\times}5km$ grid cell is considered as the lowest limit under the situation in Korea. The PRISM algorithms using a 270m DEM for South Korea were implemented in a script language environment (Python) and relevant weights for each 270m grid cell were derived from the monthly data from 432 official rain gauge stations. Weighted monthly precipitation data from at least 5 nearby stations for each grid cell were regressed to the elevation and the selected linear regression equations with the 270m DEM were used to generate a digital precipitation map of South Korea at 270m resolution. Among 1.25 million grid cells, precipitation estimates at 166 cells, where the measurements were made by the Korea Water Corporation rain gauge network, were extracted and the monthly estimation errors were evaluated. An average of 10% reduction in the root mean square error (RMSE) was found for any months with more than 100mm monthly precipitation compared to the RMSE associated with the original 5km PRISM estimates. This modified PRISM may be used for rainfall mapping in rainy season (May to September) at much higher spatial resolution than the original PRISM without losing the data accuracy.