• Title/Summary/Keyword: flows

Search Result 6,596, Processing Time 0.03 seconds

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • v.16 no.3
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

A Study on the Consideration of the Locations of Gyeongju Oksan Gugok and Landscape Interpretation - Focusing on the Arbor of Lee, Jung-Eom's "Oksan Gugok" - (경주 옥산구곡(玉山九曲)의 위치비정과 경관해석 연구 - 이정엄의 「옥산구곡가」를 중심으로 -)

  • Peng, Hong-Xu;Kang, Tai-Ho
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.36 no.3
    • /
    • pp.26-36
    • /
    • 2018
  • This study aims to examine the characteristics of landscape through the analysis of location and the landscape of Gugok while also conducting the empirical study through the literature review, field study, and digital analysis of the Okgung Gugok. Oksan Gugok is a set of songs set in Ogsan Creek(玉山川)or Jagyese Creek(紫溪川, 紫玉山), which flows in front of the Oksan Memorial Hall(李彦迪), which is dedicated to the Lee Eong-jeok (李彦迪). We first ascertained the location and configuration of Oksan Gogok. Second, we confirmed the accurate location of Oksan Gogok by utilizing the digital topographic map of Oksan Gogok which was submitted by Google Earth Pro and Geographic Information Center as well as the length of the longitude of the gravel measured by the Trimble Juno SB GPS. Through the study of the literature and the field investigation, The results of the study are as follows. First, Yi Eonjeok was not a direct composer of Oksan Gugok, nor did he produce "Oksan Gugokha(Music)". Lee Ia-sung(李野淳), the ninth Youngest Son of Tweo-Kye, Hwang Lee, visited the "Oksan Gugokha" in the spring of 1823(Sunjo 23), which was the 270th years after the reign of Yi Eonjeok. At this time, receiving the proposal of Ian Sung, Lee Jung-eom(李鼎儼), Lee Jung-gi(李鼎基), and Lee Jung-byeong(李鼎秉), the descendants of Ian Sung set up a song and created Oksan Gugok Music. And the Essay of Oksan Travel Companions writted by Lee Jung-gi turns out being a crucial data to describe the situation when setting up the Ok-San Gugok. Second, In the majority of cases, Gogok Forest is a forest managed by a Confucian Scholar, not run by ordinary people. The creation of "Oksan Bugok Music" can be regarded as an expression of pride that the descendants of Yi Eonjeok and Lee Hwang, and next generation of several Confucian scholars had inherited traditional Neo-Confucian. Third, Lee Jung-eom's "Oksan Donghaengki" contains a detailed description of the "Oksan Gugokha" process and the process of creating a song. Fourth, We examined the location of one to nine Oksan songs again. In particular, eight songs and nine songs were located at irregular intervals, and eight songs were identified as $36^{\circ}01^{\prime}08.60^{{\prime}{\prime}}N$, $129^{\circ}09^{\prime}31.20^{{\prime}{\prime}}E$. Referring to the ancient kingdom of Taojam, the nine-stringed Sainam was unbiased as a lower rock where the two valleys of the East West congregate. The location was estimated at $36^{\circ}01^{\prime}19.79^{{\prime}{\prime}}N$, $129^{\circ}09^{\prime}30.26^{{\prime}{\prime}}E$. Fifth, The landscape elements and landscapes presented in Lee Jung-eom's "Oksan Gugokha" were divided into form, semantic and climatic elements. As a result, Lee Jung-eom's Cho Young-gwan was able to see the ideal of mountain water and the feeling of being idle in nature as well as the sense of freedom. Sixth, After examining the appearance of the elements and the frequency of the appearance of the landscape, 'water' and 'mountain' were the absolute factors that emphasized the original curved environment at the mouth of Lee Jung-eom. Therefore, there was gugokga can gauge the fresh ideas(神仙思想)and retreat ever(隱居思想). This inherent harmony between the landscape as well as through the mulah any ideas that one with nature and meditation, Confucian tube.

Machine learning-based corporate default risk prediction model verification and policy recommendation: Focusing on improvement through stacking ensemble model (머신러닝 기반 기업부도위험 예측모델 검증 및 정책적 제언: 스태킹 앙상블 모델을 통한 개선을 중심으로)

  • Eom, Haneul;Kim, Jaeseong;Choi, Sangok
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.105-129
    • /
    • 2020
  • This study uses corporate data from 2012 to 2018 when K-IFRS was applied in earnest to predict default risks. The data used in the analysis totaled 10,545 rows, consisting of 160 columns including 38 in the statement of financial position, 26 in the statement of comprehensive income, 11 in the statement of cash flows, and 76 in the index of financial ratios. Unlike most previous prior studies used the default event as the basis for learning about default risk, this study calculated default risk using the market capitalization and stock price volatility of each company based on the Merton model. Through this, it was able to solve the problem of data imbalance due to the scarcity of default events, which had been pointed out as the limitation of the existing methodology, and the problem of reflecting the difference in default risk that exists within ordinary companies. Because learning was conducted only by using corporate information available to unlisted companies, default risks of unlisted companies without stock price information can be appropriately derived. Through this, it can provide stable default risk assessment services to unlisted companies that are difficult to determine proper default risk with traditional credit rating models such as small and medium-sized companies and startups. Although there has been an active study of predicting corporate default risks using machine learning recently, model bias issues exist because most studies are making predictions based on a single model. Stable and reliable valuation methodology is required for the calculation of default risk, given that the entity's default risk information is very widely utilized in the market and the sensitivity to the difference in default risk is high. Also, Strict standards are also required for methods of calculation. The credit rating method stipulated by the Financial Services Commission in the Financial Investment Regulations calls for the preparation of evaluation methods, including verification of the adequacy of evaluation methods, in consideration of past statistical data and experiences on credit ratings and changes in future market conditions. This study allowed the reduction of individual models' bias by utilizing stacking ensemble techniques that synthesize various machine learning models. This allows us to capture complex nonlinear relationships between default risk and various corporate information and maximize the advantages of machine learning-based default risk prediction models that take less time to calculate. To calculate forecasts by sub model to be used as input data for the Stacking Ensemble model, training data were divided into seven pieces, and sub-models were trained in a divided set to produce forecasts. To compare the predictive power of the Stacking Ensemble model, Random Forest, MLP, and CNN models were trained with full training data, then the predictive power of each model was verified on the test set. The analysis showed that the Stacking Ensemble model exceeded the predictive power of the Random Forest model, which had the best performance on a single model. Next, to check for statistically significant differences between the Stacking Ensemble model and the forecasts for each individual model, the Pair between the Stacking Ensemble model and each individual model was constructed. Because the results of the Shapiro-wilk normality test also showed that all Pair did not follow normality, Using the nonparametric method wilcoxon rank sum test, we checked whether the two model forecasts that make up the Pair showed statistically significant differences. The analysis showed that the forecasts of the Staging Ensemble model showed statistically significant differences from those of the MLP model and CNN model. In addition, this study can provide a methodology that allows existing credit rating agencies to apply machine learning-based bankruptcy risk prediction methodologies, given that traditional credit rating models can also be reflected as sub-models to calculate the final default probability. Also, the Stacking Ensemble techniques proposed in this study can help design to meet the requirements of the Financial Investment Business Regulations through the combination of various sub-models. We hope that this research will be used as a resource to increase practical use by overcoming and improving the limitations of existing machine learning-based models.

Location and Construction Characteristics of Imdaejeong Wonlim based on Documentation (기문(記文)을 중심으로 고찰한 임대정원림(臨對亭園林)의 입지 및 조영 특성)

  • Rho, Jae-Hyun;Park, Tae-Hee;Shin, Sang-Sup;Kim, Hyoun-Wuk
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.29 no.4
    • /
    • pp.14-26
    • /
    • 2011
  • Imdaejeong Wonlim is located on the verge of Sangsa Village in Sapyeong-ri, Daepyeong-myeon, Hwasun-gun Gyeongsangnam-do toward Northwest. It was planned by Sa-ae, Minjuhyeon in 1862 on the basis of Gobanwon built by Nam Eongi in 16th century against the backdrop of Mt. Bongjeong and facing Sapyeong Stream. As water flows from west to east in the shape of crane, this area is a propitious site standing for prosperity and happiness. This area shows a distinct feature of Wonlim surrounding the Imdaejeong with multi layers as consisting of 5 districts - front yard where landmark stone with engraved letters of 'Janggujiso of Master Sa-ea' and junipers are harmoniously arranged, internal garden of upper pavilion ranging from a pavilion to square pond with a little island in the middle, Sugyeongwon of under pavilionu consisting of 2 ponds with a painting of three taoist hermits, forest of Mt. Bonggeong and external garden including Sapyeong Stream and farmland. According to documentation and the results of on-site investigation, it is certainly proved that Imdaejeong Wonlim was motivated by Byeoseo Wonlim which realized the idea of 'going back to hometown after resignation' following the motives of Janggujiso, a hideout aimed to accomplish the ideology, 'training mind and fostering innate nature,' on the peaceful site surrounded by water and mountain, as well as motives of Sesimcheo(洗心處) to be unified with morality of Mother Nature, etc. In addition, it implies various imaginary landscapes such as Pihangji, Eupcheongdang, square pond with an island and painting of three Taoist hermits based on a notion that 'the further scent flies away, the fresher it becomes,' which is originated from Aelyeonseol(愛蓮說). In terms of technique of natural landscape treatment, divers techniques are found in Imdaejeong Wonlim such as distant view of Mt. Bongjeong, pulling view with an intention of transparent beauty of moonlight, circle view of natural and cultural sceneries on every side, borrowed scenary of pastoral rural life adopted as an opposite view, looked view of Sulyundaero, over looked view of pond, static view in pavilion and paths, close view of water space such as stream and pond, mushroom-and-umbrella like view of Imdaejeong, vista of pond surrounded by willows, imaginary view of engraved letters meaning 'widen knowledge by studying objectives' and selected view to comprise sunrise and sunset at the same time. In the beginning of construction, various plants seemed to be planted, albeit different from now, such as Ginkgo biloba, Phyllostachys spp., Salix spp., Pinus densiflora, Abies holophylla, Morus bombycis, Juglans mandschurica, Paulownia coreana, Prunus mume, Nelumbo nucifera, etc. Generally, it reflected dignity of Confucianism or beared aspect of semantic landscape implying Taoist taste and idea of Phoenix wishing a prosperity in the future. Furthermore, a diversity of planting methods were pursued for such as liner planting for the periphery of pond, bosquet planting and circle planting adopted around the pavilion, spot planting using green trees, solitary planting of monumentally planted Paulownia coreana and opposite planting presenting the Abies holophylla into yin and yang.

Conjunction of Consciousness and The Unconscious·Individuation and Circumambulation of The Psyche: Focusing on the Hexagram Bi, Pi (比) and Hexagram Gon, Kun (坤) (의식과 무의식의 통합 및 개성화와 정신의 순환: 수지비괘(일양오음괘)와 중지곤괘를 중심으로)

  • Hyeon Gu Lee
    • Sim-seong Yeon-gu
    • /
    • v.38 no.1
    • /
    • pp.1-44
    • /
    • 2023
  • Hexagram Bi (比 ䷇ 8) is one of the hexagrams comprised of one-unbroken line and five-broken lines. The hexagrams of one-unbroken and five-broken lines symbolize the relationship and dynamics between one yang-consciousness and the five-yin unconsciousness. The hexagram of one-unbroken line and five-broken lines has six different images depending on the position of the one unbroken line from the beginning line to the top line. In terms of psychology, this means that the position change of one yang line in relation to five yin lines may symbolize the function of consciousness which clarifies and determines the content of the psyche. In addition, the flow of psychic energy can be examined through the process of one unbroken line's movement. In other words, the psychic contents of the beginning line of hexagram Bok (復 ䷗ 24), which is the beginning of the hexagram of one-unbroken line and five-broken lines, proceed sequentially, and then arrive at the process of the last sixth, hexagram Bak (剝 ䷖ 23) through the fifth, the hexagram Bi (8). That is, it can be said that the content of the hexagram and the line determined according to the position of one unbroken line show a certain psychic flow. As a result, the first hexagram Bok (復 ䷗ 24), after recovering and starting newly, means the beginning of consciousness. After that the process of proceeding with the second, third, and fourth lines represents the flow of consciousness. And in the fifth place, the fifth line of hexagram Bi, it reaches its peak and is placed in the optimal state of consciousness because of its right and centered position at this hexagram Bi. Like nature, the psyche gradually enters the path of decline from the highest state, which leads to the last sixth, the top line of hexagram Bak. However, the top line of the hexagram Bak, where everything falls off, contains the content of starting again in its top line. It is the beginning line of hexagram Bok to inherit this. This means the circumambulation of the psyche that changes from a psychologically difficult state of depression to a stage of recovery. There is a stage that must be passed in this circulation process, and that is the hexagram Gon (坤 ䷁ 2). October(tenth month)'s hexagram Gon is placed between hexagram Bak, the ninth month of the lunar calendar, and hexagram Bok, the eleventh month of the lunar calendar. This represents that the flow of recovery must go through a maternal process of hexagram Gon. The retreat to the psychological uterus is inevitable in regenerating the psyche. This process flows from the hexagram Bak and through hexagram Gon to the hexagram Bok. At this situation the hexagram Gon acts the absolutely necessary role. In addition, the main body of the hexagrams of one-unbroken and five-broken lines, including the Bi hexagram, is also the Gon hexagram composed of six-broken lines. In other words, all six hexagrams of one-unbroken and five-broken lines have a certain relationship with the Gon hexagram, and it would be meaningful to look at the correlation between the unbroken lines of the hexagrams of one-unbroken and five-broken lines and the corresponding broken lines of the hexagram Gon. This can be said to be the dynamics of the maternal unconscious connected to the state of consciousness in six forms. Therefore, each hexagram of one-unbroken and five-broken lines symbolizes the expression of the integration the mother archetype with the consciousness. Revealing this well is the meaning of the hexagram of one-unbroken and five-broken lines. Its hexagram image consists of a combination of Gon (☷), which symbolizes the mother, and the thunder (☳) the eldest son, the water (☵) the middle son and the mountain (☶) the third son. As a result, the hexagram Bok (復 ䷗ 24), Sa (師 ䷆ 7), Gyeom (謙 ䷠ 15), Ye (豫 ䷏ 16), Bi (比 ䷇ 8) and Bak (剝 ䷖ 23) are sequentially created in the order of the unbroken line. This is symbolically the evolutionary process of consciousness. In this way, the hexagrams of one-unbroken and five-broken lines, which mean the conjunction of mother and son, represent the advancing relationship between the maternal unconscious and consciousness. In addition, the relationship with the mother according to the position of the son is related to the dynamics of mother archetype to the attitude of consciousness. The psychological meaning can be deduced from the flow of six lines of hexagrams of one-unbroken and five-broken lines. And the state in which the activation of the consciousness is at its peak is the fifth line of the hexagram Bi, and comparing it with the contents of the corresponding fifth line of hexagram Gon not only can find the state and meaning of the conjunction of consciousness and the maternal unconscious, but the entire flow can be compared to the individuation process.

A Study on the Forest Yield Regulation by Systems Analysis (시스템분석(分析)에 의(依)한 삼림수확조절(森林收穫調節)에 관(關)한 연구(硏究))

  • Cho, Eung-hyouk
    • Korean Journal of Agricultural Science
    • /
    • v.4 no.2
    • /
    • pp.344-390
    • /
    • 1977
  • The purpose of this paper was to schedule optimum cutting strategy which could maximize the total yield under certain restrictions on periodic timber removals and harvest areas from an industrial forest, based on a linear programming technique. Sensitivity of the regulation model to variations in restrictions has also been analyzed to get information on the changes of total yield in the planning period. The regulation procedure has been made on the experimental forest of the Agricultural College of Seoul National University. The forest is composed of 219 cutting units, and characterized by younger age group which is very common in Korea. The planning period is devided into 10 cutting periods of five years each, and cutting is permissible only on the stands of age groups 5-9. It is also assumed in the study that the subsequent forests are established immediately after cutting existing forests, non-stocked forest lands are planted in first cutting period, and established forests are fully stocked until next harvest. All feasible cutting regimes have been defined to each unit depending on their age groups. Total yield (Vi, k) of each regime expected in the planning period has been projected using stand yield tables and forest inventory data, and the regime which gives highest Vi, k has been selected as a optimum cutting regime. After calculating periodic yields and cutting areas, and total yield from the optimum regimes selected without any restrictions, the upper and lower limits of periodic yields(Vj-max, Vj-min) and those of periodic cutting areas (Aj-max, Aj-min) have been decided. The optimum regimes under such restrictions have been selected by linear programming. The results of the study may be summarized as follows:- 1. The fluctuations of periodic harvest yields and areas under cutting regimes selected without restrictions were very great, because of irregular composition of age classes and growing stocks of existing stands. About 68.8 percent of total yield is expected in period 10, while none of yield in periods 6 and 7. 2. After inspection of the above solution, restricted optimum cutting regimes were obtained under the restrictions of Amin=150 ha, Amax=400ha, $Vmin=5,000m^3$ and $Vmax=50,000m^3$, using LP regulation model. As a result, about $50,000m^3$ of stable harvest yield per period and a relatively balanced age group distribution is expected from period 5. In this case, the loss in total yield was about 29 percent of that of unrestricted regimes. 3. Thinning schedule could be easily treated by the model presented in the study, and the thinnings made it possible to select optimum regimes which might be effective for smoothing the wood flows, not to speak of increasing total yield in the planning period. 4. It was known that the stronger the restrictions becomes in the optimum solution the earlier the period comes in which balanced harvest yields and age group distribution can be formed. There was also a tendency in this particular case that the periodic yields were strongly affected by constraints, and the fluctuations of harvest areas depended upon the amount of periodic yields. 5. Because the total yield was decreased at the increasing rate with imposing stronger restrictions, the Joss would be very great where strict sustained yield and normal age group distribution are required in the earlier periods. 6. Total yield under the same restrictions in a period was increased by lowering the felling age and extending the range of cutting age groups. Therefore, it seemed to be advantageous for producing maximum timber yield to adopt wider range of cutting age groups with the lower limit at which the smallest utilization size of timber could be produced. 7. The LP regulation model presented in the study seemed to be useful in the Korean situation from the following point of view: (1) The model can provide forest managers with the solution of where, when, and how much to cut in order to best fulfill the owners objective. (2) Planning is visualized as a continuous process where new strateges are automatically evolved as changes in the forest environment are recognized. (3) The cost (measured as decrease in total yield) of imposing restrictions can be easily evaluated. (4) Thinning schedule can be treated without difficulty. (5) The model can be applied to irregular forests. (6) Traditional regulation methods can be rainforced by the model.

  • PDF