• Title/Summary/Keyword: Representative

Search Result 10,979, Processing Time 0.043 seconds

A study of the inorganic element contents for the ginsengs of Keumsan, Chungnam

  • Song, Suck-Hwan;Sik, Chang-Gyu
    • Proceedings of the Ginseng society Conference
    • /
    • 2008.05a
    • /
    • pp.74-75
    • /
    • 2008
  • This study is for geochemical relationships between ginsengs and soils from three representative soil types from Keumsan, shale, phyllite and granite. For these study, ginsengs, with the field and weathered soils were collected from the three regions, and are analysed for the major and trace elements. In the weathered soils(avg.), the granite and phyllite areas are high in the most of elements while the shale area is low. In the correlation coefficients, negative correlations are shown in the $Al_2O_3$-MgO pair while positive correlations, are shown in the Ba-Sr, Zr, Sr-Zr and Cs-Ge pairs. In the field soils(avg.), the granite and phyllite areas are, generally, high in the most of elements while the shale area is low. In the shale area, the major elements are high in the 4 year soils, but low in the 2 year soils. The LFS(Ba, Sr, Cs) and transitional elements are high in the 2 year soils, but low in the 4 year soils. The HFS(Y, Zr) is high in the 4 year soils. In the correlation coefficients, most of the elements from the 4 year show positive relationships. Positive correlations are shown in the $Al_2O_3$-CaO, MnO-MgO, V-Tl, and Ba-Sr pairs in all localities. In the ginseng contents, clear chemical differences with the ages are shown in the shale and granite ares, but not clear in the phyllite area. In the shale area Mn, Mg, Ba, Sr, and Y contents, increase with ages but decrease in Al, Cs, Be and Cd. In the correlation coefficients, degrees of the correlations for the major elements become low with the ages. Positive correlations are shown in the Al-Mn, Ti, Mn-Ti, Mg-Ca, Ca-K, Ba-Cs, Y and Cs-Y pairs. Comparisons with ginsengs of the same ages from the different areas suggest that generally, the 2 years in the shale and 3 and 4 years in the granite area are distinctive. Relative ratios(granite/ shale area) of the ginsengs are below 1 in the major elements except Mn in the 2 year ginsengs and above 1 in the other elements except Mg and Na in the 4 year. Relative ratios(granite/ phyllite area) of the ginsengs are high in the 3 year from the phyllite area. In the relative ratios(weathered/field soils) of the soils, numbers of the elements showing the ratios of above 1 increase from the shale, to phyllite and granite in the case of the major elements, but decrease in the case of the trace elements. These results suggest that major elements are high in the granite while trace elements are high in the shale area. In the relative ratios between field soils and ginsengs(field soils/ginseng), the shale area, regardless of the ages, show differences of several hundred times in the $Al_2O_3$, $TiO_2$, Y and Tl, of several ten times in the MnO, MgO and Ba and of several times in the CaO contents. These results suggest that ginseng contents are significantly different from the field soils in the $Al_2O_3$, $TiO_2$, Y and Tl, but similar in the CaO contents. The phyllite area, regardless of the ages, show differences of several hundred times in the $Al_2O_3$, $TiO_2$, Y, Tl and Be, of several ten times in the MnO, MgO, $Na_2O$ and Ba, and of several times to ten times in the CaO, $K_2O$ and Sr contents. These results suggest that ginseng contents are significantly different from those of the field soils in the $Al_2O_3$, $TiO_2$, Y, Tl and Be, but similar in the CaO, $K_2O$ and Sr contents. The granite area, regardless of the ages, show differences of several hundred times in the $Al_2O_3$, $TiO_2$, Tl and Be, of several ten times in the Ba, and of several times to ten times in the MgO and CaO contents. Of the other elements, differences of several times to ten times are shown in the MnO, $K_2O$ and Sr contents. These results suggest that ginseng contents are significantly different from those of the field soils in the $Al_2O_3$, $TiO_2$, Tl and Be, but similar in the $K_2O$ and Sr contents. Comparisons among the different ages from the same area suggest that, in the case of shale area, differences of several hundred times in the $Al_2O_3$ and $TiO_2$, of the several ten times in the MnO, MgO and Ba and several times in the CaO and $K_2O$ are shown in the 2 year ginsengs. Differences of several hundred times in the $Al_2O_3$, $TiO_2$, Cs, Y, Tl and Be, of above several ten times in the MnO, MgO, $K_2O$ and Ba, and of several times in the CaO and Sr are shown in the 3 year ginsengs. Differences of several hundred to thousand times in the $Al_2O_3$, of above several hundred times in the $TiO_2$, Cs and Y, and of several ten times in the MnO, MgO, $K_2O$ and Ba, and of several times in the $Na_2O$ are shown in the 4 year ginsengs. These relationships suggest that, regardless of the localities in the shale area, $Al_2O_3$ contents of the soils show big differences from those of the ginsengs. Regardless of the ages of ginsengs, comparisons with the overall average contents of each area show differences of several hundred times in the $Al_2O_3$, $TiO_2$, Cs and Tl and of several ten times in the MnO. These overall relationships suggest that the $Al_2O_3$, $TiO_2$, Cs and Tl contents of the soils are higher than those of the ginsengs, show big differences between two and low different contents are found in the MnO. In detail, differences of several hundred times in the Y, and ten times in the MgO and Sr, and of several times in the CaO, $Na_2O$, $K_2O$ in the case of shale area, are shown. These results suggest that the soils are higher than the ginsengs in the Y and significantly differences in Y, and moderately differences in the MgO and Sr, and low differences in the CaO, $Na_2O$ and $K_2O$ are shown between soils and ginsengs.

  • PDF

Legal Study for the KSLV launching - Products & Third Party Liability - (KSLV발사에 따른 제작 및 제3자피해 책임에 대한 우주법적 소고)

  • Shin, Sung-Hwan
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.21 no.1
    • /
    • pp.169-189
    • /
    • 2006
  • In 2007, KSLV(Korea Small Launching Vehicle) that we made at Goheung National Space Center is going to launch and promotes of our space exploration systematically and 'Space Exploration Promotion Act' was enter into force. 'Space Exploration Promotion Act' article 3, section 1, as is prescribing "Korean government keeps the space treaties contracted with other countries and international organizations and pursues after peaceful uses of outer space." The representative international treaties are Outer Space Treaty (1967) and Liability Convention (1972) etc. In Liability convention article 2, "A launching State shall be absolutely liable to pay compensation for damage caused by its space object on the surface of the earth or to aircraft in flight. The important content of the art. 2 is the responsible entity is the 'State' not the 'Company'. According by Korean Space Exploration Act art. 14, person who launches space objects according to art. 8 and art. 11 must bear the liability for damages owing to space accidents of the space objects. Could Korean government apply the Products Liability Act which is enter into force from July 1, 2002 to space launching person? And what is the contact type between Korea Aerospace Research Institute(KARl) and Russia manufacturer. Is that a Co-Development contract or Licence Product contract? And there is no exemption clause to waive the Russia manufacturer's liability which we could find it from other similar contract condition. If there is no exemption clause to the Russia manufacturer, could we apply the Korean Products Liability Act to Russia one? The most important legal point is whether we could apply the Korean Products Liability Act to the main component company. According by the art. 17 of the contract between KARl and the company, KARl already apply the Products Liability Act to the main component company. For reference, we need to examine the Appalachian Insurance co. v. McDonnell Douglas case, this case is that long distance electricity communication satellite of Western Union Telegraph company possessions fails on track entry. In Western Union's insurance company supplied to Western Union with insurance of $ 105 millions, which has the satellite regard as entirely damage. Five insurance companies -Appalachian insurance company, Commonwealth insurance company, Industrial Indemnity, Mutual Marine Office, Northbrook Excess & Surplus insurance company- went to court against McDonnell Douglases, Morton Thiokol and Hitco company to inquire for fault and strict liability of product. By the Appalachian Insurance co. v. McDonnell Douglas case, KARl should waiver the main component's product liability burden. And we could study the possibility of the adapt 'Government Contractor Defense' theory to the main component company.

  • PDF

Export Control System based on Case Based Reasoning: Design and Evaluation (사례 기반 지능형 수출통제 시스템 : 설계와 평가)

  • Hong, Woneui;Kim, Uihyun;Cho, Sinhee;Kim, Sansung;Yi, Mun Yong;Shin, Donghoon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.109-131
    • /
    • 2014
  • As the demand of nuclear power plant equipment is continuously growing worldwide, the importance of handling nuclear strategic materials is also increasing. While the number of cases submitted for the exports of nuclear-power commodity and technology is dramatically increasing, preadjudication (or prescreening to be simple) of strategic materials has been done so far by experts of a long-time experience and extensive field knowledge. However, there is severe shortage of experts in this domain, not to mention that it takes a long time to develop an expert. Because human experts must manually evaluate all the documents submitted for export permission, the current practice of nuclear material export is neither time-efficient nor cost-effective. Toward alleviating the problem of relying on costly human experts only, our research proposes a new system designed to help field experts make their decisions more effectively and efficiently. The proposed system is built upon case-based reasoning, which in essence extracts key features from the existing cases, compares the features with the features of a new case, and derives a solution for the new case by referencing similar cases and their solutions. Our research proposes a framework of case-based reasoning system, designs a case-based reasoning system for the control of nuclear material exports, and evaluates the performance of alternative keyword extraction methods (full automatic, full manual, and semi-automatic). A keyword extraction method is an essential component of the case-based reasoning system as it is used to extract key features of the cases. The full automatic method was conducted using TF-IDF, which is a widely used de facto standard method for representative keyword extraction in text mining. TF (Term Frequency) is based on the frequency count of the term within a document, showing how important the term is within a document while IDF (Inverted Document Frequency) is based on the infrequency of the term within a document set, showing how uniquely the term represents the document. The results show that the semi-automatic approach, which is based on the collaboration of machine and human, is the most effective solution regardless of whether the human is a field expert or a student who majors in nuclear engineering. Moreover, we propose a new approach of computing nuclear document similarity along with a new framework of document analysis. The proposed algorithm of nuclear document similarity considers both document-to-document similarity (${\alpha}$) and document-to-nuclear system similarity (${\beta}$), in order to derive the final score (${\gamma}$) for the decision of whether the presented case is of strategic material or not. The final score (${\gamma}$) represents a document similarity between the past cases and the new case. The score is induced by not only exploiting conventional TF-IDF, but utilizing a nuclear system similarity score, which takes the context of nuclear system domain into account. Finally, the system retrieves top-3 documents stored in the case base that are considered as the most similar cases with regard to the new case, and provides them with the degree of credibility. With this final score and the credibility score, it becomes easier for a user to see which documents in the case base are more worthy of looking up so that the user can make a proper decision with relatively lower cost. The evaluation of the system has been conducted by developing a prototype and testing with field data. The system workflows and outcomes have been verified by the field experts. This research is expected to contribute the growth of knowledge service industry by proposing a new system that can effectively reduce the burden of relying on costly human experts for the export control of nuclear materials and that can be considered as a meaningful example of knowledge service application.

Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)

  • Bae, Jung-Hwan;Son, Ji-Eun;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.141-156
    • /
    • 2013
  • Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.

Derivation of Digital Music's Ranking Change Through Time Series Clustering (시계열 군집분석을 통한 디지털 음원의 순위 변화 패턴 분류)

  • Yoo, In-Jin;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.171-191
    • /
    • 2020
  • This study focused on digital music, which is the most valuable cultural asset in the modern society and occupies a particularly important position in the flow of the Korean Wave. Digital music was collected based on the "Gaon Chart," a well-established music chart in Korea. Through this, the changes in the ranking of the music that entered the chart for 73 weeks were collected. Afterwards, patterns with similar characteristics were derived through time series cluster analysis. Then, a descriptive analysis was performed on the notable features of each pattern. The research process suggested by this study is as follows. First, in the data collection process, time series data was collected to check the ranking change of digital music. Subsequently, in the data processing stage, the collected data was matched with the rankings over time, and the music title and artist name were processed. Each analysis is then sequentially performed in two stages consisting of exploratory analysis and explanatory analysis. First, the data collection period was limited to the period before 'the music bulk buying phenomenon', a reliability issue related to music ranking in Korea. Specifically, it is 73 weeks starting from December 31, 2017 to January 06, 2018 as the first week, and from May 19, 2019 to May 25, 2019. And the analysis targets were limited to digital music released in Korea. In particular, digital music was collected based on the "Gaon Chart", a well-known music chart in Korea. Unlike private music charts that are being serviced in Korea, Gaon Charts are charts approved by government agencies and have basic reliability. Therefore, it can be considered that it has more public confidence than the ranking information provided by other services. The contents of the collected data are as follows. Data on the period and ranking, the name of the music, the name of the artist, the name of the album, the Gaon index, the production company, and the distribution company were collected for the music that entered the top 100 on the music chart within the collection period. Through data collection, 7,300 music, which were included in the top 100 on the music chart, were identified for a total of 73 weeks. On the other hand, in the case of digital music, since the cases included in the music chart for more than two weeks are frequent, the duplication of music is removed through the pre-processing process. For duplicate music, the number and location of the duplicated music were checked through the duplicate check function, and then deleted to form data for analysis. Through this, a list of 742 unique music for analysis among the 7,300-music data in advance was secured. A total of 742 songs were secured through previous data collection and pre-processing. In addition, a total of 16 patterns were derived through time series cluster analysis on the ranking change. Based on the patterns derived after that, two representative patterns were identified: 'Steady Seller' and 'One-Hit Wonder'. Furthermore, the two patterns were subdivided into five patterns in consideration of the survival period of the music and the music ranking. The important characteristics of each pattern are as follows. First, the artist's superstar effect and bandwagon effect were strong in the one-hit wonder-type pattern. Therefore, when consumers choose a digital music, they are strongly influenced by the superstar effect and the bandwagon effect. Second, through the Steady Seller pattern, we confirmed the music that have been chosen by consumers for a very long time. In addition, we checked the patterns of the most selected music through consumer needs. Contrary to popular belief, the steady seller: mid-term pattern, not the one-hit wonder pattern, received the most choices from consumers. Particularly noteworthy is that the 'Climbing the Chart' phenomenon, which is contrary to the existing pattern, was confirmed through the steady-seller pattern. This study focuses on the change in the ranking of music over time, a field that has been relatively alienated centering on digital music. In addition, a new approach to music research was attempted by subdividing the pattern of ranking change rather than predicting the success and ranking of music.

A Study on the Space Formation and Garden Characteristics of Garden Remains, Gao-Byeoleop for Restoration Design (가오별업(嘉梧別業)의 복원 설계를 위한 공간구성 및 정원 특성에 관한 연구)

  • Rho, Jae-Hyun;Kim, Soon-Ki
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.36 no.3
    • /
    • pp.58-74
    • /
    • 2018
  • This study aims to propose baseline data for designing restoration of Gaobyulup, researching space formation and characteristics of gardens of Gaobyulup, which located in the foot of Cheonmasan Mountain in Namyangju. Gaobyulup is a remain in retirement of Gyulsan Yu-Won Lee, a representative politician, administrator, and tea drinker in late Joseon Dynasty. The results of the research about the shape of Gaobyulup deducted through reference review, poetry and prose analysis, an on-the-spot survey and residents' interview are below: Lee, who used pseudonym as 'Gyulsan,' which menas Jongnamsan Mountain, yearned Mangcheonbyeoreop(輞川別業) by Yu Wang and retirement with a country house operation by Seogye Sedang Park. In the persuit of this ideal, he created and operated a country house in Gaogok of Yangju, which a family burial ground was located. Gaobyulup, which located in Gaogok in the lower part of Cheonmasan Mountain, was largely composed outer and inner gardens, and the area of house operation was started from a stone post of Gaobokji The inner garden of Gaobyulup was including major garden components like buildings, such as Sasihyanggwan, Obaekganjung, Imharyoe and Toesadam, and Chaewon near Haengrangchae, and Gwawon in an backyard. In addition, Younggwijung pavilion, which located 850m away from Gaobyulup, was the another country house inside the Byulup, thus Gaobyulup shows a duplex space formation. In the inner garden of Gaobyulup, there are Sasihyanggwan, which had functions of Sarangchae as library and depository of old paintings and calligraphic works, and Obaekganjung, a small Sarangchae which connected with Sasihyanggwan in the form of a transept. Yusanggoksuger located near Obaekganjung. Additionally, Imharyeo, a library with a tablet of Byeokryowon(??園), which located in the highest point in Byulup, has the functions of a reading room and a tea house. Many Taihu stones were located not only in Toesadam, a square-formed pond with lotus but also many places in the inner gardens. And rare garden plants were planted. These were closely related to the trend of horticulture for pleasure, wealth, and collecting old paintings and calligraphic works for pleasure of Lee. Meanwhile, the area of Younggwijung pavilion, located in Gaocheon stream fall from Byulup to Manhoiam, looks like Wooampok, a enjoying place of other personages, who use their pseudonym as "Oksan" or "Wooam" Lee identifies Wooampok as "Jesampok" and carved 'Gyulsan' s he declared this place is his operating area. Lee built Younggwijung pavilion and planted many peach trees for recreation of utopia. The stone letters of Byukpadongcheon, located in front of a bridge in the foreside of Younggwijung pavilion, seems another enchanted land created in Gaobokji inside. Lee carved Jeilsan in huge rock on the falls rear Manhoiam temple, which Lee did great role of foundation of the temple, so he identifies that this place was the end of the outer garden of Gaobyulup. This study tries to estimate traces of the country house in Gaogok through reference review and on-th-spot survey, and the results from this study are presumed based on site remains only conformed today. It needs to discover second scenary or stone carved letters between Jeilsan and Jesampok. Additionally, exact formation characteristics of Gaobyulup should be identified through excavation survey later. To do so, an interest and a major role of Namyangju-si must be equipped for future restoration of Gaobyulup.

Product Community Analysis Using Opinion Mining and Network Analysis: Movie Performance Prediction Case (오피니언 마이닝과 네트워크 분석을 활용한 상품 커뮤니티 분석: 영화 흥행성과 예측 사례)

  • Jin, Yu;Kim, Jungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.49-65
    • /
    • 2014
  • Word of Mouth (WOM) is a behavior used by consumers to transfer or communicate their product or service experience to other consumers. Due to the popularity of social media such as Facebook, Twitter, blogs, and online communities, electronic WOM (e-WOM) has become important to the success of products or services. As a result, most enterprises pay close attention to e-WOM for their products or services. This is especially important for movies, as these are experiential products. This paper aims to identify the network factors of an online movie community that impact box office revenue using social network analysis. In addition to traditional WOM factors (volume and valence of WOM), network centrality measures of the online community are included as influential factors in box office revenue. Based on previous research results, we develop five hypotheses on the relationships between potential influential factors (WOM volume, WOM valence, degree centrality, betweenness centrality, closeness centrality) and box office revenue. The first hypothesis is that the accumulated volume of WOM in online product communities is positively related to the total revenue of movies. The second hypothesis is that the accumulated valence of WOM in online product communities is positively related to the total revenue of movies. The third hypothesis is that the average of degree centralities of reviewers in online product communities is positively related to the total revenue of movies. The fourth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. The fifth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. To verify our research model, we collect movie review data from the Internet Movie Database (IMDb), which is a representative online movie community, and movie revenue data from the Box-Office-Mojo website. The movies in this analysis include weekly top-10 movies from September 1, 2012, to September 1, 2013, with in total. We collect movie metadata such as screening periods and user ratings; and community data in IMDb including reviewer identification, review content, review times, responder identification, reply content, reply times, and reply relationships. For the same period, the revenue data from Box-Office-Mojo is collected on a weekly basis. Movie community networks are constructed based on reply relationships between reviewers. Using a social network analysis tool, NodeXL, we calculate the averages of three centralities including degree, betweenness, and closeness centrality for each movie. Correlation analysis of focal variables and the dependent variable (final revenue) shows that three centrality measures are highly correlated, prompting us to perform multiple regressions separately with each centrality measure. Consistent with previous research results, our regression analysis results show that the volume and valence of WOM are positively related to the final box office revenue of movies. Moreover, the averages of betweenness centralities from initial community networks impact the final movie revenues. However, both of the averages of degree centralities and closeness centralities do not influence final movie performance. Based on the regression results, three hypotheses, 1, 2, and 4, are accepted, and two hypotheses, 3 and 5, are rejected. This study tries to link the network structure of e-WOM on online product communities with the product's performance. Based on the analysis of a real online movie community, the results show that online community network structures can work as a predictor of movie performance. The results show that the betweenness centralities of the reviewer community are critical for the prediction of movie performance. However, degree centralities and closeness centralities do not influence movie performance. As future research topics, similar analyses are required for other product categories such as electronic goods and online content to generalize the study results.

Ensemble Learning with Support Vector Machines for Bond Rating (회사채 신용등급 예측을 위한 SVM 앙상블학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.29-45
    • /
    • 2012
  • Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

A Study on the Landscape Philosophy of Hageohwon Garden (별업 하거원(何去園) 원림에 투영된 조영사상 연구)

  • Shin, Sang-Sup;Kim, Hyun-Wuk;Kang, Hyun-Min
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.30 no.1
    • /
    • pp.46-56
    • /
    • 2012
  • The research results of tracing the Landscape Philosophy of Hageowon garden(何去園) in Musu-dong, Daejon of Youhwadang, Kwon, Iijin(權以鎭, 1668-1734) is as below. The ideological background of the protagonist reflected in Hageowon is the Hyoje Ideology(filial piety and brotherly love, 孝弟) of Sinjongchuwon(painstakingly caring for one's ancestors), Musil ideology(pursuing ethical diligence and truthful mind, 務實) based on sadistic tradition and ethical rationalism, Confucionist Eunil Ideology(ideology on seclusion, 隱逸) of Cheonghanjiyeon(quiet relaxation, 淸閒之燕), and the Pungryu ideology(appreciation for the arts, 風流) of Taoism in the Taoist style. Thus, by substituting these ideological values into a space called Hageowon, the Byulup gardens(別業) such as the Symbolic garden(象徵園), meaning gaeden(意園), and miniascape garden(縮景園) were able to be constructed. 2) The space organization system of Hageowon is generally classified into three phases considering the hierarchy. The first territory is the transitional space having residential features, which is an area to reach peach tree - road(Taoist world 桃經) from Youhwadang(有懷堂). The second territory is a monumental memorial space where the Yocheondae(繞千臺), Jangwoodam(丈藕潭), Hwagae(花階), and the ancestral graves take place, centering on the yards of Sumanheon(收漫軒), and the third territory is the secluded space in the eastern outer garden where the mountain stream flows from the north to south and which is the vein of the left-hand blue dragon(靑龍) of the guardian mountain of Hageowon. 3) Symbolically, the first phase has symbolized the space as a meaningful scenery by overlapping the Confucionist place of Youhwadang - Gosudae(孤秀臺) - Odeokdae(五德臺), and the mystic world of Jukcheondang(竹遷堂) - peach tree - road(桃徑). The second phase, which is the space of Sumanheon(收漫軒), Yocheondae, and Jangwoodam, the symbolical value of Sinjongchuwon(愼終追遠) and the remembrance and longing for one's parents are reflected. The third phase, which is the eastern outer garden of Hageowon and where the mountain stream flows from the north to south, is composed of the east valley(東溪) - Hwalsudam(活水潭) - Sumi Waterfall(修眉瀑布). More specifically, (1) Mongjeong symbolizes the life of gaining knowledge through studying to realize one's foolishness, (2) Hwalsudam symbolizes a transcending attitude in life refusing to pursue wealth and fame, and (3) Jangwoodam symbolizes the gateway to the fairyland to enter the world of mystic gods. 4) The rationale behind Hageowon is that the two algorithms of Confucionism and Taoist Theory appear repeatedly and in an overlapping way. The Napoji(納汚池) and Hwalsudam, which pertains to the prelude of space development, has symbolized Susimyangseong(修心養成, meditating one's mind and improving one's nature), which is based on ethical rationalism. Moreover, if the Monjeong sphere pertaining to the eastern outer garden of Hageowon takes the Confucionist value system as its theme, including moral training, studying, and researching, Jangwudam, Sumi Waterfalls, and Unwa can be understood as a taste of Cheokbyeon(滌煩, eliminating troubles) for the arts where the mystic world is substituted as a meaningful scenery. 5) The miniascape technique called artificial mountain was substituted to Hageowon to construct a mystic world like the 12 peaks of Mt. Mu(巫山). By borrowing the symbolic meaning expressed in old poems, it has been named 'Habang(1/何放), Hwabong(2, 3/和峯), Chulgun(4, 5, 6/出群), Sinwan(7/神浣), Chwhigyu(8, 9, 10/聚糾), Cheomyo(11/處杳), Giyung(12/氣融).' The representative poet reciting artificial mountain were Wangeui(汪醫), Nosamgang(魯三江), Dubo(杜甫), Hanyou(韓愈), Jeonheaseong(錢希聖), and Beomseokho(范石湖). They related themselves with literature by transcending time and space and attempted to sing about the richness of the mental world by putting the mystic world and culture of appreciating the arts they pursued in the vacation home called Hageowon.