• Title/Summary/Keyword: Classification Papers

Search Result 240, Processing Time 0.03 seconds

Machine Learning Algorithm Accuracy for Code-Switching Analytics in Detecting Mood

  • Latib, Latifah Abd;Subramaniam, Hema;Ramli, Siti Khadijah;Ali, Affezah;Yulia, Astri;Shahdan, Tengku Shahrom Tengku;Zulkefly, Nor Sheereen
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.334-342
    • /
    • 2022
  • Nowadays, as we can notice on social media, most users choose to use more than one language in their online postings. Thus, social media analytics needs reviewing as code-switching analytics instead of traditional analytics. This paper aims to present evidence comparable to the accuracy of code-switching analytics techniques in analysing the mood state of social media users. We conducted a systematic literature review (SLR) to study the social media analytics that examined the effectiveness of code-switching analytics techniques. One primary question and three sub-questions have been raised for this purpose. The study investigates the computational models used to detect and measures emotional well-being. The study primarily focuses on online postings text, including the extended text analysis, analysing and predicting using past experiences, and classifying the mood upon analysis. We used thirty-two (32) papers for our evidence synthesis and identified four main task classifications that can be used potentially in code-switching analytics. The tasks include determining analytics algorithms, classification techniques, mood classes, and analytics flow. Results showed that CNN-BiLSTM was the machine learning algorithm that affected code-switching analytics accuracy the most with 83.21%. In addition, the analytics accuracy when using the code-mixing emotion corpus could enhance by about 20% compared to when performing with one language. Our meta-analyses showed that code-mixing emotion corpus was effective in improving the mood analytics accuracy level. This SLR result has pointed to two apparent gaps in the research field: i) lack of studies that focus on Malay-English code-mixing analytics and ii) lack of studies investigating various mood classes via the code-mixing approach.

Perspectives of Breeding for High Protein Quantity and High Protein Quality of Soybeans (콩 양질ㆍ고단백 품종 육성방향)

  • Chung, Kil-Woong;Hong, Eun-Hi;Kim, Seok-Dong;Hwang, Young-Hyun;Lee, Yeong-Ho;Park, Rae-Kyeong
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.33 no.s01
    • /
    • pp.39-47
    • /
    • 1988
  • Soybean grain is most widely used and soybean crop produces most high protein per area among crops. To meet rapid increase of human population and supply protein in safety. soybean has considered more and more important crop. And it has been emphasizing that high quality and high protein soybean breeding must be made efforts in future. Many papers related to soybean breeding for high quality and protein and soybean protein composition have suggested the problems to do in future. Soybean germplasm collection. classification and conservation should be continuously performed, and it is emphasized that wild type of soybeans (G. soja) be considered to use in breeding for high protein varieties. Selections would be better emphasized in first yield and therefore high yield of protein per area. Selection for high protein would be secondary criterion. High protein lines with high yielding potential could be selection from certain populations, and breeders should consider this phenomenon in procedure of selection. Heritability of protein percent is relatively high and genetic gain of increment of protein percent is large. Soybean protein which is comprised 70 to 90% of globulin is constituted mostly 11S and 7S proteins. Sulfur-containing amino acids, methionine and cysteine, are identified to contain more in 11S protein than 7S protein. High 11S germplasm should be desirable to use in crossing plan, and selection of high ratio of 11S/7S lines be better in development of high quality varieties.

  • PDF

The Analysis on Trend of Articles about Retina and Optic Nerve Disease in Journal of Korean Medicine (국내 한의학 학술지에 게재된 망막과 시신경질환 관련 논문들의 경향성 분석)

  • Na-Yeon Choi;Hyung-Sik Seo;Tae-Gwon Kim;Kang Kwon
    • The Journal of Korean Medicine Ophthalmology and Otolaryngology and Dermatology
    • /
    • v.36 no.2
    • /
    • pp.26-44
    • /
    • 2023
  • Objectives : This study was designed to examine and analyze the recent trend of Korean medicine research on retinal and optic nerve diseases, which is increasing. Methods : This study examines papers related to diseases occurring in the retina and optic nerve that were published in Korean journals of Korean medicine, and analyzes the results of research so far by classifying them by year, journal, disease type and type of paper. Results : 1. Since it was first published in 1995, a total of 17 articles have been published until 2018, with 2 articles(11.1%) each in 1997, 2014 and 2018, and 1 article(5.6%) each in other years. 2. The number of searched journals was 17 paper; 4 review articles, 1 original articles, 12 case reports. 3. Distribution of journals; the percentage of Journal of Korean Medicine Ophthalmology and Otolaryngology and Dermatology ranked the highest(41.2%). 4. Classification of 12 case reports into three categories; 7 retinal diseases, 3 optic nerve diseases, 2 other diseases. Conclusions : Currently, researches on retinal and optic nerve involvement in the Korean medicine journals have been conducted mainly through case reports. In the future, more clinical research and case reports are necessary to give practical application to patients.

Predicting the Compressive Strength of Concrete Using a Maturity Concept (적산온도개념을 이용한 콘크리트 압축강도 예측)

  • Ko, Hune-Bum
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.229-234
    • /
    • 2022
  • The non-destructive method of easily evaluating concrete strength through the concept of maturity has been verified by many researchers. The current work introduced such a concept in concrete strength assessment that involved 843 variables and specific values that 11 papers used in experiments, including constant temperatures (5, 10, 20, 30, 40, 50℃) with a W/B range of 18 to 70% and different curing ages (0.5 to 182 days). The classification of concrete as being of normal-strength concrete (40MPa or less), high-strength concrete (40~70MPa), and Super high-strength concrete (70MPa or more) enabled this study to identify the relationship between maturity and concrete strength using the most convenient and easily applicable maturity model in the construction field. A regression formula of lowest guaranteed concrete strength on the basis of maturity was presented.

Research Trends and Datasets Review using Satellite Image (위성영상 이미지를 활용한 연구 동향 및 데이터셋 리뷰)

  • Kim, Se Hyoung;Chae, Jung Woo;Kang, Ju Young
    • Smart Media Journal
    • /
    • v.11 no.1
    • /
    • pp.17-30
    • /
    • 2022
  • Like other computer vision research trends, research using satellite images was able to achieve rapid growth with the development of GPU-based computer computing capabilities and deep learning methodologies related to image processing. As a result, satellite images are being used in various fields, and the number of studies on how to use satellite images is increasing. Therefore, in this paper, we will introduce the field of research and utilization of satellite images and datasets that can be used for research using satellite images. First, studies using satellite images were collected and classified according to the research method. It was largely classified into a Regression-based Approach and a Classification-based Approach, and the papers used by other methods were summarized. Next, the datasets used in studies using satellite images were summarized. This study proposes information on datasets and methods of use in research. In addition, it introduces how to organize and utilize domestic satellite image datasets that were recently opened by AI hub. In addition, I would like to briefly examine the limitations of satellite image-related research and future trends.

A Literature Review and Classification of Recommender Systems on Academic Journals (추천시스템관련 학술논문 분석 및 분류)

  • Park, Deuk-Hee;Kim, Hyea-Kyeong;Choi, Il-Young;Kim, Jae-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.139-152
    • /
    • 2011
  • Recommender systems have become an important research field since the emergence of the first paper on collaborative filtering in the mid-1990s. In general, recommender systems are defined as the supporting systems which help users to find information, products, or services (such as books, movies, music, digital products, web sites, and TV programs) by aggregating and analyzing suggestions from other users, which mean reviews from various authorities, and user attributes. However, as academic researches on recommender systems have increased significantly over the last ten years, more researches are required to be applicable in the real world situation. Because research field on recommender systems is still wide and less mature than other research fields. Accordingly, the existing articles on recommender systems need to be reviewed toward the next generation of recommender systems. However, it would be not easy to confine the recommender system researches to specific disciplines, considering the nature of the recommender system researches. So, we reviewed all articles on recommender systems from 37 journals which were published from 2001 to 2010. The 37 journals are selected from top 125 journals of the MIS Journal Rankings. Also, the literature search was based on the descriptors "Recommender system", "Recommendation system", "Personalization system", "Collaborative filtering" and "Contents filtering". The full text of each article was reviewed to eliminate the article that was not actually related to recommender systems. Many of articles were excluded because the articles such as Conference papers, master's and doctoral dissertations, textbook, unpublished working papers, non-English publication papers and news were unfit for our research. We classified articles by year of publication, journals, recommendation fields, and data mining techniques. The recommendation fields and data mining techniques of 187 articles are reviewed and classified into eight recommendation fields (book, document, image, movie, music, shopping, TV program, and others) and eight data mining techniques (association rule, clustering, decision tree, k-nearest neighbor, link analysis, neural network, regression, and other heuristic methods). The results represented in this paper have several significant implications. First, based on previous publication rates, the interest in the recommender system related research will grow significantly in the future. Second, 49 articles are related to movie recommendation whereas image and TV program recommendation are identified in only 6 articles. This result has been caused by the easy use of MovieLens data set. So, it is necessary to prepare data set of other fields. Third, recently social network analysis has been used in the various applications. However studies on recommender systems using social network analysis are deficient. Henceforth, we expect that new recommendation approaches using social network analysis will be developed in the recommender systems. So, it will be an interesting and further research area to evaluate the recommendation system researches using social method analysis. This result provides trend of recommender system researches by examining the published literature, and provides practitioners and researchers with insight and future direction on recommender systems. We hope that this research helps anyone who is interested in recommender systems research to gain insight for future research.

Fifty years of economic geography in Korea:research trends and issues (한국경제지리학 반세기:연구성과와 과제)

  • ;Park, Sam Ock
    • Journal of the Korean Geographical Society
    • /
    • v.31 no.2
    • /
    • pp.160-197
    • /
    • 1996
  • The purpose of this study is to review research trends and issues of economic geography in Korea for the last fifty years by sub-fields of agricultural geography, industrial geography, commercial and service geography, and transportation geography. Research in Korean economic geography has progressed significantly in terms of the scope and the number of papers published during the last a half a century. Agricultural geography was a leading field of economic geography in Korea before mid-1970s. Since the mid-1970s, however, agricultural geography has turned over the leading role in economic geography to industrial geography. Classification and structure of agricultural region has been the most popular research theme in Korea, even though diverse topics has been dealt in the research of agricultulal geography in Korea during the last fifty years. In recent years, emphasis is given to study on the dynamics of agricultural region and regional differentiation of part-time farming. It is suggested that the future issues of research in agricultural geography in Korea are agricultural restructuring and changes in agricultural space under the WTO system, changes in rural area and agricultural region with the progress of informatization, changes in agricultural structures and rural society by the increase of part-time farming, governments agricultulal policy and its impacts, competitive advantages of Korean agricultulal products, and environmental impacts of agricultural restructuring. Research in industrial geography has remarkably progressed since the 1980s. Locational changes, regional industrial structure and formation of industrial region were the major topics of interest in the research of industrial geography in Korea before 1980. Since the early 1980s, in addition to the topics which were interested in before 1980, changes of industrial organization and industrial location, changes of production systems and industrial space development of high technology industries and science parks, industrial restructuring and regional economy, foreign direct investments, industrial linkages and industrial districts, and industrial policy and regional development have been the major research themes of industrial geography in Korea. Considerable number of papers has been published both in Korean journals and in foreign journals during this period. Considering global changes in the organization of industrial space, future research should be more focused on firms strategy for regaining competitive advantages, local and global perspectives of industry, industry and environmental changes, in addition to the topics which have been dealt in recent years. Research in commercial and service geography and transportation geography was negligible in Korea before the late 1970s. These two sub-fields in economic geography have begun to develop since 1980s. Periodic markets, structure of commercial area, and distribution of products were the major topics of interest in the 1980s in the commercial and service geography in Korea. In the 1990s, however reserch in producer services has been active with growth of producer services in Korean economy. It is suggested that regional changes with progress of informatization and technology, changes of international trade and regional changes, development of efficient distribution system, role of producer services in regional development, and network of producer services are the major issues to be studied in the future in the field of commercial and service geography in Korea. Commuting, distribution of products, and transportation networks have been the major topics of research in transportation geography in Korea. Diverse quantitative techniques have been applied in the most of the researches in transportation geography. It is required that future studies in transportation geography should also focus on societal and behavioral issues, policy issues regional impacts of new transportation facilities, an analysis of transportation system at the global or international level. Since the 1980s economic geography in Korea has considerably progressed with publication of papers and books. The progress can be regarded as successful in quantitative aspect, but not in quantitative aspects. For the development of Korean economic geography in both quantitative and qualitative aspects, it is necessary to promote international collaborative researches and interdisciplinary cooperations. Attention should also be given to the research on changes in competitive advantages and economic restructuring, changes of economic space with the development of high technology and the progress of informatization. economic development and culture. and foreign regional studies.

  • PDF

Predicting stock movements based on financial news with systematic group identification (시스템적인 군집 확인과 뉴스를 이용한 주가 예측)

  • Seong, NohYoon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.1-17
    • /
    • 2019
  • Because stock price forecasting is an important issue both academically and practically, research in stock price prediction has been actively conducted. The stock price forecasting research is classified into using structured data and using unstructured data. With structured data such as historical stock price and financial statements, past studies usually used technical analysis approach and fundamental analysis. In the big data era, the amount of information has rapidly increased, and the artificial intelligence methodology that can find meaning by quantifying string information, which is an unstructured data that takes up a large amount of information, has developed rapidly. With these developments, many attempts with unstructured data are being made to predict stock prices through online news by applying text mining to stock price forecasts. The stock price prediction methodology adopted in many papers is to forecast stock prices with the news of the target companies to be forecasted. However, according to previous research, not only news of a target company affects its stock price, but news of companies that are related to the company can also affect the stock price. However, finding a highly relevant company is not easy because of the market-wide impact and random signs. Thus, existing studies have found highly relevant companies based primarily on pre-determined international industry classification standards. However, according to recent research, global industry classification standard has different homogeneity within the sectors, and it leads to a limitation that forecasting stock prices by taking them all together without considering only relevant companies can adversely affect predictive performance. To overcome the limitation, we first used random matrix theory with text mining for stock prediction. Wherever the dimension of data is large, the classical limit theorems are no longer suitable, because the statistical efficiency will be reduced. Therefore, a simple correlation analysis in the financial market does not mean the true correlation. To solve the issue, we adopt random matrix theory, which is mainly used in econophysics, to remove market-wide effects and random signals and find a true correlation between companies. With the true correlation, we perform cluster analysis to find relevant companies. Also, based on the clustering analysis, we used multiple kernel learning algorithm, which is an ensemble of support vector machine to incorporate the effects of the target firm and its relevant firms simultaneously. Each kernel was assigned to predict stock prices with features of financial news of the target firm and its relevant firms. The results of this study are as follows. The results of this paper are as follows. (1) Following the existing research flow, we confirmed that it is an effective way to forecast stock prices using news from relevant companies. (2) When looking for a relevant company, looking for it in the wrong way can lower AI prediction performance. (3) The proposed approach with random matrix theory shows better performance than previous studies if cluster analysis is performed based on the true correlation by removing market-wide effects and random signals. The contribution of this study is as follows. First, this study shows that random matrix theory, which is used mainly in economic physics, can be combined with artificial intelligence to produce good methodologies. This suggests that it is important not only to develop AI algorithms but also to adopt physics theory. This extends the existing research that presented the methodology by integrating artificial intelligence with complex system theory through transfer entropy. Second, this study stressed that finding the right companies in the stock market is an important issue. This suggests that it is not only important to study artificial intelligence algorithms, but how to theoretically adjust the input values. Third, we confirmed that firms classified as Global Industrial Classification Standard (GICS) might have low relevance and suggested it is necessary to theoretically define the relevance rather than simply finding it in the GICS.

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

Intents of Acquisitions in Information Technology Industrie (정보기술 산업에서의 인수 유형별 인수 의도 분석)

  • Cho, Wooje;Chang, Young Bong;Kwon, Youngok
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.123-138
    • /
    • 2016
  • This study investigates intents of acquisitions in information technology industries. Mergers and acquisitions are a strategic decision at corporate-level and have been an important tool for a firm to grow. Plenty of firms in information technology industries have acquired startups to increase production efficiency, expand customer base, or improve quality over the last decades. For example, Google has made about 200 acquisitions since 2001, Cisco has acquired about 210 firms since 1993, Oracle has made about 125 acquisitions since 1994, and Microsoft has acquired about 200 firms since 1987. Although there have been many existing papers that theoretically study intents or motivations of acquisitions, there are limited papers that empirically investigate them mainly because it is challenging to measure and quantify intents of M&As. This study examines the intent of acquisitions by measuring specific intents for M&A transactions. Using our measures of acquisition intents, we compare the intents by four acquisition types: (1) the acquisition where a hardware firm acquires a hardware firm, (2) the acquisition where a hardware firm acquires a software/IT service firm, (3) the acquisition where a software/IT service firm acquires a hardware firm, and (4) the acquisition where a software /IT service firm acquires a software/IT service firm. We presume that there are difference in reasons why a hardware firm acquires another hardware firm, why a hardware firm acquires a software firm, why a software/IT service firm acquires a hardware firm, and why a software/IT service firm acquires another software/IT service firm. Using data of the M&As in US IT industries, we identified major intents of the M&As. The acquisition intents are identified based on the press release of M&A announcements and measured with four categories. First, an acquirer may have intents of cost saving in operations by sharing common resources between the acquirer and the target. The cost saving can accrue from economies of scope and scale. Second, an acquirer may have intents of product enhancement/development. Knowledge and skills transferred from the target may enable the acquirer to enhance the product quality or to expand product lines. Third, an acquirer may have intents of gain additional customer base to expand the market, to penetrate the market, or to enter a foreign market. Fourth, a firm may acquire a target with intents of expanding customer channels. By complementing existing channel to the customer, the firm can increase its revenue. Our results show that acquirers have had intents of cost saving more in acquisitions between hardware companies than in acquisitions between software companies. Hardware firms are more likely to acquire with intents of product enhancement or development than software firms. Overall, the intent of product enhancement/development is the most frequent intent in all of the four acquisition types, and the intent of customer base expansion is the second. We also analyze our data with the classification of production-side intents and customer-side intents, which is based on activities of the value chain of a firm. Intents of cost saving operations and those of product enhancement/development can be viewed as production-side intents and intents of customer base expansion and those of expanding customer channels can be viewed as customer-side intents. Our analysis shows that the ratio between the number of customer-side intents and that of production-side intents is higher in acquisitions where a software firm is an acquirer than in the acquisitions where a hardware firm is an acquirer. This study can contribute to IS literature. First, this study provides insights in understanding M&As in IT industries by answering for question of why an IT firm intends to another IT firm. Second, this study also provides distribution of acquisition intents for acquisition types.