• Title/Summary/Keyword: 한국화 검색

Search Result 2,073, Processing Time 0.031 seconds

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

    • Thay, Setha;Ha, Inay;Jo, Geun-Sik
      • Journal of Intelligence and Information Systems
      • /
      • v.19 no.2
      • /
      • pp.1-20
      • /
      • 2013
    • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.

    Microbiological and Enzymological Studies on Takju Brewing (탁주(濁酒) 양조(釀造)에 관(關)한 미생물학적(微生物學的) 및 효소학적(酵素學的) 연구(硏究))

    • Kim, Chan-Jo
      • Applied Biological Chemistry
      • /
      • v.10
      • /
      • pp.69-100
      • /
      • 1968
    • 1. In order to investigate on the microflora and enzyme activity of mold wheat 'Nuruk' , the major source of microorganisms for the brewing of Takju (a Korean Sake), two samples of Nuruk, one prepared at the College of Agriculture, Chung Nam University (S) and the other perchased at a market (T), were taken for the study. The molds, aerobic bacteria, lactic acid bacteria, and yeasts were examined and counted. The yeasts were classified by the treatment with TTC (2, 3, 5 triphenyltetrazolium chloride) agar that yields a varied shade of color. The amylase and protease activities of Nuruk were measured. The results were as the followings. a) In the Nuruk S found were: Aspergillus oryzae group, $204{\times}10^5$; Black Aspergilli, $163{\times}10^5$; Rhizogus, $20{\times}10^5$; Penicillia, $134{\times}10^5$; Areobic bacteria, $9{\times}10^6-2{\times}10^7$; Lactic acid bacteria, $3{\times}10^4$ In the Nuruk T found were: Aspergillus oryzae group, $836{\times}10^5$; Black Aspergilli, $286{\times}10^5$; Rhizopus, $623{\times}10^5$; Penicillia, $264{\times}10^5$; Aerobic bacteria, $5{\times}10^6-9{\times}10^6$; Lactic acid bacteria, $3{\times}10^4$ b) Eighty to ninety percent of the aerobic bacteria in Nuruk S appeared to belong to Bacillus subtilis while about 70% of those in Nuruk T seemed to be spherical bacteria. In both Nuruks about 80% of lactic acid bacteria were observed as spherical ones. c) The population of yeasts in 1g. of Nuruk S was about $6{\times}10^5$, 56.5% of which were TTC pink yeasts, 16% of which were TTC red pink yeasts, 8% of which were TTC red yeasts, 19.5% of which were TTC white yeasts. In Nuruk T(1g) the number of yeasts accounted for $14{\times}10^4$ and constituted of 42% TTC pink. 21% TTC red pink 28% TTC red and 9% TTC white. d) The enzyme activity of 1g Nuruk S was: Liquefying type Amylase, $D^{40}/_{30},=256$ W.V. Saccharifying type Amylase, 43.32 A.U. Acid protease, 181 C.F.U. Alkaline protease, 240C.F.U. The enzyme activity of 1g Nuruk T was: Liquefying type Amylase $D^{40}/_{30},=32$ W.V. Saccharifying type amylase $^{30}34.92$ A.U. Acid protease, 138 C.F.U. Alkaline protease 31 C.F.U. 2. During the fermentation of 'Takju' employing the Nuruks S and T the microflora and enzyme activity throughout the brewing were observed in 12 hour intervals. TTC pink and red yeasts considered to be the major yeasts were isolated and cultured. The strains ($1{\times}10^6/ml$) were added to the mashes S and T in which pH was adjusted to 4.2 and the change of microflora was examined during the fermentation. The results were: a) The molds disappeared from each sample plot since 2 to 3 days after mashing while the population of aerobic bacteria was found to be $10{\times}10^7-35{\times}10^7/ml$ inS plots and $8.2{\times}10^7-12{\times}10^7$ in plots. Among them the coccus propagated substantially until some 30 hours elasped in the S and T plots treated with lactic acid but decreased abruptly thereafter. In the plots of SP. SR. TP. and TR the coccus had not appeared from the beginning while the bacillus showed up and down changes in number and diminished by 1/5-1/10 the original at the end stage. b) The lactic acid bacteria observed in the S plot were about $7.4{\times}10^7$ in number per ml of the mash in 24 hours and increased up to around $2{\times}10^8$ until 3-4 days since. After this period the population decreased rapidly and reached about $4{\times}10^5$ at the end, In the plot T the lactic acid becteria found were about $3{\times}10^8$ at the period of 24 fours, about $3{\times}10$ in 3 days and about $2{\times}10^5$ at the end in number. In the plots SP. SR. TP, and TR the lactic acid bacteria observed were as less as $4{\times}10^5$ at the stage of 24 hours and after this period the organisms either remained unchanged in population or ceased to exist. c) The maiority of lactic acid bacteria found in each mash were spherical and the change in number displayed a tendency in accordance with the amount of lactic acid and alcohol produced in the mash. d) The yeasts had showed a marked propagation since the period of 24 hours when the number was about $2{\times}10^8$ ㎖ mash in the plot S. $4{\times}10^8$ in 48 hours and $5-7{\times}10^8$ in the end period were observed. In the plot T the number was $4{\times}10^8$ in 24 hours and thereafter changed up and down maintaining $2-5{\times}10^8$ in the range. e) Over 90% of the yeasts found in the mashes of S and T plots were TTC pink type while both TTC red pink and TTC red types held range of $2{\times}10-3{\times}10^7$ throughout the entire fermentation. f) The population of TTC pink yeasts in the plot SP was as $5{\times}10^8$ much as that is, twice of that of S plot at the period of 24 hours. The predominance in number continued until the middle and later stages but the order of number became about the same at the end. g) Total number of the yeasts observed in the plot SR showed little difference from that of the plot SP. The TTC red yeasts added appeared considerably in the early stage but days after the change in number was about the same as that of the plot S. In the plot TR the population of TTC red yeasts was predominant over the T plot in the early stage which there was no difference between two plots there after. For this reason even in the plot w hers TTC red yeasts were added TTC pink yeasts were predominant. TTC red yeasts observed in the present experiment showed continuing growth until the later stage but the rate was low. h) In the plot TP TTC pink yeasts were found to be about $5{\times}10^8$ in number at the period of 2 days and inclined to decrease thereafter. Compared with the plot T the number of TTC pink yeasts in the plot TP was predominant until the middle stage but became at the later stage. i) The productivity of alcohol in the mash was measured. The plot where TTC pink yeasts were added showed somewhat better yield in the earely stage but at and after the middle stage the difference between the yeast-added and the intact mashes was not recognizable. And the production of alcohol was not proportional to the total number of yeasts present. j) Activity of the liquefying amylase was the highest until 12 hours after mashing, somewhat lowered once after that, and again increased around 36-48 hours after mashing. Then the activity had decreased continuously. Activity of saccharifying amylase also decreased at the period of 24 hours and then increased until 48 hours when it reached the maximum. Since, the activity had gradually decreased until 72 hours and rapidly so did thereafter. k) Activity of alkaline protease during the fermentation of mash showed a tendency to decrease continusously although somewhat irregular. Activity of acid protease increased until hours at the maximum, then decreased rapidly, and again increased, the vigor of acid protease showed better shape than that of alkaline protease throughout. 3. TTC pink yeasts that were predominant in number, two strains of TTC red pink yeasts that appeared throughout the brewing, and TTC red yeasts were identified and the physiological characters examined. The results were as described below. a) TTC pinkyeasts (B-50P) and two strains of TTC red pink yeasts (B-54 RP & B-60 RP) w ere identified as the type of Saccharomyces cerevisiae and TTC pink red yeasts CB-53 R) were as the type of Hansenula subpelliculosa. b) The fermentability of four strains above mentioned were measured as follows. Two strains of TTC red pink yeasts were the highest, TTC pink yeasts were the lowest in the fermantability. The former three strains were active in the early stage of fermentation and found to be suitable for manufacturing 'Takju' TTC red yeasts were found to play an important role in Takju brewing due to its strong ability to produce esters although its fermentability was low. c) The tolerance against nitrous acid of strains of yeast was marked. That against lactic acid was only 3% in Koji extract, and TTC red yeasts showed somewhat stronger resistance. The tolerance against alcohol of TTC pink and red pink yeasts in the Hayduck solution was 7% while that in the malt extract was 13%. However, that of TTC red yeasts was much weaker than others. Liguefying activity of gelatin by those four strains of yeast was not recognized even in 40 days. 4. Fermentability during Takju brewing was shown in the first two days as much as 70-80% of total fermentation and around 90% of fermentation proceeded in 3-4 days. The main fermentation appeared to be completed during :his period. Productivity of alcohol during Takju brewing was found to be apporximately 65% of the total amount of starch put in mashing. 5. The reason that Saccharomyces coreanuss found be Saito in the mash of Takju was not detected in the present experiment is considered due to the facts that Aspergillus oryzae has been inoculated in the mold wheat (Nuruk) since around 1930 and also that Koji has been used in Takju brewing, consequently causing they complete change in microflora in the Takju brewing. This consideration will be supported by the fact that the original flavor and taste have now been remarkably changed.

    • PDF

    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.