• Title/Summary/Keyword: Web version

Search Result 196, Processing Time 0.032 seconds

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

Efficacy and Toxicity of Anti-VEGF Agents in Patients with Castration-Resistant Prostate Cancer: a Meta-analysis of Prospective Clinical Studies

  • Qi, Wei-Xiang;Fu, Shen;Zhang, Qing;Guo, Xiao-Mao
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.19
    • /
    • pp.8177-8182
    • /
    • 2014
  • Background: Blocking angiogenesis by targeting vascular endothelial growth factor (VEGF) signaling pathway to inhibit tumor growth has proven to be successful in treating a variety of different metastatic tumor types, including kidney, colon, ovarian, and lung cancers, but its role in castration-resistant prostate cancer (CRPC) is still unknown. We here aimed to determine the efficacy and toxicities of anti-VEGF agents in patients with CRPC. Materials and Methods: The databases of PubMed, Web of Science and abstracts presented at the American Society of Clinical Oncology up to March 31, 2014 were searched for relevant articles. Pooled estimates of the objective response rate (ORR) and prostate-specific antigen (PSA) response rate (decline ${\geq}50%$) were calculated using the Comprehensive Meta-Analysis (version 2.2.064) software. Median weighted progression-free survival (PFS) and overall survival (OS) time for anti-VEGF monotherapy and anti-VEGF-based doublets were compared by two-sided Student's t test. Results: A total of 3,841 patients from 19 prospective studies (4 randomized controlled trials and 15 prospective nonrandomized cohort studies) were included for analysis. The pooled ORR was 12.4% with a higher response rate of 26.4% (95%CI, 13.6-44.9%) for anti-VEGF-based combinations vs. 6.7% (95%CI, 3.5-12.7%) for anti-VEGF alone (p=0.004). Similarly, the pooled PSA response rate was 32.4% with a higher PSA response rate of 52.8% (95%CI: 40.2-65.1%) for anti-VEGF-based combinations vs. 7.3% (95%CI, 3.6-14.2%) for anti-VEGF alone (p<0.001). Median PFS and OS were 6.9 and 22.1 months with weighted median PFS of 5.6 vs. 6.9 months (p<0.001) and weighted median OS of 13.1 vs. 22.1 months (p<0.001) for anti-VEGF monotherapy vs. anti-VEGF-based doublets. Conclusions: With available evidence, this pooled analysis indicates that anti-VEGF monotherapy has a modest effect in patients with CRPC, and clinical benefits gained from anti-VEGF-based doublets appear greater than anti-VEGF monotherapy.

Analysis of Research Topics among Library, Archives and Museums using Topic Modeling (토픽 모델링을 활용한 도서관, 기록관, 박물관간의 연구 주제 분석)

  • Kim, Heesop;Kang, Bora
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.339-358
    • /
    • 2019
  • The purpose of this study is to understand the topics of the research for the establishment of cooperative platform between libraries, archives, and museums that carry out the common task of providing knowledge information in a broad sense. To achieve the purpose of this study, 637 bibliographic information on three institutions were collected from the Web version of Scopus database. Among the collected bibliographic information, 5,218 words were extracted through NetMiner V.4 and analysed topic modeling. The results are as follows: First, as a result of analyzing the frequency of word appearance according to the tf-idf weight 'Preservation' was the most hottest topic. Second, the topic modeling analysis through LDA(Latent Dirichlet Allocation) algorithm resulted in 13 topic areas. Third, as a result of expressing 13 topic areas as a network, repository construction was the central topic, and the research topics such as cooperation among institutions, conservation environment for collections, system and policy discovery, life cycle of collections, exhibition of information resources, and information retrieval were closely related to the central topic. Fourth, the trend of 13 topic areas by year 1998 is limited to the specific subjects such as system and policy discovery, information retrieval, and life cycle of collections, while the subsequent studies have been carried out after that year.

Development of Radar-Based Multi-Sensor Quantitative Precipitation Estimation Technique (레이더기반 다중센서활용 강수추정기술의 개발)

  • Lee, Jae-Kyoung;Kim, Ji-Hyeon;Park, Hye-Sook;Suk, Mi-Kyung
    • Atmosphere
    • /
    • v.24 no.3
    • /
    • pp.433-444
    • /
    • 2014
  • Although the Radar-AWS Rainrate (RAR) calculation system operated by Korea Meteorological Administration estimated precipitation using 2-dimensional composite components of single polarization radars, this system has several limitations in estimating the precipitation accurately. To to overcome limitations of the RAR system, the Korea Meteorological Administration developed and operated the RMQ (Radar-based Multi-sensor Quantitative Precipitation Estimation) system, the improved version of NMQ (National Mosaic and Multi-sensor Quantitative Precipitation Estimation) system of NSSL (National Severe Storms Laboratory) for the Korean Peninsula. This study introduced the RMQ system domestically for the first time and verified the precipitation estimation performance of the RMQ system. The RMQ system consists of 4 main parts as the process of handling the single radar data, merging 3D reflectivity, QPE, and displaying result images. The first process (handling of the single radar data) has the pre-process of a radar data (transformation of data format and quality control), the production of a vertical profile of reflectivity and the correction of bright-band, and the conduction of hydrid scan reflectivity. The next process (merger of 3D reflectivity) produces the 3D composite reflectivity field after correcting the quality controlled single radar reflectivity. The QPE process classifies the precipitation types using multi-sensor information and estimates quantitative precipitation using several Z-R relationships which are proper for precipitation types. This process also corrects the precipitation using the AWS position with local gauge correction technique. The last process displays the final results transformed into images in the web-site. This study also estimated the accuracy of the RMQ system with five events in 2012 summer season and compared the results of the RAR (Radar-AWS Rainrate) and RMQ systems. The RMQ system ($2.36mm\;hr^{-1}$ in RMSE on average) is superior to the RAR system ($8.33mm\;hr^{-1}$ in RMSE) and improved by 73.25% in RMSE and 25.56% in correlation coefficient on average. The precipitation composite field images produced by the RMQ system are almost identical to the AWS (Automatic Weather Statioin) images. Therefore, the RMQ system has contributed to improve the accuracy of precipitation estimation using weather radars and operation of the RMQ system in the work field in future enables to cope with the extreme weather conditions actively.

An Exploratory Study for Identifying Key Factors in Online Games Development Strategy Utilizing Web Community (온라인게임 개발전략에 관한 탐색적 연구 : 게임 커뮤니티 활용을 중심으로)

  • Jung, Jai-Jin;Chang, Chung-Moo;Kim, Tae-Ung
    • The KIPS Transactions:PartD
    • /
    • v.11D no.4
    • /
    • pp.991-1002
    • /
    • 2004
  • Online game business has emerged as the most lucrative entertainment industry, with over 20 million platers. The popularity of online games can be attributed to the presence of numerous PC Bangs around the country, which have pushed online games into the mainstream culture while broadband internet services facilitated online game play. The age distribution of online game players is expanding and a variety of new games are under development to target certain age groups. While the online game market continues to expand, with many new online game publishers entering the market, relatively little is known about which factors are strategically important for successful development of online games. A conceptual framework is proposed, and a structural equation modeling, for Identifying the factors affecting the market success of online games, is developed. The concept of online game community, idea generation, systematic development strategy, flexible development process, utilizing demo-version, outsourcing, etc, are ail introduced into the model, as the independent variables affecting the success level of online games directly and indirectly. Based on data collected from questionnaire survey, the validity of the model has been tested and interesting conclusions have been developed concerning the relationships between these variables. Statistical results show that utilizing online game community and system atic development strategy is the key for successful online game development. Other interesting results concerning game development strategy are also provided. It is hoped that this result might provide the useful guidelines for developing the successful online game contents. With a better understanding of key success factors, online game developers should be able to make adjustments in their development and marketing plans, providing them with a sustainable advantage over their competition.

Effect of Allergy Related Disease on Suicide Ideation among Adolescents in Korea (청소년 알레르기성 질환의 복합성과 중증도가 자살 생각에 미치는 영향)

  • Wang, Jin Woo;Kim, Eun Young;Park, Su Jin;Lee, Jun Hyup;Rhim, Kook Hwan
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.17 no.3
    • /
    • pp.11-25
    • /
    • 2016
  • Background & Objectives: There were increasing evidence about the relationship between allergy related disease such as asthma, atopic dermatitis and allergic rhinitis and suicide ideation. However little was known about the concrete relatedness between severity and comorbidity of allergy related disease with suicide ideation. The objective of this study was to investigate the cases of the prevalence of suicide ideation among adolescents with allergy related disease such as asthma, atopic dermatitis and allergic rhinitis, and examine the association between allergy related disease and suicidal ideation among adolescents in South Korea. Methods: Data was based on Korean Youth Risk Behavior Web-based Survey(2014) which was a cross-sectional study containing 34,874 Korean middle and high school students who diagnosed with allergy related disease. We used the weights, strata and primary sampling unit information provided by the public use dataset to compute descriptive statistics and logistic regressions. Computations were done with SPSS version 20.0. Results: 19.9%, 15.6%, 13.8% of adolescents who suffered from one, two and three of allergy related diseases respectively reported having been thought of suicide ideation. Socio-demographic factors were adjusted as control variables. Students with greater severity of disease were more likely to have suicide ideation. Odds ratio for students who were absent one to three days from school because of allergies was 1.96(95% CI 1.51-2.46), and odds ratio for those who were absent more than four days from school was 3.60(95% CI 2.46-5.28). Conclusions: Given that adolescents' severity and comorbidity of allergy related disease were clearly associated with suicide ideation, suicide prevention programs for adolescents with allergy related disease should be improved by strategic approaches towards the severity and comorbidity of disease.

Related factors of oral symptoms in adolescents from Korean multicultural families (우리나라 다문화가정 청소년의 구강질환증상과의 관련요인)

  • Han, Yeo-Jung;Park, Sin-Young;Ryu, So-Yeon
    • Journal of Korean society of Dental Hygiene
    • /
    • v.16 no.6
    • /
    • pp.893-907
    • /
    • 2016
  • Objectives: The purpose of this study was to identify the related factors of dental caries and periodontal disease in adolescents from Korean multicultural families, thereby helping to reduce the prevalence rate of oral disease. Methods: The subjects were 710 multicultural adolescents recruited using a web-based survey, National 2015 Korean Youth Risk Behavior, from the Korean Center for Disease Control. A multicultural family was defined in this study as one having an immigrant mother or father. Oral symptoms included dental caries and periodontal disease. Toothache was defined as a symptom of dental caries. Tender or bleeding gums were defined as symptom of periodontal disease. For statistical analysis, Statistical Package for Social Sciences (SPSS) Version 21.0 for Windows was used. Descriptive analysis and a Chi-square test were conducted to determine the factors associated with general characteristics, health behavior, and oral health behavior. Finally, to investigate the associations among oral disease symptoms, logistic regression analysis was performed. Results: Toothache was significantly higher in female 1.52 (95% CI; 1.45-1.60), high school 1.23 (95% CI; 1.18-1.28), women school 1.10 (95% CI; 1.05-1.16), individuals with poor economic status 1.45 (95% CI; 1.30-1.52), and participants who consumed alcohol 1.32 (95% CI; 1.27-1.37). Toothache related to perceived health status was significantly lower in the healthy group 0.69 (95% CI; 0.64-0.75), and was higher in usual stress group 1.65 (95% CI; 1.57-1.74). Gum bleeding was significantly higher in female 1.32 (95% CI; 1.27-1.37), high school 1.15 (95% CI; 1.10-1.19), and individuals with poor economic status 1.38 (95% CI; 1.27-1.50). Gum bleeding related to perceived health status was significantly lower in the healthy group 0.68 (95% CI; 0.63-0.74), and was higher in usual stress group 1.54 (95% CI; 1.46-1.62). Conclusions: Taking into account of social and economic levels, and dietary habits in the multicultural families adolescents, further education and support will be needed for oral disease prevention and early treatment.

Associations of Eating Habits with Obesity and Nutrition Knowledge for Middle and High School Adolescents in Shanghai and Heze China (중국 상하이·허쩌 중·고등학생의 식습관과 비만도 및 영양지식과의 관련성 연구)

  • Song, Yang;Ahn, Hyo-Jin;Choi, Ji-Hye;Oh, Se-Young
    • Journal of the Korean Society of Food Culture
    • /
    • v.29 no.6
    • /
    • pp.648-658
    • /
    • 2014
  • The aim of this study was to investigate the relationships between eating habits and health among adolescents in Shanghai and Heze, China. A cross-sectional study was conducted in 2013 on 2,089 adolescents; 1,089 students were from Shanghai and 999 students from Heze region. Eating habits, weight, height, and nutritional knowledge were assessed using a self-administered questionnaire. Eating habits score was classified into two categories: healthy eating habits and unhealthy eating habits, based on "Korean Youth Risk Behavior Web-based Survey", for statistical data analysis. Associations between eating habits, BMI, and nutritional knowledge were examined using a general linear model with adjustment of potential confounding factors such as region, gender, age, parents' education level, and pocket money. Statistical analyses were performed using the SAS (version 9.3) program. Proportions of healthy eating habits group were 90.0% for breakfast (3-7 times/wk), 29.1% for fruit (${\geq}once/d$), 12.5% for vegetable (${\geq}3times/d$), 7.3% for milk (${\geq}2times/d$), 90.0% for fast food (<3 times/wk) consumption, respectively. The average BMI score was 20.1 (Shanghai 20.5 Heze 19.6), which is in the range of normal weight. Rates of obesity and overweight were 16.5% and 8.3% in Shanghai and Heze, respectively. There were significant negative correlations between intake frequencies of breakfast, fast food, biscuits, sugar, chocolate, and BMI score. Eating habits and nutritional knowledge score showed a significant positive correlation. These results showed better eating habits regarding eating regularity and consumption of fruits and soft drinks in Chinese adolescents compared with Korean adolescents, although cultural differences were not fully considered. This study demonstrated significant associations of BMI and nutritional knowledge with dietary behavior in Chinese adolescents in two regions of China. Further studies on Chinese adolescents from other regions in China should be considered.

Device Virtualization Framework for Smart Home Cloud Service (스마트홈 클라우드 서비스를 위한 디바이스 가상화 프레임워크)

  • Kim, Kyungwon;Park, Jongbin;Kum, Seungwoo;Jung, Jongjin;Yang, Chang-Mo;Lim, Taebeom
    • Telecommunications review
    • /
    • v.24 no.5
    • /
    • pp.677-691
    • /
    • 2014
  • Connectivity is becoming more important keywords recently. For example, many devices are going to be connected to the internet. It is usually called as the IoT(internet of things). Many IoT devices can be evolved as a part of giant system of the world wide web. It is a great opportunity for us, because many new services can have emerged through this paradigm. In this paper, we propose a device virtualization framework for smart home service. The proposed framework connects the many home appliances devices and the internet using a dynamic protocol conversion. After our protocol conversion for device virtualization, our framework provides a RESTful API to access the resources of device through the internet. Therefore, the proposed framework can provide a variety of services, so it also can be developed into the ecosystem for smart home service. The current framework version only supports UPnP enabled devices of the home, but it can easily be extended to many other home middleware solutions. To verify the feasibility of the framework, we have implemented several service scenarios.

Validation of a physical activity classification table in Korean adults and elderly using a doubly labeled water method (한국 성인과 노인을 대상으로 이중표식수법을 이용한 신체활동분류표 타당도 평가)

  • Hye-Ji Han ;Ha-Yeon Jun;Jonghoon Park;Kazuko Ishikawa-Takata;Eun-Kyung Kim
    • Journal of Nutrition and Health
    • /
    • v.56 no.4
    • /
    • pp.391-403
    • /
    • 2023
  • Purpose: This study evaluated the validity of a physical activity classification table (PACT) based on total energy expenditure (TEE) and physical activity level (PAL) measured using the doubly labeled water (DLW) method in Korean adults and the elderly. Methods: A total of 141 (male 70, female 71) adults and elderly were included. The reference standards TEEDLW, PALDLW were measured over a 14-day period using DLW. A 24-hour physical activity diary was kept for three days (two days during the week and one day on the weekend). PALPACT was calculated by classifying the activity type and intensity using the PACT. PALPACT was multiplied by resting energy expenditure measured by indirect calorimetry to estimate TEEPACT. Results: The mean age of the study participants was 50.5 ± 18.8 years, and the mean body mass index was 23.4 ± 3.3 kg/m2. A comparison of TEEDLW and TEEPACT by sex and age showed no significant differences. The bias, the difference between TEEDLW and TEEPACT, was male 17.3 kcal/day and female -4.5 kcal/day. The percentage of accurate predictions (values within ± 10% of the TEEDLW) of TEEPACT was 58.6% in males and 54.9% in females, with the highest prediction values in the age group 40-64 years (70.9%) in males and over 65 years (73.9%) in females. The spearman correlation coefficient (r) between TEEPACT and TEEDLW was 0.769, indicating a significant positive correlation (p < 0.001). Conclusion: In this study, the use of a new PACT for calculating TEE and PAL was evaluated as valid. A web version of the software program and a smartphone application need to be developed using PACT to make it easier to apply for research purposes.