• Title/Summary/Keyword: Web News

Search Result 247, Processing Time 0.021 seconds

A Study of Main Contents Extraction from Web News Pages based on XPath Analysis

  • Sun, Bok-Keun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.7
    • /
    • pp.1-7
    • /
    • 2015
  • Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information servece, and contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page, in this paper, we solve the problem through the implementation of XTractor(XPath Extractor). Since XPath is used to navigate the attribute data and the data elements in the XML document, the XPath analysis to be carried out through the XTractor. XTractor Extracts main text by html parsing, XPath grouping and detecting the XPath contains the main data. The result, the recognition and precision rate are showed in 97.9%, 93.9%, except for a few cases in a large amount of experimental data and it was confirmed that it is possible to properly extract the main text of the news.

The Status of Constitutional Medical Industry Related to Metabolic Diseases by Web Search (웹 검색에 의한 대사성질환 관련 체질의학산업 현황)

  • Lee, Yeon-Joo;Kim, Jong-Yeol
    • Journal of Sasang Constitutional Medicine
    • /
    • v.27 no.4
    • /
    • pp.388-395
    • /
    • 2015
  • Objectives To grasp the trend of constitution medical industry related to the metabolic disorders by analyzing the web resource.Methods Web search with the search formula ("constitutional" or "spirit") and ("Metabolic" or "diabetes" or "high blood pressure" or "hyperlipidemia" or "obesity") for 20 years (1995.09.10 ~ 2015.09.09.) in the web portal address "Web search with the search formula ("constitutional" or "spirit") and ("Metabolic" or "diabetes" or "high blood pressure" or "hyperlipidemia" or "obesity") for 20 years (1995.09.10 ~ 2015.09.09.) in the web portal address "http://web.search.naver.com".Results In the search area of news, blogs, cafes and knowledge-in, the number of searched pages retrieved by the word "constitution" was about 1.78 million. In the news 9760 cases of "obesity", 4046 cases of "hypertension" and 3253 cases of "diabetes" were searched. In Naver Web search Korean medicine clinics related to "constitution" were 24.3%. If we multiple 25.3% to 1000, the actual number of herbal hospitals, The constitution related to Korean medicine clinics is estimated to be approximately 3160 places. Among metabolic disorders, "Overweight", "Diabetes" and "Hypertension" were most frequently searched.Conclusions Constitutional industry related to metabolic diseases is very actively created on the internet in various areas. Among metabolic diseases, obesity, diabetes, hypertension were found with high frequency.

A Study on the Implementation of the Mobile Web Contents Guideline for integrating Web and Mobile - Focus on the NewsSite- (웹과 모바일을 연동하기 위한 모바일웹 컨텐츠 가이드라인 구현에 관한 연구 -뉴스 사이트를 중심으로-)

  • Ko, Hee-Ae;Sim, Kun-Jung;Kim, Jong-Keun;Lim, Young-Hwan
    • Journal of Digital Contents Society
    • /
    • v.8 no.2
    • /
    • pp.141-148
    • /
    • 2007
  • As the environment of wireless mobile is developed, the number of users who want to searching the information on the mobile internet is increasing as same as the contents are increasing. However the contents is just offered limited contents which should be charged such as download sounds and images. Users cannot be satisfied with their needs for searching information. The other hands, the reason why cannot achieved the effective comparison with costs is that the costs of development contents is very expensive. So variable contents cannot produced as much as users want. On this papers will introduce the way of producing the mobile contents by low cost. The program 'Mobuilder' will be introduced, which is the program to transfer directly form web contents to mobile contents. And it will propose the guideline to design mobile contents and mobile web will be developed mobile news site by proposed guideline and build the mobile site.

  • PDF

Stock and News Application of Intelligent Agent System

  • Kim, Dae-Su
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.3 no.2
    • /
    • pp.239-243
    • /
    • 2003
  • Recently, there has been active research conducted on the intelligent agent in various fields. The results have been widely applied to intelligent user-friendly interfaces. In this system, we modeled, designed, and implemented an intelligent agent system that can be applied to stock and news. Some procedures such as login sequence to the web site, process to get stock information, setting stock in concern, intelligent news system module, news analysis module, and news learning module are modeled in detail and described in block diagram level. In our experiment on stock system, it showed quite a useful alarming screen avatar result and also on news system. it successfully rearranged the order of the news according to the user's preferences.

An Exploratory Study on Issues Related to chatGPT and Generative AI through News Big Data Analysis

  • Jee Young Lee
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.4
    • /
    • pp.378-384
    • /
    • 2023
  • In this study, we explore social awareness, interest, and acceptance of generative AI, including chatGPT, which has revolutionized web search, 30 years after web search was released. For this purpose, we performed a machine learning-based topic modeling analysis based on Korean news big data collected from November 30, 2022, when chatGPT was released, to August 31, 2023. As a result of our research, we have identified seven topics related to chatGPT and generative AI; (1)growth of the high-performance hardware market, (2)service contents using generative AI, (3)technology development competition, (4)human resource development, (5)instructions for use, (6)revitalizing the domestic ecosystem, (7)expectations and concerns. We also explored monthly frequency changes in topics to explore social interest related to chatGPT and Generative AI. Based on our exploration results, we discussed the high social interest and issues regarding generative AI. We expect that the results of this study can be used as a precursor to research that analyzes and predicts the diffusion of innovation in generative AI.

A Study on the Preemptive Measure for Fake News Eradication Using Data Mining Algorithms : Focused on the M Online Community Postings (데이터 마이닝을 활용한 가짜뉴스의 선제적 대응을 위한 연구 : M 온라인 커뮤니티 게시물을 중심으로)

  • Lim, Munyeong;Park, Sungbum
    • Journal of Information Technology Services
    • /
    • v.18 no.1
    • /
    • pp.219-234
    • /
    • 2019
  • Fake news threaten democratic elections and causes social conflicts, resulting in major damage. However, the concept of fake news is hard to define, as there is a saying, "News is not fake, fake is not news." Fake news, however, has irreversible characteristics that can not be recovered or reversed completely through post-punishment of economic and political benefits. It is also rapidly spreading in the early days. Therefore, it is very important to preemptively detect these types of articles and prevent their blind proliferation. The existing countermeasures are focused on reporting fake news, raising the level of punishment, and the media & academia to determine the authenticity of the news. Researchers are also trying to determine the authenticity by analyzing its contents. Apart from the contents of fake news, determining the behavioral characteristics of the promoters and its qualities can help identify the possibility of having fake news in advance. The online community has a fake news interception and response tradition through its long-standing community-based activities. As a result, I attempted to model the fake news by analyzing the affirmation-denial analysis and posting behavior by securing the web board crawl of the 'M community' bulletin board during the 2017 Korean presidential election period. Random forest algorithm deemed significant. The results of this research will help counteract fake news and focus on preemptive blocking through behavioral analysis rather than post-judgment after semantic analysis.

The Influence of the Introduction of Smart Phone on Using Portal Sites: An Exploratory Study by the Analysis on Smart Phone Users' Web Traffic (스마트폰 도입이 포털사이트 이용에 미친 영향: 스마트폰 이용자의 웹 트래픽 분석을 통한 탐색적 연구)

  • Kim, Wi-Geun
    • Korean journal of communication and information
    • /
    • v.64
    • /
    • pp.109-135
    • /
    • 2013
  • This study is for empirical verification of the influence of the introduction of smart phone on using the portal sites that were affected the most in the previous media environment. To achieve this, Web traffic data that are the result of smart phone users' practical Web uses have collected longitudinally and analyzed. The research results are the following: First, the use hours of portal sites have decreased about 15% and the page views have did about 35%, since using smart phones was diffused and habituated in earnest during the past two years. Using the community, news media, video, mobile, and game section of portal site sections have reduced. Second, the portal site portion of using smart phone Web is much more than that portion of using PC Web. More than two thirds of smart phone Web use traffic occurs in using portal sites, while more than one third of PC Web use traffic does in using that. Using the news media section is the most of using portal site sections on a smart phone. Third, since the introduction of smart phone, using the news media, communication, and life section of portal site sections have greatly increased, while the community, mobile, and game section have greatly decreased in the aggregate.

  • PDF

User Oriented clustering of news articles using Tweets Heterogeneous Information Network (트위트 이형 정보 망을 이용한 뉴스 기사의 사용자 지향적 클러스터링)

  • Shoaib, Muhammad;Song, Wang-Cheol
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.85-94
    • /
    • 2013
  • With the emergence of world wide web, in particular web 2.0 the rapidly growing amount of news articles has created a problem for users in selection of news articles according to their requirements. To overcome this problem different clustering mechanism has been proposed to broadly categorize news articles. However these techniques are totally machine oriented techniques and lack users' participation in the process of decision making for membership of clustering. In order to overcome the issue of zero-participation in the process of clustering news articles in this paper we have proposed a framework for clustering news articles by combining users' judgments that they post on twitter with the news articles to cluster the objects. We have employed twitter hash-tags for this purpose. Furthermore we have computed the credibility of users' based on frequency of retweets for their tweets in order to enhance the accuracy of the clustering membership function. In order to test performance of proposed methodology, we performed experiments on tweets messages tweeted during general election 2013 in Pakistan. Our results proved over claim that using users' output better outcome can be achieved then ordinary clustering algorithms.

Controversy and Guideline Suggestion Surrounding Fake News in the Digital Media Age (가짜뉴스(Fake News) 현황분석을 통해 본 디지털매체 시대의 쟁점과 뉴스콘텐츠 제작 가이드라인)

  • Kwon, Mahnwoo;Jun, Yong Woo;Im, Hajin
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.11
    • /
    • pp.1419-1426
    • /
    • 2015
  • Distinguishing border between news and advertising is disappearing. Traditional journalism considered editorial part deals news and ad part handle commercial messages. But now this classification is meaningless. Current news consumers do not separate advertising content and non-advertising content. In Korea, making fake news or paid news pages is becoming social problem. Fake news uses various camouflages to pretend to be real news. This paper descriptively analyzed Korean fake news cases and suggested some guidelines for publishing news. We analyzed 3 major newspaper web sites from July to September, 2014. These three newspapers publish section pages everyday containing fake news or sponsored news. Totally more than one thousand articles were selected for content analysis. We coded the numbers of fake news, day of the week, the rate of sponsored news, average fake news publication number per pages, the conformity between news and advertising, and the type of fake news. We also coded the number of sponsored news article in day sections. We used method of comparing the advertising contents and news articles. As a result, 24.8% of news article were published for the advertising sponsors. Advertorial or fake news were sometimes arranged same pages the same day. We coded the conformity between same advertising and news content. More than 60 percent (60.9%) of fake news match with their sponsors. PR style of fake news is top and advertising type of fake news is the lowest.

The Third- and First-Person Effects of Election Polling News Through Emotions

  • Kim, Hyunjung
    • Asian Journal for Public Opinion Research
    • /
    • v.10 no.4
    • /
    • pp.262-276
    • /
    • 2022
  • In this study, we examine how the third- and first-person perceptions of election polling news are linked to voters' political behaviors through anxiety and pride. The results of two web-based surveys conducted before and after the 2022 local elections in South Korea demonstrate that the third-person perception of election polling news is directly and indirectly linked to support for restrictions on media reports of election poll results through anxiety. The first-person perception of polling news is positively associated with reinforcement of support for the preferred candidate. These results suggest that how voters perceive the effects of polling news may have actual impacts on their political behaviors.