• Title/Summary/Keyword: Topic Change Detection

Search Result 10, Processing Time 0.036 seconds

Topic and Topic Change Detection in Instance Messaging (인스턴트 메시징에서의 대화 주제 및 주제 전환 탐지)

  • Choi, Yoon-Jung;Shin, Wook-Hyun;Jeong, Yoon-Jae;Myaeng, Sung-Hyon;Han, Kyoung-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.59-66
    • /
    • 2008
  • This paper describes a novel method for identifying the main topic and detecting topic changes in a text-based dialogue as in Instant Messaging (IM). Compared to other forms of text, dialogues are uniquely characterized with the short length of text with small number of words, two or more participants, and existence of a history that affects the current utterance. Noting the characteristics, our method detects the main topic of a dialogue by considering the keywords not only the utterances of the user but also the dialogue system's responses. Dialogue histories are also considered in the detection process to increase accuracy. For topic change detection, the similarity between the former utterance's topic and the current utterance's topic is calculated. If the similarity is smaller than a certain threshold, our system judges that the topic has been changed from the current utterance. We obtained 88.2% and 87.4% accuracy in topic detection and topic change detection, respectively.

  • PDF

A Comparison of Scene Change Localization Methods over the Open Video Scene Detection Dataset

  • Panchenko, Taras;Bieda, Igor
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.1-6
    • /
    • 2022
  • Scene change detection is an important topic because of the wide and growing range of its applications. Streaming services from many providers are increasing their capacity which causes the industry growth. The method for the scene change detection is described here and compared with the State-of-the-Art methods over the Open Video Scene Detection (OVSD) - an open dataset of Creative Commons licensed videos freely available for download and use to evaluate video scene detection algorithms. The proposed method is based on scene analysis using threshold values and smooth scene changes. A comparison of the presented method was conducted in this research. The obtained results demonstrated the high efficiency of the scene cut localization method proposed by authors, because its efficiency measured in terms of precision, recall, accuracy, and F-metrics score exceeds the best previously known results.

Statistical Properties of News Coverage Data

  • Lim, Eunju;Hahn, Kyu S.;Lim, Johan;Kim, Myungsuk;Park, Jeongyeon;Yoon, Jihee
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.771-780
    • /
    • 2012
  • In the current analysis, we examine news coverage data widely used in media studies. News coverage data is usually time series data to capture the volume or the tone of the news media's coverage of a topic. We first describe the distributional properties of autoregressive conditionally heteroscadestic(ARCH) effects and compare two major American newspaper's coverage of U.S.-North Korea relations. Subsequently, we propose a change point detection model and apply it to the detection of major change points in the tone of American newspaper coverage of U.S.-North Korea relations.

Trend Properties and a Ranking Method for Automatic Trend Analysis (자동 트렌드 탐지를 위한 속성의 정의 및 트렌드 순위 결정 방법)

  • Oh, Heung-Seon;Choi, Yoon-Jung;Shin, Wook-Hyun;Jeong, Yoon-Jae;Myaeng, Sung-Hyon
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.3
    • /
    • pp.236-243
    • /
    • 2009
  • With advances in topic detection and tracking(TDT), automatic trend analysis from a collection of time-stamped documents, like patents, news papers, and blog pages, is a challenging research problem. Past research in this area has mainly focused on showing a trend line over time of a given concept by measuring the strength of trend-associated term frequency information. for detection of emerging trends, either a simple criterion such as frequency change was used, or an overall comparison was made against a training data. We note that in order to show most salient trends detected among many possibilities, it is critical to devise a ranking function. To this end, we define four properties(change, persistency, stability and volume) of trend lines drawn from frequency information, to quantify various aspects of trends, and propose a method by which trend lines can be ranked. The properties are examined individually and in combination in a series of experiments for their validity using the ranking algorithm. The results show that a judicious combination of the four properties is a better indicator for salient trends than any single criterion used in the past for ranking or detecting emerging trends.

Research on Brand Value Dimensions of Employers: Based on Online Reviews by the Employees

  • XU, Meng
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.10
    • /
    • pp.215-225
    • /
    • 2022
  • This study investigates employees' online reviews, conducts in-depth text topic mining, effectively summarizes the dimensions of employer brand value, and seeks effective ways to build employer brands from a multi-dimensional perspective. This study employs samples of employer reviews, filter keywords according to word frequency-inverse document frequency, builds a review network containing the same keywords, explore the community and summarize the theme dimensions. Simultaneously, it makes a dynamic comparison and analysis of the employer brand value dimension of different industries and enterprises. The study shows that the community exploration theme can be summarized into 11 dimensions of employer brand value, and the dimensions of employer brand value are significantly different across industries and among different enterprises within the industry. The attention to the employer brand value dimension has a significant time change. Various industries pay increasing attention to the dimension of work intensity and career development, while employers pay steady attention to the dimension of welfare benefits. The findings of this study suggest that seeking the heterogeneity of employer brand resources from the multi-dimensional differences and changes is an effective way to improve the competitiveness of enterprises in the human capital market.

Graphene Coated Optical Fiber SPR Biosensor

  • Kim, Jang Ah;Hwang, Taehyun;Dugasani, Sreekantha Reddy;Kulkarni, Atul;Park, Sung Ha;Kim, Taesung
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2014.02a
    • /
    • pp.401-401
    • /
    • 2014
  • In this study, graphene, the most attractive material today, has been applied to the wavelength-modulated surface plasmon resonance (SPR) sensor. The optical fiber sensor technology is the most fascinating topic because of its several benefits. In addition to this, the SPR phenomenon enables the detection of biomaterials to be label-free, highly sensitive, and accurate. Therefore, the optical fiber SPR sensor has powerful advantages to detect biomaterials. Meanwhile, Graphene shows superior mechanical, electrical, and optical characteristics, so that it has tremendous potential to be applied to any applications. Especially, grapheme has tighter confinement plasmon and relatively long propagation distances, so that it can enhance the light-matter interactions (F. H. L. Koppens, et al., Nano Lett., 2011). Accordingly, we coated graphene on the optical fiber probe which we fabricated to compose the wavelength-modulated SPR sensor (Figure 1.). The graphene film was synthesized via thermal chemical vapor deposition (CVD) process. Synthesized graphene was transferred on the core exposed region of fiber optic by lift-off method. Detected analytes were biotinylated double cross-over DNA structure (DXB) and Streptavidin (SA) as the ligand-receptor binding model. The preliminary results showed the SPR signal shifts for the DXB and SA binding rather than the concentration change.

  • PDF

Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)

  • Bae, Jung-Hwan;Son, Ji-Eun;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.141-156
    • /
    • 2013
  • Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.

Epidemiology Characteristics and Trends of Incidence and Morphology of Stomach Cancer in Iran

  • Almasi, Zeinab;Rafiemanesh, Hosein;Salehiniya, Hamid
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.7
    • /
    • pp.2757-2761
    • /
    • 2015
  • Background: Stomach cancer is the fourth most common cancer and the second leading cause of cancer-related death through the world. It is predicted that the number of new cancer cases will be more than 15 million cases by 2020. Regarding the lack of studies on this topic in the country, we have thoroughly examined the patho-epidemiology of stomach cancer in Iran. Materials and Methods: In this cross- sectional study data were collected retrospectively reviewing all new stomach cancer patients in Cancer Registry Center report of health deputy for Iran during a 6-year period (2003-2008). The study also examined the morphology of common stomach cancers. Trends in incidence and morphology underwent joinpoint regression analysis. Results: During the six-year period, a total of 35,171 cases of stomach cancer were registered. Average age standardized rate for females and males were equal to 7.1 and 15.1 per 100,000 persons, respectively. Most common histological type was adenocarcinoma, NOS with 21,980 cases (62.50%). The annual percentage change (APC) in age-standardized incidence rate (per 100,000) was increase in both females and males at 11.1 (CI: 4.3 to 18.3) and 9.2 (CI: 5.2 to 13.4), respectively. Conclusions: According to our results, the incidence of gastric cancer is increasing in Iran, so further epidemiological studies into the etiology and early detection are essential.

Changes in public recognition of parabens on twitter and the research status of parabens related to toothpaste (트위터(twitter)에서의 파라벤(parabens) 관련 대중의 인식 변화와 치약내 파라벤에 대한 연구 현황)

  • Oh, Hyo-Jung;Jeon, Jae-Gyu
    • Journal of Korean Academy of Oral Health
    • /
    • v.41 no.2
    • /
    • pp.154-161
    • /
    • 2017
  • Objectives: The purpose of this study was to investigate changes in public recognition of parabens on Twitter and the research status of parabens related to toothpaste. Methods: Tweet information between 2010 and October 2016 was collected by an automatic web crawler and examined according to tweet frequency, key words (2012-October 2016), and issue tweet detection analyses to reveal changes in public recognition of parabens on Twitter. To investigate the research status of parabens related to toothpaste, queries such as "paraben," "paraben and toxicity," "paraben and (toothpastes or dentifrices)," and "paraben and (toothpastes or dentifrices) and toxicity" were used. Results: The number of tweets concerning parabens sharply increased when parabens in toothpaste emerged as a social issue (October 2014), and decreased from 2015 onward. However, toothpaste and its related terms were continuously included in the core key words extracted from tweets from 2015. They were not included in key words before 2014, indicating that the emergence of parabens in toothpaste as a social issue plays an important role in public recognition of parabens in toothpaste. The issue tweet analysis also confirmed the change in public recognition of parabens in toothpaste. Despite the expansion of public recognition of parabens in toothpaste, there are only seven research articles on the topic in PubMed. Conclusions: The general public clearly recognized parabens in toothpaste after emergence of parabens in toothpaste as a social issue. Nevertheless, the scientific information on parabens in toothpaste is very limited, suggesting that the efforts of dental scientists are required to expand scientific knowledge related to parabens in oral hygiene measures.

Extracting Beginning Boundaries for Efficient Management of Movie Storytelling Contents (스토리텔링 콘텐츠의 효과적인 관리를 위한 영화 스토리 발단부의 자동 경계 추출)

  • Park, Seung-Bo;You, Eun-Soon;Jung, Jason J.
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.279-292
    • /
    • 2011
  • Movie is a representative media that can transmit stories to audiences. Basically, a story is described by characters in the movie. Different from other simple videos, movies deploy narrative structures for explaining various conflicts or collaborations between characters. These narrative structures consist of 3 main acts, which are beginning, middle, and ending. The beginning act includes 1) introduction to main characters and backgrounds, and 2) conflicts implication and clues for incidents. The middle act describes the events developed by both inside and outside factors and the story dramatic tension heighten. Finally, in the end act, the events are developed are resolved, and the topic of story and message of writer are transmitted. When story information is extracted from movie, it is needed to consider that it has different weights by narrative structure. Namely, when some information is extracted, it has a different influence to story deployment depending on where it locates at the beginning, middle and end acts. The beginning act is the part that exposes to audiences for story set-up various information such as setting of characters and depiction of backgrounds. And thus, it is necessary to extract much kind information from the beginning act in order to abstract a movie or retrieve character information. Thereby, this paper proposes a novel method for extracting the beginning boundaries. It is the method that detects a boundary scene between the beginning act and middle using the accumulation graph of characters. The beginning act consists of the scenes that introduce important characters, imply the conflict relationship between them, and suggest clues to resolve troubles. First, a scene that the new important characters don't appear any more should be detected in order to extract a scene completed the introduction of them. The important characters mean the major and minor characters, which can be dealt as important characters since they lead story progression. Extra should be excluded in order to extract a scene completed the introduction of important characters in the accumulation graph of characters. Extra means the characters that appear only several scenes. Second, the inflection point is detected in the accumulation graph of characters. It is the point that the increasing line changes to horizontal line. Namely, when the slope of line keeps zero during long scenes, starting point of this line with zero slope becomes the inflection point. Inflection point will be detected in the accumulation graph of characters without extra. Third, several scenes are considered as additional story progression such as conflicts implication and clues suggestion. Actually, movie story can arrive at a scene located between beginning act and middle when additional several scenes are elapsed after the introduction of important characters. We will decide the ratio of additional scenes for total scenes by experiment in order to detect this scene. The ratio of additional scenes is gained as 7.67% by experiment. It is the story inflection point to change from beginning to middle act when this ratio is added to the inflection point of graph. Our proposed method consists of these three steps. We selected 10 movies for experiment and evaluation. These movies consisted of various genres. By measuring the accuracy of boundary detection experiment, we have shown that the proposed method is more efficient.