• Title/Summary/Keyword: Text data

Search Result 2,953, Processing Time 0.033 seconds

Jointly Image Topic and Emotion Detection using Multi-Modal Hierarchical Latent Dirichlet Allocation

  • Ding, Wanying;Zhu, Junhuan;Guo, Lifan;Hu, Xiaohua;Luo, Jiebo;Wang, Haohong
    • Journal of Multimedia Information System
    • /
    • v.1 no.1
    • /
    • pp.55-67
    • /
    • 2014
  • Image topic and emotion analysis is an important component of online image retrieval, which nowadays has become very popular in the widely growing social media community. However, due to the gaps between images and texts, there is very limited work in literature to detect one image's Topics and Emotions in a unified framework, although topics and emotions are two levels of semantics that often work together to comprehensively describe one image. In this work, a unified model, Joint Topic/Emotion Multi-Modal Hierarchical Latent Dirichlet Allocation (JTE-MMHLDA) model, which extends previous LDA, mmLDA, and JST model to capture topic and emotion information at the same time from heterogeneous data, is proposed. Specifically, a two level graphical structured model is built to realize sharing topics and emotions among the whole document collection. The experimental results on a Flickr dataset indicate that the proposed model efficiently discovers images' topics and emotions, and significantly outperform the text-only system by 4.4%, vision-only system by 18.1% in topic detection, and outperforms the text-only system by 7.1%, vision-only system by 39.7% in emotion detection.

  • PDF

The Development of Text-oriented Mindmap Application Based-on Android (안드로이드 기반의 텍스트 중심 마인드맵 애플리케이션 개발)

  • Kim, Tae-Hun;Kim, Jong-Hoon
    • 한국정보교육학회:학술대회논문집
    • /
    • 2011.01a
    • /
    • pp.247-252
    • /
    • 2011
  • Smartphones which are a recent worldwide trend, have the advantage of minimizing the constraints of time and space as using a variety of resources and capabilities. Mindmap which is for organizing ideas more systematically, helps develop the creativity and increase thinking power. In this study, we developed on application that can create text-based mind mapping based on the Android smart phones. Anywhere and anytime, using a mobile phone capable of taking advantage of creating a mind map will be a big help with your organizing, keeping and managing creative thinking. In my proposal, we need to develop another application which can use a variety ofmultimedia data and share with others' mindmaps.

  • PDF

The Impacts of Media Symbol Variety on Performance in Virtual Teams

  • Shim, Sang-Min;Suh, Kil-Soo;Im, Kun-Shin
    • Journal of Information Technology Applications and Management
    • /
    • v.17 no.3
    • /
    • pp.83-97
    • /
    • 2010
  • The purpose of this study is to examine the impacts of media symbol variety on group performance in virtual teams. Symbol variety is defined as the number of ways in which information can be communicated and includes Daft and Lengel [1986]'s multiplicity of cues and language variety. According to media richness theory and media synchronicity theory, the use of media with high symbol variety is assumed to facilitate and promote communications among virtual team members. Therefore, it is expected that the media symbol variety is positively associated with group performance in virtual teams. Furthermore, online relationship building is expected to mediate the impacts of symbol variety on the performance. To confirm the suppositions, a controlled lab experiment was conducted with 60 undergraduate students as subjects. In the experimental virtual teams, subjects were allowed to communicate with other members using text-based messenger with emoticons. Subjects in the control virtual teams were allowed to communicate using only text-based messenger. The direct impact of symbol variety on group performance in virtual teams was found insignificant. However, the online relationship was found to completely mediate the positive impact of symbol variety on group performance. The implications and limitations of this study are also discussed for future research.

  • PDF

Predicting User Profile based on user behaviors (모바일 사용자 행태 기반 프로파일 예측)

  • Sim, Myo-Seop;Lim, Heui-Seok
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.7
    • /
    • pp.1-7
    • /
    • 2020
  • As the performance of mobile devices has dramatically improved, users can perform many tasks in a mobile environment. This means that the use of behavior information stored in the mobile device can tell a lot of users. For example, a user's text message and frequently used application information (behavioral information) can be utilized to create useful information, such as whether the user is interested in parenting(profile prediction). In this study, I investigate the behavior information of the user that can be collected in the mobile device and propose the item that can profile the user. And I also suggest ideas about how to utilize profiling information.

Fast, Flexible Text Search Using Genomic Short-Read Mapping Model

  • Kim, Sung-Hwan;Cho, Hwan-Gue
    • ETRI Journal
    • /
    • v.38 no.3
    • /
    • pp.518-528
    • /
    • 2016
  • The searching of an extensive document database for documents that are locally similar to a given query document, and the subsequent detection of similar regions between such documents, is considered as an essential task in the fields of information retrieval and data management. In this paper, we present a framework for such a task. The proposed framework employs the method of short-read mapping, which is used in bioinformatics to reveal similarities between genomic sequences. In this paper, documents are considered biological objects; consequently, edit operations between locally similar documents are viewed as an evolutionary process. Accordingly, we are able to apply the method of evolution tracing in the detection of similar regions between documents. In addition, we propose heuristic methods to address issues associated with the different stages of the proposed framework, for example, a frequency-based fragment ordering method and a locality-aware interval aggregation method. Extensive experiments covering various scenarios related to the search of an extensive document database for documents that are locally similar to a given query document are considered, and the results indicate that the proposed framework outperforms existing methods.

A Consistent Quality Bit Rate Control for the Line-Based Compression

  • Ham, Jung-Sik;Kim, Ho-Young;Lee, Seong-Won
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.5 no.5
    • /
    • pp.310-318
    • /
    • 2016
  • Emerging technologies such as the Internet of Things (IoT) and the Advanced Driver Assistant System (ADAS) often have image transmission functions with tough constraints, like low power and/or low delay, which require that they adopt line-based, low memory compression methods instead of existing frame-based image compression standards. Bit rate control in the conventional frame-based compression systems requires a lot of hardware resources when the scope of handled data falls at the frame level. On the other hand, attempts to reduce the heavy hardware resource requirement by focusing on line-level processing yield uneven image quality through the frame. In this paper, we propose a bit rate control that maintains consistency in image quality through the frame and improves the legibility of text regions. To find the line characteristics, the proposed bit rate control tests each line for ease of compression and the existence of text. Experiments on the proposed bit rate control show peak signal-to-noise ratios (PSNRs) similar to those of conventional bit rate controls, but with the use of significantly fewer hardware resources.

An Improvement Of Efficiency For kNN By Using A Heuristic (휴리스틱을 이용한 kNN의 효율성 개선)

  • Lee, Jae-Moon
    • The KIPS Transactions:PartB
    • /
    • v.10B no.6
    • /
    • pp.719-724
    • /
    • 2003
  • This paper proposed a heuristic to enhance the speed of kNN without loss of its accuracy. The proposed heuristic minimizes the computation of the similarity between two documents which is the dominant factor in kNN. To do this, the paper proposes a method to calculate the upper limit of the similarity and to sort the training documents. The proposed heuristic was implemented on the existing framework of the text categorization, so called, AI :: Categorizer and it was compared with the conventional kNN with the well-known data, Router-21578. The comparisons show that the proposed heuristic outperforms kNN about 30∼40% with respect to the execution time.

Authorship Attribution in Korean Using Frequency Profiles (빈도 정보를 이용한 한국어 저자 판별)

  • Han, Na-Rae
    • Korean Journal of Cognitive Science
    • /
    • v.20 no.2
    • /
    • pp.225-241
    • /
    • 2009
  • This paper presents an authorship attribution study in Korean conducted on a corpus of newspaper column texts. Based on the data set consisting of a total of 160 columns written by four columnists of Chosun Daily, the approach utilizes relative frequencies of various lexical units in Korean such as fully inflected words, morphemes, syllables and their bigrams in an attempt to establish authorship of a blind text selected from the set. Among these various lexical units, "the morpheme" is found to be most effective in predicting who among the four potential candidates authored a text, reporting accuracies of over 93%. The results indicate that quantitative and statistical techniques in authorship attribution and computational stylistics can be successfully applied to Korean texts.

  • PDF

Topic Extraction and Classification Method Based on Comment Sets

  • Tan, Xiaodong
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.329-342
    • /
    • 2020
  • In recent years, emotional text classification is one of the essential research contents in the field of natural language processing. It has been widely used in the sentiment analysis of commodities like hotels, and other commentary corpus. This paper proposes an improved W-LDA (weighted latent Dirichlet allocation) topic model to improve the shortcomings of traditional LDA topic models. In the process of the topic of word sampling and its word distribution expectation calculation of the Gibbs of the W-LDA topic model. An average weighted value is adopted to avoid topic-related words from being submerged by high-frequency words, to improve the distinction of the topic. It further integrates the highest classification of the algorithm of support vector machine based on the extracted high-quality document-topic distribution and topic-word vectors. Finally, an efficient integration method is constructed for the analysis and extraction of emotional words, topic distribution calculations, and sentiment classification. Through tests on real teaching evaluation data and test set of public comment set, the results show that the method proposed in the paper has distinct advantages compared with other two typical algorithms in terms of subject differentiation, classification precision, and F1-measure.

Usefulness of RDF/OWL Format in Pediatric and Oncologic Nuclear Medicine Imaging Reports (소아 및 종양 핵의학 영상판독에서 RDF/OWL 데이터의 유용성)

  • Hwang, Kyung Hoon;Lee, Haejun;Koh, Geon;Choi, Duckjoo;Sun, Yong Han
    • Journal of Biomedical Engineering Research
    • /
    • v.36 no.4
    • /
    • pp.128-134
    • /
    • 2015
  • Recently, the structured data format in RDF/OWL has played an increasingly vital role in the semantic web. We converted pediatric and oncologic nuclear medicine imaging reports in free text into RDF/OWL format and evaluated the usefulness of nuclear medicine imaging reports in RDF/OWL by comparing SPARQL query results with the manually retrieved results by physicians from the reports in free text. SPARQL query showed 95% recall for simple queries and 91% recall for dedicated queries. In total, SPARQL query retrieved 93% (51 lesions of 55) recall and 100% precision for 20 clinical query items. All query results missed by SPARQL query were of some inference. Nuclear medicine imaging reports in the format of RDF/OWL were very useful for retrieving simple and dedicated query results using SPARQL query. Further study using more number of cases and knowledge for inference is warranted.