• Title/Summary/Keyword: Text data

Search Result 2,953, Processing Time 0.033 seconds

An empirical evaluation of electronic annotation tools for Twitter data

  • Weissenbacher, Davy;O'Connor, Karen;Hiraki, Aiko T.;Kim, Jin-Dong;Gonzalez-Hernandez, Graciela
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.24.1-24.7
    • /
    • 2020
  • Despite a growing number of natural language processing shared-tasks dedicated to the use of Twitter data, there is currently no ad-hoc annotation tool for the purpose. During the 6th edition of Biomedical Linked Annotation Hackathon (BLAH), after a short review of 19 generic annotation tools, we adapted GATE and TextAE for annotating Twitter timelines. Although none of the tools reviewed allow the annotation of all information inherent of Twitter timelines, a few may be suitable provided the willingness by annotators to compromise on some functionality.

WTO, an ontology for wheat traits and phenotypes in scientific publications

  • Nedellec, Claire;Ibanescu, Liliana;Bossy, Robert;Sourdille, Pierre
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.14.1-14.11
    • /
    • 2020
  • Phenotyping is a major issue for wheat agriculture to meet the challenges of adaptation of wheat varieties to climate change and chemical input reduction in crop. The need to improve the reuse of observations and experimental data has led to the creation of reference ontologies to standardize descriptions of phenotypes and to facilitate their comparison. The scientific literature is largely under-exploited, although extremely rich in phenotype descriptions associated with cultivars and genetic information. In this paper we propose the Wheat Trait Ontology (WTO) that is suitable for the extraction and management of scientific information from scientific papers, and its combination with data from genomic and experimental databases. We describe the principles of WTO construction and show examples of WTO use for the extraction and management of phenotype descriptions obtained from scientific documents.

A Study on the Future Direction of the Digital Signage Industry in Korea: A Big Data Network Analysis from 2008 to 2019

  • Yoo, Seung-Chul;Piscarac, Diana
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.120-127
    • /
    • 2020
  • The use of digital signage in the public and commercial communication areas has been increasing in recent years. By integrating cutting-edge information technologies such as 5G, artificial intelligence, and the Internet of Things, digital signage continues to break apart from traditional outdoor advertising media. This study identified the problems facing the domestic digital signage industry by exploring and analyzing major issues related to digital signage and derived future development measures. Specifically, online documents were collected based on the digital signage-related keywords created over the past 12 years to conduct big data network analysis, and key topics were derived through visualization of the results. This study has great policy implications in that it excluded biased interpretations based on the viewpoints of companies or the government and, more objectively, suggested the direction of the digital signage industry's development in the domestic media market.

Design and Implementation of an Integrated Multimedia Editor for Effective Link Creation (효율적인 링크 형성을 지원하는 멀티미디어 통합편집기의 설계 및 구현)

  • 김정현;고영곤;최윤철
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.3
    • /
    • pp.28-37
    • /
    • 1996
  • To reduce an authors burden in hypermedia system that allows non-sequential information the process of creating links must be easy. However, most of the conventional hypermedia systems possess two difficulties. First, the author must go through several troublesome process to create a single link. Secondly, it is not easy to create an anchor in text or other multimedia data. Therefore, in order to support effective construction of hypermedia system the editing environment must provide an easy method to create links. In this paper, to resolve the weaknesses of conventional hypermedia system as mentioned above, an editing tool is developed and implemented to easily create the links of multimedia data. There are three methods in creating links and a user can select a convenient method in given circumstances. And for teh efficient production of nodes composed of multimedia information, we provide an authoring environment to integrate and process those informations.

  • PDF

Development of an e-Learning Environment for Blended Learning

  • Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.345-353
    • /
    • 2006
  • Over the past few years, training professionals have become more pragmatic in their approach to technology-based media by using it to augment traditional forms of training delivery, such as classroom instruction and text-based materials. This trend has led to the rise of the term blended learning. Blended learning, an environment of e-learning, is a powerful learning solution created through a mixture of face-to-face and online learning delivered through a mix of media and superior learning experiences. In this article we design and implement an e-learning environment for blended learning. The environment focused on following factors: learning activity and participation of learners, and real time feedback of instructor.

  • PDF

ANALYTIC SMOOTHING EFFECT AND SINGLE POINT SINGULARITY FOR THE NONLINEAR SCHRODINGER EQUATIONS

  • Kato, Keiichi;Ogawa, Takayoshi
    • Journal of the Korean Mathematical Society
    • /
    • v.37 no.6
    • /
    • pp.1071-1084
    • /
    • 2000
  • We show that a weak solution of the Cauchy problem for he nonlinear Schrodinger equation, {i∂(sub)t u + ∂$^2$(sub)x u = f(u,u), t∈(-T,T), x∈R, u(0,x) = ø(x).} in the negative solbolev space H(sup)s has a smoothing effect up to real analyticity if the initial data only have a single point singularity such as the Dirac delta measure. It is shown that for H(sup)s (R)(s>-3/4) data satisfying the condition (※Equations, See Full-text) the solution is analytic in both space and time variable. The argument is based on the recent progress on the well-posedness result by Bourgain [2] and Kenig-Ponce-Vega [18] and previous work by Kato-Ogawa [12]. We give an improved new argument in the regularity argument.

  • PDF

Implementation of Korean TTS System based on Natural Language Processing (자연어 처리 기반 한국어 TTS 시스템 구현)

  • Kim Byeongchang;Lee Gary Geunbae
    • MALSORI
    • /
    • no.46
    • /
    • pp.51-64
    • /
    • 2003
  • In order to produce high quality synthesized speech, it is very important to get an accurate grapheme-to-phoneme conversion and prosody model from texts using natural language processing. Robust preprocessing for non-Korean characters should also be required. In this paper, we analyzed Korean texts using a morphological analyzer, part-of-speech tagger and syntactic chunker. We present a new grapheme-to-phoneme conversion method for Korean using a hybrid method with a phonetic pattern dictionary and CCV (consonant vowel) LTS (letter to sound) rules, for unlimited vocabulary Korean TTS. We constructed a prosody model using a probabilistic method and decision tree-based method. The probabilistic method atone usually suffers from performance degradation due to inherent data sparseness problems. So we adopted tree-based error correction to overcome these training data limitations.

  • PDF

User-Created Content Recommendation Using Tag Information and Content Metadata

  • Rhie, Byung-Woon;Kim, Jong-Woo;Lee, Hong-Joo
    • Management Science and Financial Engineering
    • /
    • v.16 no.2
    • /
    • pp.29-38
    • /
    • 2010
  • As the Internet is more embedded in people's lives, Internet users draw on new Internet applications to express themselves through "user-created content (UCC)." In addition, there is a noticeable shift from text-centered contents mainly posted on bulletin boards to multimedia contents such as images and videos on UCC web sites. The changes require different way of recommendations comparing to traditional products or contents recommendation on the Internet. This paper aims to design UCC recommendation methods with user behavior data and contents metadata such as tags and titles, and compare performances of the suggested methods. Real web logs data of a major Korean video UCC site was used to empirical experiments. The results of the experiments show that collaborative filtering technique based on similarity of UCC customers' preferences performs better than other content-based recommendation methods based on tag information and content metadata.

Analysis of Job Data on Media (미디어에 나타난 직업 관련 데이터의 분석)

  • Ban, ChaeHoon;Jung, YoonSeung;Jeong, DongMin
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.152-155
    • /
    • 2018
  • 과거와는 비교 할 수 없을 만큼 방대한 양의 데이터가 생산되는 정보화 시대에서 과거와 현재의 데이터를 비교 분석하는 것이 매우 중요하다. 이러한 데이터를 분석하는 도구인 R은 통계 기반의 정보 분석을 가능하게 하는 언어와 환경이다. 본 논문에서는 R을 이용하여 미디어에 나타난 직업 관련 빅데이터를 분석한다. 다양한 미디어에서 직업 관련 데이터를 수집하고 어떠한 텍스트가 분포되어 있는지 빈도 조사를 수행한다.

  • PDF

Hand Gesture based Manipulation of Meeting Data in Teleconference (핸드제스처를 이용한 원격미팅 자료 인터페이스)

  • Song, Je-Hoon;Choi, Ki-Ho;Kim, Jong-Won;Lee, Yong-Gu
    • Korean Journal of Computational Design and Engineering
    • /
    • v.12 no.2
    • /
    • pp.126-136
    • /
    • 2007
  • Teleconferences have been used in business sectors to reduce traveling costs. Traditionally, specialized telephones that enabled multiparty conversations were used. With the introduction of high speed networks, we now have high definition videos that add more realism in the presence of counterparts who could be thousands of miles away. This paper presents a new technology that adds even more realism by telecommunicating with hand gestures. This technology is part of a teleconference system named SMS (Smart Meeting Space). In SMS, a person can use hand gestures to manipulate meeting data that could be in the form of text, audio, video or 3D shapes. Fer detecting hand gestures, a machine learning algorithm called SVM (Support Vector Machine) has been used. For the prototype system, a 3D interaction environment has been implemented with $OpenGL^{TM}$, where a 3D human skull model can be grasped and moved in 6-DOF during a remote conversation between distant persons.