• 제목/요약/키워드: textual information

검색결과 240건 처리시간 0.019초

빅데이터를 활용한 정책분석의 방법론적 함의 : 기회형 창업 관련 소셜 빅데이터 분석 사례를 중심으로 (Methodological Implications of Employing Social Bigdata Analysis for Policy-Making : A Case of Social Media Buzz on the Startup Business)

  • 이영주;김도훈
    • 한국IT서비스학회지
    • /
    • 제15권1호
    • /
    • pp.97-111
    • /
    • 2016
  • In the creative economy paradigm, motivation of the opportunity based startup is a continuous concern to policy-makers. Recently, bigdata anlalytics challenge traditional methods by providing efficient ways to identify social trend and hidden issues in the public sector. In this study the authors introduce a case study using social bigdata analytics for conducting policy analysis. A semantic network analysis was employed using textual data from social media including online news, blog, and private bulletin board which create buzz on the startup business. Results indicates that each media has been forming different discourses regarding government's policy on the startup business. Furthermore, semantic network structures from private bulletin board reveal unexpected social burden that hiders opening a startup, which has not been found in the traditional survey nor experts interview. Based on these results, the authors found the feasibility of using social bigdata analysis for policy-making. Methodological and practical implications are discussed.

Consideration of Image Quality of Dithered Picture by Constrained Average Method Using Various Probability Distribution Models

  • Sato, Mitsuhiro;Hasegawa, Madoka;Kato, Shigeo
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -3
    • /
    • pp.1495-1498
    • /
    • 2002
  • The constrained average method is one of dither methods which combines edge emphasis and grayscale rendition to provide legibility of textual region and proper quality of continuous tone region. How-ever, image quality of continuous tone region is insufficient compared to other dither methods, such as ordered dither methods or the error diffusion method. The constrained average method uses a uniform distribution function to decide number of lit pixels related to the average intensity in a picture area. However, actual distribution of continuous tone region is closer to the Laplacian distribution or triangle distribution. In this paper, we introduce various probability distributions and the actual luminance distribution to decide the threshold value of the constrained average method in order to improve image quality of dithered image.

  • PDF

Towards cross-platform interoperability for machine-assisted text annotation

  • de Castilho, Richard Eckart;Ide, Nancy;Kim, Jin-Dong;Klie, Jan-Christoph;Suderman, Keith
    • Genomics & Informatics
    • /
    • 제17권2호
    • /
    • pp.19.1-19.10
    • /
    • 2019
  • In this paper, we investigate cross-platform interoperability for natural language processing (NLP) and, in particular, annotation of textual resources, with an eye toward identifying the design elements of annotation models and processes that are particularly problematic for, or amenable to, enabling seamless communication across different platforms. The study is conducted in the context of a specific annotation methodology, namely machine-assisted interactive annotation (also known as human-in-the-loop annotation). This methodology requires the ability to freely combine resources from different document repositories, access a wide array of NLP tools that automatically annotate corpora for various linguistic phenomena, and use a sophisticated annotation editor that enables interactive manual annotation coupled with on-the-fly machine learning. We consider three independently developed platforms, each of which utilizes a different model for representing annotations over text, and each of which performs a different role in the process.

Ontology Matching Method Based on Word Embedding and Structural Similarity

  • Hongzhou Duan;Yuxiang Sun;Yongju Lee
    • International journal of advanced smart convergence
    • /
    • 제12권3호
    • /
    • pp.75-88
    • /
    • 2023
  • In a specific domain, experts have different understanding of domain knowledge or different purpose of constructing ontology. These will lead to multiple different ontologies in the domain. This phenomenon is called the ontology heterogeneity. For research fields that require cross-ontology operations such as knowledge fusion and knowledge reasoning, the ontology heterogeneity has caused certain difficulties for research. In this paper, we propose a novel ontology matching model that combines word embedding and a concatenated continuous bag-of-words model. Our goal is to improve word vectors and distinguish the semantic similarity and descriptive associations. Moreover, we make the most of textual and structural information from the ontology and external resources. We represent the ontology as a graph and use the SimRank algorithm to calculate the structural similarity. Our approach employs a similarity queue to achieve one-to-many matching results which provide a wider range of insights for subsequent mining and analysis. This enhances and refines the methodology used in ontology matching.

Q&A Chatbot in Arabic Language about Prophet's Biography

  • Somaya Yassin Taher;Mohammad Zubair Khan
    • International Journal of Computer Science & Network Security
    • /
    • 제24권3호
    • /
    • pp.211-223
    • /
    • 2024
  • Chatbots have become very popular in our times and are used in several fields. The emergence of chatbots has created a new way of communicating between human and computer interaction. A Chatbot also called a "Chatter Robot," or conversational agent CA is a software application that mimics human conversations in its natural format, which contains textual material and oral communication with artificial intelligence AI techniques. Generally, there are two types of chatbots rule-based and smart machine-based. Over the years, several chatbots designed in many languages for serving various fields such as medicine, entertainment, and education. Unfortunately, in the Arabic chatbots area, little work has been done. In this paper, we developed a beneficial tool (chatBot) in the Arabic language which contributes to educating people about the Prophet's biography providing them with useful information by using Natural Language Processing.

인터네트상의 멀티미디어 전자우편 시스템의 설계 및 구현 (Design and Implementation of a Multimedia Mail System in Internet)

  • 나성주;한선영
    • 한국정보처리학회논문지
    • /
    • 제2권6호
    • /
    • pp.866-878
    • /
    • 1995
  • 요즈음 하드웨어의 급격한 발전과 함께 멀티미디어에 대해 많은 관심이 집중 되고 있다. 컴퓨터 네트워크를 통한 정보 전달 서비스에서도 멀티미디어를 보다 효율적이고 편리하게 이용하기 위한 관심이 폭발적으로 증대되고 있다. 본 논문에서는 RFC822를 수정 확정하여 기존의 전자우편 망인 SMTP(Simple Mail Transfer Protocol)를 통하 여 멀티미디어 데이타를 전송할 수 있도록 한 MIME (Multipurpose Intedent Mail Extensions)을 기반으로 멀티미디어 전자우편 시스템을 설계 구현하였다. 복수개의 멀티미디어 메세지를 하나의 전자우편안에 포함시켜 전송할 수 있으며, 오디오, 이미지, 그래픽스와 같은 멀티미디어 데이타를 전자우편을 통한 자료교환에서 기존의 텍스트와 같이 손쉽게 처리할 수 있도록 하여, 기존의 전자우편망을 이용한 멀티미디어 전송, 처리를 사용자가 손쉽게 할 수 있는 전자우편 시스템을 설계, 구현 하였다. 이에 보다 나은 멀티미디어 전자우편 시스템의 개발에 근간이 될 수 있도록 한다.

  • PDF

인터넷 정보서비스를 위한 메타데이터 프레임워크 개발에 관한 연구 (A Study on Development in Metadata Framework for Internet Information Service)

  • 황상규;윤세진;오경묵
    • 정보관리학회지
    • /
    • 제19권2호
    • /
    • pp.159-179
    • /
    • 2002
  • 웹 정보체계에서 여러 가지 다양한 종류의 정보자원을 보다 효과적으로 관리하기 위하여, 인터넷 정보를 기술하고 있는 서로 다른 메타데이터 표준들간의 상호운용성 문제가 점차 중요한 비중을 차지하게 된다. 이러한 문제점을 해결하기 위하여 IFLA의 FRBR, INDECS, ABC 모델 등의 상호연동 모델들을 비교·분석함으로서, 다양한 형태의 메타데이터간의 상호연동을 위한 새로운 방식의 상호운용성 모델을 개발, 제안하였다. 리소스모델의 기본·메타데이터 항목 요소들로부터 중심이 되는 개체(core entity)와 이와 연관된 주요 사건(core event)들을 식별하고. 이를 리소스중심 중간모델과 이벤트중심 중간모델로 변환(swapping)하는 과정을, 구체적인 방법을 제시함으로써, 이들 모델간 전환 방법을 명확하게 제시하였다.

Meeting Real Challenges in Eliciting Security Attributes for Mobile Application Development

  • Yusop, Noorrezam;Kamalrudin, Massila;Yusof, Mokhtar Mohd;Sidek, Safiah
    • 인터넷정보학회논문지
    • /
    • 제17권5호
    • /
    • pp.25-32
    • /
    • 2016
  • There has been a rapid growth in the development of mobile application resulting from its wide usage for online transaction, data storage and exchange of information. However, an important issue that has been overlooked is the lack of emphasis on the security issues at the early stage of the development. In fact, security issues have been kept until the later stage of the implementation of mobile apps. Requirements engineers frequently ignore and incorrectly elicit security related requirements at the early stage of mobile application development. This scenario has led to the failure of developing secure and safe mobile application based on the needs of the users. As such, this paper intends to provide further understanding of the real challenges in extracting security attributes for mobile application faced by novice requirements engineers. For this purpose, two experiments on eliciting security attributes requirements of textual requirements scenario were conducted. The performance related to the correctness and time taken to elicit the security attributes were measured and recorded. It was found that the process of eliciting correct security attributes for mobile application requires effort, knowledge and skills. The findings indicate that an automated tool for correct elicitation security attributes requirement could help to overcome the challenges in eliciting security attributes requirements, especially among novice requirements engineers.

A Reliability Verification of Screening Time Prediction Reporting of 'Cine-Hangeul'

  • Jeon, Byoung-Won
    • Journal of Multimedia Information System
    • /
    • 제7권2호
    • /
    • pp.141-146
    • /
    • 2020
  • Cine-Hangeul is a program that can predict the running time of a movie based on the screenplay before production. This paper seeks to verify the prediction reporting function of Cine-Hangeul, which is the standard Korean screenplay format. Moreover, this paper presents a method to increase the accuracy of the Cine-Hangeul reporting function. The objective of this paper is to offer a correction method based on scientific evidence because the current Cine-Hangeul reporting function has many errors. The verification process for five scenarios and movies confirmed that the default setting value of Cine- Hangeul's screening time prediction reporting was many errors. Cine-Hangeul analyzes the amount of textual information to predict the time of the scene and the time of the dialogue and helps predict the total time of the movie. Therefore, if a certain amount of text information is not available, the accuracy is unreliable. The current Cine-Hangeul prediction report confirms that the efficiency is high when the scenario volume is about 90 to 100 pages. As a result, prediction of screening time by Cine-Hangeul, a Korean scenario standard format program, confirmed the verification that it could secure the same level of reliability as the actual screening time by correcting the reporting settings. This verification also affirms that when applying about 50 percent of the basic set of screening time reporting, it is almost identical to the screening time.

고려말에서 조선중기까지의 구결자료에 관한 서지학적 연구 (A bibliographical study of the 'kukyeul system' in Korean language from Koryo to Chosun dynasty)

  • 남권희
    • 한국도서관정보학회지
    • /
    • 제27권
    • /
    • pp.485-572
    • /
    • 1997
  • The purpose of this study is to investigate the textual and physical bibliography of these books that were printed from Koryo to Chosun Dynasty and written by the Kukyul system. This study is concerned with the Kukyul written in the transformed Chinese characters which representing their sino-Korean sound values only. The Kukyul is the Korean function word inserted to a written Chinese sentence for an easier understanding of the meaning by the Koreans. Until the present, most of these studies on the Kukyul are mainly concerned with the Korean linguistic characters. But this mentions to present the basic bibliographical information in order to presume the written period of the Kukyul system. 2The analysis of each book is made in the respect of: 1) historical aspect of the book 2) physical form and publishing date 3) transcription period of the Kukyul 4) the category of presenting Kukyul 5) historical change of transcribing Kukyul system The results of the study are as follows : First, the Kukyul system was divided into Sokdok and Sundok Kukyul according to the translation and recording format. Second, the Sokdok Kukyul is a kind of writing system for translated Chinese into Korean. Third, the Sundok Kukyul was frequently used Buddhist publications from later Koryo Dynasty to Middle Chosun period. Fourth, through the analysis of physical bibliography for that books, we rearrange the chronological oder of Sokdok Kukyul system as Hwaum-kyung, Hwaum-kyungSo, Kumkwngmyu ngkyung, Kuyeukinwang-kyung, Yukasajiron. Fifth, the characters of Sundok Kukyul systems were gradually decreased from eighty numbers to fifty numbers. This change is caused by the unification trends of sound value in morphological aspect.

  • PDF