• Title/Summary/Keyword: Text features

Search Result 580, Processing Time 0.026 seconds

Automatic Title Detection by Spatial Feature and Projection Profile for Document Images (공간 정보와 투영 프로파일을 이용한 문서 영상에서의 타이틀 영역 추출)

  • Park, Hyo-Jin;Kim, Bo-Ram;Kim, Wook-Hyun
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.3
    • /
    • pp.209-214
    • /
    • 2010
  • This paper proposes an algorithm of segmentation and title detection for document image. The automated title detection method that we have developed is composed of two phases, segmentation and title area detection. In the first phase, we extract and segment the document image. To perform this operation, the binary map is segmented by combination of morphological operation and CCA(connected component algorithm). The first phase provides segmented regions that would be detected as title area for the second stage. Candidate title areas are detected using geometric information, then we can extract the title region that is performed by removing non-title regions. After classification step that removes non-text regions, projection is performed to detect a title region. From the fact that usually the largest font is used for the title in the document, horizontal projection is performed within text areas. In this paper, we proposed a method of segmentation and title detection for various forms of document images using geometric features and projection profile analysis. The proposed system is expected to have various applications, such as document title recognition, multimedia data searching, real-time image processing and so on.

A Study of T-Shirt Graphic Designs Shown in Fashion Collections (패션컬렉션에 나타난 티셔츠의 그래픽디자인에 관한 연구)

  • Kim, Sun-Young
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.36 no.7
    • /
    • pp.727-740
    • /
    • 2012
  • This study is for the enhancement and utilization of future graphic design for T-shirts and deals with the expression styles and features of T-shirt graphic designs that appear in modern fashion. A literature examination about graphic design and T-shirts was performed for the research method and the analysis followed 378 pieces of graphic design featured in four major international collections for females from 2001S/S to 2011S/S. The research results from the expression type of T-shirt graphic design in the modern fashion are as follows. Expression in graphic figure accounts for the largest portion of 40.8% that includes illustration or cartoon characters, personal figure or part of the physical body, object in daily life or landscape pictures, animals and plants, and others. Expression given in text with typography or logo accounted for 27.5%, expression combined with letter/text, graphics and geometric figures accounted for 24.3%, geometrical expression accounted for 7.4%; most of which are given in print. Characteristics found in modern fashion graphic design are as follows. First, role of sort of public relations marketing was accompanied with utilization of brand logo or symbol. Second, visual play was shown in a sense of humor with diverse graphic figures and playful texts, witty layout with graphic motives, and a free design formation. Third, it denoted a front burner issue delivering the message for various current events or arguments via the way of texts, slogans, and symbolic pictures. Fourth, it depicted artistry through the self-expressive creation of the designer.

Wine Label Recognition System using Image Similarity (이미지 유사도를 이용한 와인라벨 인식 시스템)

  • Jung, Jeong-Mun;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Kim, Sun-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.5
    • /
    • pp.125-137
    • /
    • 2011
  • Recently the research on the system using images taken from camera phones as input is actively conducted. This paper proposed a system that shows wine pictures which are similar to the input wine label in order. For the calculation of the similarity of images, the representative color of each cell of the image, the recognized text color, background color and distribution of feature points are used as the features. In order to calculate the difference of the colors, RGB is converted into CIE-Lab and the feature points are extracted by using Harris Corner Detection Algorithm. The weights of representative color of each cell of image, text color and background color are applied. The image similarity is calculated by normalizing the difference of color similarity and distribution of feature points. After calculating the similarity between the input image and the images in the database, the images in Database are shown in the descent order of the similarity so that the effort of users to search for similar wine labels again from the searched result is reduced.

Study on the Creation and Usage of Cultural Contents for 'Geotaji Folk Tale' ('거타지' 설화의 문화콘텐츠 창작과 활용)

  • Lee, Kyu-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.8
    • /
    • pp.119-127
    • /
    • 2008
  • The object of this study is to find ways of a creation and an usage of cultural contents, based on the survey of the culture archetype facts, the culture elements and the narrative structure of 'Geotagi folk tale'. This story takes its place as our cultural archetype, because of its perfect narrative structure and the myth-like features in the story line. Additionally, various cultural elements implied in the text have the world wide universality as well as the unique traditional characters of Korea. Thus, for above reason, 'Geotagi folk tale' can be helpful in creating the new cultural contents. In creating and using the cultural contents of 'Geotagi folk tale', the first way is to build up the Digital contents including the data base of the original text and others, and the creation of graphics. The second way is to create the story bank utilizing hypertext. The third way is to make flash animations or games concerning the folk tale. and the fourth way is to to develop the symbols of the folk tale as animation characters.

Pan-Genomics of Lactobacillus plantarum Revealed Group-Specific Genomic Profiles without Habitat Association

  • Choi, Sukjung;Jin, Gwi-Deuk;Park, Jongbin;You, Inhwan;Kim, Eun Bae
    • Journal of Microbiology and Biotechnology
    • /
    • v.28 no.8
    • /
    • pp.1352-1359
    • /
    • 2018
  • Lactobacillus plantarum is a lactic acid bacterium that promotes animal intestinal health as a probiotic and is found in a wide variety of habitats. Here, we investigated the genomic features of different clusters of L. plantarum strains via pan-genomic analysis. We compared the genomes of 108 L. plantarum strains that were available from the NCBI GenBank database. These genomes were 2.9-3.7 Mbp in size and 44-45% in G+C content. A total of 8,847 orthologs were collected, and 1,709 genes were identified to be shared as core genes by all the strains analyzed. On the basis of SNPs from the core genes, 108 strains were clustered into five major groups (G1-G5) that are different from previous reports and are not clearly associated with habitats. Analysis of group-specific enriched or depleted genes revealed that G1 and G2 were rich in genes for carbohydrate utilization (${\text\tiny{L}}-arabinose$, ${\text\tiny{L}}-rhamnose$, and fructooligosaccharides) and that G3, G4, and G5 possessed more genes for the restriction-modification system and MazEF toxin-antitoxin. These results indicate that there are critical differences in gene content and survival strategies among genetically clustered L. plantarum strains, regardless of habitats.

Academic Conference Categorization According to Subjects Using Topical Information Extraction from Conference Websites (학회 웹사이트의 토픽 정보추출을 이용한 주제에 따른 학회 자동분류 기법)

  • Lee, Sue Kyoung;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.22 no.2
    • /
    • pp.61-77
    • /
    • 2017
  • Recently, the number of academic conference information on the Internet has rapidly increased, the automatic classification of academic conference information according to research subjects enables researchers to find the related academic conference efficiently. Information provided by most conference listing services is limited to title, date, location, and website URL. However, among these features, the only feature containing topical words is title, which causes information insufficiency problem. Therefore, we propose methods that aim to resolve information insufficiency problem by utilizing web contents. Specifically, the proposed methods the extract main contents from a HTML document collected by using a website URL. Based on the similarity between the title of a conference and its main contents, the topical keywords are selected to enforce the important keywords among the main contents. The experiment results conducted by using a real-world dataset showed that the use of additional information extracted from the conference websites is successful in improving the conference classification performances. We plan to further improve the accuracy of conference classification by considering the structure of websites.

A Study on the Use Pattern of Lee Yuk-sa in the media -Focused on the drama "Climax"(2011) (영상매체에 나타난 이육사 표상 연구 -드라마 <절정>(2011)을 중심으로)

  • Son, Mi-young
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.31-37
    • /
    • 2020
  • This study examines the way poetry text is inserted in dramas and the way poets represent themselves through the drama "The climax" (2011). The drama features Lee Yuk-sa, a poet and independence activist, as a central figure and chooses a narrative structure that follows his life. The drama maximizes the lyricity and visual beauty of the drama by inserting his poems with fantastic images at the most dramatic moments of the poet's life. The image presented with the poem maximizes Lee Yuk-sa's intense hardship, while portraying the poem as a crystal of this hardship. Thus, the drama "The climax" uses Lee Yuk-sa's poetry to visualize the inner world of the central character Lee Yuk-sa. Lee Yuk-sa's poems are used in conjunction with his image to simultaneously represent the beauty of poetry and the upright spirit of the poet. This is the result of a balanced portrayal of Yi Yuk-sa, a poet and independence activist, as an intellectual who acts. The drama "The climax" is the main text that sincerely performed the representations of poems and poets through video.

An Experimental Study on the Relation Extraction from Biomedical Abstracts using Machine Learning (기계 학습을 이용한 바이오 분야 학술 문헌에서의 관계 추출에 대한 실험적 연구)

  • Choi, Sung-Pil
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.50 no.2
    • /
    • pp.309-336
    • /
    • 2016
  • This paper introduces a relation extraction system that can be used in identifying and classifying semantic relations between biomedical entities in scientific texts using machine learning methods such as Support Vector Machines (SVM). The suggested system includes many useful functions capable of extracting various linguistic features from sentences having a pair of biomedical entities and applying them into training relation extraction models for maximizing their performance. Three globally representative collections in biomedical domains were used in the experiments which demonstrate its superiority in various biomedical domains. As a result, it is most likely that the intensive experimental study conducted in this paper will provide meaningful foundations for research on bio-text analysis based on machine learning.

Convergence Study on Career Development Process and Influencing Factors (학령기 진로발달과정의 특성 및 영향 요인에 관한 융합연구)

  • Choi, Jung-Ah;Seo, Jun-Ho;Yang, Ji-Yeon
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.9
    • /
    • pp.203-217
    • /
    • 2020
  • The purpose of this study was to perform a convergence study for investigating main features and influencing factors in career development process, throughout the whole periods of education, that might influence their ultimate choice of majors. We collected data of career development process at the elementary, middle, high school, and college levels using career-o-grams, for the college students who majored in English Lang/Lit and Global Commerce, and we applied text mining techniques for qualitative data analysis. Two major factors influencing career goals were parents and teachers. In particular, teachers were most influential in the career decisions at the middle school level. Teachers, family situations, and peers showed a negative impact on career aspiration. The findings would serve as a guide for career consultants and education program developers.

Item-Based Collaborative Filtering Recommendation Technique Using Product Review Sentiment Analysis (상품 리뷰 감성분석을 이용한 아이템 기반 협업 필터링 추천 기법)

  • Yun, So-Young;Yoon, Sung-Dae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.8
    • /
    • pp.970-977
    • /
    • 2020
  • The collaborative filtering recommendation technique has been the most widely used since the beginning of e-commerce companies introducing the recommendation system. As the online purchase of products or contents became an ordinary thing, however, recommendation simply applying purchasers' ratings led to the problem of low accuracy in recommendation. To improve the accuracy of recommendation, in this paper suggests the method of collaborative filtering that analyses product reviews and uses them as a weighted value. The proposed method refines product reviews with text mining to extract features and conducts sentiment analysis to draw a sentiment score. In order to recommend better items to user, sentiment weight is used to calculate the predicted values. The experiment results show that higher accuracy can be gained in the proposed method than the traditional collaborative filtering.