• Title/Summary/Keyword: 문자영상

Search Result 796, Processing Time 0.029 seconds

Extracting curved text lines using the chain composition and the expanded grouping method (체인 정합과 확장된 그룹핑 방법을 사용한 곡선형 텍스트 라인 추출)

  • Bai, Nguyen Noi;Yoon, Jin-Seon;Song, Young-Jun;Kim, Nam;Kim, Yong-Gi
    • The KIPS Transactions:PartB
    • /
    • v.14B no.6
    • /
    • pp.453-460
    • /
    • 2007
  • In this paper, we present a method to extract the text lines in poorly structured documents. The text lines may have different orientations, considerably curved shapes, and there are possibly a few wide inter-word gaps in a text line. Those text lines can be found in posters, blocks of addresses, artistic documents. Our method based on the traditional perceptual grouping but we develop novel solutions to overcome the problems of insufficient seed points and vaned orientations un a single line. In this paper, we assume that text lines contained tone connected components, in which each connected components is a set of black pixels within a letter, or some touched letters. In our scheme, the connected components closer than an iteratively incremented threshold will make together a chain. Elongate chains are identified as the seed chains of lines. Then the seed chains are extended to the left and the right regarding the local orientations. The local orientations will be reevaluated at each side of the chains when it is extended. By this process, all text lines are finally constructed. The proposed method is good for extraction of the considerably curved text lines from logos and slogans in our experiment; 98% and 94% for the straight-line extraction and the curved-line extraction, respectively.

Robust Motorbike License Plate Detection and Recognition using Image Warping based on YOLOv2 (YOLOv2 기반의 영상워핑을 이용한 강인한 오토바이 번호판 검출 및 인식)

  • Dang, Xuan-Truong;Kim, Eung-Tae
    • Journal of Broadcast Engineering
    • /
    • v.24 no.5
    • /
    • pp.713-725
    • /
    • 2019
  • Automatic License Plate Recognition (ALPR) is a technology required for many applications such as Intelligent Transportation Systems and Video Surveillance Systems. Most of the studies have studied were about the detection and recognition of license plates on cars, and there is very little about detecting and recognizing license plates on motorbikes. In the case of a car, the license plate is located at the front or rear center of the vehicle and is a straight or slightly sloped license plate. Also, the background of the license plate is mainly monochromatic, and license plate detection and recognition process is less complicated. However since the motorbike is parked by using a kickstand, it is inclined at various angles when parked, so the process of recognizing characters on the motorbike license plate is more complicated. In this paper, we have developed a 2-stage YOLOv2 algorithm to detect the area of a license plate after detection of a motorbike area in order to improve the recognition accuracy of license plate for motorbike data set parked at various angles. In order to increase the detection rate, the size and number of the anchor boxes were adjusted according to the characteristics of the motorbike and license plate. Image warping algorithms were applied after detecting tilted license plates. As a result of simulating the license plate character recognition process, the proposed method had the recognition rate of license plate of 80.23% compared to the recognition rate of the conventional method(YOLOv2 without image warping) of 47.74%. Therefore, the proposed method can increase the recognition of tilted motorbike license plate character by using the adjustment of anchor boxes and the image warping which fit the motorbike license plate.

Study On The Signal Radar Plan Position Indicator Scope Of The Data Expressed Scanning System Implemented As An Sticking Image On LCD Display (Plan Position Indicator Scope 주사방식의 Radar 영상신호를 LCD Display에 잔상영상으로 데이터 표출 구현에 관한 연구)

  • Shin, Hyun Jong;Yu, Hyeung Keun
    • Journal of Satellite, Information and Communications
    • /
    • v.10 no.3
    • /
    • pp.94-101
    • /
    • 2015
  • The display device is an important video information communication system device to connect between human and device. it transfers the information as characters, shapes, images and pattern to enable recognizing by eyes. Theres absolutely needs some key functions and role to quickly display informations. It can analyse a information through a PPI Scope of a cathode-ray tube(CRT) displays information which can perform a role. this research proposed a radar device to display informations as received signal. The radar display researches can apply to fixed function graphics pipeline algorithms of the large capacity type through a vertical blanking interval and buffer swap of display unit. Also, it can be possible to apply to performed algorithms to FPGA logic without high-performance graphics processing unit GPU through synchronization which can implement a display system. In this paper, we improved the affordability and reliability through proposed research. 이So, we have studied the radar display unit which can change a flat display from radar display of CRT radar display.

A study of new business creation on digital contents industries (디지털콘텐츠 산업분석을 통한 기술사엄화 기회창출 연구)

  • Park, Dong-Un;Kim, Eun-Sun;Park, Young-Seo
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.759-762
    • /
    • 2006
  • Historically, internet is the fastest growing media together with the ICT development and the key of the development is contents. Digital contents indicate information which covers voice, DB, game, publications and music etc. and the areas have been creating new technology business opportunities. The value chain of digital contents consists of production, collection, processing, services, connection and navigation and is expected to be reorganized around business players of production and distribution areas. This paper presents on those changes occurring in business environment and examples of business models, and further provides industries and academias with technology commercialization strategies.

  • PDF

Design of Large-set Off-line Handwritten Hangul Database Construction (대용량 오프라인 한글 글씨 데이타베이스의 설계)

  • Lee, S.W.;Song, H.H.;Kim, J.S.;Lee, E.J.;Park, H.S.
    • Annual Conference on Human and Language Technology
    • /
    • 1995.10a
    • /
    • pp.131-136
    • /
    • 1995
  • 최근들어 자연스럽게 필기된 한글을 인식함으로써 정보 입력 과정을 자동화하기 위한 오프라인 한글 글씨 인식에 관한 연구가 활발히 진행되고 있다. 오프라인 한글 글씨 인식에 관한 연구에 있어서 반드시 확보되어야 하는 연구 환경으로 대용량 오프라인 한글 글씨 데이타베이스의 구축을 들 수 있는데, 본 논문에서는 시스템공학연구소 국어공학센터의 국어 정보 베이스 개발사업의 일환으로 추진중인 오프라인 한글 글씨 데이타베이스의 구축현황에 대해 간략히 소개하고자 한다. 오프라인 한글 글씨 데이타베이스의 구축은 크게 글씨 데이타베이스 설계, 글씨 데이타 수집, 용지 스캔 및 문자 단위 분할, 데이타베이스 검증의 4 단계로 구성된다. 본 연구에서는 다양한 변형을 갖는 글씨체의 수집을 데이타베이스 구축시 가장 고려해야 할 요소로 삼았으며, 고품질의 일관성 있는 글씨 데이타베이스 구축을 위해 데이타베이스 설계 단계와 검증 단계에 많은 시간을 할애했다. 마지막으로 본 연구에서는 WWW(World Wide Web)의 HTML(Hyper Text Markup Language)을 이용하여 편리 한 사용자 인터페이스를 구현함으로써 사용자들이 쉽게 한글 글씨 영상을 검색 할 수 있음은 물론 인식 알고리즘의 개발에 사용 가능한 형태의 화일을 제공받을 수 있도록 구성하고 있다. 현재는 KS C 완성형 한글 2,350자 중에서 사용 빈도순 상위 520자에 대한 한글 글씨 1,000벌을 수집하여 명도영상 데이타베이스를 구축 중에 있으며, 향후 2년간 나머지 1,830자에 대한 한글 글씨 데이타를 수집하여 데이타베이스를 완성하고자 한다. 구축된 글씨 데이타베이스는 조만간 국내의 오프라인 한글 글씨 인식 연구자들에게 제공되어 우수한 인식 알고리즘의 개발을 위한 중요한 실험 데이타로서 사용될 예정이며, 개발된 인식 시스템에 대한 객관적인 성능 평가에 있어서도 크게 기여하여 국내의 오프라인 한글 글씨 인식에 관한 연구를 활성화시켜주는 계기가 될 것으로 기대된다.

  • PDF

Digital Animation As a New Medium Taking a View of Bolz Media Theory (미디어미학에서 바라 본 뉴미디어로써 디지털 애니메이션 - 노르베르트 볼츠의 매체미학을 중심으로 -)

  • 이종한
    • Archives of design research
    • /
    • v.16 no.4
    • /
    • pp.225-232
    • /
    • 2003
  • A German philosopher, Herbert Bolz predicted that the human way of thinking would be fundamentally and completely changed because of digital media and the 'Gutenberg-Galaxis', named by M.McLuhan which was symbolized of modern reason was doomed to be over. He thought that the limit of reason-centered European culture would be overcome by the up-to-date multimedia to revive the communication life. On this theory, he emphasized the emotional perception, 'aisthesis' which is original meaning of aesthetics. That is to say, he insisted on the restoration of communication media to enable the five senses' amusement condition mentioned by Kant. This thesis asserts that the representative hypermedia digital animation may play a key role to rehabilitate human sensibility pressed by reason centered modernism. Digital animation has the unique worth of Art that is firstly to deal with time and space and enable unlimited expressions and can communicate effectively as a characteristic synthetic medium which consists of intensive computer techniques. Based on the background, this thesis analyzes the possibility of the digital animation as a new medium. Especially, it is focused on the relations to the hypermedia theory of Norbert Bolz who is a media analyst and professor of a design college.

  • PDF

A Study on the Communication System Design for Auditory Disabled (청각장애인을 위한 의사소통 시스템 디자인 연구)

  • Yang, Sung-Ho;Song, Ji-Won
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.1172-1175
    • /
    • 2009
  • This study aims to develop communication devices and interfaces to address the communication needs of the hearing impaired. Through three FGI (Focused Group Interview)s with deaf persons and sign-language interpreters, we studied the communication methods, devices, and needs of the deaf. On the basis of our analysis, we propose a communication framework to improve their means of communication with normal or other deaf persons. We have designed a communication system that is based on the proposed framework: this system suggests functions for remote sign-language interpretation services for conversations in close-range and over the phone. Details are presented regarding the design of interfaces for video calls, text messages, and digital memos, addressing the conversation patterns of the deaf. The system includes hardware form factors for video phones that facilitate sign language conversations and also mitigate other auditory problems in daily life, such as problems with door bells. The design concept has been verified through a test with six deaf users.

  • PDF

기술혁신이 정보통신 사업구도에 미치는 영향 및 전망

  • 임명환;오길환
    • Proceedings of the Korea Technology Innovation Society Conference
    • /
    • 1998.05a
    • /
    • pp.21-21
    • /
    • 1998
  • 최근의 정보통신은 하루가 다르게 기술혁신에 의한 변화를 맞이하고 있는 가운데, 기술이 시장을 주도하여 신기술이 시장구조를 변화시킴은 물론 정보통신 사업구조를 크게 개편시키고 있다. 정보통신망기술은 다양한 멀티미디어 정보를 대량, 고속으로 전송·교환할 수 있는 방향으로 발전되고 있으며, 단말기는 유·무선전화서비스를 비롯하여 영상정보까지 다양하게 송수신할 수 있는 보다 작고 간편한 멀티미디어 일체형으로 발전하고 있다. 서비스분야에서는 디지털화, 고속화, 광역화에 따라 문자, 음성, 영상 등이 통합된 멀티미디어서비스가 보급되고 있으며, 특히 '7년부터 상용화된 인터넷을 통한 전화와 방송서비스는 정보통신분야에 새로운 사업영역으로 등장하고 있다. 이로인해 기존의 통신사업 질서인 “유선통신 대 무선통신”, “기본통신 대 고도통신”, “통신 대 방송”의 개념은 무너져 버리고, 기술과 시장이 상호 융항되어 새로운 통신사업 구도를 형성시키고 있다. 이의 대표적인 사례는 인터넷분야이며, 기존의 전화사업은 물론 CATV등 방송사업에까지 시장잠식과 구도변화라는 형태로 커다란 영향 미치고 있다. 국내 정보통신 사업정책도 이러한 세계적인 추세에 따라 이미 지난해 별정통신 사업자의 신설 등 정책과 제도를 바꾸었으나, 기술혁신 속도보다 늦게 반응하여 사업초기에 혼란을 주고 있는 실정이다. 최근 기술에 의한 시장구조 변화에 대응하기 위하여 정부는 기존 국가 초고속망구축사업도 대폭 수정하여, 즉 모든 가입자선로를 광케이블로 연결하는 당초의 계획을 ADSL 등 기존 전화선의 활용과 WLL기술을 선택적으로 적용시키고 있다. 통신사업자들도 유선위주의 전송망과 가입자망을 LMDS, WLL 등의 무선통신망으로 구축하려는 현상이 나타나고 있으며, 정보통신 정책도 과거의 유·무선통신을 별개로 취급하던 정책에서 선회하여 겸업 또는 연계를 권장하는 방향으로 전환되고 있다. 이와같이 정보통신분야의 기술혁신과 새로운 서비스의 도입은 기존 서비스시장의 시장잠식은 물론 유·무선영역을 무너뜨리고, 고정사업자와 무선사업자간의 영역이 허물어지는 등 사업자구도를 크게 변화시키고 있다. 이러한 기술혁신의 영향으로 시장경쟁은 심화되어 단기적으로 기존 통신사업자의 경영수지가 악화되기도 하지만, 요금인하를 수반한 신기술서비스의 공급은 이용자 입장에서 저렴하고 양질의 정보통신서비스를 제공받을 수가 있으며, 장기적으로는 서비스이용 증가로 인해 전체적인 시장크기는 더욱 확대되어 통신사업자와 이용자 모두 효율성을 나타낼 수 있을 것으로 전망된다.

  • PDF

Remote Medical Information Service System based on RFID Technology and Mobile Terminal (RFID 기술과 이동 단말기를 이용한 원격 의료정보 서비스 시스템)

  • Kim, Jae-Joon;Kim, Jong-Wan;Cho, Kyu-Cheol
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.3
    • /
    • pp.131-140
    • /
    • 2007
  • A general medical information service in hospitals recently has been rapidly developed in the effective patient management due to the digitalization. In addition, the hospitals make an effort to support the medical information service in ubiquitous environment. A key requirement in ubiquitous environment is the ability to communicate between the image viewer system using the DICOM standard and a server system to support the medical information service. This paper describes a remote networking system based on the mobile terminal with RFID technology for the medical information service. In order to apply the overall configuration, we first implemented the DICOM viewer system, configured the database to store the patient information, and realized the server/client networking system in mobile terminal environment. In particular, this paper showed the capability for the medical image-based communication as well as the text-based communication.

Recognition of Car License Plate by Using Dynamical Thresholding and Neural Network with Enhanced Learning Algorithm (동적인 임계화 방법과 개선된 학습 알고리즘의 신경망을 이용한 차량 번호판 인식)

  • Kim, Gwang-Baek;Kim, Yeong-Ju
    • The KIPS Transactions:PartB
    • /
    • v.9B no.1
    • /
    • pp.119-128
    • /
    • 2002
  • This paper proposes an efficient recognition method of car license plate from the car images by using both the dynamical thresholding and the neural network with enhanced learning algorithm. The car license plate is extracted by the dynamical thresholding based on the structural features and the density rates. Each characters and numbers from the p]ate is also extracted by the contour tracking algorithm. The enhanced neural network is proposed for recognizing them, which has the algorithm of combining the modified ART1 and the supervised learning method. The proposed method has applied to the real-world car images. The simulation results show that the proposed method has better the extraction rates than the methods with information of the gray brightness and the RGB, respectively. And the proposed method has better recognition performance than the conventional backpropagation neural network.