• Title/Summary/Keyword: Image Processing Technology

Search Result 2,384, Processing Time 0.036 seconds

Optical Character Recognition for Hindi Language Using a Neural-network Approach

  • Yadav, Divakar;Sanchez-Cuadrado, Sonia;Morato, Jorge
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.117-140
    • /
    • 2013
  • Hindi is the most widely spoken language in India, with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English, the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script, using Artificial Neural Network (ANN), which improves its efficiency. One of the major reasons for the poor recognition rate is error in character segmentation. The presence of touching characters in the scanned documents further complicates the segmentation process, creating a major problem when designing an effective character segmentation technique. Preprocessing, character segmentation, feature extraction, and finally, classification and recognition are the major steps which are followed by a general OCR. The preprocessing tasks considered in the paper are conversion of gray scaled images to binary images, image rectification, and segmentation of the document's textual contents into paragraphs, lines, words, and then at the level of basic symbols. The basic symbols, obtained as the fundamental unit from the segmentation process, are recognized by the neural classifier. In this work, three feature extraction techniques-: histogram of projection based on mean distance, histogram of projection based on pixel value, and vertical zero crossing, have been used to improve the rate of recognition. These feature extraction techniques are powerful enough to extract features of even distorted characters/symbols. For development of the neural classifier, a back-propagation neural network with two hidden layers is used. The classifier is trained and tested for printed Hindi texts. A performance of approximately 90% correct recognition rate is achieved.

Environmental IoT-Enabled Multimodal Mashup Service for Smart Forest Fires Monitoring

  • Elmisery, Ahmed M.;Sertovic, Mirela
    • Journal of Multimedia Information System
    • /
    • v.4 no.4
    • /
    • pp.163-170
    • /
    • 2017
  • Internet of things (IoT) is a new paradigm for collecting, processing and analyzing various contents in order to detect anomalies and to monitor particular patterns in a specific environment. The collected data can be used to discover new patterns and to offer new insights. IoT-enabled data mashup is a new technology to combine various types of information from multiple sources into a single web service. Mashup services create a new horizon for different applications. Environmental monitoring is a serious tool for the state and private organizations, which are located in regions with environmental hazards and seek to gain insights to detect hazards and locate them clearly. These organizations may utilize IoT - enabled data mashup service to merge different types of datasets from different IoT sensor networks in order to leverage their data analytics performance and the accuracy of the predictions. This paper presents an IoT - enabled data mashup service, where the multimedia data is collected from the various IoT platforms, then fed into an environmental cognition service which executes different image processing techniques such as noise removal, segmentation, and feature extraction, in order to detect interesting patterns in hazardous areas. The noise present in the captured images is eliminated with the help of a noise removal and background subtraction processes. Markov based approach was utilized to segment the possible regions of interest. The viable features within each region were extracted using a multiresolution wavelet transform, then fed into a discriminative classifier to extract various patterns. Experimental results have shown an accurate detection performance and adequate processing time for the proposed approach. We also provide a data mashup scenario for an IoT-enabled environmental hazard detection service and experimentation results.

A Fast SAD Algorithm for Area-based Stereo Matching Methods (영역기반 스테레오 영상 정합을 위한 고속 SAD 알고리즘)

  • Lee, Woo-Young;Kim, Cheong Ghil
    • Journal of Satellite, Information and Communications
    • /
    • v.7 no.2
    • /
    • pp.8-12
    • /
    • 2012
  • Area-based stereo matchng algorithms are widely used for image analysis for stereo vision. SAD (Sum of Absolute Difference) algorithm is one of well known area-based stereo matchng algorithms with the characteristics of data intensive computing application. Therefore, it requires very high computation capabilities and its processing speed becomes very slow with software realization. This paper proposes a fast SAD algorithm utilizing SSE (Streaming SIMD Extensions) instructions based on SIMD (Single Instruction Multiple Data) parallism. CPU supporing SSE instructions has 16 XMM registers with 128 bits. For the performance evaluation of the proposed scheme, we compare the processing speed between SAD with/without SSE instructions. The proposed scheme achieves four times performance improvement over the general SAD, which shows the possibility of the software realization of real time SAD algorithm.

Video Stabilization using Phase Correlation and Kalman Filter-Based Motion Prediction (위상상관과 칼만 필터 움직임 예측을 이용한 동영상 안정화)

  • Han, Hag-Yong;Jeong, Hyo-Won;Kang, Bong-Soon;Hur, Kang-In
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.2
    • /
    • pp.106-111
    • /
    • 2009
  • Real-time video stabilization technology is used in correction for the camera vibrations of the hand-held camera by hand or fixed camera by external condition. This paper is about the counterplan to cope with the vibration of the movie generated by the large external cause relatively. we use the movie stabilization parameters with the phase correlation method based the DFT to get the displacements of the current frame to the reference frame. we use the kalman filter for the efficient and stable searching works on the phase correlation map and present the proper conditions for the real-time processing through the experiments. We propose the measure to evaluate the capability of the video stabilizer which is the standard deviation of the brightness of the center block. and compare the capability for the video sequences randomly shifted and the jittered video sequences obtained from camera.

  • PDF

An Intelligent Chatbot Utilizing BERT Model and Knowledge Graph (BERT 모델과 지식 그래프를 활용한 지능형 챗봇)

  • Yoo, SoYeop;Jeong, OkRan
    • The Journal of Society for e-Business Studies
    • /
    • v.24 no.3
    • /
    • pp.87-98
    • /
    • 2019
  • As artificial intelligence is actively studied, it is being applied to various fields such as image, video and natural language processing. The natural language processing, in particular, is being studied to enable computers to understand the languages spoken and spoken by people and is considered one of the most important areas in artificial intelligence technology. In natural language processing, it is a complex, but important to make computers learn to understand a person's common sense and generate results based on the person's common sense. Knowledge graphs, which are linked using the relationship of words, have the advantage of being able to learn common sense easily from computers. However, the existing knowledge graphs are organized only by focusing on specific languages and fields and have limitations that cannot respond to neologisms. In this paper, we propose an intelligent chatbotsystem that collects and analyzed data in real time to build an automatically scalable knowledge graph and utilizes it as the base data. In particular, the fine-tuned BERT-based for relation extraction is to be applied to auto-growing graph to improve performance. And, we have developed a chatbot that can learn human common sense using auto-growing knowledge graph, it verifies the availability and performance of the knowledge graph.

Smoke Rendering Method in Post-processing for Safety-Training Contents (안전 훈련 콘텐츠에 적합한 포스트 프로세싱 단계에서의 연기 렌더링 방법)

  • Park, Sanghyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1644-1652
    • /
    • 2022
  • In the case of safety training where practical training is impossible due to risk problems, training contents using realistic media such as virtual reality or augmented reality are becoming a new alternative. In this paper, we propose a smoke modeling method that can be applied to safety-training contents implemented with realistic media technology. When an accident occurs in a hazardous area such as a petrochemical plant, visibility is not secured due to gas leakage and fire. In order to create such a situation, it is important to realistically express smoke. The proposed method is a smoke model implementation technique that can be effectively applied to the background of complex passages and devices such as petrochemical plants. In the proposed method, the smoke is expressed using volumetric rendering in the post-processing stage for the resulting image of scene rendering. Implementation results in the background of the factory show that the proposed method produces models that can express the smoke realistically.

Geospatial Data Display Technique for Non-Glasses Stereoscopic Monitor (무안경식 입체 모니터를 이용한 지형공간 데이터의 디스플레이 기법)

  • Lee, Seun-Geun;Lee, Dong-Cheon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.26 no.6
    • /
    • pp.599-609
    • /
    • 2008
  • Development of computer and electronic technology leads innovative progress in spatial informatics and successful commercialization. Geospatial information technology plays an important role in decision making in various applications. However, information display media are two-dimensional plane that limits visual perception. Understanding human visual processing mechanism to percept stereo vision makes possible to implement three-dimensional stereo image display. This paper proposes on-the-fly stereo image generation methods that are involved with various exterior and camera parameters including exposure station, viewing direction, image size, overlap and focal length. Collinearity equations and parameters related with stereo viewing conditions were solved to generate realisitc stereo imagery. In addition stereo flying simulation scenery was generated with different viewing locations and directions. The stereo viewing is based on the parallax principle of two veiwing locations. This study implemented anaglyphic stereogram, polarization and lenticular stereo display methods. Existing display technology has limitation to provide visual information of three-dimensional and dynamic nature of the real world because the 3D spatial information is projected into 2D plane. Therefore, stereo display methods developed in this study improves geospatial information and applications of GIS by realistic stereo visualization.

Development of the Algorithm for Traffic Accident Auto-Detection in Signalized Intersection (신호교차로 내 실시간 교통사고 자동검지 알고리즘 개발)

  • O, Ju-Taek;Im, Jae-Geuk;Hwang, Bo-Hui
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.5
    • /
    • pp.97-111
    • /
    • 2009
  • Image-based traffic information collection systems have entered widespread adoption and use in many countries since these systems are not only capable of replacing existing loop-based detectors which have limitations in management and administration, but are also capable of providing and managing a wide variety of traffic related information. In addition, these systems are expanding rapidly in terms of purpose and scope of use. Currently, the utilization of image processing technology in the field of traffic accident management is limited to installing surveillance cameras on locations where traffic accidents are expected to occur and digitalizing of recorded data. Accurately recording the sequence of situations around a traffic accident in a signal intersection and then objectively and clearly analyzing how such accident occurred is more urgent and important than anything else in resolving a traffic accident. Therefore, in this research, we intend to present a technology capable of overcoming problems in which advanced existing technologies exhibited limitations in handling real-time due to large data capacity such as object separation of vehicles and tracking, which pose difficulties due to environmental diversities and changes at a signal intersection with complex traffic situations, as pointed out by many past researches while presenting and implementing an active and environmentally adaptive methodology capable of effectively reducing false detection situations which frequently occur even with the Gaussian complex model analytical method which has been considered the best among well-known environmental obstacle reduction methods. To prove that the technology developed by this research has performance advantage over existing automatic traffic accident recording systems, a test was performed by entering image data from an actually operating crossroad online in real-time. The test results were compared with the performance of other existing technologies.

Comparative Analysis of CNN Deep Learning Model Performance Based on Quantification Application for High-Speed Marine Object Classification (고속 해상 객체 분류를 위한 양자화 적용 기반 CNN 딥러닝 모델 성능 비교 분석)

  • Lee, Seong-Ju;Lee, Hyo-Chan;Song, Hyun-Hak;Jeon, Ho-Seok;Im, Tae-ho
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.59-68
    • /
    • 2021
  • As artificial intelligence(AI) technologies, which have made rapid growth recently, began to be applied to the marine environment such as ships, there have been active researches on the application of CNN-based models specialized for digital videos. In E-Navigation service, which is combined with various technologies to detect floating objects of clash risk to reduce human errors and prevent fires inside ships, real-time processing is of huge importance. More functions added, however, mean a need for high-performance processes, which raises prices and poses a cost burden on shipowners. This study thus set out to propose a method capable of processing information at a high rate while maintaining the accuracy by applying Quantization techniques of a deep learning model. First, videos were pre-processed fit for the detection of floating matters in the sea to ensure the efficient transmission of video data to the deep learning entry. Secondly, the quantization technique, one of lightweight techniques for a deep learning model, was applied to reduce the usage rate of memory and increase the processing speed. Finally, the proposed deep learning model to which video pre-processing and quantization were applied was applied to various embedded boards to measure its accuracy and processing speed and test its performance. The proposed method was able to reduce the usage of memory capacity four times and improve the processing speed about four to five times while maintaining the old accuracy of recognition.

Study of KINECT based 3D Holographic and Gesture (KINECT 기반 3D 홀로그래픽과 제스처에 대한 연구)

  • Jiang, Zhou;Seo, Laiwon;Roh, Changbae
    • Journal of Digital Contents Society
    • /
    • v.14 no.4
    • /
    • pp.411-417
    • /
    • 2013
  • Two-dimensional image processing method and tools Rigi then developed a report prepared by a variety of video and three-dimensional images are increasing demands for navigation. The hard part to experience in the real world experience in the virtual environment, and has the purpose to take advantage of. This is a system that provides a simple 3D background, but everyday actions that can control the system with the needs of an instinctive interface technology means. The purpose of this study a variety of human behavior using the Kinect device in action close to the three-dimensional technology to develop a new navigation control is Kinect Holography and 3D images using the input data so that you have the linkage is to design the system.