• Title/Summary/Keyword: scene image

Search Result 947, Processing Time 0.031 seconds

Restoration of Landsat ETM+ SLC-off Gaps Using SPOT Image (SPOT 영상을 이용한 Landsat-7의 SLC-off 영상 복원)

  • Kim Hye-Jin;Yu Ki-Yun;Kim Yong-Il
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2006.04a
    • /
    • pp.229-234
    • /
    • 2006
  • On May 31, 2003. Landsat 7 experienced an anomaly causing the Scan Line Corrector(SLC) to stop functioning normally. The SLC-off causes individual scan lines to alternately overlap and then leave large gaps at the edge of the Image. A many scientists with ongoing experience using ETM+ data evaluated the scientific usability and validity of Landsat 7 products containing the SLC anomaly The best reference scene for gap-filling is the other SLC-on Landsat scene that provide same resolution, few changes, and similar data acquisition. But receiving of Landsat imagery is not stable in Korea. So SPOT image can be another alternative solution because it is a steady-state multispectral satellite image as Landsat image. In this study, we filled the SLC-off gap s of 2, 3, 4 bands using SPOT image by a local regression technique, and assigned the optimum spectral value to gaps of 1, 5, 7 bands based on a spectral adjacency. Through this process, we could restore Landsat SLC-off image and evaluated the accuracy of the results.

  • PDF

Color Correction Using Chromaticity of Highlight Region in Multi-Scaled Retinex

  • Jang, In-Su;Park, Kee-Hyon;Ha, Yeong-Ho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.59-62
    • /
    • 2009
  • In general, as a dynamic range of digital still camera is narrower than a real scene‘s, it is hard to represent the shadow region of scene. Thus, multi-scaled retinex algorithm is used to improve detail and local contrast of the shadow region in an image by dividing the image by its local average images through Gaussian filtering. However, if the chromatic distribution of the original image is not uniform and dominated by a certain chromaticity, the chromaticity of the local average image depends on the dominant chromaticity of original image, thereby the colors of the resulting image are shifted to a complement color to the dominant chromaticity. In this paper, a modified multi-scaled retinex method to reduce the influence of the dominant chromaticity is proposed. In multi-scaled retinex process, the local average images obtained by Gaussian filtering are divided by the average chromaticity values of the original image in order to reduce the influence of dominant chromaticity. Next, the chromaticity of illuminant is estimated in highlight region and the local average images are corrected by the estimated chromaticity of illuminant. In experiment, results show that the proposed method improved the local contrast and detail without color distortion.

  • PDF

A Study on the Expression of Symbolism in the Production of Animation for the Original Work 'Grave of the Fireflies(火垂 墓)' ('반딧불의 묘' 원작에 대한 애니메이션 연출의 상징성 표현 연구)

  • Kim Il-Tae;No Su-Ah
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.4
    • /
    • pp.111-121
    • /
    • 2005
  • The appearance of digital culture swiftly has changed the culture in domestic and international arenas before and after the year 2004 and the image and animation have become two of the most important expression media in contemporary age. Among the Japanese animations that have demonstrated the rapid development of cartoon and animation in the world, the director Dakahata Isao's 'Graves of the Fireflies' that has influenced many works has been evaluated as one of the noticeable works that has a unique method and scenario dramatization in terms of producing the original novel into an animation. This study investigates the metaphor and symbolism shown in this work according to each sequence, divides the production ability in the work into three elements and applies them to the important elements such as camera, colors and mise-en-scene when the original work is depicted into image. It can be summarized in more detail as in the following: firstly, I study the rhythm of camera corresponding to the symbolism of the angle that the camera has and production; secondly, I analyze the artistic elements appeared in the process of expressing the original work into the image, especially the production for the colors and symbolism contained in them and the composition of screen. Thirdly, I analyze how effectively the atmosphere for the situations for the original work is expressed in animation with the aid of one of the image elements, mis-en-scene. It is expected that the analyzed findings will be effective as a way of overcoming the limitation of expressions that the original work in text and the study on these processes will become good examples to the relevant workers and will be the good references to the producers who are interested in the creation of animation in Korea.

  • PDF

Modeling the Visual Target Search in Natural Scenes

  • Park, Daecheol;Myung, Rohae;Kim, Sang-Hyeob;Jang, Eun-Hye;Park, Byoung-Jun
    • Journal of the Ergonomics Society of Korea
    • /
    • v.31 no.6
    • /
    • pp.705-713
    • /
    • 2012
  • Objective: The aim of this study is to predict human visual target search using ACT-R cognitive architecture in real scene images. Background: Human uses both the method of bottom-up and top-down process at the same time using characteristics of image itself and knowledge about images. Modeling of human visual search also needs to include both processes. Method: In this study, visual target object search performance in real scene images was analyzed comparing experimental data and result of ACT-R model. 10 students participated in this experiment and the model was simulated ten times. This experiment was conducted in two conditions, indoor images and outdoor images. The ACT-R model considering the first saccade region through calculating the saliency map and spatial layout was established. Proposed model in this study used the guide of visual search and adopted visual search strategies according to the guide. Results: In the analysis results, no significant difference on performance time between model prediction and empirical data was found. Conclusion: The proposed ACT-R model is able to predict the human visual search process in real scene images using salience map and spatial layout. Application: This study is useful in conducting model-based evaluation in visual search, particularly in real images. Also, this study is able to adopt in diverse image processing program such as helper of the visually impaired.

A Study on the Recognition of Polyhedral Object using 3-D Information (3차원 정보를 이용한 다면체의 물제인식에 관한 연구)

  • 김영일;우동임;백남칠;우광방
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.38 no.6
    • /
    • pp.458-469
    • /
    • 1989
  • A measurement method is proposed which finds 3-D position and attitude of a known polyhedra utilizing shading information. Through the systematic interpretation of relations between polyhedra and its image as well as shadow image and also the determination of candidate position, 3-D information with respect to vertex of polyhedra is extracted. Following preprocessing of this information, the image of polyhedra is represented in terms of the scene with positioned object and the correspondence is searched by means of matching process between a scene description of the object and the correspondence is searched by means of matching process between a scene description of the object and a model description stored in data-base. In the experiments, initially 3-D information is employed to select several model regions, and objects are recognized through matching process with respect to scene regions. The results demonstrate that the recognition system performs with a high efficiency by proper selection of the threshold values.

Arabic Words Extraction and Character Recognition from Picturesque Image Macros with Enhanced VGG-16 based Model Functionality Using Neural Networks

  • Ayed Ahmad Hamdan Al-Radaideh;Mohd Shafry bin Mohd Rahim;Wad Ghaban;Majdi Bsoul;Shahid Kamal;Naveed Abbas
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.7
    • /
    • pp.1807-1822
    • /
    • 2023
  • Innovation and rapid increased functionality in user friendly smartphones has encouraged shutterbugs to have picturesque image macros while in work environment or during travel. Formal signboards are placed with marketing objectives and are enriched with text for attracting people. Extracting and recognition of the text from natural images is an emerging research issue and needs consideration. When compared to conventional optical character recognition (OCR), the complex background, implicit noise, lighting, and orientation of these scenic text photos make this problem more difficult. Arabic language text scene extraction and recognition adds a number of complications and difficulties. The method described in this paper uses a two-phase methodology to extract Arabic text and word boundaries awareness from scenic images with varying text orientations. The first stage uses a convolution autoencoder, and the second uses Arabic Character Segmentation (ACS), which is followed by traditional two-layer neural networks for recognition. This study presents the way that how can an Arabic training and synthetic dataset be created for exemplify the superimposed text in different scene images. For this purpose a dataset of size 10K of cropped images has been created in the detection phase wherein Arabic text was found and 127k Arabic character dataset for the recognition phase. The phase-1 labels were generated from an Arabic corpus of quotes and sentences, which consists of 15kquotes and sentences. This study ensures that Arabic Word Awareness Region Detection (AWARD) approach with high flexibility in identifying complex Arabic text scene images, such as texts that are arbitrarily oriented, curved, or deformed, is used to detect these texts. Our research after experimentations shows that the system has a 91.8% word segmentation accuracy and a 94.2% character recognition accuracy. We believe in the future that the researchers will excel in the field of image processing while treating text images to improve or reduce noise by processing scene images in any language by enhancing the functionality of VGG-16 based model using Neural Networks.

Tree-Based Static/Dynamic Image Mosaicing (트리 기반 정적/동적 영상 모자이크)

  • Kang, Oh-hyung;Rhee, Yang-won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.4
    • /
    • pp.758-766
    • /
    • 2003
  • This paper proposes a tree-based hierarchical image mosaicing system using camera and object parameters for efficient video database construction. Gray level histogram difference and average intensity difference are proposed for scene change detection of input video. Camera parameter measured by utilizing least sum of square difference and affine model, and difference image is used for similarity measure of two input images. Also, dynamic objects are searched by through macro block setting and extracted by using region splitting and 4-split detection methods. Dynamic trajectory evaluation function is used for expression of dynamic objects, and blurring is performed for construction of soft and slow mosaic image.

Development of 3D Stereoscopic Image Generation System Using Real-time Preview Function in 3D Modeling Tools

  • Yun, Chang-Ok;Yun, Tae-Soo;Lee, Dong-Hoon
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.6
    • /
    • pp.746-754
    • /
    • 2008
  • A 3D stereoscopic image is generated by interdigitating every scene with video editing tools that are rendered by two cameras' views in 3D modeling tools, like Autodesk MAX(R) and Autodesk MAYA(R). However, the depth of object from a static scene and the continuous stereo effect in the view of transformation, are not represented in a natural method. This is because after choosing the settings of arbitrary angle of convergence and the distance between the modeling and those two cameras, the user needs to render the view from both cameras. So, the user needs a process of controlling the camera's interval and rendering repetitively, which takes too much time. Therefore, in this paper, we will propose the 3D stereoscopic image editing system for solving such problems as well as exposing the system's inherent limitations. We can generate the view of two cameras and can confirm the stereo effect in real-time on 3D modeling tools. Then, we can intuitively determine immersion of 3D stereoscopic image in real-time, by using the 3D stereoscopic image preview function.

  • PDF

Implementation of Multispectral Imaging System (멀티스펙트럼 영상 획득 시스템 구현)

  • Jin, Yoon-Jong;Lee, Moon-Hyun;Noh, Sung-Kyu;Park, Jong-Il
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.717-721
    • /
    • 2008
  • This paper proposes an image system that can efficiently measure the spectral reflectance of a scene using RGB cameras and LED light sources. Multispectral imaging system is composed of LED controllers, LED clusters and RGB cameras. It captures full-spectral images at real-time. The system adopts a simple, empirical linear model to estimate the full spectral reflectance at each pixel. Since the model is linear, the reconstruction is efficient and stable. We estimated the spectral reflectance of various scenes using the system and showed the effectiveness of the proposed system.

  • PDF

Coupled Line Cameras as a New Geometric Tool for Quadrilateral Reconstruction (사각형 복원을 위한 새로운 기하학적 도구로서의 선분 카메라 쌍)

  • Lee, Joo-Haeng
    • Korean Journal of Computational Design and Engineering
    • /
    • v.20 no.4
    • /
    • pp.357-366
    • /
    • 2015
  • We review recent research results on coupled line cameras (CLC) as a new geometric tool to reconstruct a scene quadrilateral from image quadrilaterals. Coupled line cameras were first developed as a camera calibration tool based on geometric insight on the perspective projection of a scene rectangle to an image plane. Since CLC comprehensively describes the relevant projective structure in a single image with a set of simple algebraic equations, it is also useful as a geometric reconstruction tool, which is an important topic in 3D computer vision. In this paper we first introduce fundamentals of CLC with reals examples. Then, we cover the related works to optimize the initial solution, to extend for the general quadrilaterals, and to apply for cuboidal reconstruction.