• Title/Summary/Keyword: single-image detection

Search Result 357, Processing Time 0.027 seconds

A Method for Structuring Digital Video

  • Lee, Jae-Yeon;Jeong, Se-Yoon;Yoon, Ho-Sub;Kim, Kyu-Heon;Bae, Younglae-J;Jang, Jong-whan
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1998.06b
    • /
    • pp.92-97
    • /
    • 1998
  • For the efficient searching and browsing of digital video, it is essential to extract the internal structure of the video contents. As an example, a news video consists of several sections such as politics, economics, sports and others, and also each section consists of individual topics. With this information in hand, users can ore easily access the required video frames. This paper addresses the problem of automatic shot boundary detection and selection of representative frames (R-frames), which are the essential step in recognizing the internal structure of video contents. In the shot boundary detection, a new algorithm that have dual detectors which are designed specifically for the abrupt boundaries (cuts) and gradually changing bounaries respectively is proposed. Compared to the existing 미algorithms that mostly have tried to detect both types by a single mechanism, the proposed algorithm is proved to be more robust and accurate. Also in the problem of R-frame selection, simple mechanical approaches such as selecting one frame every other second have been adopted. However this approach often selects too many R-frames in static short, while drops important frames in dynamic shots. To improve the selection mechanism, a new R-frame selection algorithm that uses motion information extracted from pixel difference is proposed.

  • PDF

Road Sign Recognition and Geo-content Creation Schemes for Utilizing Road Sign Information (도로표지 정보 활용을 위한 도로표지 인식 및 지오콘텐츠 생성 기법)

  • Seung, Teak-Young;Moon, Kwang-Seok;Lee, Suk-Hwan;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.2
    • /
    • pp.252-263
    • /
    • 2016
  • Road sign is an important street furniture that gives some information such as road conditions, driving direction and condition for a driver. Thus, road sign is a major target of image recognition for self-driving car, ADAS(autonomous vehicle and intelligent driver assistance systems), and ITS(intelligent transport systems). In this paper, an enhanced road sign recognition system is proposed for MMS(Mobile Mapping System) using the single camera and GPS. For the proposed system, first, a road sign recognition scheme is proposed. this scheme is composed of detection and classification step. In the detection step, object candidate regions are extracted in image frames using hybrid road sign detection scheme that is based on color and shape features of road signs. And, in the classification step, the area of candidate regions and road sign template are compared. Second, a Geo-marking scheme for geo-content that is consist of road sign image and coordinate value is proposed. If the serious situation such as car accident is happened, this scheme can protect geographical information of road sign against illegal users. By experiments with test video set, in the three parts that are road sign recognition, coordinate value estimation and geo-marking, it is confirmed that proposed schemes can be used for MMS in commercial area.

Crack Detection and Sorting of Eggs by Image Processing (영상처리에 의한 계란의 파란 검출 및 선별)

  • Cho, H.K.;Kwon, Y.;Cho, S.K.
    • Korean Journal of Poultry Science
    • /
    • v.22 no.4
    • /
    • pp.233-238
    • /
    • 1995
  • A computer vision system was built to generate images of a single, stationary egg. This system includes a CGD camera, a frame grabber, and incandescent back lighting system. Image processing algorithms were developed to inspect egg shell and to sort eggs. Those values of both gray level and area of dark spots in the egg image were used as criteria to detect holes in egg and those values of both area and roundness of dark spots in the egg image were used to detect cracks in egg. For a sample of 300 eggs, this system was able to correctly analyze an egg for the presence of a defect 97.5% of the time. The weights of eggs were found to be linear to both the projected area and the perimeter of eggs viewed from above. Those two values were used as criteria to sort eggs. The coefficients of determination(r$^2$) for the regression equations between weights and those two values were 0.967 and 0.972 in the two sets of experiment. Accuracies in grading were found to be 95.6% and 96.7% as compared with results from sizing by electronic weight scale.

  • PDF

Implementation of interactive 3D floating image pointing device (인터렉티브 3D 플로팅 영상 포인팅 장치의 구현)

  • Shin, Dong-Hak;Kim, Eun-Soo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.8
    • /
    • pp.1481-1487
    • /
    • 2008
  • In this paper, we propose a novel interactive 3D floating image pointing device for the use of 3D environment. The proposed system consists of the 3D floating image generation system by use of a floating lens array and the a user interface based on real-time finger detection. In the proposed system, a user selects single image among the floating images so that the interaction function are performed effectively by pressing the button event through the finger recognition using two cameras. To show the usefulness of the proposed system, we carry out the experiment and the preliminary results are presented.

Shadowing Area Detection in Image by HSI Color Model and Intensity Clustering (HSI 컬러모델 및 명도 군집화를 이용한 영상에서의 그림자영역 추출)

  • Choi, Yun-Woong;Jang, Young-Woon;Park, Jung-Nam;Cho, Gi-Sung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.26 no.5
    • /
    • pp.455-463
    • /
    • 2008
  • The shadows, which is generated when acquiring data using optical sensor, mutilates consistency of brightness for same objects in the images. Hence, it makes a trouble to interpret the ground information. This study is focused on detecting the shadowing area in the images. And only single image is used without any other data which is acquired from different source. Also, This study presents the method using HSI color model, especially, using I(intensity) information, and the intensity clustering algorithm. Then, we illuminate the effects of shadow by FFT(Fast Fourier Transform).

The navigation method of mobile robot using a omni-directional position detection system (전방향 위치검출 시스템을 이용한 이동로봇의 주행방법)

  • Ryu, Ji-Hyoung;Kim, Jee-Hong;Lee, Chang-Goo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.2
    • /
    • pp.237-242
    • /
    • 2009
  • Comparing with fixed-type Robots, Mobile Robots have the advantage of extending their workspaces. But this advantage need some sensors to detect mobile robot's position and find their goal point. This article describe the navigation teaching method of mobile robot using omni-directional position detection system. This system offers the brief position data to a processor with simple devices. In other words, when user points a goal point, this system revise the error by comparing its heading angle and position with the goal. For these processes, this system use a conic mirror and a single camera. As a result, this system reduce the image processing time to search the target for mobile robot navigation ordered by user.

Single-pixel Autofocus with Plasmonic Nanostructures

  • Seok, Godeun;Choi, Seunghwan;Kim, Yunkyung
    • Current Optics and Photonics
    • /
    • v.4 no.5
    • /
    • pp.428-433
    • /
    • 2020
  • Recently, the on-chip autofocus (AF) function has become essential to the CMOS image sensor. An auto-focus usually operates using phase detection of the photocurrent difference from a pair of AF pixels that have focused or defocused. However, the phase-detection method requires a pair of AF pixels for comparison of readout. Therefore, the pixel variation may reduce AF performance. In this paper, we propose a color-selective AF pixel with a plasmonic nanostructure in a 0.9 μ㎡ pixel. The suggested AF pixel requires one pixel for AF function. The plasmonic nanostructure uses metal-insulator-metal (MIM) stack arrays instead of a color filter (CF). The color filters are formed at the subwavelength, and they transmit the specific wavelength of light according to the stack period and incident angles. For the optical analysis of the pixel, a finite-difference time-domain (FDTD) simulation was conducted. The analysis showed that the MIM stack arrays in the pixels perform as an AF pixel. As the primary metric of AF performance, the resulting AF contrasts are 1.8 for the red pixels, 1.6 for green, and 1.5 blue. Based on the simulation results, we confirmed the autofocusing performance of the MIM stack arrays.

Experimental Results of Single Carrier Digital Modulation for Underwater Sensor Networks (수중 센서네트워크 구현을 위한 단일 반송파 디지털 변조기법의 실험적 고찰)

  • Kim, Se-Young;Han, Jeong-Woo;Kim, Ki-Man
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.1
    • /
    • pp.33-40
    • /
    • 2011
  • In this paper, underwater acoustic communication experiment was carried out to test a performance of single carrier digital modulation schemes. The communication experiment was performed at real sea and tested modulation schemes are ASK, FSK with non-coherent detection and QPSK with coherent detection. A modulated image data was transmitted with data rates of 600bps~3Kbps. From the results of BER of the demodulated signal, ASK and FSK show the achievable BER of $10^{-3}{\sim}10^{-4}$ without compensation techniques and QPSK show that of $10^{-4}$ with linear equalizer.

Object Dimension Estimation for Remote Visual Inspection in Borescope Systems

  • Kim, Hyun-Sik;Park, Yong-Suk
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.8
    • /
    • pp.4160-4173
    • /
    • 2019
  • Borescopes facilitate the inspection of areas inside machines and systems that are not directly accessible for visual inspection. They offer real-time, up-close access to confined and hard-to-access spaces without having to dismantle or destructure the object under inspection. Borescopes are ideal instruments for routine maintenance, quality inspection and monitoring of systems and structures. The main application being fault or defect detection, it is useful to have measuring capability to quantify object dimensions in a target area. High-end borescopes use multi-optic solutions to provide measurement information of viewed objects. Multi-optic solutions can provide accurate measurements at the expense of structural complexity and cost increase. Measuring functionality is often unavailable in low-end, single camera borescopes. In this paper, a single camera measurement solution that enables the size estimation of viewed objects is proposed. The proposed solution computes and overlays a scaled grid of known spacing value over the screen view, enabling the human inspector to estimate the size of the objects in view. The proposed method provides a simple means of measurement that is applicable to low-end borescopes with no built-in measurement capability.

Automatic assessment of post-earthquake buildings based on multi-task deep learning with auxiliary tasks

  • Zhihang Li;Huamei Zhu;Mengqi Huang;Pengxuan Ji;Hongyu Huang;Qianbing Zhang
    • Smart Structures and Systems
    • /
    • v.31 no.4
    • /
    • pp.383-392
    • /
    • 2023
  • Post-earthquake building condition assessment is crucial for subsequent rescue and remediation and can be automated by emerging computer vision and deep learning technologies. This study is based on an endeavour for the 2nd International Competition of Structural Health Monitoring (IC-SHM 2021). The task package includes five image segmentation objectives - defects (crack/spall/rebar exposure), structural component, and damage state. The structural component and damage state tasks are identified as the priority that can form actionable decisions. A multi-task Convolutional Neural Network (CNN) is proposed to conduct the two major tasks simultaneously. The rest 3 sub-tasks (spall/crack/rebar exposure) were incorporated as auxiliary tasks. By synchronously learning defect information (spall/crack/rebar exposure), the multi-task CNN model outperforms the counterpart single-task models in recognizing structural components and estimating damage states. Particularly, the pixel-level damage state estimation witnesses a mIoU (mean intersection over union) improvement from 0.5855 to 0.6374. For the defect detection tasks, rebar exposure is omitted due to the extremely biased sample distribution. The segmentations of crack and spall are automated by single-task U-Net but with extra efforts to resample the provided data. The segmentation of small objects (spall and crack) benefits from the resampling method, with a substantial IoU increment of nearly 10%.