• Title/Summary/Keyword: color images

Search Result 2,708, Processing Time 0.033 seconds

Content Based Image Retrieval using 8AB Representation of Spatial Relations between Objects (객체 위치 관계의 8AB 표현을 이용한 내용 기반 영상 검색 기법)

  • Joo, Chan-Hye;Chung, Chin-Wan;Park, Ho-Hyun;Lee, Seok-Lyong;Kim, Sang-Hee
    • Journal of KIISE:Databases
    • /
    • v.34 no.4
    • /
    • pp.304-314
    • /
    • 2007
  • Content Based Image Retrieval (CBIR) is to store and retrieve images using the feature description of image contents. In order to support more accurate image retrieval, it has become necessary to develop features that can effectively describe image contents. The commonly used low-level features, such as color, texture, and shape features may not be directly mapped to human visual perception. In addition, such features cannot effectively describe a single image that contains multiple objects of interest. As a result, the research on feature descriptions has shifted to focus on higher-level features, which support representations more similar to human visual perception like spatial relationships between objects. Nevertheless, the prior works on the representation of spatial relations still have shortcomings, particularly with respect to supporting rotational invariance, Rotational invariance is a key requirement for a feature description to provide robust and accurate retrieval of images. This paper proposes a high-level feature named 8AB (8 Angular Bin) that effectively describes the spatial relations of objects in an image while providing rotational invariance. With this representation, a similarity calculation and a retrieval technique are also proposed. In addition, this paper proposes a search-space pruning technique, which supports efficient image retrieval using the 8AB feature. The 8AB feature is incorporated into a CBIR system, and the experiments over both real and synthetic image sets show the effectiveness of 8AB as a high-level feature and the efficiency of the pruning technique.

Hand Gesture Recognition Algorithm Robust to Complex Image (복잡한 영상에 강인한 손동작 인식 방법)

  • Park, Sang-Yun;Lee, Eung-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.7
    • /
    • pp.1000-1015
    • /
    • 2010
  • In this paper, we propose a novel algorithm for hand gesture recognition. The hand detection method is based on human skin color, and we use the boundary energy information to locate the hand region accurately, then the moment method will be employed to locate the hand palm center. Hand gesture recognition can be separated into 2 step: firstly, the hand posture recognition: we employ the parallel NNs to deal with problem of hand posture recognition, pattern of a hand posture can be extracted by utilize the fitting ellipses method, which separates the detected hand region by 12 ellipses and calculates the white pixels rate in ellipse line. the pattern will be input to the NNs with 12 input nodes, the NNs contains 4 output nodes, each output node out a value within 0~1, the posture is then represented by composed of the 4 output codes. Secondly, the hand gesture tracking and recognition: we employed the Kalman filter to predict the position information of gesture to create the position sequence, distance relationship between positions will be used to confirm the gesture. The simulation have been performed on Windows XP to evaluate the efficiency of the algorithm, for recognizing the hand posture, we used 300 training images to train the recognizing machine and used 200 images to test the machine, the correct number is up to 194. And for testing the hand tracking recognition part, we make 1200 times gesture (each gesture 400 times), the total correct number is 1002 times. These results shows that the proposed gesture recognition algorithm can achieve an endurable job for detecting the hand and its' gesture.

Autonomous Mobile Robot System Using Adaptive Spatial Coordinates Detection Scheme based on Stereo Camera (스테레오 카메라 기반의 적응적인 공간좌표 검출 기법을 이용한 자율 이동로봇 시스템)

  • Ko Jung-Hwan;Kim Sung-Il;Kim Eun-Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.1C
    • /
    • pp.26-35
    • /
    • 2006
  • In this paper, an automatic mobile robot system for a intelligent path planning using the detection scheme of the spatial coordinates based on stereo camera is proposed. In the proposed system, face area of a moving person is detected from a left image among the stereo image pairs by using the YCbCr color model and its center coordinates are computed by using the centroid method and then using these data, the stereo camera embedded on the mobile robot can be controlled for tracking the moving target in real-time. Moreover, using the disparity map obtained from the left and right images captured by the tracking-controlled stereo camera system and the perspective transformation between a 3-D scene and an image plane, depth information can be detected. Finally, based-on the analysis of these calculated coordinates, a mobile robot system is derived as a intelligent path planning and a estimation. From some experiments on robot driving with 240 frames of the stereo images, it is analyzed that error ratio between the calculated and measured values of the distance between the mobile robot and the objects, and relative distance between the other objects is found to be very low value of $2.19\%$ and $1.52\%$ on average, respectably.

Application of Terahertz Spectroscopy and Imaging in the Diagnosis of Prostate Cancer

  • Zhang, Ping;Zhong, Shuncong;Zhang, Junxi;Ding, Jian;Liu, Zhenxiang;Huang, Yi;Zhou, Ning;Nsengiyumva, Walter;Zhang, Tianfu
    • Current Optics and Photonics
    • /
    • v.4 no.1
    • /
    • pp.31-43
    • /
    • 2020
  • The feasibility of the application of terahertz electromagnetic waves in the diagnosis of prostate cancer was examined. Four samples of incomplete cancerous prostatic paraffin-embedded tissues were examined using terahertz spectral imaging (TPI) system and the results obtained by comparing the absorption coefficient and refractive index of prostate tumor, normal prostate tissue and smooth muscle from one of the paraffin tissue masses examined were reported. Three hundred and sixty cases of absorption coefficients from one of the paraffin tissues examined were used as raw data to classify these three tissues using the Principal Component Analysis (PCA) and Least Squares Support Vector Machine (LS-SVM). An excellent classification with an accuracy of 92.22% in the prediction set was achieved. Using the distribution information of THz reflection signal intensity from sample surface and absorption coefficient of the sample, an attempt was made to use the TPI system to identify the boundaries of the different tissues involved (prostate tumors, normal and smooth muscles). The location of three identified regions in the terahertz images (frequency domain slice absorption coefficient imaging, 1.2 THz) were compared with those obtained from the histopathologic examination. The tissue tumor region had a distinctively visible color and could well be distinguished from other tissue regions in terahertz images. Results indicate that a THz spectroscopy imaging system can be efficiently used in conjunction with the proposed advanced computer-based mathematical analysis method to identify tumor regions in the paraffin tissue mass of prostate cancer.

Interpretation of Receiver Operating Characteristics (ROC) (ROC(receiver operating characteristics) 해석)

  • Kim Jae-Duk
    • Imaging Science in Dentistry
    • /
    • v.30 no.3
    • /
    • pp.155-158
    • /
    • 2000
  • The purpose of this paper is to explain the making procedure and the usage of receiver operating characteristic (ROC) curve for interpretation of radiographic images. The conventional radiograms obtained after the creation of the lesions in the acrylic plates and were enhanced in color. The observer were informed of which tooth to examine, the 'a priori' probability of a lesion present and the approximate diameter of the lesions. The two groups of films were interpreted separately by the same observer using the same rating scale. The following rating scale was used: A; definitely no lesion, B; probably no lesion, C; not sure, D; probably a lesion, and E; definitely a lesion. In analysis, for each observer the diagnostic results in terms of true positive (TP) and false positive (FP) decisions were plotted on a graph. The lowest point on the graph represents the TP and FP when only decisions designated as E according to the rating scale are included. The next point shows the TP and FP values when diagnoses designated as D are added and so forth. By connecting such plot points, a receiver operating characteristic (ROC) curves is obtained. The area under the curve represents the diagnostic accuracy resulting from a diagnostic performance at pure chance level and a value of 1.0 at perfect performance. This method has been known as an useful method to detect the minute difference for each radiographic technic, each observer and for the different lesion depths.

  • PDF

Media Research in Global Brand Timelapse Advertisement (글로벌 브랜드 타임랩스 광고에 나타난 영상 연구)

  • Yu, Jung-Sun;Chung, Jean-Hun
    • Journal of Digital Convergence
    • /
    • v.15 no.8
    • /
    • pp.333-340
    • /
    • 2017
  • Timelapse is an imaging technique that captures motion at regular intervals and then projects it at normal speed. We looked at Timelapse advertising images of global brands and presented a model for analyzing components and expression methods of Timelapse, a new image technique. In previous research, literature research, Internet data survey, and YouTube data were investigated. Continuous photography has been developed as an imaging technique, and we have examined the current production status applied to domestic and foreign documentary, domestic and foreign drama, film, and advertisement. In 2015-2016, I will analyze the techniques of iPhones (2016), Ralph Lauren Polo Ads (2015), and Canon EOS (2013) videos that use Timelapse techniques in their recent advertisements. The results show that the Timelapse component is a static element, the static motif is mainly an artificial structure, the place is outdoor, the color is taken at a time showing the characteristics of the place, and the layout is all centered. The dynamic motif is a moving object. The dynamic line consists of a story based on the object. The time is about 11-15 seconds, the longest is about 1 minute and 30 seconds, and the editing is mainly focused on the product with the brand logo emphasized. In conclusion, it is the role of the image to pay attention to the advertisement and catch the eye. In order to motivate the buyer's mind, it is necessary to direct and edit such as Timelapse, which stimulates the emotions inherent in the mind and stimulates the non-verbal symbols. Future research is likely to reveal various attempts at temporal editing of images.

A Quality-control Experiment Involving an Optical Televiewer Using a Fractured Borehole Model (균열모형시추공을 이용한 광학영상화검층 품질관리 시험)

  • Jeong, Seungho;Shin, Jehyun;Hwang, Seho;Kim, Ji-Soo
    • The Journal of Engineering Geology
    • /
    • v.30 no.1
    • /
    • pp.17-30
    • /
    • 2020
  • An optical televiewer is a geophysical logging device that produces continuous high-resolution full-azimuth images of a borehole wall using a light-emitting-diode and a complementary metal-oxide semiconductor image sensor to provide valuable information on subsurface discontinuities. Recently, borehole imaging logging has been applied in many fields, including ground subsidence monitoring, rock mass integrity evaluation, stress-induced fracture detection, and glacial annual-layer measurements in polar regions. Widely used commercial borehole imaging logging systems typically have limitations depending on equipment specifications, meaning that it is necessary to clearly verify the scope of applications while maintaining appropriate quality control for various borehole conditions. However, it is difficult to directly check the accuracy, implementation, and reliability for outcomes, as images derived from an optical televiewer constitute in situ data. In this study, we designed and constructed a modular fractured borehole model having similar conditions to a borehole environment to report unprecedented results regarding reliable data acquisition and processing. We investigate sonde magnetometer accuracy, color realization, and fracture resolution, and suggest data processing methods to obtain accurate aperture measurements. The experiment involving the fractured borehole model should enhance not only measurement quality but also interpretations of high-resolution and reliable optical imaging logs.

2D-to-3D Stereoscopic conversion: Depth estimation in monoscopic soccer videos (단일 시점 축구 비디오의 3차원 영상 변환을 위한 깊이지도 생성 방법)

  • Ko, Jae-Seung;Kim, Young-Woo;Jung, Young-Ju;Kim, Chang-Ick
    • Journal of Broadcast Engineering
    • /
    • v.13 no.4
    • /
    • pp.427-439
    • /
    • 2008
  • This paper proposes a novel method to convert monoscopic soccer videos to stereoscopic videos. Through the soccer video analysis process, we detect shot boundaries and classify soccer frames into long shot or non-long shot. In the long shot case, the depth mapis generated relying on the size of the extracted ground region. For the non-long shot case, the shot is further partitioned into three types by considering the number of ground blocks and skin blocks which is obtained by a simple skin-color detection method. Then three different depth assignment methods are applied to each non-long shot types: 1) Depth estimation by object region extraction, 2) Foreground estimation by using the skin block and depth value computation by Gaussian function, and 3)the depth map generation for shots not containing the skin blocks. This depth assignment is followed by stereoscopic image generation. Subjective evaluation comparing generated depth maps and corresponding stereoscopic images indicate that the proposed algorithm can yield the sense of depth from a single view images.

Separate Factor Caching Scheme for Mobile Web Service (모바일 웹 서비스를 위한 요소분할 캐싱 기법)

  • Sim, Kun-Jung;Kang, Eui-Sun;Kim, Jong-Keun;Ko, Hee-Ae;Lim, Young-Hwan
    • The KIPS Transactions:PartD
    • /
    • v.14D no.4 s.114
    • /
    • pp.447-458
    • /
    • 2007
  • The objective of this study is to provide faster mobile web service by improving performance of Contents Cache used for mobile web service in the existing Mobile Gate System. It was found that two elements existed in Mark-Up page transcoded by Contents Generator. One of the elements was dependent only on the requested DIDL page and Mark-Up type. The other was dependent on each of the requested DIDL page, Mark-Up type, size of mobile display 모바일 장치 to request service, type of images available and color depth count of the images available. The conventional Contents Cache saved the entire Mark-Up page to hold both of the two elements. This caused the problem where storage space was not effectively used because reusable elements were repetitively saved in cache memory domain due to change in one of the elements even though all the other elements were the same. As a result, a larger number of transcoded Mark-Up pages could not be saved in the same cache memory size. Therefore, in this study, Mark-Up pages transcoded by Contents Generator were divided into two elements and were separately saved. Also, in order to respond to the demand for replacing data in cache with new data, this study applied two algorithms of LFU and LRU. This study proposed the method to implement cache performance of faster speed by enabling to save more number of the transcoded Mark-Up pages in the same cache storage space.

Application of Image Processing Method to Evaluate Ultimate Strain of Rebar (철근의 한계상태변형률 평가를 위한 이미지 프로세싱의 적용)

  • Kim, Seong-Do;Jung, Chi-Young;Woo, Tae-Ryeon;Cheung, Jin-Hwan
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.20 no.3
    • /
    • pp.111-121
    • /
    • 2016
  • In this study, measurements were conducted by image processing to do an in-depth evaluation of strain of rebar in a uniaxial tension test. The distribution of strain and the necking region were evaluated. The image processing is used to analyze the color information of a colored image, so that the parts consistent with desired targets can be distinguished from the other parts. After this process, the image was converted to a binary one. Centroids of each target region are obtained in the binary images. After repeating such process on the images from starting point to the finishing point of the test, elongation between targets is calculated based on the centroid of each target. The tensile test were conducted on grade 60 #7(D22) and #9(D29) rebars fabricated in accordance with ASTM A615 standards. Strain results from image processing were compared to the results from a conventional strain gauge, in order to see the validity of the image processing. With the image processing, the measuring was possible in not only the initial elastic region but also the necking region of more than 0.5(50%) strain. The image processing can remove the measuring limits as long as the targets can be video recorded. It also can measure strain at various spots because the targets can easily be attached and detached. Thus it is concluded that the image processing helps overcome limits in strain measuring and will be used in various ways.