• Title/Summary/Keyword: 시각객체

Search Result 494, Processing Time 0.022 seconds

Visualization System for Dance Movement Feedback using MediaPipe (MediaPipe를 활용한 춤동작 피드백 시각화 시스템)

  • Hyeon-Seo Kim;Jae-Yeung Jeong;Bong-Jun Choi;Mi-Kyeong Moon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.1
    • /
    • pp.217-224
    • /
    • 2024
  • With the rapid growth of K-POP, the dance content industry is spreading. With the recent increase in the spread of SNS, they also shoot and share their dance videos. However, it is not easy for dance beginners who are new to dancing to learn dance moves because it is difficult to receive objective feedback when dancing alone while watching videos. This paper describes a system that uses MediaPipe to compare choreography videos and dance videos of users and detect whether they are following the movement correctly. This study proposes a method of giving feedback based on Color Map to users by calculating the similarity of dance movements between user images taken with webcam or camera and choreography images using cosine similarity and COCO OKS. Through this system, objective feedback on users' dance movements can be visually received, and beginners are expected to be able to learn accurate dance movements.

Video Scene Detection using Shot Clustering based on Visual Features (시각적 특징을 기반한 샷 클러스터링을 통한 비디오 씬 탐지 기법)

  • Shin, Dong-Wook;Kim, Tae-Hwan;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.47-60
    • /
    • 2012
  • Video data comes in the form of the unstructured and the complex structure. As the importance of efficient management and retrieval for video data increases, studies on the video parsing based on the visual features contained in the video contents are researched to reconstruct video data as the meaningful structure. The early studies on video parsing are focused on splitting video data into shots, but detecting the shot boundary defined with the physical boundary does not cosider the semantic association of video data. Recently, studies on structuralizing video shots having the semantic association to the video scene defined with the semantic boundary by utilizing clustering methods are actively progressed. Previous studies on detecting the video scene try to detect video scenes by utilizing clustering algorithms based on the similarity measure between video shots mainly depended on color features. However, the correct identification of a video shot or scene and the detection of the gradual transitions such as dissolve, fade and wipe are difficult because color features of video data contain a noise and are abruptly changed due to the intervention of an unexpected object. In this paper, to solve these problems, we propose the Scene Detector by using Color histogram, corner Edge and Object color histogram (SDCEO) that clusters similar shots organizing same event based on visual features including the color histogram, the corner edge and the object color histogram to detect video scenes. The SDCEO is worthy of notice in a sense that it uses the edge feature with the color feature, and as a result, it effectively detects the gradual transitions as well as the abrupt transitions. The SDCEO consists of the Shot Bound Identifier and the Video Scene Detector. The Shot Bound Identifier is comprised of the Color Histogram Analysis step and the Corner Edge Analysis step. In the Color Histogram Analysis step, SDCEO uses the color histogram feature to organizing shot boundaries. The color histogram, recording the percentage of each quantized color among all pixels in a frame, are chosen for their good performance, as also reported in other work of content-based image and video analysis. To organize shot boundaries, SDCEO joins associated sequential frames into shot boundaries by measuring the similarity of the color histogram between frames. In the Corner Edge Analysis step, SDCEO identifies the final shot boundaries by using the corner edge feature. SDCEO detect associated shot boundaries comparing the corner edge feature between the last frame of previous shot boundary and the first frame of next shot boundary. In the Key-frame Extraction step, SDCEO compares each frame with all frames and measures the similarity by using histogram euclidean distance, and then select the frame the most similar with all frames contained in same shot boundary as the key-frame. Video Scene Detector clusters associated shots organizing same event by utilizing the hierarchical agglomerative clustering method based on the visual features including the color histogram and the object color histogram. After detecting video scenes, SDCEO organizes final video scene by repetitive clustering until the simiarity distance between shot boundaries less than the threshold h. In this paper, we construct the prototype of SDCEO and experiments are carried out with the baseline data that are manually constructed, and the experimental results that the precision of shot boundary detection is 93.3% and the precision of video scene detection is 83.3% are satisfactory.

Region-based Building Extraction of High Resolution Satellite Images Using Color Invariant Features (색상 불변 특징을 이용한 고해상도 위성영상의 영역기반 건물 추출)

  • Ko, A-Reum;Byun, Young-Gi;Park, Woo-Jin;Kim, Yong-Il
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.2
    • /
    • pp.75-87
    • /
    • 2011
  • This paper presents a method for region-based building extraction from high resolution satellite images(HRSI) using integrated information of spectral and color invariant features without user intervention such as selecting training data sets. The purpose of this study is also to evaluate the effectiveness of the proposed method by applying to IKONOS and QuickBird images. Firstly, the image is segmented by the MSRG method. The vegetation and shadow regions are automatically detected and masked to facilitate the building extraction. Secondly, the region merging is performed for the masked image, which the integrated information of the spectral and color invariant features is used. Finally, the building regions are extracted using the shape feature for the merged regions. The boundaries of the extracted buildings are simplified using the generalization techniques to improve the completeness of the building extraction. The experimental results showed more than 80% accuracy for two study areas and the visually satisfactory results obtained. In conclusion, the proposed method has shown great potential for the building extraction from HRSI.

A Real-Time Stereoscopic Image Conversion Method Based on A Single Frame (단일 프레임 기반의 실시간 입체 영상 변환 방법)

  • Jung Jae-Sung;Cho Hwa-Hyun;Choi Myung-Ryul
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.1 s.307
    • /
    • pp.45-52
    • /
    • 2006
  • In this paper, a real-time stereoscopic image conversion method using a single frame from a 2-D image is proposed. The Stereoscopic image is generated by creating depth map using vortical position information and parallax processing. For a real-time processing of stereoscopic conversion and reduction of hardware complexity, it uses image sampling, object segmentation by standardizing luminance and depth map generation by boundary scan. The proposed method offers realistic 3-D effect regardless of the direction, velocity and scene conversion of the 2-D image. It offers effective stereoscopic conversion using images suitable conditions assumed in this paper such as recorded image at long distance, landscape and panorama photo because it creates different depth sense using vertical position information from a single frame. The proposed method can be applied to still image because it uses a single frame from a 2-D image. The proposed method has been evaluated using visual test and APD for comparing the stereoscopic image of the proposed method with that of MTD. It is confirmed that stereoscopic images conversed by the proposed method offers 3-D effect regardless of the direction and velocity of the 2-D image.

A Multi-dimensional Range Query Processing using Space Filling Curves (공간 순서화 곡선을 이용한 다차원 영역 질의 처리)

  • Back, Hyun;Won, Jung-Im;Yoon, Jee-Hee
    • Journal of Korea Spatial Information System Society
    • /
    • v.8 no.2 s.17
    • /
    • pp.13-38
    • /
    • 2006
  • Range query is one of the most important operations for spatial objects, it retrieves all spatial objects that overlap a given query region in multi-dimensional space. The DOT(DOuble Transformation) is known as an efficient indexing methods, it transforms the MBR of a spatial object into a single numeric value using a space filling curve, and stores the value in a $B^+$-tree. The DOT index is possible to be employed as a primary index for spatial objects. However, the range query processing based on the DOT index requires much overhead for spatial transformations to get the query region in the final space. Also, the detailed range query processing method for 2-dimensional spatial objects has not been studied yet in this paper, we propose an efficient multi-dimensional range query processing technique based on the DOT index. The proposed technique exploits the regularities in the moving patterns of space filling curves to divide a query region into a set of maximal sub-legions within which space filling curves traverse without interruption. Such division reduces the number of spatial transformations required to perform the range query and thus improves the performance of range query processing. A visual simulator is developed to show the evaluation method and the performance of our technique.

  • PDF

A Study on Land Cover Map of UAV Imagery using an Object-based Classification Method (객체기반 분류기법을 이용한 UAV 영상의 토지피복도 제작 연구)

  • Shin, Ji Sun;Lee, Tae Ho;Jung, Pil Mo;Kwon, Hyuk Soo
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.23 no.4
    • /
    • pp.25-33
    • /
    • 2015
  • The study of ecosystem assessment(ES) is based on land cover information, and primarily it is performed at the global scale. However, these results as data for decision making have a limitation at the aspects of range and scale to solve the regional issue. Although the Ministry of Environment provides available land cover data at the regional scale, it is also restricted in use due to the intrinsic limitation of on screen digitizing method and temporal and spatial difference. This study of objective is to generate UAV land cover map. In order to classify the imagery, we have performed resampling at 5m resolution using UAV imagery. The results of object-based image segmentation showed that scale 20 and merge 34 were the optimum weight values for UAV imagery. In the case of RapidEye imagery;we found that the weight values;scale 30 and merge 30 were the most appropriate at the level of land cover classes for sub-category. We generated land cover imagery using example-based classification method and analyzed the accuracy using stratified random sampling. The results show that the overall accuracies of RapidEye and UAV classification imagery are each 90% and 91%.

A Study on the Design and Implementation of Multi-Disaster Drone System Using Deep Learning-Based Object Recognition and Optimal Path Planning (딥러닝 기반 객체 인식과 최적 경로 탐색을 통한 멀티 재난 드론 시스템 설계 및 구현에 대한 연구)

  • Kim, Jin-Hyeok;Lee, Tae-Hui;Han, Yamin;Byun, Heejung
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.10 no.4
    • /
    • pp.117-122
    • /
    • 2021
  • In recent years, human damage and loss of money due to various disasters such as typhoons, earthquakes, forest fires, landslides, and wars are steadily occurring, and a lot of manpower and funds are required to prevent and recover them. In this paper, we designed and developed a disaster drone system based on artificial intelligence in order to monitor these various disaster situations in advance and to quickly recognize and respond to disaster occurrence. In this study, multiple disaster drones are used in areas where it is difficult for humans to monitor, and each drone performs an efficient search with an optimal path by applying a deep learning-based optimal path algorithm. In addition, in order to solve the problem of insufficient battery capacity, which is a fundamental problem of drones, the optimal route of each drone is determined using Ant Colony Optimization (ACO) technology. In order to implement the proposed system, it was applied to a forest fire situation among various disaster situations, and a forest fire map was created based on the transmitted data, and a forest fire map was visually shown to the fire fighters dispatched by a drone equipped with a beam projector. In the proposed system, multiple drones can detect a disaster situation in a short time by simultaneously performing optimal path search and object recognition. Based on this research, it can be used to build disaster drone infrastructure, search for victims (sea, mountain, jungle), self-extinguishing fire using drones, and security drones.

YOLO-based Traffic Signal Detection for Identifying the Violation of Motorbike Riders (YOLO 기반의 교통 신호등 인식을 통한 오토바이 운전자의 신호 위반 여부 확인)

  • Wahyutama, Aria Bisma;Hwang, Mintae
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.141-143
    • /
    • 2022
  • This paper presented a new technology to identify traffic violations of motorbike riders by detecting the traffic signal using You Only Look Once (YOLO) object detection. The hardware module that is mounted on the front of the motorbike consists of Raspberry Pi with a camera to run the YOLO object detection, a GPS module to acquire the motorcycle's coordinate, and a LoRa communication module to send the data to a cloud DB. The main goal of the software is to determine whether a motorbike has violated a traffic signal. This paper proposes a function to recognize the red traffic signal colour with its movement inside the camera angle and determine that the traffic signal violation happens if the traffic signal is moving to the right direction (the rider turns left) or moving to the top direction (the riders goes straight). Furthermore, if a motorbike rider is violated the signal, the rider's personal information (name, mobile phone number, etc), the snapshot of the violation situation, rider's location, and date/time will be sent to a cloud DB. The violation information will be delivered to the driver's smartphone as a push notification and the local police station to be used for issuing violation tickets, which is expected to prevent motorbike riders from violating traffic signals.

  • PDF

Deduction of R-PLM technology development consideration (R-PLM 기술 개발 시 고려사항 도출에 관한 연구)

  • Kang, Tae-Wook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.7
    • /
    • pp.4618-4625
    • /
    • 2014
  • The aim of this study was to develop a R-PLM (Railway BIM-based Product Lifecycle Management) technology. The railway engineering productivity can be maximized if the railway product is created using the object modeling method and the railway lifecycle process is managed effectively. Recently, technology known as BIM and PLM was applied to the construction industry to improve the productivity. To define the R-PLM functions, the expert interview and a practitioner interview were conducted to identify the expected outcome and obstacles to R-PLM technology. As a result, the differences between the occupation and the priority of those were derived. Finally, R-PLM technology development consideration was suggested based on the interview results.

A study on the three dimensional paper doll development with augmented reality technology (증강현실기술이 적용된 3D 인형놀이 개발에 대한 연구)

  • Kim, Tae-Eun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.3
    • /
    • pp.297-302
    • /
    • 2014
  • The AR(Augmented Reality) is the technique that make the fusion between real world and virtual object. The augmented object on real world can provide the visualization of digital information to user. Also, the real-time AR system makes the interaction between user and computers. Therefore, the immersion of user be increasing and help the information transfer to user with AR system. In this paper, we implement the AR system with the modeled VRML as a three-dimensional object using programming language. The application technique is proposed that augment the various virtual clothes to three-dimensional avatar. Moreover, we propose the novel interface by using marker that can be increase the immersion of user.