• Title/Summary/Keyword: Video representation

Search Result 195, Processing Time 0.027 seconds

Effective Method to Change Multimedia Scene Configuration Information Using DOM Update (DOM update를 이용한 효율적인 멀티미디어 장면 구성 정보 변경 방안)

  • Kim, Kyuheon;Park, JungWook;Kim, Byungchul
    • Journal of Broadcast Engineering
    • /
    • v.18 no.1
    • /
    • pp.43-58
    • /
    • 2013
  • Richmedia Service means that interactive media service can provide view with various multimedia elements(such as Video, Audio, Text) at same time. Various Multimedia elements can be serviced by Scene Description technology standards like BIFS(Binary Format for Scenes) and LASeR(Light Application Scene Representation). By providing Scene Component information, richmedia service is available to various multimedia services. so users is available to personalized services fitting temporal and spatial options. In conventional technology, when the scene is changed by user or service, mobile deletes the scene of configuration information and makes new scene of configuration information. this is a very inefficient way. In this paper, Propoesed that by using DOM(Document Object Model) method, to pass only the dynamic configuration part, changes scene method.

Human Motion Tracking based on 3D Depth Point Matching with Superellipsoid Body Model (타원체 모델과 깊이값 포인트 매칭 기법을 활용한 사람 움직임 추적 기술)

  • Kim, Nam-Gyu
    • Journal of Digital Contents Society
    • /
    • v.13 no.2
    • /
    • pp.255-262
    • /
    • 2012
  • Human motion tracking algorithm is receiving attention from many research areas, such as human computer interaction, video conference, surveillance analysis, and game or entertainment applications. Over the last decade, various tracking technologies for each application have been demonstrated and refined among them such of real time computer vision and image processing, advanced man-machine interface, and so on. In this paper, we introduce cost-effective and real-time human motion tracking algorithms based on depth image 3D point matching with a given superellipsoid body representation. The body representative model is made by using parametric volume modeling method based on superellipsoid and consists of 18 articulated joints. For more accurate estimation, we exploit initial inverse kinematic solution with classified body parts' information, and then, the initial pose is modified to more accurate pose by using 3D point matching algorithm.

A Study on the Characteristics of Onomatopoeia Subtitle in Korean and Chinese Variety TV Shows Based on Writing System (문자 체계에 따른 한중 예능 프로그램의 의성어 자막 특성 연구)

  • Wen Liang;Yoojin Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.243-251
    • /
    • 2024
  • As digital video communication technology advances and global interactions become more frequent, cultural barriers between countries are gradually diminishing. Subtitles in TV content reflect the writing systems and cultural contexts of different countries, aiding in the comprehension of program content. However, when comparing subtitles between countries with different writing systems, variations in format and the representation of onomatopoeic expressions become apparent. Therefore, this study focuses on analyzing the differences and peculiarities in the onomatopoeic subtitles of Korean and Chinese variety shows, which are based on distinct writing systems. Through this analysis, the study aims to understand how differences in writing systems influence the representation of onomatopoeic subtitles and viewer experience. This investigation is expected to provide creative inspiration for variety show producers and facilitate cross-cultural communication.

Spatial-Temporal Scale-Invariant Human Action Recognition using Motion Gradient Histogram (모션 그래디언트 히스토그램 기반의 시공간 크기 변화에 강인한 동작 인식)

  • Kim, Kwang-Soo;Kim, Tae-Hyoung;Kwak, Soo-Yeong;Byun, Hye-Ran
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1075-1082
    • /
    • 2007
  • In this paper, we propose the method of multiple human action recognition on video clip. For being invariant to the change of speed or size of actions, Spatial-Temporal Pyramid method is applied. Proposed method can minimize the complexity of the procedures owing to select Motion Gradient Histogram (MGH) based on statistical approach for action representation feature. For multiple action detection, Motion Energy Image (MEI) of binary frame difference accumulations is adapted and then we detect each action of which area is represented by MGH. The action MGH should be compared with pre-learning MGH having pyramid method. As a result, recognition can be done by the analyze between action MGH and pre-learning MGH. Ten video clips are used for evaluating the proposed method. We have various experiments such as mono action, multiple action, speed and site scale-changes, comparison with previous method. As a result, we can see that proposed method is simple and efficient to recognize multiple human action with stale variations.

Channel Estimation and Prediction in Cross-Layer Design Using Side-information (크로스레이어 디자인에서 사이드 인포메이션을 활용한 채널 추정 및 예측)

  • Cho, Yong-Ju;Cha, Ji-Hun;Kim, Wook-Joong
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.797-800
    • /
    • 2011
  • The objective of MPEG Media Transport (MMT), which is on going standard, is to develop efficient delivery of media over packet based networks in an adaptive, progressive, download/streaming fashion over various IP based networks, including terrestrial, satellite and cable broadcast networks. In this paper we introduce utilization of signal strength information based on Cross Layer Design(CLD) to efficient multimedia delivery over wireless network in which in practice the wireless conditions can vary significantly. Many recent studies have shown that a significant improvement in wireless video throughput can be achieved by utilizing signal strength information on CLD [1][2]. Despite of its usefulness, however, it was difficult to employ signal strength information in rate adaptation applications due to different representation of signal strength information for each underlying wireless network. To that end, we proposed syntax and semantics of signal strength information in such a way that the information can be interpreted in the unified way. The proposed signal strength information was proposed for the MMT standardization.

A Study on Voluptuous Beauty of Females Found in Music Videos by Popular Music Genre (대중음악 장르별 뮤직비디오 의상에 나타난 여성 관능미에 관한 연구)

  • Seo, Eun-Hee;Choi, Jeong-Wook
    • Journal of the Korean Society of Costume
    • /
    • v.59 no.2
    • /
    • pp.154-168
    • /
    • 2009
  • This study aims on providing a design technique that expresses aesthetical elements by arranging the analysis of sensual beauty into detailed elements of design from music video outfits by genre of pop music, by observing music videos of female vocalists chosen from each genre of pop music focusing on their fashion. The results of this study are the following. 1. In the genre rock, the sensual beauty of female were expressed with a boyish and neutral style using texture such as leather or denim, and such style had the effect of emphasizing their feminine side even more. 2. In the genre dance music, exposure is extensive compared to other genre using sexy or lingerie look, and I found an ambivalent style of feminism with clothes in the form of drapery using textures such as chiffon and silk, and femme fatale style with textures adhering to the body such as leggings, leotard, and bodysuit. 3. In the genre of rap and hip-hop, clothes from casual and costume-play style were found using training jersey, t-shirt, and denim pants, and emphasized the sensual beauty of women by showing a silhouette with short length and fitting style using shiny textures. 4. In the genre of R&B, there were diverse outfits that suits the characteristics of characters appearing in the stories, or the situation of the story since there are many dramatic representation in the form of story Especially in case of female characters, the feminine side was emphasized staging a feminine style by wearing dresses with the texture of chiffon and silk. Exposure was restrained compared to other genre.

Object Detection and Classification Using Extended Descriptors for Video Surveillance Applications (비디오 감시 응용에서 확장된 기술자를 이용한 물체 검출과 분류)

  • Islam, Mohammad Khairul;Jahan, Farah;Min, Jae-Hong;Baek, Joong-Hwan
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.4
    • /
    • pp.12-20
    • /
    • 2011
  • In this paper, we propose an efficient object detection and classification algorithm for video surveillance applications. Previous researches mainly concentrated either on object detection or classification using particular type of feature e.g., Scale Invariant Feature Transform (SIFT) or Speeded Up Robust Feature (SURF) etc. In this paper we propose an algorithm that mutually performs object detection and classification. We combinedly use heterogeneous types of features such as texture and color distribution from local patches to increase object detection and classification rates. We perform object detection using spatial clustering on interest points, and use Bag of Words model and Naive Bayes classifier respectively for image representation and classification. Experimental results show that our combined feature is better than the individual local descriptor in object classification rate.

Towards the Generation of Language-based Sound Summaries Using Electroencephalogram Measurements (뇌파측정기술을 활용한 언어 기반 사운드 요약의 생성 방안 연구)

  • Kim, Hyun-Hee;Kim, Yong-Ho
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.3
    • /
    • pp.131-148
    • /
    • 2019
  • This study constructed a cognitive model of information processing to understand the topic of a sound material and its characteristics. It then proposed methods to generate sound summaries, by incorporating anterior-posterior N400/P600 components of event-related potential (ERP) response, into the language representation of the cognitive model of information processing. For this end, research hypotheses were established and verified them through ERP experiments, finding that P600 is crucial in screening topic-relevant shots from topic-irrelevant shots. The results of this study can be applied to the design of classification algorithm, which can then be used to generate the content-based metadata, such as generic or personalized sound summaries and video skims.

A Study on the Data Compression Algorithm for Just-in-Time Rendering of Concentric Mosaic (동심원 모자이크의 실시간 표현을 위한 데이터 압축 알고리즘에 관한 연구)

  • Jee, Inn-Ho;Ahn, Hong-Yeoung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.1
    • /
    • pp.91-96
    • /
    • 2010
  • Concentric mosaics are made with arranging and summing of video frames by using common spacial standards. Compared with previous works on 3-D wavelet transform coding, we have made important design considerations to enable flexible partial decoding and bit-stream random access. A just-in-time(JIT) rendering engine of the compressed concentric mosaic is developed. However, computationally, it is still demanding to accomplish the real-time rendering. Only the contents for specific scene representation are need to be decoded by maintaining compressed data. Thus our proposed algorithm is able to render real concentric mosaic by using lifting scheme instead of wavelet transform.

Comparison of "The Cabinet of Dr. Caligari" and Deconstructive Architecture in the Expressionist Characteristics (칼리가리 박사의 밀실과 해체주의 건축의 표현주의 특성 비교)

  • Choi, Hyo-Sik
    • Korean Institute of Interior Design Journal
    • /
    • v.25 no.1
    • /
    • pp.35-46
    • /
    • 2016
  • The purpose of this study was to identify the characteristics of expressionism in the space of deconstructive architecture by comparing the spaces of "The Cabinet of Dr. Caligari" video with the expressionist characteristics of film narrative structure and expressionist architecture and making an expansion based on the results. The findings were as follows: first, the "The Cabinet of Dr. Caligari" is divided into two set spaces: one has the perspective representation distorted in the viewpoint of a mad person applied to it, and the other reflects the viewpoint of a normal person from medieval paintings with no perspective. Second, the expressionist buildings did not reflect the expressionist characteristics in the interior spaces as fully as in the exterior ones. Third, the incomplete combination of Signifiant and $Signifi{\acute{e}}$, which were the theoretical basis of deconstructive architecture, showed a tendency of binary opposition like the double narrative structure of "The Cabinet of Dr. Caligari." Fourth, deconstructive architecture seems to embody the exterior form and interior space of "The Cabinet of Dr. Caligari" and its set spaces in the phenomenal aspect but exhibits its limitations with the realization of dynamics, one of the characteristics of expressionism. Finally, the Seattle Public Library presents the best embodiment of expressionist characteristics found in the set spaces of "The Cabinet of Dr. Caligari" while seeking after the combination of horizontal and vertical paths of action through the spiral ramps and inclined slabs.