• Title/Summary/Keyword: video metadata

Search Result 115, Processing Time 0.021 seconds

MPEG-A PART 9 DIGITAL MULTIMEDIA BROADCASTING APPLICATION FORMAT

  • Sabirin, Muhammad Syah Houari;Lee, Jung-Soo;Kim, Hui-Yong;Kim, Mun-Churl;Kim, Yong-Han
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.346-350
    • /
    • 2009
  • Digital Multimedia Broadcasting (DMB) is the mobile TV service based on a digital radio transmission system that provides high quality audio/video and other auxiliary data services. As users want to store the DMB content in their device to be consumed later or to be shared among users, a standardized format needs to be specified to guarantee the interoperability for the DMB contents for various devices. DMB AF (Application Format) specification defines a file format for DMB contents and services. It specifies how to combine the variety of DMB contents with associated information for a presentation in a well-defined format that facilitates storage, interchange, management, editing, and presentation of the DMB contents in protected, governed, and interoperable ways. In this paper we present our implementation of DMB AF as part of the development of DMB AF reference software. Our implementation of DMB AF is developed as the reference software for the standard specification that consists of a three applications: packager, media player, metadata browser and collection of supporting libraries used by the applications.

  • PDF

A Study on Immersive 360-degree Video Application Metadata and Operating System for Interworking with UCI Standard Identification System (UCI 표준식별체계 연동을 위한 실감형 360도 영상 응용 메타데이터 및 운영 시스템에 관한 연구)

  • Park, Byeongchan;Jang, Seyoung;Ruziev, Ulugbek;Kim, Youngmo;Kim, Seok-Yoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2020.07a
    • /
    • pp.433-435
    • /
    • 2020
  • 본 논문에서 저작권 보호 기술 적용을 위해 실감형 360도 영상의 강인성 정보를 이용하여 UCI 운영을 위한 응용 메타데이터 요소를 제안한다. 오늘날 멀티미디어 콘텐츠의 산업의 규모가 비약적으로 커지고 있음에 따라 이를 효과적으로 관리 및 유통할 수 있는 콘텐츠 고유의 식별체계가 요구되고 있다. 현재 국내에서 운용 중인 대표 식별자는 정통부가 개발한 UCI가 활용되고 있다. UCI는 다양한 멀티미디어 콘텐츠를 효과적으로 관리 및 유통할 수 있으나 저작권 보호 기술에 직접적으로 연관이 되어 있지 않아 이를 보완할 수 있는 기술이 요구된다. 본 논문에서는 UCI와 직접으로 연동할 수 있는 실감형 360도 동영상 응용 메타데이터 요소 및 운영 방법을 제안하여 저작권 보호 기술을 적용할 수 있도록 한다.

  • PDF

Video Analysis System for Action and Emotion Detection by Object with Hierarchical Clustering based Re-ID (계층적 군집화 기반 Re-ID를 활용한 객체별 행동 및 표정 검출용 영상 분석 시스템)

  • Lee, Sang-Hyun;Yang, Seong-Hun;Oh, Seung-Jin;Kang, Jinbeom
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.89-106
    • /
    • 2022
  • Recently, the amount of video data collected from smartphones, CCTVs, black boxes, and high-definition cameras has increased rapidly. According to the increasing video data, the requirements for analysis and utilization are increasing. Due to the lack of skilled manpower to analyze videos in many industries, machine learning and artificial intelligence are actively used to assist manpower. In this situation, the demand for various computer vision technologies such as object detection and tracking, action detection, emotion detection, and Re-ID also increased rapidly. However, the object detection and tracking technology has many difficulties that degrade performance, such as re-appearance after the object's departure from the video recording location, and occlusion. Accordingly, action and emotion detection models based on object detection and tracking models also have difficulties in extracting data for each object. In addition, deep learning architectures consist of various models suffer from performance degradation due to bottlenects and lack of optimization. In this study, we propose an video analysis system consists of YOLOv5 based DeepSORT object tracking model, SlowFast based action recognition model, Torchreid based Re-ID model, and AWS Rekognition which is emotion recognition service. Proposed model uses single-linkage hierarchical clustering based Re-ID and some processing method which maximize hardware throughput. It has higher accuracy than the performance of the re-identification model using simple metrics, near real-time processing performance, and prevents tracking failure due to object departure and re-emergence, occlusion, etc. By continuously linking the action and facial emotion detection results of each object to the same object, it is possible to efficiently analyze videos. The re-identification model extracts a feature vector from the bounding box of object image detected by the object tracking model for each frame, and applies the single-linkage hierarchical clustering from the past frame using the extracted feature vectors to identify the same object that failed to track. Through the above process, it is possible to re-track the same object that has failed to tracking in the case of re-appearance or occlusion after leaving the video location. As a result, action and facial emotion detection results of the newly recognized object due to the tracking fails can be linked to those of the object that appeared in the past. On the other hand, as a way to improve processing performance, we introduce Bounding Box Queue by Object and Feature Queue method that can reduce RAM memory requirements while maximizing GPU memory throughput. Also we introduce the IoF(Intersection over Face) algorithm that allows facial emotion recognized through AWS Rekognition to be linked with object tracking information. The academic significance of this study is that the two-stage re-identification model can have real-time performance even in a high-cost environment that performs action and facial emotion detection according to processing techniques without reducing the accuracy by using simple metrics to achieve real-time performance. The practical implication of this study is that in various industrial fields that require action and facial emotion detection but have many difficulties due to the fails in object tracking can analyze videos effectively through proposed model. Proposed model which has high accuracy of retrace and processing performance can be used in various fields such as intelligent monitoring, observation services and behavioral or psychological analysis services where the integration of tracking information and extracted metadata creates greate industrial and business value. In the future, in order to measure the object tracking performance more precisely, there is a need to conduct an experiment using the MOT Challenge dataset, which is data used by many international conferences. We will investigate the problem that the IoF algorithm cannot solve to develop an additional complementary algorithm. In addition, we plan to conduct additional research to apply this model to various fields' dataset related to intelligent video analysis.

Service Provider Ranking Based on Visual Media Ontology (시각 미디어 온톨로지에 기반한 서비스 제공자 랭킹)

  • Min, Young-Kun;Lee, Bog-Ju
    • The KIPS Transactions:PartB
    • /
    • v.15B no.4
    • /
    • pp.315-322
    • /
    • 2008
  • It is important to retrieve effectively the visual media such as pictures and video in the internet, especially to the application areas such as electronic art museum, e-commerce, and internet shopping malls. It is also needed in these areas to have content-based or even semantic-based multimedia retrieval instead of simple keyword-based retrieval. In our earlier research, we proposed a semantic-based visual media retrieval framework for the effective retrieval of the visual media from the internet. It uses visual media metadata and ontology based on the web service to achieve the semantic-based retrieval. In this research, there are more than one visual media service providers and one central service broker. As a preliminary step to the visual media data retrieval, a method is proposed to retrieve the service providers effectively. The method uses the structure of the ontology tree to obtain the providers and their rankings. It also uses the size of sub nodes and child nodes in the tree. It measures the rankings of providers more effectively than previous method. The experimental results show the accuracy of the method while keeping compatible speed against the existing method.

Multimedia Information Retrieval Using Semantic Relevancy (의미적 연관성을 이용한 멀티미디어 정보 검색)

  • Park, Chang-Sup
    • Journal of Internet Computing and Services
    • /
    • v.8 no.5
    • /
    • pp.67-79
    • /
    • 2007
  • As the Web technologies and wired/wireless network are improved and various new multimedia services are introduced recently, need for searching multimedia including video data has been much increasing, The previous approaches for multimedia retrieval, however, do not make use of the relationships among semantic concepts contained in multimedia contents in an efficient way and provide only restricted search results, This paper proposes a multimedia retrieval system exploiting semantic relevancy of multimedia contents based on a domain ontology, We show the effectiveness of the proposed system by experiments on a prototype system we have developed. The proposed multimedia retrieval system can extend a given search keyword based on the relationships among the semantic concepts in the ontology and can find a wide range of multimedia contents having semantic relevancy to the input keyword. It also presents the results categorized by the semantic meaning and relevancy to the keyword derived from the ontology. Independency of domain ontology with respect to metadata on the multimedia contents is preserved in the proposed system architecture.

  • PDF

PSIP Converter based on PMCP for Terrestrial/Cable Data Broadcasting Retransmission Service (지상파/케이블 데이터방송 재전송 서비스를 위한 PMCP 기반 PSIP 변환기)

  • Choi Ji Hoon;Kim Yong Ho;Choi Jin Soo;Hong Jin Woo
    • The KIPS Transactions:PartB
    • /
    • v.12B no.6 s.102
    • /
    • pp.647-654
    • /
    • 2005
  • In this paper, we implemented a terrestrial/cable PSIP converting system, so-called a PSIP converter, which is converting a terrestrial PSIP into a cable PSIP for a data broadcasting service in the interoperable network of terrestrial and cable, and define an interface between the PSIP converter and the OOB SI generator by using PMCP messages compliant to ATSC T3/Sl. The exiting PSIP converter just converts a terrestrial PSIP into a cable PSIP compliant to ATSC and OCAP standard and transmits by a MPEG-2 TS format. That is to say, it is not for the digital data broadcasting but for the digital broadcasting. In addition, the PSIP converter can support various types of PSIP information to the OOB SI generator by using PMCP messages defined by a hierarchical structure as per each channel, audio/video event, data event and so on.

A Case Study of the Audio-Visual Archives System Development and Management (시청각(사진/동영상) 기록물 관리를 위한 시스템 구축과 운영 사례 연구)

  • Shin, Dong-Hyeon;Jung, Se-Young;Kim, Seon-Heon
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.9 no.1
    • /
    • pp.33-50
    • /
    • 2009
  • ADD(Agency for Defense Development) has developed digital audio-visual archives management system to ensure easy access and long-term preservation for digital audio-visual archives. This paper covers total process of the system development and database management in the aspect of preservation and utilization by users' easy search through digitization of audio-visual archives. In detail, it contains system design for images and video data handling, standard workflow establishment, data quality, and metadata settings for database by converting an analog data into digital format. Also, this study emphasizes the importance of audio-visual archives management system through cost-effectiveness analysis.

LMS-based Edutech Teaching and Learning Platform Model Design Study (LMS 기반 에듀테크 교수학습 플랫폼 모형 설계 연구)

  • Yoon, Seung­-Bae;Yang, Seung Hyuk;Park, Hyunsoon
    • Journal of Digital Convergence
    • /
    • v.19 no.10
    • /
    • pp.29-38
    • /
    • 2021
  • Purpose: This is a study to design an optimal Edutech teaching-learning platform model that can be linked with various types of LMS to activate e-learning. Methods: For this purpose, the contents of e-learning systems that can be used in the 4th industrial technology of cyber universities and general universities were cross-sectionally analyzed. Results: Cyber universities relied entirely on LMS, and general universities supplemented and utilized different Edutech methods for each professor such as Google Classroom, Zoom video communication, and YouTube in addition to LMS. It was considered that it would be meaningful to provide a minimal algorithm mapping to LMS to share metadata such as Google and YouTube for the Edutech teaching and learning platform model. Conclusion: Therefore, this study is expected to contribute to the improvement of teaching methods and academic achievement through the LMS-based Edutech teaching and learning platform model.

Considerations for Applying Korean Natural Language Processing Technology in Records Management (기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항)

  • Haklae, Kim
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.4
    • /
    • pp.129-149
    • /
    • 2022
  • Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records' creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning-based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.

A OTT content data analysis technique on a PC environment (PC 환경에서의 OTT 콘텐츠 데이터 분석 방법)

  • Chanwoo Lee;Junyoung Heo
    • Smart Media Journal
    • /
    • v.13 no.2
    • /
    • pp.62-67
    • /
    • 2024
  • Due to technological advancements in viewing devices and the COVID-19 pandemic, a lot of OTT-only content is being produced and distributed as people shift from traditional movie theater viewing and broadcasters' fixed TV viewing to free-form OTT viewing using wired and wireless internet. As a result, the ability to leverage data from OTT audiences has become critical to the competitiveness of the industry. However, third parties other than OTT content providers are facing difficulties in acquiring OTT viewer data. In this paper, as a way to overcome the shortcomings of existing viewer data acquisition, we propose a method to extract audio and video data by developing an OTT viewing data acquisition agent using Web APIs by adopting a web browser environment that does not affect the performance of OS and viewing devices, so that third-party companies that need viewing data can utilize it.