• Title/Summary/Keyword: Media synchronization

Search Result 226, Processing Time 0.023 seconds

Synchronization of VOD Content and Captions Using Speech Recognition and Modified Dynamic Programming (음성인식과 변경된 동적계획법을 이용한 VOD 콘텐트와 자막의 동기화)

  • Oh, Juhyun
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2021.06a
    • /
    • pp.131-134
    • /
    • 2021
  • 지상파 방송에서는 청각장애인을 위해 폐쇄자막(closed caption) 서비스가 제공되고 있지만, 이를 저장하여 VOD 서비스 등에 제공하고자 할 때는 영상과의 비동기화(desynchronization) 문제로 인해 활용할 수 없는 문제가 있다. 본 논문에서는 이를 해결하기 위해 자동 음성인식(automatic speech recognition)과, 자막 동기화 문제에 맞게 변경된 동적계획법(modified dynamic programming)을 이용하는 방법을 제안한다. 문자열 정렬에서 삽입과 삭제 등 간격(gap)의 발생을 제어하는 제약조건과 그에 따른 점수 구조를 적용함으로써 문자열 정렬 성능을 개선한다. 또한 정렬된 폐쇄자막과 음성인식 문자열로부터 시간 동기정보를 복원하고 동기화된 자막을 생성하는 방법을 제안한다. 실제 TV 프로그램과 자막에 적용하여 기존 방법에 비해 성능의 향상이 있음을 확인하였다.

  • PDF

Closed Caption Synchronization Using Dynamic Programming (동적계획법을 이용한 장애인방송 폐쇄자막 동기화)

  • Oh, Juhyun
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.461-464
    • /
    • 2020
  • 지상파 방송에서는 청각장애인을 위해 폐쇄자막(closed caption) 서비스가 제공되고 있다. 현재의 폐쇄자막 방송은 속기사가 실시간으로 방송을 보면서 입력하기 때문에 지연이 있다. 또한 이렇게 입력된 폐쇄자막은 TV 프로그램 영상과 별도로 저장되기 때문에 영상과 그 시작점이 맞지 않는 경우가 대부분이다. 폐쇄자막을 온라인 서비스 등에 제공하고자 할 때 이러한 문제로 인해 영상과의 동기가 맞지 않아 사용이 어렵다. 본 논문에서는 TV 프로그램의 음성을 인식하여 동기화된 텍스트를 추출하고, 이를 기 저장된 폐쇄자막과 정렬하여 동기화하는 방법을 제안한다. 실제 TV 프로그램과 자막에 적용하였을 때 대부분의 음절과 라인에서 동기화가 정확히 이루어짐을 확인하였다.

  • PDF

An Integrated Synchronization Method for a Hyperpresentation in a distributed Computing Environment (분산 컴퓨팅환경에서 하이퍼 프리젠테이션을 위한 통합 동기화 기법)

  • Lim, Young-Hwan;Kim, Doo-Hyun;Kung, Sang-Hwan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.6
    • /
    • pp.1441-1456
    • /
    • 1998
  • The concept of a hyperpresmtation, as an extension of a hypermedia, is the presentation in which time-varying multimedia presentations are dynamically linked together and a hyperlink's context can be changed over time at any time during a continuous presentation. Problems caused by integrating the hyperpresentation into an existing multimedia system which handles a sequential presentation only are, how to describe the hyperprcsentation, how to set up a hyperlink on a continuous media, and how to check the consistency of the synchronized presentations. In this paper. a new synchronization description method for the hyperpresentation and a method for setting a hyper link on a continuous media during" presentation are proposed after havin!; SHrvey of existing methods, The proposed method deals with only the DC value in a stream ut a DCT based compressed data for checking a condition of te link. Finally, the method for checking the consistency of mixed presentations before actual play of the hnlerpresentation is described. Proposed methods are implemented on MuX(Multimedia IO Server) where a sample scenario is tested.

  • PDF

Cell ID Detection Schemes Using PSS/SSS for 5G NR System (5G NR 시스템에서 PSS/SSS를 이용한 Cell ID 검출 방법)

  • Ahn, Haesung;Kim, Hyeongseok;Cha, Eunyoung;Kim, Jeongchang
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.870-881
    • /
    • 2020
  • This paper presents cell ID (cell identity) detection schemes using PSS/SSS (primary synchronization signal/secondary synchronization signal) for 5G NR (new radio) system and evaluates the detection performance. In this paper, we consider two cell ID detection schemes, i.e. two-stage detection and joint detection schemes. The two-stage detection scheme consists of two stages which estimate a channel gain between a transmitter and receiver and detect the PSS and SSS sequences. The joint detection scheme jointly detects the PSS and SSS sequences. In addition, this paper presents coherent and non-coherent combining schemes. The coherent scheme calculates the correlation value for the total length of the given PSS and SSS sequences, and the non-coherent combining scheme calculates the correlation within each group by dividing the total length of the sequence into several groups and then combines them non-coherently. For the detection schemes considered in this paper, the detection error rates of PSS, SSS and overall cell ID are evaluated and compared through computer simulations. The simulation results show that the joint detection scheme outperforms the two-stage detection scheme for both coherent and non-coherent combining schemes, but the two-stage detection scheme can greatly reduce the computational complexity compared to the joint detection scheme. In addition, the non-coherent combining detection scheme shows better performance under the additive white Gaussian noise (AWGN), fixed, and mobile environments.

Performance Improvement of Frequency Synchronization in ATSC DTV System using Signal Power at Both Edges of Spectrum (ATSC DTV 시스템에서 스펙트럼 양끝의 신호전력을 이용한 주파수 동기 성능 개선)

  • Song Hyun Keun;Lee Joo Hyung;Kim Jae Moung;Eum Ho Min;Kim Seung Won
    • Journal of Broadcast Engineering
    • /
    • v.10 no.1 s.26
    • /
    • pp.31-42
    • /
    • 2005
  • ATSC DTV system uses FPLL block for acquiring the frequency synchronization. Because the FPLL uses only the pilot signal, the frequency convergence range becomes narrower and it takes a more time to acquire the frequency synchronization as the pilot is distorted. And the spectrum shape around the pilot makes an asymmetric convergence range between the positive frequency offset and the negative frequency offset. This paper proposes the algorithm that requires the Installation of the fitters at the both edges of a VSB spectrum and uses the signal power that passes these filters. The proposed algorithm complements the problems of the asymmetric convergence range and overcomes the performance degradation due to the distortion of a pilot level.

A Framework and Synchronization Mechanism for Real-time Multimedia Streaming Services based on the Time-triggered Message-triggered Object (실시간 멀티미디어 스트리밍 서비스를 위한 Time-triggered Message-triggered Object 기반의 프레임워크 및 동기화 메커니즘)

  • Jo, Eun-Hwan;Kim, Moon-Hae
    • The KIPS Transactions:PartC
    • /
    • v.13C no.6 s.109
    • /
    • pp.669-676
    • /
    • 2006
  • In this paper, we present a new framework and stream synchronization mechanism to effectively support developing real-time multimedia streaming services by using a real-time object model named TMO (Time-triggered Message-triggered Object). The purpose of the framework is twofold. Firstly, the framework helps developers to design complex distributed real-time multimedia streaming services. Secondly, it supports timely streaming facilities. In order to achieve these goals, our framework is consist of Multimedia Streaming TMO, MMStreaming TMO Support Library and TMO Support Middleware. The time-triggered spontaneous feature of the MMStream TMO and a global-time based synchronization scheme is used as a regulator against the irregular deliveries and processing of media units caused by QoS non-guaranteed systems and communication channels. In conclusion, timely service capability of our framework is expected to contributed to overall enhancement of the real-time multimedia streaming.

SMIL Authoring System for Multi-media synchronization and representation (멀티미디어 동기화 및 표현을 위한 SMIL 저작 시스템)

  • Ham, Jong-Wan;Jin, Du-Seok;Choi, Bong-Kyu;Cao, Ke-Rang;Park, Man-Seob;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.653-656
    • /
    • 2009
  • Currently with development of development and the hardware of the superhigh speed network about increase is spreading out at the rapid pace the many multimedia contents quite from internet. The production environment is growing about the multimedia contents because of such as circumstance, as well as multimedia contents will increase. However, Numerous voice, the picture, with text etc. the time of the same multimedia contents and problem of spatial synchronization occur, started. W3C(World Wide Web Consortium) solves like this problem point presented the method for. Does so, SMIL(Synchronized Multimedia Integration Language) where puts a base in XML(Extensible Markup Language) will be able to compose the expression of the multimedia contents which is various standard was proposed. SMIL the individual multimedia object of chain with time will be able to integrate with the multimedia presentation which is synchronized spatial in order. In this paper a variety of multimedia content and synchronization of the time and space, to be represented by integrating the design and implementation of SMIL authoring system.

  • PDF

Scalable Random Access for SVC-based DASH Service (SVC 기반의 DASH 서비스를 위한 스케일러블 임의접근 지원 방법)

  • Seo, Kwang-Deok;Lee, Hong-Rae;Kim, Jae-Gon;Jung, Soon-Heung;Yoo, Jeong-Ju;Jeong, Young-Ho
    • Journal of Broadcast Engineering
    • /
    • v.16 no.6
    • /
    • pp.1073-1076
    • /
    • 2011
  • In this paper, we propose a scalable random access scheme in SVC based DASH service that enables random access support not only for base layer of SVC but also for enhancement layers. The proposed method includes extension of segment index box ('sidx') from DASH standard, as well as new RAP Synchronization Box ('raps'). Since the proposed scheme provides random access service for movie fragments with SVC encoded video layers, adaptive scalable random access service is possible.

Synchronization of Common Channel in Distributed Push Servers (분산 푸시서버에서 공통채널 연동)

  • 연승호;김영헌;한군희;백순화;전병민
    • Journal of Broadcast Engineering
    • /
    • v.5 no.1
    • /
    • pp.130-138
    • /
    • 2000
  • The paper describes a push system, a broadcasting system in the internet, which is developed for internet and intranet use. In this paper, we carried out research on the method to support the large number of users in the intranet environment. Particularly the paper analyzes the effects on the network traffic according to the number of the user connected to the push system and the frequency of the connections when push servers are distributed over the intranet. Push system described here uses two different kinds of channels, common channel and local channel. Common channel is the channel to be replicated among the push servers in the intranet. This paper shows that the method using the common channel synchronization is efficient in supporting the large number of intranet users. We introduce an algorithm to make the interconnections between channels efficiently amongpush servers distributed over the intranet.

  • PDF

TRSG 모델을 기반으로 한 멀티미디어 프리젠테이션 및 저작 도구 개발

  • Na, In-Ho
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.1
    • /
    • pp.36-44
    • /
    • 2000
  • In this paper, we describe the developing of a tool which supports both multimedia presentation with user's participation through a high speed network and authoring of various media using a single authoring tool. To support real-time synchronous multimedia presentation, we adopt dynamic synchronization method and adaptive transmission algorithm for synchronizing data transfer rate between sender and receiver using buffer management algorithm based on QoS parameters. And we also allow user's participation in the presentation using TRSG(Temporal Relationship Specification Graph) model. Finally, the proposed tool supports the minimal level of QoS and its continuous play-out using event auditing threads which control the current state of a multimedia presentation continuously by monitoring of negative factors effecting on QoS, and synchronization.

  • PDF