• Title/Summary/Keyword: Visual Browsing

Search Result 39, Processing Time 0.022 seconds

Reversible Multipurpose Watermarking Algorithm Using ResNet and Perceptual Hashing

  • Mingfang Jiang;Hengfu Yang
    • Journal of Information Processing Systems
    • /
    • v.19 no.6
    • /
    • pp.756-766
    • /
    • 2023
  • To effectively track the illegal use of digital images and maintain the security of digital image communication on the Internet, this paper proposes a reversible multipurpose image watermarking algorithm based on a deep residual network (ResNet) and perceptual hashing (also called MWR). The algorithm first combines perceptual image hashing to generate a digital fingerprint that depends on the user's identity information and image characteristics. Then it embeds the removable visible watermark and digital fingerprint in two different regions of the orthogonal separation of the image. The embedding strength of the digital fingerprint is computed using ResNet. Because of the embedding of the removable visible watermark, the conflict between the copyright notice and the user's browsing is balanced. Moreover, image authentication and traitor tracking are realized through digital fingerprint insertion. The experiments show that the scheme has good visual transparency and watermark visibility. The use of chaotic mapping in the visible watermark insertion process enhances the security of the multipurpose watermark scheme, and unauthorized users without correct keys cannot effectively remove the visible watermark.

Salient Region Detection Algorithm for Music Video Browsing (뮤직비디오 브라우징을 위한 중요 구간 검출 알고리즘)

  • Kim, Hyoung-Gook;Shin, Dong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.112-118
    • /
    • 2009
  • This paper proposes a rapid detection algorithm of a salient region for music video browsing system, which can be applied to mobile device and digital video recorder (DVR). The input music video is decomposed into the music and video tracks. For the music track, the music highlight including musical chorus is detected based on structure analysis using energy-based peak position detection. Using the emotional models generated by SVM-AdaBoost learning algorithm, the music signal of the music videos is classified into one of the predefined emotional classes of the music automatically. For the video track, the face scene including the singer or actor/actress is detected based on a boosted cascade of simple features. Finally, the salient region is generated based on the alignment of boundaries of the music highlight and the visual face scene. First, the users select their favorite music videos from various music videos in the mobile devices or DVR with the information of a music video's emotion and thereafter they can browse the salient region with a length of 30-seconds using the proposed algorithm quickly. A mean opinion score (MOS) test with a database of 200 music videos is conducted to compare the detected salient region with the predefined manual part. The MOS test results show that the detected salient region using the proposed method performed much better than the predefined manual part without audiovisual processing.

Implementations of Geographic Information Systems on Sewage Management for Water Resources Protection

  • Wu, Mu-Lin;Chen, Chiou-Hsiung;Chou, Wen-Shang;Huang, Hsiu-Lan
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.1188-1190
    • /
    • 2003
  • Taipei Watershed Management Bureau (WRATB) is a government agency entitled for water resources protection at two major watersheds in order to provide drinking water for about four millions population in Taipei on a sustainable basis. At WRATB, there are two major public sewage treatment facilities which can convert sewage in each watershed into an acceptable state before they were discharged into rivers. More than 82% of household wastewater have been collected and treated by the two public sewage systems. However, households at remote area still need more effective sewage management prescriptions. The objective of this paper is to implement geographic information systems in order to provide more effective approaches that sewage management can be easier and cost effective. ArcIMS was implemented for Internet browsing and map server of those sewage facilities on personal computers, laptop computers. In the open field, ArcPAD was implemented with personal digital assistant (PDA) such that compact flash type's global positioning systems (GPS) and digital camera can be utilized with PDA. All sewage facilities digital files were convert into ArcMap format files. MapObjects and visual BASIC were used to create sewage application modules to meet every single technician personal flavor. ASP.NET was implemented for Internet database manipulations of all sewage databases. Mobile GIS was the key component of GIS applications in the open field for sewage management on a basis of house by house. Houses at remote area, which can not cover by the two public sewage systems, were managed by PDA and laptop computers with GPS and digital camera. Sewage management at Taipei Watershed Management Bureau is easier both in the open field and in the office. Integration of GPS, GIS, and PDA makes sewage management in the open field much easier. ArcIMS, MapObjects, ASP.NET and visual BASIC make sewage management can be done in the office and over Internet.

  • PDF

A Design of A Dynamic Configurational Multimedia Spreadsheet for Effective HCI (효과적인 HCI를 위한 동적 재구성 멀티미디어 스프레드쉬트 설계)

  • Jee Sung-Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.1
    • /
    • pp.14-22
    • /
    • 2006
  • The multimedia visualizational spreadsheet environment is shown to be extremely effective in supporting the organized visualization of multi-dimensional data sets. In this paper, we designed the visualization model that consists of the configurational 2D arrangement of spreadsheet elements at run time and each spreadsheet element has a novel framestack. As the feature, it supports 3D data structure of each element on the proposed model. It enables the visualization spreadsheet 1) to effectively manage, organize, and compactly encapsulate multi-dimensional data sets, 2) to reconfigure cell-structures dynamically according to client request, and 3) to rapidly process interactive user interface. Using several experiments with scientific users, the model has been demonstrated to be a highly interactive visual browsing tool for 2D and 3D graphics and rendering in each frame.

  • PDF

A Scene-based Tree Browsing Technique for Video Retrieval and Visual Summary (비디오 검색과 시각적 요약을 위한 장면 기반 계층적 브라우징 기법)

  • Im, Dong-Hyeok;Lee, Seok-Ryong;Jeong, Jin-Wan
    • Journal of KIISE:Databases
    • /
    • v.28 no.2
    • /
    • pp.181-187
    • /
    • 2001
  • 디지털 비디오의 사용이 일반화되어감에 따라 비디오 브라우징에 대한 연구가 더욱 요구되 어 지고 있다. 과거에 연구된 바 있는 VCR식 순차탐색기법은 아날로그 비디오 브라우징에 서 사용되던 고전적 방식을 다지털 비디오에 적용한 것이며, 키프레임 정적표현법은 비디오 를 구성하는 프레임을 보이는 방식이다. 이 두 방식이 디지털 비디오의 특성을 충분히 사용 하지 못하기 때문에, 최근에는 비디오 샷산의 계층적 관계를 기반으로 하는 계층적 브라우 징과 장면 간의 관계를 기반으로 하는 장면 기반 브라우징이 주목을 받고 있다. 본 논문에 서는 기존의 비디오 브라우징에 대한 연구들을 자세히 알아보고, 계층적 브라우징과 장면 기반 브라우징에서처럼 비디오 내의 각 장면에 바로 접근할 수 있을 뿐만 아니라, 계층적 브라우징처럼 비디오의 전체구조를 알기 쉽게 보여준다. 또한 브라우징의 결과는 시각적 요 약으로 사용될 수 있다.

  • PDF

Automatic Video Genre Identification Method in MPEG compressed domain

  • Kim, Tae-Hee;Lee, Woong-Hee;Jeong, Dong-Seok
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1527-1530
    • /
    • 2002
  • Video summary is one of the tools which can provide the fast and effective browsing fur a lengthy video. Video summary consists of many key-frames that could be defined differently depending on the video genre it belongs to. Consequently, the video summary constructed by the uniform manner might lead into inadequate result. Therefore, identifying the video genre is the important first step in generating the meaningful video summary. We propose a new method that can classify the genre of the video data in MPEG compressed bit-stream domain. Since the proposed method operates directly on the com- pressed bit-stream without decoding the frame, it has merits such as simple calculation and short processing time. In the proposed method, only the visual information is utilized through the spatial-temporal analysis to classify the video genre. Experiments are done for 6 genres of video: Cartoon, Commercial, Music Video, News, Sports, and Talk Show. Experimental result shows more than 90% of accuracy in genre classification for the well-structured video data such as Talk Show and Sports.

  • PDF

Clustering Representative Annotations for Image Browsing (이미지 브라우징 처리를 위한 전형적인 의미 주석 결합 방법)

  • Zhou, Tie-Hua;Wang, Ling;Lee, Yang-Koo;Ryu, Keun-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2010.06c
    • /
    • pp.62-65
    • /
    • 2010
  • Image annotations allow users to access a large image database with textual queries. But since the surrounding text of Web images is generally noisy. an efficient image annotation and retrieval system is highly desired. which requires effective image search techniques. Data mining techniques can be adopted to de-noise and figure out salient terms or phrases from the search results. Clustering algorithms make it possible to represent visual features of images with finite symbols. Annotationbased image search engines can obtains thousands of images for a given query; but their results also consist of visually noise. In this paper. we present a new algorithm Double-Circles that allows a user to remove noise results and characterize more precise representative annotations. We demonstrate our approach on images collected from Flickr image search. Experiments conducted on real Web images show the effectiveness and efficiency of the proposed model.

  • PDF

A Development of Management System for Publication and Operation of Clinical Contents Model's Outcomes (임상콘텐츠모형 산출물 홍보와 운영을 위한 관리시스템 개발)

  • Yun, Ji-Hyun;Ahn, Sun-Ju;Lee, Bo-Hye;So, Hye-Jin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.9
    • /
    • pp.3398-3405
    • /
    • 2010
  • In this paper we developed a CCM Manager which is a management system to support efficient development, publication and promotion of Clinical Contents Model(CCM)'s outcomes. We derived the functions of manager and user from the analysis of work process from development of CCM to management and publication, designed the database based on CCM architecture, and developed web-based system for each user to easily access and search the contents. CCM Manager supports to share the results between model developers well, distribute and promote the outcomes of model type to general users without manager's manual packaging and publishing it. This paper shows superiority compared with other clinical information model management system in aspects of providing multiple search functions, visual model browsing methods and management functions. The proposed CCM Manager is currently used in process of CCM development, and the analysis of its efficiency with the results will be made in the future.

A Usability Evaluation on the Visualization of Information Extraction Output (정보추출결과의 시각화 표현방법에 관한 이용성 평가 연구)

  • Lee Jee-Yeon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.39 no.2
    • /
    • pp.287-304
    • /
    • 2005
  • The goal of this research is to evaluate the usability of visually browsing the automatically extracted information. A domain-independent information extraction system was used to extract information from news type texts to populate the visually browasable knowledge base. The information extraction system automatically generated Concept-Relation-Concept triples by applying various Natural Language Processing techniques to the text portion of the news articles. To visualize the information stored in the knowledge base, we used PersoanlBrain to develop a visualization portion of the user interface. PersonalBrain is a hyperbolic information visualization system, which enables the users to link information into a network of logical associations. To understand the usability of the visually browsable knowledge base, IS test subjects were observed while they use the visual interface and also interviewed afterward. By applying a qualitative test data analysis method. a number of usability Problems and further research directions were identified.

Shot Boundary Detection Algorithm by Compensating Pixel Brightness and Object Movement (화소 밝기와 객체 이동을 이용한 비디오 샷 경계 탐지 알고리즘)

  • Lee, Joon-Goo;Han, Ki-Sun;You, Byoung-Moon;Hwang, Doo-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.5
    • /
    • pp.35-42
    • /
    • 2013
  • Shot boundary detection is an essential step for efficient browsing, sorting, and classification of video data. Robust shot detection method should overcome the disturbances caused by pixel brightness and object movement between frames. In this paper, two shot boundary detection methods are presented to address these problem by using segmentation, object movement, and pixel brightness. The first method is based on the histogram that reflects object movements and the morphological dilation operation that considers pixel brightness. The second method uses the pixel brightness information of segmented and whole blocks between frames. Experiments on digitized video data of National Archive of Korea show that the proposed methods outperforms the existing pixel-based and histogram-based methods.