• Title/Summary/Keyword: Digital audio files

Search Result 21, Processing Time 0.025 seconds

A Study on Long-term Preservation of the Cultural Archetypes in Digital Audio Format (문화원형콘텐츠의 장기보존에 관한 연구 - 디지털 소리자료를 중심으로 -)

  • Suh, Hye-Ran
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.17 no.2
    • /
    • pp.65-82
    • /
    • 2006
  • The purpose of this study is to present some basic information essential for planning long-term preservation of digital audio files, which have been collected through 'Cultural Archetypes Digitization Project' by the Korea Culture & Content Agency. Needs and limitations of digitization of audio resources for their long-term preservation, digital audio archiving models(OAIS Reference Model and AHDS Model), some technical principles for audio digitization and preservation, preservation metadata and storing media for digital audio files were discussed. Compliance to the requisites listed in Audit Checklist for the Certification of Trusted Digital Repositories of RLG & NARA was suggested. It was also recommended to review the appropriateness of collaborating on digital preservation project with other institutions.

Reversible Watermarking for Audio Using Recompression Method (재압축 기술을 이용한 오디오 파일에서의 가역 정보은닉)

  • Whang, Ho Young;Kim, Hyoung Joong
    • Journal of Digital Contents Society
    • /
    • v.14 no.2
    • /
    • pp.199-206
    • /
    • 2013
  • Various methods of data compression have been developed to handle data within limited storage capacity and limited transmission speed. Recompression technology, a technology most recent among them, is a technology that can embed data regardless of the information entropy of a data. Recompression technology separates original multimedia data in to blocks and embeds 0 or 1 according to whether each block is flipped or not. In this paper, this technology has been applied on audio files. And was able to implement reversible watermarking for audio files.

An Automatic Method of Detecting Audio Signal Tampering in Forensic Phonetics (법음성학에서의 오디오 신호의 위변조 구간 자동 검출 방법 연구)

  • Yang, Il-Ho;Kim, Kyung-Wha;Kim, Myung-Jae;Baek, Rock-Seon;Heo, Hee-Soo;Yu, Ha-Jin
    • Phonetics and Speech Sciences
    • /
    • v.6 no.2
    • /
    • pp.21-28
    • /
    • 2014
  • We propose a novel scheme for digital audio authentication of given audio files which are edited by inserting small audio segments from different environmental sources. The purpose of this research is to detect inserted sections from given audio files. We expect that the proposed method will assist human investigators by notifying suspected audio section which considered to be recorded or transmitted on different environments. GMM-UBM and GSV-SVM are applied for modeling the dominant environment of a given audio file. Four kinds of likelihood ratio based scores and SVM score are used to measure the likelihood for a dominant environment model. We also use an ensemble score which is a combination of the aforementioned five kinds of scores. In the experimental results, the proposed method shows the lowest average equal error rate when we use the ensemble score. Even when dominant environments were unknown, the proposed method gives a similar accuracy.

Musician Search in Time-Series Pattern Index Files using Features of Audio (오디오 특징계수를 이용한 시계열 패턴 인덱스 화일의 뮤지션 검색 기법)

  • Kim, Young-In
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.5 s.43
    • /
    • pp.69-74
    • /
    • 2006
  • The recent development of multimedia content-based retrieval technologies brings great attention of musician retrieval using features of a digital audio data among music information retrieval technologies. But the indexing techniques for music databases have not been studied completely. In this paper, we present a musician retrieval technique for audio features using the space split methods in the time-series pattern index file. We use features of audio to retrieve the musician and a time-series pattern index file to search the candidate musicians. Experimental results show that the time-series pattern index file using the rotational split method is efficient for musician retrievals in the time-series pattern files.

  • PDF

Analysis and Improving Suggestions for Knowledge and Lecture Podcasts (지식과 강의 팟캐스트의 현황 분석과 발전방향)

  • Han, Seong Won;Hur, Gi Taek;Kim, Eun Seok
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2008.05a
    • /
    • pp.121-125
    • /
    • 2008
  • As the ability and technology develops for mobile devices, it now is possible to play audio files that are downloaded from the internet on the mobile devices. The audio and video files that are provided through the internet became 'podcast,' which has an auto-update service system. The major broadcasts and universities in Europe and America are using podcasts as one of their major service. Therefore large amount of Knowledge and Lecture podcasts are actively being provided. In this paper, we will analyze the Knowledge and Lecture podcasts, and suggest improving solutions.

  • PDF

A Study on Forgery Techniques of Smartphone Voice Recording File Structure and Metadata (스마트폰 음성녹음 파일 구조 및 메타데이터의 위변조 기법에 관한 연구)

  • Park, Jae Wan;Kwak, Won Jun;Lee, John Sanghyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.807-812
    • /
    • 2022
  • Recently, as the number of voice recording files submitted as court evidence increases, the number of cases claiming forgery is also increasing. If the audio recording file structure and metadata, which are objective grounds, are completely forged, it is actually impossible to detect forgery of the sophisticated audio recording file. It is extremely rare for the court to reject the file structure and metadata analysis performed with the forged audio recording file. The purpose of this study is to prove that forgery of voice recording file structure and metadata is easily possible. To this end, in this study, it was introduced that forgery detection is impossible when the 'mixed paste' function, which enables sophisticated editing based on the typification of the editing method of voice recording files, is applied. Moreover, it has been proven through experiments that forgery of file structure and metadata is possible. Therefore, a stricter standard for judging the admissibility of evidence is required when the audio recording file is adopted as digital evidence. This study will not only contribute to the standard of integrity in the adoption of digital evidence by judges, but will also contribute to the method of constructing a dataset for artificial intelligence in detecting forgery of recorded files that is expected to be developed in the future.

Unleashing the Power of Digitization: National Mission for Manuscript's Analysis and Special Efforts in Enhancing Manuscript Usability and Preserving Cultural Heritage in Uttar Pradesh

  • Priyanka Jaiswal;Abhay Chaurasia;Ajay Pratap Singh
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.14 no.3
    • /
    • pp. 7-18
    • /
    • 2024
  • The present study focuses on the activities and efforts of the National Mission for Manuscripts (NMM) in the Uttar Pradesh region, which is known for its vast area, population, and rich cultural heritage. The aim is to examine the digitization work carried out by the NMM in this area, as digitization plays a crucial role in preserving our country's rich ancient heritage. The importance of safeguarding cultural heritage is universally acknowledged, and digitization serves as a vital tool in this endeavour. Through digitization, we can protect and preserve our heritage for future generations. The government has implemented several commendable initiatives for manuscript digitization, and the NMM stands as a prominent organization dedicated to the conservation of cultural heritage. The NMM possesses a diverse range of cultural heritage resources, including photographic slides, photographs, digital images, photo-negatives, motion pictures, audio spools, microfiche, LP records, endangered manuscripts, audio and videotapes, digital images, microfilms, digital audio and video files, and more. The mission has undertaken extensive digitization efforts to conserve and provide access to a significant portion of its collection. This study is unique as it explores the digital conservation and digitization practices of a premier institute working in the field of art and cultural heritage in Uttar Pradesh. With its extensive network of institutions, the mission aims to cover all manuscripts, digitize them, and consolidate them on a common platform for easy access and utilization.

ENF based Detection of Forgery and Falsification of Digital Files due to Quadratic Interpolation (이차 보간에 따른 ENF 기반의 위변조 디지털 파일 탐지 기법)

  • Park, Se Jin;Yoon, Ji Won
    • Journal of KIISE
    • /
    • v.45 no.3
    • /
    • pp.311-320
    • /
    • 2018
  • Recently, the use of digital audio and video as proof in criminal and all kinds of litigation is increasing, and scientific investigation using digital forensic technique is developing. With the development of computing and file editing technologies, anyone can simply manipulate video files, and the number of cases of manipulating digital data is increasing. As a result, the integrity of the evidence and the reliability of the evidence Is required. In this paper, we propose a technique for extracting the Electrical Network Frequency (ENF) through a grid of power grids according to the geographical environment for power supply, and then performing signal processing for peak detection using QIFFT. Through the detection algorithm using the standard deviation, it was confirmed that the video file was falsified with 73% accuracy and the forgery point was found.

Restoration of damaged speech files using deep neural networks (심층 신경망을 활용한 손상된 음성파일 복원 자동화)

  • Heo, Hee-Soo;So, Byung-Min;Yang, IL-Ho;Yoon, Sung-Hyun;Yu, Ha-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.2
    • /
    • pp.136-143
    • /
    • 2017
  • In this paper, we propose a method for restoring damaged audio files using deep neural network. It is different from the conventional file carving based restoration. The purpose of our method is to infer lost information which can not be restored by existing techniques such as the file carving. We have devised methods that can automate the tasks which are essential for the restoring but are inappropriate for humans. As a result of this study it has been shown that it is possible to restore the damaged files, which the conventional file carving method could not, by using tasks such as speech or nonspeech decision and speech encoder recognizer using a deep neural network.

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

  • Miran Kim
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.13-20
    • /
    • 2023
  • This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.