• Title/Summary/Keyword: 소리 생성

Search Result 88, Processing Time 0.024 seconds

Imaging Inner Structure of Bukbawi at Mt. Palgong Provincial Park Using Ground Penetrating Radar (지하투과레이더를 활용한 팔공산 도립공원 북바위 내부구조 연구)

  • Kim, Hyeong-Gi;Baek, Seung-Ho;Kim, Seung-Sep;Lee, Na Young;Kwon, Jang-Soon
    • Economic and Environmental Geology
    • /
    • v.50 no.6
    • /
    • pp.487-495
    • /
    • 2017
  • A granite rock body, called 'Bukbawi', located on a mountaineering trail at Mt. Palgong Provincial Park is popular among the public because it resembles a percussion instrument. If someone hits the specific surface area of this rock body, people can hear drum-like sound. Such phenomenon may be geologically associated with exfoliation process of the granite body or miarolitic cavity developed after gasses escaped during formation of granite. To understand better the inner structure causing drum-like sound, we carried out a non-destructive ground-penetrating radar survey. In this study, as our primary target is very close to the surface, we utilized 1 GHz antennas to produce high-resolution near-surface images. In order to construct 3-D internal images, the measurements were conducted along a pre-defined grid. The processed radargrams revealed that the locations associated with 'drum' sound coincide with strong reflections. In addition, both reflection patterns of fracture and cavity were observed. To further quantify the observed reflections, we simulated GPR scans from a synthetic fracture in a granite body, filled with different materials. The simulated results suggest that both exfoliation process and miarolitic cavity may have contributed to the 'drum' phenomena. Furthermore, the radargrams showed a well-developed cavity signature where two major reflection planes were crossed. Thus, our study is an example of non-destructive geophysical studies that can promote Earth Science in the broader community by examining geological structures attracting the public.

Enhanced Sound Signal Based Sound-Event Classification (향상된 음향 신호 기반의 음향 이벤트 분류)

  • Choi, Yongju;Lee, Jonguk;Park, Daihee;Chung, Yongwha
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.5
    • /
    • pp.193-204
    • /
    • 2019
  • The explosion of data due to the improvement of sensor technology and computing performance has become the basis for analyzing the situation in the industrial fields, and various attempts to detect events based on such data are increasing recently. In particular, sound signals collected from sensors are used as important information to classify events in various application fields as an advantage of efficiently collecting field information at a relatively low cost. However, the performance of sound-event classification in the field cannot be guaranteed if noise can not be removed. That is, in order to implement a system that can be practically applied, robust performance should be guaranteed even in various noise conditions. In this study, we propose a system that can classify the sound event after generating the enhanced sound signal based on the deep learning algorithm. Especially, to remove noise from the sound signal itself, the enhanced sound data against the noise is generated using SEGAN applied to the GAN with a VAE technique. Then, an end-to-end based sound-event classification system is designed to classify the sound events using the enhanced sound signal as input data of CNN structure without a data conversion process. The performance of the proposed method was verified experimentally using sound data obtained from the industrial field, and the f1 score of 99.29% (railway industry) and 97.80% (livestock industry) was confirmed.

The Genealogy of Forbidden Sound -Political Aesthetics of Ambiguity in the Criticism of Japanese Style in Korean Society of the 1960s (일본적인 것, 혹은 금지된 '소리'의 계보 -한일국교정상화 성립기 '왜색(倭色)' 비판담론과 양의성의 정치미학)

  • Jeong, Chang-Hoon
    • Journal of Popular Narrative
    • /
    • v.25 no.1
    • /
    • pp.349-392
    • /
    • 2019
  • In the 1960s of Korea, the normalization of diplomatic relations between Korea and Japan led to a sense of a vigorous anxiety and fear that "Japan will once again come to the Korean peninsula". As a reaction to this, the discourse on the criticism of 'Japanese Style' strongly emerged. If the prior discourse of criticism was to express the national antipathy toward the colonial remnants that had not yet been disposed of, the critical discourse of the 1960s was the wariness of the newly created 'Japanese Style' in popular culture, and to grasp it as a symptomatic phenomenon that 'evil-minded Japan' was revealed. Thus, this new logic of criticism of the 'Japanese Style' had a qualitative difference from the existing ones. It was accompanied by a willingness to inspect and censor the 'masses' that grew up as consumers of transnational 'mass culture' that flowed and chained in the geopolitical order under the Cold War system. Therefore, the topology of 'popular things=Japanese things=consuming things' reveals the paradox of moral demands that existed within Korean society in the 1960s. This was to solidify the divisive circulation structure that caused them to avoid direct contact with the other called 'Japan', but at the same time, get as close to it as ever. It is a repetitive obsession that pushes the other to another side through the moral segregation that strictly draws a line of demarcation between oneself and the other, but on the other hand is attracted to the object and pulls it back to its side. This paper intends to listen to the different voices that have arisen in the repetitive obsession to understand the significance of the dissonance that has been repeated in the contemporary era. This will be an examination of the paradoxical object of Japan that has been repeatedly asked to build the internal control principle of Korean society, or to hide the oppressive and violent side of the power, and that can neither be accepted nor destroyed completely as part of oneself.

Conversion of Image into Sound Based on HSI Histogram (HSI 히스토그램에 기초한 이미지-사운드 변환)

  • Kim, Sung-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.3
    • /
    • pp.142-148
    • /
    • 2011
  • The final aim of the present study is to develop the intelligent robot, emulating human synesthetic skills which make it possible to associate a color image with a specific sound. This can be done on the basis of the mutual conversion between color image and sound. As a first step of the final goal, this study focused on a basic system using a conversion of color image into sound. This study describes a proposed method to convert color image into sound, based on the likelihood in the physical frequency information between light and sound. The method of converting color image into sound was implemented by using HSI histograms through RGB-to-HSI color model conversion, which was done by Microsoft Visual C++ (ver. 6.0). Two different color images were used on the simulation experiments, and the results revealed that the hue, saturation and intensity elements of each input color image were converted into fundamental frequency, harmonic and octave elements of a sound, respectively. Through the proposed system, the converted sound elements were then synthesized to automatically generate a sound source with wav file format, using Csound.

The Emotional Boundary Decision in a Linear Affect-Expression Space for Effective Robot Behavior Generation (효과적인 로봇 행동 생성을 위한 선형의 정서-표정 공간 내 감정 경계의 결정 -비선형의 제스처 동기화를 위한 정서, 표정 공간의 영역 결정)

  • Jo, Su-Hun;Lee, Hui-Sung;Park, Jeong-Woo;Kim, Min-Gyu;Chung, Myung-Jin
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.540-546
    • /
    • 2008
  • In the near future, robots should be able to understand human's emotional states and exhibit appropriate behaviors accordingly. In Human-Human Interaction, the 93% consist of the speaker's nonverbal communicative behavior. Bodily movements provide information of the quantity of emotion. Latest personal robots can interact with human using multi-modality such as facial expression, gesture, LED, sound, sensors and so on. However, a posture needs a position and an orientation only and in facial expression or gesture, movements are involved. Verbal, vocal, musical, color expressions need time information. Because synchronization among multi-modalities is a key problem, emotion expression needs a systematic approach. On the other hand, at low intensity of surprise, the face could be expressed but the gesture could not be expressed because a gesture is not linear. It is need to decide the emotional boundaries for effective robot behavior generation and synchronization with another expressible method. If it is so, how can we define emotional boundaries? And how can multi-modality be synchronized each other?

  • PDF

Aesthetic Implications of the Algorithm Applied to New Media Art Works : A Focus on Live Coding (뉴미디어 예술 작품에 적용된 알고리즘의 미학적 함의 : 라이브 코딩을 중심으로)

  • Oh, Junho
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.3
    • /
    • pp.119-130
    • /
    • 2013
  • This paper researches the algorithm, whose materiality and expressiveness can be obtained through live coding. Live coding is an improvised genre of music that generates sounds while writing code in real time and projecting it onto a screen. Previous studies of live coding have focused on the development environment to support live coding performance effectively. However, this study examines the aesthetic attitude immanent in the realization of the algorithm through analyzing mostly used languages such as ChucK, Impromtu, and the visualization of live code and cases of "aa-cell" and "slub" performance. The aesthetic attitudes of live coding performance can be divided into algebraic and geometric attitudes. Algebraic attitudes underline the temporal development of concepts; geometric attitudes highlight the materialization of the spatial structure of concepts through image schemas. Such a difference echoes the tension between conception and materiality, which appears in both conceptual and concrete poetry. The linguistic question of whether conception or materiality is more greatly emphasized defines the expressiveness of the algorithm.

Adaptive Noise Canceller of Single Channel For Heart Sound Enhancement (심음 향상을 위한 단일채널 적응 잡음 제거기)

  • Lee, Chul-Hyun;Kim, Pil-Un;Lee, Yun-Jung;Chang, Yong-Min;Bae, Keun-Sung;Cho, Jin-Ho;Kim, Myoung-Nam
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.7
    • /
    • pp.973-982
    • /
    • 2010
  • In this paper, we proposed the single-channel adaptive noise canceller for the enhancement of heart sound (HS) in the auscultation signal. In case of either normal or emergency, a HS diagnosis is difficult due to the various signal source in the chest. Therefore, the HS enhancement is necessary. The conventional active noise canceller(ANC) has two channel, main signal and reference signal. For signal channel, the reference signal in ANC was generated by the proposed HS analyser and BS-Gate based on the characteristic of HS. This reference signal is suitable to the ANC condition. Experimental data were acquisited from MP36, SS30L in BIOPAC Inc., By the experiment, we confirmed that the proposed single-channel ANC was efficient for HS enhancement. And by the comparison with active linear enhancement, it was validate that the proposed ANC is not affected by the variation of a heartbeat.

Adaptive Keyframe-Based Tracking for Augmented Books (증강 책을 위한 적응형 키프레임 기반 트래킹)

  • Yoo, Jae-Sang;Cho, Kyu-Sung;Yang, Hyun-S.
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.4
    • /
    • pp.502-506
    • /
    • 2010
  • An augmented book is an application that augments such multimedia elements as virtual 3D objects generated by computer graphics, movie clips, or sound clips to a real book using AR technologies. It is intended to bring additional education and entertainment effects to users. For augmented books, this paper proposes an adaptive keyframe-based page tracking method to estimate the camera's 6 DOF pose in real-time after recognizing a page and performing wide-baseline keypoint matching. For a page tracking, proposed method in this paper chooses a proper keyframe and performs a tracking in two step of coarse-to-fine stage. As a result, the proposed method in this paper guarantees a robust tracking to view-point and illumination variations and real-time.

A study on Interactive-type Exhibition Using Fractal Images (프랙탈 이미지를 활용한 쌍방향 실감형 전시에 관한 연구)

  • Lim, Mi-Jeong;Cho, Hyong-Je;Choi, Gyoo-Seok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.15 no.5
    • /
    • pp.163-168
    • /
    • 2015
  • Recent exhibition's paradigm is changing from the existing unidirectional oriented exhibition form to a form of interactive hands-on exhibits that viewers can get and realistically feel a variety of information. Hands-on exhibit embodies the human interface by utilizing light, sound, pressure, etc. in time and space. In this paper, we have studied the creation of fractal image by the Mandelbrot technique and proposed the interaction method for it to be converted into a variety of forms. By using the proposed method, a variety of image transformation such as printmaking effect, sketch effect, Pop Art effect can be performed, according to clicking a certain fraction on the created fractal image screen by a user mouse. Interactive image generated in this study are expected to be used for trade shows, promotional products, media art design.

Non-Dialog Section Detection for the Descriptive Video Service Contents Authoring (화면해설방송 저작을 위한 비 대사 구간 검출)

  • Jang, Inseon;Ahn, ChungHyun;Jang, Younseon
    • Journal of Broadcast Engineering
    • /
    • v.19 no.3
    • /
    • pp.296-306
    • /
    • 2014
  • This paper addresses a problem of non-dialog section detection for the DVS authoring, the goal of which is to find meaningful section from the broadcasting audio, where audio description can be inserted. The broadcasting audio involves the presence of various sounds so that it first discriminates between speech and non-speech for each audio frame. Proposed method jointly exploits the inter-channels structure and speech source characteristics of the broadcasting audio whose number of channel is stereo. Also, rule based post-processing is finally applied to detect the non-dialog section whose length is appropriate for audio description. Proposed method provides more accurate detection compared to conventional method. Experimental results on real broadcasting contents show that qualitative superiority of the proposed method.