• Title/Summary/Keyword: Spatial Sound

Search Result 261, Processing Time 0.024 seconds

DECODE: A Novel Method of DEep CNN-based Object DEtection using Chirps Emission and Echo Signals in Indoor Environment (실내 환경에서 Chirp Emission과 Echo Signal을 이용한 심층신경망 기반 객체 감지 기법)

  • Nam, Hyunsoo;Jeong, Jongpil
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.3
    • /
    • pp.59-66
    • /
    • 2021
  • Humans mainly recognize surrounding objects using visual and auditory information among the five senses (sight, hearing, smell, touch, taste). Major research related to the latest object recognition mainly focuses on analysis using image sensor information. In this paper, after emitting various chirp audio signals into the observation space, collecting echoes through a 2-channel receiving sensor, converting them into spectral images, an object recognition experiment in 3D space was conducted using an image learning algorithm based on deep learning. Through this experiment, the experiment was conducted in a situation where there is noise and echo generated in a general indoor environment, not in the ideal condition of an anechoic room, and the object recognition through echo was able to estimate the position of the object with 83% accuracy. In addition, it was possible to obtain visual information through sound through learning of 3D sound by mapping the inference result to the observation space and the 3D sound spatial signal and outputting it as sound. This means that the use of various echo information along with image information is required for object recognition research, and it is thought that this technology can be used for augmented reality through 3D sound.

Corticostriatal Connections of the Superior Temporal Regions in the Macaque Monkey

  • Jung, Yongwook;Hong, Sungwon
    • Animal cells and systems
    • /
    • v.7 no.4
    • /
    • pp.317-325
    • /
    • 2003
  • Corticostriatal connections of auditory areas within the rostral and caudal portions of the superior temporal gyrus (STG) and in the supratemporal plane(STP) of pigtail macaque (Macacca nemestrina) were studied with particular emphasis on specific projections to the ventral striatum. Retrograde tracers were Injected into five different regions of the ventral striatum such as the ventromedial caudate nucleus, ventral shell, central shell, dorsal core of the nucleus accumbens (NA), and ventrolateral putamen to Identify the cells of origin. There were only few projections from the auditory areas in the STP to the ventral striatum. However, the association (or belt) areas of the STG collectively had widespread corticostriatal projections characterized by differential topographic distributions. The rostral parts of the STG strongly projected to the ventromedial caudate nucleus. The midportion of the STG also projected to the same ventral striatal regions, but the connections were relatively less extensive. Interestingly, the caudal portion of the STG had no connection to all subregions of the ventral striatum. These differential patterns of corticostriatal connectivity suggest that the ventromedial caudate nucleus would be a major auditory convergence area and mainly involved in sound recognition rather than spatial localization of sound sources.

Influence of Unsteady Wake on Turbulent Separated Flows over a Backward-Facing Step (후향 계단 주위 난류 박리 유동에 대한 비정상 후류의 영향)

  • Chun, Se-Jong;Sung, Hyung-Jin
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.27 no.12
    • /
    • pp.1708-1715
    • /
    • 2003
  • An experimental study was made of turbulent separated and reattaching flow over a backward-facing step, where unsteady wake was generated by a spoked-wheel type wake generator with cylindrical rods in front of the separated flow. The influence of unsteady wake was scrutinized in terms of the rotating speed of the wake generator (0$\leq$S $t_{H}$$\leq$0.4). A conditional averaging technique in corporation with SBF was employed to elucidate the influence of the unsteady wake on the large-scale vortical structures of the separated flow. Special attention was made during two-dimensional measurements of wall-pressure with or without unsteady wake. The wall-pressure fluctuations were used to predict dipole sound source by Curie's integral formula. It was found that the reduction of the dipole sound source was due to the reduction of turbulent kinetic energy by unsteady wake in the recirculation region.n.

Interactive Control Panel Layout Using a Constraint Satisfaction Algorithm (제약만족 알고리즘을 이용한 상호대화적 조종패널 배치)

  • Park, Sung-Joon;Jeong, Eui-S.;Chang, Soo-Y.
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.20 no.4
    • /
    • pp.85-97
    • /
    • 1994
  • An interactive and iterative control panel layout method based on the constraint satisfaction problem (CSP) technique was developed to generate an ergonomically sound panel design. This control panel layout method attempts to incorporate a variety of relevant ergonomic principles and design constraints, and generate an optimal or, at least, a "satisfactory" solution through an efficient search algorithm. The problem of seeking an ergonomically sound panel design should be viewed as a multi-criteria design problem and most of the design objectives should be understood as constraints. Hence, a CSP technique was employed in this study for dealing with the multi-constraints layout problem. The efficient search algorithm using "preprocess" and "look_ahead" procedures was developed to handle vast amount of computation. In order to apply the CSP technique to the panel layout procedure, the ergonomic principles such as spatial compatibility, frequency-of-use, importance, functional grouping, and sequence-of-use were formalized as CSP terms. The effectiveness of the proposed panel layout method was evaluated by example problems and the results clearly showed that the generated layouts properly considered various ergonomic design principles.

  • PDF

COMPUTATION OF AERODYNAMIC SOUNDS AT LOW MACH NUMBERS USING FINITE DIFFERENCE LATTICE BOLTZMANN METHOD

  • Kang H. K;Tsutahara M;Shikata K;Kim E. R;Kim Y. T;Lee Y. H
    • Journal of computational fluids engineering
    • /
    • v.10 no.1
    • /
    • pp.8-15
    • /
    • 2005
  • Aerodynamic sounds generated by a uniform flow around a two-dimensional circular cylinder at Re=150 are simulated by applying the finite difference lattice Boltzmann method. Thethird-order-accurate up-wind scheme (UTOPIA) is used for the spatial derivatives, and the second-order-accurate Runge-Kutta scheme is applied for the time marching. We have succeed in capturing very small pressure fluctuations with the same frequency of the Karman vortex street compared with the pressure fluctuation around a circular cylinder. The propagation velocity of the acoustic waves shows that the points of peak pressure are biased upstream due to the Doppler effect in the uniform flow. For the downstream, on the other hand, it is faster. It is also apparent that the amplitude of sound pressure is proportional to r /sup -1/2/,r being the distance from the center of the circular cylinder. To investigate the effect of the lattice dependence, furthermore, 2D computations of the tone noises radiated by a square cylinder and NACA0012 with a blunt trailing edge at high incidence and low Reynolds number are also investigate.

Age-related Deficits in Response Characteristics on Safety Warning of Intelligent Vehicle (지능형 자동차의 안전 경고음에 대한 고령운전자의 반응 특성)

  • Kim, Man-Ho;Lee, Yong-Tae;Son, Joon-Woo;Jang, Chee-Hwan
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.26 no.12
    • /
    • pp.131-137
    • /
    • 2009
  • Recent technological advances made a vehicle more intelligent to increase safety and comfort. An intelligent vehicle provides drivers with safety warning information through audible sounds, visual displays, and tactile devices. However, elderly drivers have been known to decrease the physical and cognitive abilities such as muscular strength, hearing, eyesight, short term memory, and spatial perception. Therefore, possible age-related deficits should be considered to design an effective warning system. This paper aims to evaluate the impact of advancing age on response performance on audible safety warnings which are widely used for alerting driving hazards. In order to understand the effect of age-related hearing loss and movement slowing, three sound characteristics (frequency, intensity, and period) and three age groups (younger, middle, and older) are considered. Data was drawn from 38 drivers who drove a simulated rural road in a driving simulator. Experimental results show that age influences driver's response performance. In conclusion, the appropriate range of a warning sound is suggested.

Variations of Abundance and Hatch Timing of Dungeness Crab Larvae in Southeastern Alaska: Implications for Climate Effect

  • Park, Won-Gyu;Shirley, Thomas C.
    • Animal cells and systems
    • /
    • v.12 no.4
    • /
    • pp.287-295
    • /
    • 2008
  • Variations of larval abundance and hatch timing of Dungeness crabs, Cancer magister Dana 1852, were investigated. Dungeness crab larvae were monthly collected at 16 stations arrayed in four transects, Upper Chatham, Icy Strait, Cross Sound, and Icy Point, in southeastern Alaska from May to September 1997-2004. Larval abundance at all transects was the highest in June except in the Icy Point transect. Larval abundance was the highest in the Icy Strait transect, moderate in the Upper Chatham and Cross Sound transects, and the lowest in the Icy Point transect. Zoeae I(ZI) was predominated in May; thereafter ZI decreased and late zoeal stages occurred. In May and June, small numbers of late stage larvae unusually co-occurred with ZI in three transects. These late stage larvae may have been transported from where hatching occurs earlier. The timing of ZI occurrence varied interannually and was related to degreedays during the egg incubation period of Dungeness crabs: later larval hatching in 1997 and 2002 when temperatures were colder, while earlier larval hatching in 1998 when temperatures were warmer. The distribution patterns of Dungeness crab larvae in southeastern Alaska were markedly different from those reported from other areas of the species distribution ranges: larvae occurring much later in the year, and late stage larvae occurring in inland waters.

Sound Quality Enhancement in MPEG Surround by Using ILD Distortion (ILD DISTORTION을 이용한 MPEG SURROUND의 음질 개선)

  • Chon, Sang-Bae;Choi, In-Yong;Sung, Koeng-Mo
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.241-242
    • /
    • 2006
  • MPEG Surround is an audio coding technology that represents multi-channel audio signal with downmixed audio signal(s) and very low bitrate side information based on Binaural Cue Coding. The side information consists of Inter-Channel Level Difference, Inter-Channel Correlation, and payloads. These two parameters are correspondent to the well-known spatial parameters in psycho-acoustics, Inter-aural Level Difference (ILD) and Inter-Aural Cross Correlation (IACC). Though ICLD is to provide perceptually equivalent ILD to the listener, however, the ILD of the original multi-channel audio signal and that of the MPEG Surround encoded signal was different. The difference between two ILD values is defined as ILD Distortion (ILDD). This paper provides how ILDD can be applied to enhance sound quality in MPEG Surround and how much ILDD is decreased.

  • PDF

Pitch-shifted sound synthesis using digital waveguide model (피치 변화음의 합성을 위한 도파관 모델)

  • Cho, Sang-Jin;Kang, Myeong-Su;Chong, Ui-Pil
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.2
    • /
    • pp.127-131
    • /
    • 2009
  • In the digital waveguide theory, traveling waves are represented by general solution to the wave equation that is second-order linear partial differential equation. The movement of these waves can be implemented using only delay lines. An unit delay in the general digital waveguide describes a sampling time interval. However, in the space-based digital waveguide the unit delay implies the spatial sampling distance. In consideration of these differences between two models, it is known that the space-based digital waveguide model is adequate to synthesize pitch-shifted sounds such as vibrato because the propagation distance can be directly control. In this paper, the time-based digital waveguide model which also synthesizes pitch-shifted sounds is proposed and compared with space-based digital waveguide.

  • PDF

A DNN-Based Personalized HRTF Estimation Method for 3D Immersive Audio

  • Son, Ji Su;Choi, Seung Ho
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.161-167
    • /
    • 2021
  • This paper proposes a new personalized HRTF estimation method which is based on a deep neural network (DNN) model and improved elevation reproduction using a notch filter. In the previous study, a DNN model was proposed that estimates the magnitude of HRTF by using anthropometric measurements [1]. However, since this method uses zero-phase without estimating the phase, it causes the internalization (i.e., the inside-the-head localization) of sound when listening the spatial sound. We devise a method to estimate both the magnitude and phase of HRTF based on the DNN model. Personalized HRIR was estimated using the anthropometric measurements including detailed data of the head, torso, shoulders and ears as inputs for the DNN model. After that, the estimated HRIR was filtered with an appropriate notch filter to improve elevation reproduction. In order to evaluate the performance, both of the objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated HRTF are measured. For subjective evaluation, the MUSHRA test and preference test are conducted. As a result, the proposed method can make listeners experience more immersive audio than the previous methods.