• Title/Summary/Keyword: 음향적 특징

Search Result 437, Processing Time 0.03 seconds

Preparation of Biopolymer coated Magnetite And Magnetic Biopolymer Microsphere Particles for Medical Application (의학적 응용을 위한 생체 고분자로 피복 된 자성 나노 입자와 미소구체의 제조)

  • Ko, Sang-Gil;Cho, Jun-Hee;Ahn, Yang-kyu;Song, Ki-Chang;Choi, Eun-Jung
    • Journal of the Korean Magnetics Society
    • /
    • v.16 no.4
    • /
    • pp.221-227
    • /
    • 2006
  • We have synthesized uniform nanometer sized magnetite particles using chemical coprecipitation technique through a sonochemical method with surfactant such as oleic acid. Magnetite phase nanoparticles could be observed from X-ray diffraction. Magnetite nanoparticles is surface phase morphology and biopolymer-microspheres for Application Medical. Magnetite nanoparticles coated biopolymer. Atomic Force Microscope (AFM) was used to image the coated nanoparticles. Magnetic colloid suspensions containing particles with sodium oleate, chitosan and $\beta$-glucan have been prepared. The morphology of the magnetic biopolymer microsphere particles were characterized using optical microscope. Magnetic hysteresis measurement were performed using a superconducting quantum interference device (SQUID) magnetometer at room temperature to investigate the magnetic properties of the biopolymer microspheres and magnetite coated biopolymer including magnetite nanoparticles. Magnetic Resonance (MR) imaging was used to investigate biopolymer coated nanoparticles and biopolymer microspheres.

A Study on Lip-reading Enhancement Using Time-domain Filter (시간영역 필터를 이용한 립리딩 성능향상에 관한 연구)

  • 신도성;김진영;최승호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.375-382
    • /
    • 2003
  • Lip-reading technique based on bimodal is to enhance speech recognition rate in noisy environment. It is most important to detect the correct lip-image. But it is hard to estimate stable performance in dynamic environment, because of many factors to deteriorate Lip-reading's performance. There are illumination change, speaker's pronunciation habit, versatility of lips shape and rotation or size change of lips etc. In this paper, we propose the IIR filtering in time-domain for the stable performance. It is very proper to remove the noise of speech, to enhance performance of recognition by digital filtering in time domain. While the lip-reading technique in whole lip image makes data massive, the Principal Component Analysis of pre-process allows to reduce the data quantify by detection of feature without loss of image information. For the observation performance of speech recognition using only image information, we made an experiment on recognition after choosing 22 words in available car service. We used Hidden Markov Model by speech recognition algorithm to compare this words' recognition performance. As a result, while the recognition rate of lip-reading using PCA is 64%, Time-domain filter applied to lip-reading enhances recognition rate of 72.4%.

Implementation of Embedded Speech Recognition System for Supporting Voice Commander to Control an Audio and a Video on Telematics Terminals (텔레메틱스 단말기 내의 오디오/비디오 명령처리를 위한 임베디드용 음성인식 시스템의 구현)

  • Kwon, Oh-Il;Lee, Heung-Kyu
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.42 no.11
    • /
    • pp.93-100
    • /
    • 2005
  • In this paper, we implement the embedded speech recognition system to support various application services such as audio and video control using speech recognition interface on cars. The embedded speech recognition system is implemented and ported in a DSP board. Because MIC type and speech codecs affect the accuracy of speech recognition. And also, we optimize the simulation and test environment to effectively remove the real noises on a car. We applied a noise suppression and feature compensation algorithm to increase an accuracy of sppech recognition on a car. And we used a context dependent tied-mixture acoustic modeling. The performance evaluation showed high accuracy of proposed system in office environment and even real car environment.

A Study on Improving Speech Recognition Rate (H/W, S/W) of Speech Impairment by Neurological Injury (신경학적 손상에 의한 언어장애인 음성 인식률 개선(H/W, S/W)에 관한 연구)

  • Lee, Hyung-keun;Kim, Soon-hub;Yang, Ki-Woong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.11
    • /
    • pp.1397-1406
    • /
    • 2019
  • In everyday mobile phone calls between the disabled and non-disabled people due to neurological impairment, the communication accuracy is often hindered by combining the accuracy of pronunciation due to the neurological impairment and the pronunciation features of the disabled. In order to improve this problem, the limiting method is MEMS (micro electro mechanical systems), which includes an induction line that artificially corrects difficult vocalization according to the oral characteristics of the language impaired by improving the word of out of vocabulary. mechanical System) Microphone device improvement. S/W improvement is decision tree with invert function, and improved matrix-vector rnn method is proposed considering continuous word characteristics. Considering the characteristics of H/W and S/W, a similar dictionary was created, contributing to the improvement of speech intelligibility for smooth communication.

Seismic Stratigraphy of Upper Devonian Carbonates Area in Northern Alberta, Canada (캐나다 북부 알버타주 데본기 후기 탄산염암 지역의 탄성파 층서)

  • Lee, Min-Woo;Oh, Jin-Yong;Yun, Hye-Su
    • Economic and Environmental Geology
    • /
    • v.44 no.6
    • /
    • pp.503-511
    • /
    • 2011
  • The Upper Devonian Grosmont Formation in northern Alberta, Canada, underlies the erosion unconformity that formed between the Cretaceous and Upper Devonian. The formation is divided into four units on the basis of intercalated shales and showing a typical shelf environment of shallowing-upward. It was possible to separate four units(LG~UG3), considering the seismic interpretation attributes of polarity, continuity, frequency/spacing and amplitude and showing the reflection characteristics of the medium-high amplitude, medium-low frequency, good continuity, and subparallel reflection events. The formation can be interpreted as shelf or platform, based on in-situ core data. However, it is difficult, only with reflection attributes and features, to recognize the boundaries and sedimentary environment of parasequence. Therefore, we try to interprete by parasequence set in this study. The parasequence set was formed by erosion unconformity with systems tracts. The erosion unconformity can be recognized by facies data and karst, erosional surface. Grosmont carbonate deposits ranging from platform and shelf to shelf slope are; by wedge-shaped strata of characterized by complex sigmoid-oblique progradational configurations, reflecting a depositional history of upbuilding and outbuilding in response to sea-level changes. Most of the sedimentary units is interpreted as platforms under regression and lowstand environments that support is evidences. In particular, shale layer at the basal part of the highstand systems tracts represents the regressive to lowstand of sea level.

Korean Phoneme Recognition Using Self-Organizing Feature Map (SOFM 신경회로망을 이용한 한국어 음소 인식)

  • Jeon, Yong-Koo;Yang, Jin-Woo;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.2
    • /
    • pp.101-112
    • /
    • 1995
  • In order to construct a feature map-based phoneme classification system for speech recognition, two procedures are usually required. One is clustering and the other is labeling. In this paper, we present a phoneme classification system based on the Kohonen's Self-Organizing Feature Map (SOFM) for clusterer and labeler. It is known that the SOFM performs self-organizing process by which optimal local topographical mapping of the signal space and yields a reasonably high accuracy in recognition tasks. Consequently, SOFM can effectively be applied to the recognition of phonemes. Besides to improve the performance of the phoneme classification system, we propose the learning algorithm combined with the classical K-mans clustering algorithm in fine-tuning stage. In order to evaluate the performance of the proposed phoneme classification algorithm, we first use totaly 43 phonemes which construct six intra-class feature maps for six different phoneme classes. From the speaker-dependent phoneme classification tests using these six feature maps, we obtain recognition rate of $87.2\%$ and confirm that the proposed algorithm is an efficient method for improvement of recognition performance and convergence speed.

  • PDF

Seismic Stratigraphy and Sedimentary Environment of the Dukjuk-Do Sand Ridge in Western Gyeonggi Bay, Korea (경기만 서부 덕적도 사퇴의 탄성파층서 및 퇴적환경 연구)

  • Lee, Yoon-Oh;Choi, Sang-Il;Jeong, Gyo-Cheol
    • The Journal of Engineering Geology
    • /
    • v.24 no.1
    • /
    • pp.9-21
    • /
    • 2014
  • We examined high-resolution seismic data, side scan sonar data, surface sediments, and vibrocore samples from a sand ridge off the western part of Dukjuk-Do in Gyeonggi Bay, with the aim of interpretation of seismic stratigraphy and sedimentary environment. Based on the seismic data, the deposited sands are divided into three sedimentary units. 14C age data indicate that the top sequence (sequence I) formed at 5000-6000 yr BP, when a transgression resulted in strong shifting tides. Analyses of the vibrocore samples indicate that sequence II is a paleo-mudflat layer of intertidal sediments dominated by mud. Sequence III consists of terrestrial sediments that are presumed to have been deposited at the end of the Pleistocene, unconformably overlying the acoustic bedrock and Mesozoic granite. The side scan sonar data indicate that sand waves were formed on the seabed on top of the sand ridge. Generally, this is the direction of $N20^{\circ}E$, which coincides with the direction of tidal flow. Sand ripples occur away from the top of the sand ridge and are distributed homogeneously across a sandy slope. Vibrocore analyses indicate that the surface sediments and core sediments (samples VC-1, -2, and -3) are homogeneous, without any internal structures, and are characterized by a mixture of medium and fine sand (1-$2{\phi}$), respectively.

A study on the location of microphones in measurement considering the frequency characteristics of elevator noise in households (세대 내 승강기 소음 주파수특성을 고려한 측정 시 마이크로폰 위치에 관한 연구)

  • Min-Woo Kang;Yang-Ki Oh
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.2
    • /
    • pp.124-132
    • /
    • 2023
  • When the building becomes high, the number of households increases and they are adjacent to the elevator. So, frequency of use of elevators will increase. Elevator noise is bound to increase in the future. However, there are currently no legal standards for elevator noise or measurement and evaluation methods that can clearly measure elevator noise in Korea. Although some methods for measuring elevator noise are presented in KS F ISO 16032, this standard is not a standard established for elevator noise. It is a standard that integrates the overall measurement method of building equipment and equipment, and the position of the microphone is selected by the experimenter during measurement. Elevator noise is characterized by a low sound pressure level as the noise in the mid-low frequency band is important. However, even today, complaints from residents about elevator noise are increasing. In this study, the position of the microphone that can most sensitively pick up the elevator noise when measuring the elevator noise was studied. According to the distance from the wall and the height from the floor, a total of 9 microphone positions were measured and analyzed. As a result of the experiment, it was confirmed that the elevator noise has a very high influence in the 63 Hz band. The measured value at the center point was identified as a factor that lowered the overall elevator noise level value.

Salient Region Detection Algorithm for Music Video Browsing (뮤직비디오 브라우징을 위한 중요 구간 검출 알고리즘)

  • Kim, Hyoung-Gook;Shin, Dong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.112-118
    • /
    • 2009
  • This paper proposes a rapid detection algorithm of a salient region for music video browsing system, which can be applied to mobile device and digital video recorder (DVR). The input music video is decomposed into the music and video tracks. For the music track, the music highlight including musical chorus is detected based on structure analysis using energy-based peak position detection. Using the emotional models generated by SVM-AdaBoost learning algorithm, the music signal of the music videos is classified into one of the predefined emotional classes of the music automatically. For the video track, the face scene including the singer or actor/actress is detected based on a boosted cascade of simple features. Finally, the salient region is generated based on the alignment of boundaries of the music highlight and the visual face scene. First, the users select their favorite music videos from various music videos in the mobile devices or DVR with the information of a music video's emotion and thereafter they can browse the salient region with a length of 30-seconds using the proposed algorithm quickly. A mean opinion score (MOS) test with a database of 200 music videos is conducted to compare the detected salient region with the predefined manual part. The MOS test results show that the detected salient region using the proposed method performed much better than the predefined manual part without audiovisual processing.

Development of the Holocene Sediments in Gamak Bay of the South Sea, Korea (남해 가막만의 현생퇴적층 발달특성)

  • Kim, So Ra;Lee, Gwang Soo;Choi, Dong Lim;Kim, Dae Choul;Lee, Tae Hee;Seo, Young Kyo
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.19 no.2
    • /
    • pp.131-146
    • /
    • 2014
  • High-resolution seismic profiles coupled with sediment sampling were analyzed to investigate the acoustic characters and distribution patterns of the late Holocene sediments in Gamak Bay of the South Sea, Korea. The mean grain size of surficial sediment lies around $6.3{\sim}9.7{\Phi}$. Sediments in the bay consist of silt and clay with progressive decrease toward the inner bay. The seismic sedimentary sequence overlying the acoustic basement can be divided into two sedimentary units (GB I and II) by a prominent mid-reflector (Maximum Flooding Surface; MFS). The acoustic basement occurs at the depth between 20 m and 40 m below the sea-level and deepens gradually southward. The GB I, mostly occupying the channel-fill, is characterized by reflection-free seismic facies. It can be formed as late Transgressive System Tract (TST), interpreted tidal environment deposits. MFS appears at the depth of about 15~28 m below the sea-level and is well defined by even and continuous reflectors on the seismic profile. The GB II overlying MFS is composed of acoustically transparent to semitransparent and parallel internal reflectors. GB II is interpreted as the Highstand System Tract (HST) probably deposited during the last 6,000 yrs when the sea level was close to the present level. Especially, it is though that the GB II was subdivided into two layers (GB II-a and II-b) by a HST-reflector and this was classified by wind, sea water flux, and tidal current.