• Title/Summary/Keyword: 주파수변환

Search Result 2,180, Processing Time 0.025 seconds

Entropy-Based 6 Degrees of Freedom Extraction for the W-band Synthetic Aperture Radar Image Reconstruction (W-band Synthetic Aperture Radar 영상 복원을 위한 엔트로피 기반의 6 Degrees of Freedom 추출)

  • Hyokbeen Lee;Duk-jin Kim;Junwoo Kim;Juyoung Song
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_1
    • /
    • pp.1245-1254
    • /
    • 2023
  • Significant research has been conducted on the W-band synthetic aperture radar (SAR) system that utilizes the 77 GHz frequency modulation continuous wave (FMCW) radar. To reconstruct the high-resolution W-band SAR image, it is necessary to transform the point cloud acquired from the stereo cameras or the LiDAR in the direction of 6 degrees of freedom (DOF) and apply them to the SAR signal processing. However, there are difficulties in matching images due to the different geometric structures of images acquired from different sensors. In this study, we present the method to extract an optimized depth map by obtaining 6 DOF of the point cloud using a gradient descent method based on the entropy of the SAR image. An experiment was conducted to reconstruct a tree, which is a major road environment object, using the constructed W-band SAR system. The SAR image, reconstructed using the entropy-based gradient descent method, showed a decrease of 53.2828 in mean square error and an increase of 0.5529 in the structural similarity index, compared to SAR images reconstructed from radar coordinates.

Study on Analysis of Queen Bee Sound Patterns (여왕벌 사운드 패턴 분석에 대한 연구)

  • Kim Joon Ho;Han Wook
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.5
    • /
    • pp.867-874
    • /
    • 2023
  • Recently, many problems are occurring in the bee ecosystem due to rapid climate change. The decline in the bee population and changes in the flowering period are having a huge impact on the harvest of bee-keepers. Since it is impossible to continuously observe the beehives in the hive with the naked eye, most people rely on knowledge based on experience about the state of the hive.Therefore, interest is focused on smart beekeeping incorporating IoT technology. In particular, with regard to swarming, which is one of the most important parts of beekeeping, we know empirically that the swarming time can be determined by the sound of the queen bee, but there is no way to systematically analyze this with data.You may think that it can be done by simply recording the sound of the queen bee and analyzing it, but it does not solve various problems such as various noise issues around the hive and the inability to continuously record.In this study, we developed a system that records queen bee sounds in a real-time cloud system and analyzes sound patterns.After receiving real-time analog sound from the hive through multiple channels and converting it to digital, a sound pattern that was continuously output in the queen bee sound frequency band was discovered. By accessing the cloud system, you can monitor sounds around the hive, temperature/humidity inside the hive, weight, and internal movement data.The system developed in this study made it possible to analyze the sound patterns of the queen bee and learn about the situation inside the hive. Through this, it will be possible to predict the swarming period of bees or provide information to control the swarming period.

Spontaneous Speech Emotion Recognition Based On Spectrogram With Convolutional Neural Network (CNN 기반 스펙트로그램을 이용한 자유발화 음성감정인식)

  • Guiyoung Son;Soonil Kwon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.6
    • /
    • pp.284-290
    • /
    • 2024
  • Speech emotion recognition (SER) is a technique that is used to analyze the speaker's voice patterns, including vibration, intensity, and tone, to determine their emotional state. There has been an increase in interest in artificial intelligence (AI) techniques, which are now widely used in medicine, education, industry, and the military. Nevertheless, existing researchers have attained impressive results by utilizing acted-out speech from skilled actors in a controlled environment for various scenarios. In particular, there is a mismatch between acted and spontaneous speech since acted speech includes more explicit emotional expressions than spontaneous speech. For this reason, spontaneous speech-emotion recognition remains a challenging task. This paper aims to conduct emotion recognition and improve performance using spontaneous speech data. To this end, we implement deep learning-based speech emotion recognition using the VGG (Visual Geometry Group) after converting 1-dimensional audio signals into a 2-dimensional spectrogram image. The experimental evaluations are performed on the Korean spontaneous emotional speech database from AI-Hub, consisting of 7 emotions, i.e., joy, love, anger, fear, sadness, surprise, and neutral. As a result, we achieved an average accuracy of 83.5% and 73.0% for adults and young people using a time-frequency 2-dimension spectrogram, respectively. In conclusion, our findings demonstrated that the suggested framework outperformed current state-of-the-art techniques for spontaneous speech and showed a promising performance despite the difficulty in quantifying spontaneous speech emotional expression.

Estimation of Structural Properties from the Measurements of Phase Velocity and Attenuation Coefficient in Trabecular Bone (해면질골에서 위상속도 및 감쇠계수 측정에 의한 구조적 특성 평가)

  • Lee, Kang-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.661-667
    • /
    • 2009
  • Trabecular-bone-mimicking phantoms consisting of parallel-nylon-wire arrays were used to investigate correlations of phase velocity and attenuation coefficient with structural properties in trabecular bone. Trabecular separation (Tb.Sp) of the 7 trabecular-bone-mimicking phantoms ranged from 300 to $900\;{\mu}m$ and volume fraction (VF) from 1.6% to 8.7%. Phase velocity and attenuation coefficient of the phantoms were measured by using a through-transmission method in water, with a matched pair of broadband unfocused transducers with a diameter of 12.7 mm and a center frequency of 1 MHz. Phase velocity and attenuation coefficient at 1 MHz decreased almost linearly with increasing Tb. Sp and increased almost linearly with increasing VF. The simple and multiple linear regression models with phase velocity and attenuation coefficient as independent vanables and Tb.Sp and VF as dependent variables demonstrated that the coefficients of determination for the prediction of VF were higher than those for the prediction of Tb.Sp. The results obtained in the trabecular-bone-mimicking phantoms consisting of parallel-nylon-wire arrays were consistent with those in human trabecular bone suggesting that the structural properties can be estimated from the measurements of phase velocity and attenuation coefficient in trabecular bone.

Selectively Partial Encryption of Images in Wavelet Domain (웨이블릿 영역에서의 선택적 부분 영상 암호화)

  • ;Dujit Dey
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.6C
    • /
    • pp.648-658
    • /
    • 2003
  • As the usage of image/video contents increase, a security problem for the payed image data or the ones requiring confidentiality is raised. This paper proposed an image encryption methodology to hide the image information. The target data of it is the result from quantization in wavelet domain. This method encrypts only part of the image data rather than the whole data of the original image, in which three types of data selection methodologies were involved. First, by using the fact that the wavelet transform decomposes the original image into frequency sub-bands, only some of the frequency sub-bands were included in encryption to make the resulting image unrecognizable. In the data to represent each pixel, only MSBs were taken for encryption. Finally, pixels to be encrypted in a specific sub-band were selected randomly by using LFSR(Linear Feedback Shift Register). Part of the key for encryption was used for the seed value of LFSR and in selecting the parallel output bits of the LFSR for random selection so that the strength of encryption algorithm increased. The experiments have been performed with the proposed methods implemented in software for about 500 images, from which the result showed that only about 1/1000 amount of data to the original image can obtain the encryption effect not to recognize the original image. Consequently, we are sure that the proposed are efficient image encryption methods to acquire the high encryption effect with small amount of encryption. Also, in this paper, several encryption scheme according to the selection of the sub-bands and the number of bits from LFSR outputs for pixel selection have been proposed, and it has been shown that there exits a relation of trade-off between the execution time and the effect of the encryption. It means that the proposed methods can be selectively used according to the application areas. Also, because the proposed methods are performed in the application layer, they are expected to be a good solution for the end-to-end security problem, which is appearing as one of the important problems in the networks with both wired and wireless sections.

Digital Hologram Compression Technique By Hybrid Video Coding (하이브리드 비디오 코팅에 의한 디지털 홀로그램 압축기술)

  • Seo, Young-Ho;Choi, Hyun-Jun;Kang, Hoon-Jong;Lee, Seung-Hyun;Kim, Dong-Wook
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.5 s.305
    • /
    • pp.29-40
    • /
    • 2005
  • According as base of digital hologram has been magnified, discussion of compression technology is expected as a international standard which defines the compression technique of 3D image and video has been progressed in form of 3DAV which is a part of MPEG. As we can identify in case of 3DAV, the coding technique has high possibility to be formed into the hybrid type which is a merged, refined, or mixid with the various previous technique. Therefore, we wish to present the relationship between various image/video coding techniques and digital hologram In this paper, we propose an efficient coding method of digital hologram using standard compression tools for video and image. At first, we convert fringe patterns into video data using a principle of CGH(Computer Generated Hologram), and then encode it. In this research, we propose a compression algorithm is made up of various method such as pre-processing for transform, local segmentation with global information of object image, frequency transform for coding, scanning to make fringe to video stream, classification of coefficients, and hybrid video coding. Finally the proposed hybrid compression algorithm is all of these methods. The tool for still image coding is JPEG2000, and the toots for video coding include various international compression algorithm such as MPEG-2, MPEG-4, and H.264 and various lossless compression algorithm. The proposed algorithm illustrated that it have better properties for reconstruction than the previous researches on far greater compression rate above from four times to eight times as much. Therefore we expect that the proposed technique for digital hologram coding is to be a good preceding research.

Crosshole EM 2.5D Modeling by the Extended Born Approximation (확장된 Born 근사에 의한 시추공간 전자탐사 2.5차원 모델링)

  • Cho, In-Ky;Suh, Jung-Hee
    • Geophysics and Geophysical Exploration
    • /
    • v.1 no.2
    • /
    • pp.127-135
    • /
    • 1998
  • The Born approximation is widely used for solving the complex scattering problems in electromagnetics. Approximating total internal electric field by the background field is reasonable for small material contrasts as long as scatterer is not too large and the frequency is not too high. However in many geophysical applications, moderate and high conductivity contrasts cause both real and imaginary part of internal electric field to differ greatly from background. In the extended Born approximation, which can improve the accuracy of Born approximation dramatically, the total electric field in the integral over the scattering volume is approximated by the background electric field projected to a depolarization tensor. The finite difference and elements methods are usually used in EM scattering problems with a 2D model and a 3D source, due to their capability for simulating complex subsurface conductivity distributions. The price paid for a 3D source is that many wavenumber domain solutions and their inverse Fourier transform must be computed. In these differential equation methods, all the area including homogeneous region should be discretized, which increases the number of nodes and matrix size. Therefore, the differential equation methods need a lot of computing time and large memory. In this study, EM modeling program for a 2D model and a 3D source is developed, which is based on the extended Born approximation. The solution is very fast and stable. Using the program, crosshole EM responses with a vertical magnetic dipole source are obtained and the results are compared with those of 3D integral equation solutions. The agreement between the integral equation solution and extended Born approximation is remarkable within the entire frequency range, but degrades with the increase of conductivity contrast between anomalous body and background medium. The extended Born approximation is accurate in the case conductivity contrast is lower than 1:10. Therefore, the location and conductivity of the anomalous body can be estimated effectively by the extended Born approximation although the quantitative estimate of conductivity is difficult for the case conductivity contrast is too high.

  • PDF

A 10b 200MS/s 75.6mW $0.76mm^2$ 65nm CMOS Pipeline ADC for HDTV Applications (HDTV 응용을 위한 10비트 200MS/s 75.6mW $0.76mm^2$ 65nm CMOS 파이프라인 A/D 변환기)

  • Park, Beom-Soo;Kim, Young-Ju;Park, Seung-Jae;Lee, Seung-Hoon
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.3
    • /
    • pp.60-68
    • /
    • 2009
  • This work proposes a 10b 200MS/s 65nm CMOS ADC for high-definition video systems such as HDTV requiring high resolution and fast operating speed simultaneously. The proposed ADC employs a four-step pipeline architecture to minimize power consumption and chip area. The input SHA based on four capacitors reduces the output signal range from $1.4V_{p-p}$ to $1.0V_{p-p}$ considering high input signal levels at a low supply voltage of 1.2V. The proposed three-stage amplifiers in the input SHA and MDAC1 overcome the low output resistance problem as commonly observed in a 65nm CMOS process. The proposed multipath frequency-compensation technique enables the conventional RNMC based three-stage amplifiers to achieve a stable operation at a high sampling rate of 200MS/s. The conventional switched-bias power-reduction technique in the sub-ranging flash ADCs further reduces power consumption while the reference generator integrated on chip with optional off-chip reference voltages allows versatile system a locations. The prototype ADC in a 65nm CMOS technology demonstrates a measured DNL and INL within 0.19LSB and 0.61LSB, respectively. The ADC shows a maximum SNDR of 54.BdB and 52.4dB and a maximum SFDR of 72.9dB and 64.8dB at 150MS/S and 200MS/s, respectively. The proposed ADC occupies an active die area of $0.76mm^2$ and consumes 75.6mW at a 1.2V supply voltage.

A Non-Calibrated 2x Interleaved 10b 120MS/s Pipeline SAR ADC with Minimized Channel Offset Mismatch (보정기법 없이 채널 간 오프셋 부정합을 최소화한 2x Interleaved 10비트 120MS/s 파이프라인 SAR ADC)

  • Cho, Young-Sae;Shim, Hyun-Sun;Lee, Seung-Hoon
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.9
    • /
    • pp.63-73
    • /
    • 2015
  • This work proposes a 2-channel time-interleaved (T-I) 10b 120MS/s pipeline SAR ADC minimizing offset mismatch between channels without any calibration scheme. The proposed ADC employs a 2-channel SAR and T-I topology based on a 2-step pipeline ADC with 4b and 7b in the first and second stage for high conversion rate and low power consumption. Analog circuits such as comparator and residue amplifier are shared between channels to minimize power consumption, chip area, and offset mismatch which limits the ADC linearity in the conventional T-I architecture, without any calibration scheme. The TSPC D flip-flop with a short propagation delay and a small number of transistors is used in the SAR logic instead of the conventional static D flip-flop to achieve high-speed SAR operation as well as low power consumption and chip area. Three separate reference voltage drivers for 4b SAR, 7b SAR circuits and a single residue amplifier prevent undesirable disturbance among the reference voltages due to each different switching operation and minimize gain mismatch between channels. High-frequency clocks with a controllable duty cycle are generated on chip to eliminate the need of external complicated high-frequency clocks for SAR operation. The prototype ADC in a 45nm CMOS technology demonstrates a measured DNL and INL within 0.69LSB and 0.77LSB, with a maximum SNDR and SFDR of 50.9dB and 59.7dB at 120MS/s, respectively. The proposed ADC occupies an active die area of 0.36mm2 and consumes 8.8mW at a 1.1V supply voltage.

Quantitative Electroencephalogram Markers for Predicting Cerebral Amyloid Pathology in Non-Demented Older Individuals With Depression: A Preliminary Study (비치매 노인 우울증 환자에서 대뇌 아밀로이드 병리 예측을 위한 정량화 뇌파 지표: 예비연구)

  • Park, Seon Young;Chae, Soohyun;Park, Jinsick;Lee, Dong Young;Park, Jee Eun
    • Sleep Medicine and Psychophysiology
    • /
    • v.28 no.2
    • /
    • pp.78-85
    • /
    • 2021
  • Objectives: When elderly patients show depressive symptoms, discrimination between depressive disorder and prodromal phase of Alzheimer's disease is important. We tested whether a quantitative electroencephalogram (qEEG) marker was associated with cerebral amyloid-β (Aβ) deposition in older adults with depression. Methods: Non-demented older individuals (≥ 55years) diagnosed with depression were included in the analyses (n = 63; 76.2% female; mean age ± standard deviation 73.7 ± 6.87 years). The participants were divided into Aβ+ (n = 32) and Aβ- (n = 31) groups based on amyloid PET assessment. EEG was recorded during the 7min eye-closed (EC) phase and 3min eye-open (EO) phase, and all EEG data were analyzed using Fourier transform spectral analysis. We tested interaction effects among Aβ positivity, condition (EC vs. EO), laterality (left, midline, or right), and polarity (frontal, central, or posterior) for EEG alpha band power. Then, the EC-to-EO alpha reactivity index (ARI) was examined as a neurophysiological marker for predicting Aβ+ in depressed older adults. Results: The mean power spectral density of the alpha band in EO phase showed a significant difference between the Aβ+ and Aβ- groups (F = 6.258, p = 0.015). A significant 3-way interaction was observed among Aβ positivity, condition, and laterality on alpha-band power after adjusting for age, sex, educational years, global cognitive function, medication use, and white matter hyperintensities on MRI (F = 3.720, p = 0.030). However, post-hoc analyses showed no significant difference in ARI according to Aβ status in any regions of interest. Conclusion: Among older adults with depression, increased power in EO phase alpha band was associated with Aβ positivity. However, EC-to-EO ARI was not confirmed as a predictor for Aβ+ in depressed older individuals. Future studies with larger samples are needed to confirm our results.