Search | Korea Science

Adaptive Kernel Function of SVM for Improving Speech/Music Classification of 3GPP2 SMV

Lim, Chung-Soo;Chang, Joon-Hyuk
- ETRI Journal
- /
- v.33 no.6
- /
- pp.871-879
- /
- 2011
Because a wide variety of multimedia services are provided through personal wireless communication devices, the demand for efficient bandwidth utilization becomes stronger. This demand naturally results in the introduction of the variable bitrate speech coding concept. One exemplary work is the selectable mode vocoder (SMV) that supports speech/music classification. However, because it has severe limitations in its classification performance, a couple of works to improve speech/music classification by introducing support vector machines (SVMs) have been proposed. While these approaches significantly improved classification accuracy, they did not consider correlations commonly found in speech and music frames. In this paper, we propose a novel and orthogonal approach to improve the speech/music classification of SMV codec by adaptively tuning SVMs based on interframe correlations. According to the experimental results, the proposed algorithm yields improved results in classifying speech and music within the SMV framework.
https://doi.org/10.4218/etrij.11.0110.0780 인용 PDF KSCI

영상압축 : Digital Image Compression

Kim, Gyeong-Seop
- Korean Journal of Digital Imaging in Medicine
- /
- v.4 no.1
- /
- pp.166-180
- /
- 1998
$\cdot$ 영상 압축은 영상의 통계학적 분포, 반복성을 이용하여 빈도가 높은 데이터는 적은 수의 bits를, 빈도가 낮은 데이터에는 보다 많은 수의 bits를 할당하여 전체 영상을 나타내는 bits 수를 줄이는 것임. $\cdot$ 영상 압축은 크게 Lossy Coding, Lossless Coding으로 나뉘며, Lossy coding은 DCT, 양자화기, VLC Codes를 쓰며 압축 율은 높으나 원래의 영상을 정확히 복원하지 못함. $\cdot$ 영상 압축에 대한 국제 규격 협회는 JPEG, MPEG I, MPEG II, MPEG IV, H.261, H.263 등이 있으나 본 seminar에서는 JPEG 규격만 논함. $\cdot$ 의학 영상은 Resolution이 크고 study 단위로 관리되기 때문에 영상 데이터량이 많으나 진단의 목적으로 쓰이기 때문에 주로 lossless 압축을 쓰게 되나 압축율이 낮음.(3:1 이하). 최근에는 Fractal, Wavelet Coding을 통한 압축율을 증가 시키는 Image Compression Algorithms이 활용됨. $\cdot$ MPEG은 동영상의 압축 표준안이며, 동영상은 한frame 당 25개 이상의 정지 화상으로 이루어지기 때문에 JPEG 규격에서 사용되었던 기법이 그대로 활용되며 영상과 영상간, 또는 frame과 frame 간의 여상의 변화, 움직임을 Vector로 coding하는 interframe Coding 기법을 활용하나 설명하기에는 광범위한 topic이므로 본 seminar에서는 생략함.
PDF

System Design and Realization for Real Time DVR System with Robust Video Watermarking (강인한 비디오 워터마킹을 적용한 실시간 DVR 시스템의 설계 구현)

Ryu Kwang-Ryol;Kim Ja-Hwan
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.6
- /
- pp.1019-1024
- /
- 2006
A system design and realization for real time DVR system with robust video watermarking algorithm for contents security is presented in this paper. The robust video watermarking is used the intraframe space region and interframe insertion simultaneously, and to be processed at real time on image data and algorithm is used the 64bits special purpose quad DSP processor with assembly and soft pipeline codes. The experimental result shows that the processing time takes about 2.5ms in the D1 image per frame for 60% moving image.
PDF KSCI

Effect of Input Data Video Interval and Input Data Image Similarity on Learning Accuracy in 3D-CNN

Kim, Heeil;Chung, Yeongjee
- International Journal of Internet, Broadcasting and Communication
- /
- v.13 no.2
- /
- pp.208-217
- /
- 2021
3D-CNN is one of the deep learning techniques for learning time series data. However, these three-dimensional learning can generate many parameters, requiring high performance or having a significant impact on learning speed. We will use these 3D-CNNs to learn hand gesture and find the parameters that showed the highest accuracy, and then analyze how the accuracy of 3D-CNN varies through input data changes without any structural changes in 3D-CNN. First, choose the interval of the input data. This adjusts the ratio of the stop interval to the gesture interval. Secondly, the corresponding interframe mean value is obtained by measuring and normalizing the similarity of images through interclass 2D cross correlation analysis. This experiment demonstrates that changes in input data affect learning accuracy without structural changes in 3D-CNN. In this paper, we proposed two methods for changing input data. Experimental results show that input data can affect the accuracy of the model.
https://doi.org/10.7236/IJIBC.2021.13.2.208 인용 PDF KSCI

Interframe interpolation technique based on variable skip rate (가변 스킵율 기반의 프레임간 보간 기법)

Kim, Dong-wook;Choi, Yeon-sung
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.25 no.3B
- /
- pp.510-518
- /
- 2000
A new video interpolation technique based on variable skip rate of video sequence is proposed in this paper. in the proposed technique. the determination whether a frame is skipped or not is done by the degree of motion complexity of the frame. If the motion complexity of a frame is low the frame is skipped. otherwise it is coded and transmitted. To determine the motion complexity of a frame a new technique using MEF (moving edge in frame),the set of pixels considered as moving edges in a frame. is introduced. In the course of decoding and interpolating of receiver., the motion field is segmented. For the purpose of dividing vector field morphological filtering is applied. Morphological filtering also used to smooth the boundaries between the changed and unchanged region. In the simulation results, the proposed technique shows higher quality and lower fluctuation of picture quality than the conventional techniques on conditioning of the same bit rate.
PDF

Geometric distortion correction of fluorescein ocular fundus photographs (형광 안저 사진의 기하 왜곡 교정)

권갑현;하영호;김수중
- Progress in Medical Physics
- /
- v.2 no.2
- /
- pp.183-192
- /
- 1991
Ophthalmoscopy following the intravenous injection of fluorescein has gained great diagnostic importance in ophthalmology. This technique provides sequential evaluation of the anatomic and physiologic status of the choroidal and retinal vasculature. In order to detect the changes between fluorescein ocular fundus image frames, the direct subtraction of the two frames is inadequate because of geometric distortions and background gray level differences in two images. In this study, a scheme for the correction of the geometric distortions is proposed. Precise control point coordinate values for transformation functions are manually determined after the process including a series of blood vessel detection and thinning, and one frame is mapped to another, and then a geometric distortion corrected image is obtained. When the corrected image is used in interframe change detections, a sucessful result is ensured.
PDF

Pareto Optimized EDCA Parameter Control for Wireless Local Area Networks

Kim, Minseok;Oh, Wui Hwan;Chung, Jong-Moon;Lee, Bong Gyou;Seo, Myunghwan;Kim, Jung-Sik;Cho, Hyung-Weon
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.10
- /
- pp.3458-3474
- /
- 2014
The performance of IEEE 802.11e enhanced distributed channel access (EDCA) is influenced by several interactive parameters that make quality of service (QoS) control complex and difficult. In EDCA, the most critical performance influencing parameters are the arbitration interframe space (AIFS) and contention window size (CW) of each access category (AC). The objective of this paper is to provide a scheme for parameter control such that the throughput per station as well as the overall system throughput of the network is maximized and controllable. For this purpose, a simple and accurate analytical model describing the throughput behavior of EDCA networks is presented in this paper. Based on this model, the paper further provides a scheme in which a Pareto optimal system configuration is obtained via an appropriate CW control for a given AIFS value, which is a different approach compared to relevant papers in the literature that deal with CW control only. The simulation results confirm the effectiveness of the proposed method which shows significant performance improvements compared to other existing algorithms.
https://doi.org/10.3837/tiis.2014.10.010 인용 PDF KSCI KPUBS HTML

Two-stage variable block-size multiresolution motion estiation in the wavelet transform domain (웨이브렛 변환영역에서의 2단계 가변 블록 다해상도 움직임 추정)

김성만;이규원;정학진;박규태
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.22 no.7
- /
- pp.1487-1504
- /
- 1997
In this paper, the two-stage variable block-size multiresolution motion algorithm is proposed for an interframe coding scheme in the wavelet decomposition. An optimal bit allocagion between motion vectors and the prediction error in sense of minimizing the total bit rate is obtained by the proposed algorithm. The proposed algorithm consists of two stages for motion estimatation and only the first stage can be separated and run on its own. The first stage of the algorithm introduces a new method to give the lower bit rate of the displaced frame difference as well as a smooth motion field. In the second stage of the algorithm, the technique is introduced to have more accurate motion vectors in detailed areas, and to decrease the number of motion vectors in uniform areas. The algorithm aims at minimizin gthe total bit rate which is sum of the motion vectors and the displaced frame difference. The optimal bit allocation between motion vectors and displaced frame difference is accomplished by reducing the number of motion vectors in uniform areas and it is based on a botom-up construction of a quadtree. An entropy criterion aims at the control of merge operation. Simulation resuls show that the algorithm lends itself to the wavelet based image sequence coding and outperforms the conventional scheme by up to the maximum 0.28 bpp.
PDF

Scalable Video Coding with Low Complex Wavelet Transform (공간 웨이블릿 변환의 복잡도를 줄인 스케일러블 비디오 부호화에 관한 연구)

Park Seong-Ho;Jeong Se-Yoon;Kim Won-Ha
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.42 no.3 s.303
- /
- pp.53-62
- /
- 2005
In the decoding process of interframe Wavelet coding, the Wavelet transform requires huge computational complexity. Since the decoder may need to be used in various devices such as PDAs, notebooks, or PC, the decoder's complexity should be adapted to the processor's computational power. So, it is natural that the low complexity codec is also required for scalable video coding. In this paper, we develop a method of controlling and lowering the complexity of the spatial Wavelet transform while sustaining the same coding efficiency as the conventional spatial Wavelet transform. In addition, the proposed method may alleviate the ringing effect for slowly changing image sequences.
PDF KSCI

An Efficient Compression Algorithm for Simple Computer Cell Animation (단순 컴퓨터 셀 애니메이션 영상에 효율적인 압축 알고리듬)

민병석;정제창;최병욱
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.3A
- /
- pp.211-220
- /
- 2002
In this paper, we propose an efficient algorithm to compress simple computer cell animation at very low bit rate. The structure of proposed algorithm consists of intra frame coding and inter frame coding. In inter frame coding, animation is encoded by color quantization using a palette, rearrangement of index, ADPCM used in JPEG-LS, mapping, classification, and entropy coding. In interframe coding, classifying the characteristics of motion, animation is encoded by block based motion replenishment. Experimental results show that the proposed methods turns out to outperform conventional methods including Flash, FLC, Motion-JPEG, MPEG-1, and MPEG-4.
PDF KSCI

Search Result 69, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)