Search | Korea Science

Saliency Detection Using Entropy Weight and Weber's Law (엔트로피 가중치와 웨버 법칙을 이용한 세일리언시 검출)

Lee, Ho Sang;Moon, Sang Whan;Eom, Il Kyu
- Journal of the Institute of Electronics and Information Engineers
- /
- v.54 no.1
- /
- pp.88-95
- /
- 2017
In this paper, we present a saliency detection method using entropy weight and Weber contrast in the wavelet transform domain. Our method is based on the commonly exploited conventional algorithms that are composed of the local bottom-up approach and global top-down approach. First, we perform the multi-level wavelet transform for the CIE Lab color images, and obtain global saliency by adding the local Weber contrasts to the corresponding low-frequency wavelet coefficients. Next, the local saliency is obtained by applying Gaussian filter that is weighted by entropy of wavelet high-frequency subband. The final saliency map is detected by non-lineally combining the local and global saliencies. To evaluate the proposed saliency detection method, we perform computer simulations for two image databases. Simulations results show the proposed method represents superior performance to the conventional algorithms.
https://doi.org/10.5573/ieie.2017.54.1.088 인용 PDF KSCI

An Image Merging Method for Two High Dynamic Range Images of Different Exposure (노출 시간이 다른 두 HDR 영상의 융합 기법)

Kim, Jin-Heon
- Journal of Korea Multimedia Society
- /
- v.13 no.4
- /
- pp.526-534
- /
- 2010
This paper describes an algorithm which merges two HDR pictures taken under different exposure time to display on the LDR devices such as LCD or CRT. The proposed method does not generate the radiance map, but directly merges using the weights computed from the input images. The weights are firstly produced on the pixel basis, and then blended with a Gaussian function. This process prevents some possible sparkle noises caused by radical change of the weights and contributes to smooth connection between 2 image informations. The chrominance informations of the images are merged on the weighted averaging scheme using the deviations of RGB average and their differences. The algorithm is characterized by the feature that it represents well the unsaturated area of 2 original images and the connection of the image information is smooth. The proposed method uses only 2 input images and automatically tunes the whole internal process according to them, thus autonomous operation is possible when it is included in HDR cameras which use double shuttering scheme or double sensor cells.
PDF KSCI

Threshold Selection Method for Capacity Optimization of the Digital Watermark Insertion (디지털 워터마크의 삽입용량 최적화를 위한 임계값 선택방법)

Lee, Kang-Seung;Park, Ki-Bum
- Journal of the Institute of Convergence Signal Processing
- /
- v.10 no.1
- /
- pp.49-59
- /
- 2009
In this paper a watermarking algorithm is proposed to optimize the capacity of the digital watermark insertion in an experimental threshold using the characteristics of human visual system(HVS), adaptive scale factors, and weight functions based on discrete wavelet transform. After the original image is decomposed by a 3-level discrete wavelet transform, the watermarks for capacity optimization are inserted into all subbands except the baseband, by applying the important coefficients from the experimental threshold in the wavelet region. The adaptive scale factors and weight functions based on HVS are considered for the capacity optimization of the digital watermark insertion in order to enhance the robustness and invisibility. The watermarks are consisted of gaussian random sequences and detected by correlation. The experimental results showed that this algorithm can preserve a fine image quality against various attacks such as the JPEG lossy compression, noise addition, cropping, blurring, sharpening, linear and non-linear filtering, etc.
PDF

Performance Improvement of Variable Vocabulary Speech Recognizer (가변어휘 음성인식기의 성능개선)

Kim Seunghi;Kim Hoi-Rin
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.21-24
- /
- 1999
본 논문에서는 가변어휘 음성인식기의 성능개선 작업에 관한 내용을 기술하고 있다. 묵음을 포함한 총 40개의 문맥독립 음소모델을 사용한다. LDA 기법을 이용하여 동일차수의 특징벡터내에 보다 유용한 정보를 포함시키고, likelihood 계산시 가우시안 분포와 mixture weight에 대한 가중치를 달리 함으로써 성능향상을 볼 수 있었다. ETRI POW 3848 DB만을 사용하여 실험한 경우, $21.7\%$의 오류율 감소를 확인할 수 있었다. 잡음환경 및 어휘독립환경을 고려하여 POW 3848 DB와 PC 168 DB 및 PBW445 DB를 사용한 실험도 행하였으며, PBW 445 DB를 사용한 어휘독립 인식실험의 경우 $56.8\%$의 오류율 감소를 얻을 수 있었다.
PDF

Convergence Analysis on Bilateral Filter with a Fixed Point Iteration (고정점 반복을 이용한 양방향 필터의 수렴 분석)

Ham, Bumsub;Sohn, Kwanghoon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.11-13
- /
- 2011
양방향 필터 (Bilateral filter)는 에지 보전 평활화 필터로써 디노이징, 반사 제거, 스테레오 매칭 등 다양한 분야에서 사용되고 있다. 이는 기존의 가우시안 필터에 사용되는 공간 도메인 커널 (spatial kernel)이외에 강도 도메인 커널 (range kernel)을 추가로 사용하여 비슷한 강도의 픽셀에 높은 가중치를 부여함으로써 에지를 보전하면서 평활화를 한다. 또한 양방향 필터는 비등방성 확산 필터 (Anisotropic diffusion filter)와 달리 항상 수렴을 보장한다. 따라서 본 논문에서는 고정점 반복 이론을 적용하여 양방향 필터의 수렴을 수학적으로 증명한다.
PDF

Perceptual Data Hiding Model with Adaptive Watermark Strength (적응적 워터마크 삽입강도를 갖는 지각적 데이터 은닉 모델)

조영웅;장봉주;김응수;문광석;권기룡
- Proceedings of the Korea Multimedia Society Conference
- /
- 2002.11b
- /
- pp.287-290
- /
- 2002
본 논문에서는 디지털 컨텐츠 저작권 보호를 위해 강인성과 비가시성의 유지를 위한 보다 효과적인 방법으로 웨이브릿 변환에서 적응적 워터마크 삽입강도를 갖는 지각적 데이터 은닉 모델을 제안한다. 먼저 영상을 9/7 쌍직교 웨이브릿 필터를 사용해 4레벨로 다해상도 분해한다. 다음으로 연속부대역 양자화(successive subband quantization)를 통한 시각적 중요계수(perceptually significant coefficient: PSC)들을 선정하여 선택된 계수들에 대해서만 워터마크 정보를 삽입한다. 지각 모델은 정상상태의 일반화 가우시안 모델(generalized gaussian model)로 추정된 NVF(noise visibility function)로 에지와 텍스쳐영역 그리고 평탄영역에 따라 각각 적응적으로 삽입되게 한다. 이는 각 서브밴드 내의 분산과 형상계수(shape parameter)에 의해 결정된다. 적응적 워터마크의 삽입강도를 갖기 위해 에지와 텍스쳐영역의 삽입강도는 각 서브밴드의 주파수 감도(frequency sensitivity)로 결정되고, 평탄영역의 삽입강도는 영상의 국부적 특성에 근거한 통계적 가중치를 사용한다. 삽입되는 워터마크는 랜덤시퀀스로 N(0,1)이다. 여러 가지 공격에 대한 실험으로 제안한 방법의 비가시성과 강인성을 확인한다.
PDF

Speech analysis using the Robust Time-Weighted Kalman filtering (시간가중치의 로버스트 칼만필터를 이용한 음성분석)

최홍섭;안수길
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.1E
- /
- pp.73-78
- /
- 1992
시벼형 신호인 음성 신호의 분석에 칼만필터를 이용하였다. 일반적인 음성 분석은 프레임단위의 처리방법인 선형 예측 부호화 기법을 주로 이용하지만 음성의 시변 특성을 파악하는데에는 적절하지 못 하다. 따라서 순차적인 추정기법으로 많이 이용되는 칼만 필터를 음성 분석에 적용하였다. 또한 음성과 같은 시변신호에서는 과거 신호의 잡음의 분산값에 적당한 가중치를 부가하므로써 과거의 신호에 의해 서 현재의 추정값에 미치는 영향을 줄였으며 이를 음성의 천이 구간에서의 파라메타 추정에 사용하였 다. 그리고 음성신호 모델에서 생기는 모델링 오차는 일반적으로 백색 가우시안 잡음으로 가정하고 있 으나 이는 자음과 같은 무성음에서 특징 파라메타 푸정에는 오차가 적지만 모음등의 유성음에서는 음성 발생시의 여기신호인 펄스열에 의해서 많은 모델링 오차를 생기게 한다. 따라서 모델링 오차신호는 Non-Gaussian 확률분포로 가정한 후 로버스트 칼만 필터를 사용하여 합성으멩 대해 특징 파라메터를 추출하였다.
PDF

A Study on Speaker-Independent Speech Recognition Using a Hybrid System of Semi-Continuous HMM and RBF (반연속 HMM과 RBF 혼합 시스템을 이용한 화자독립 음성인식에 관한 연구)

Moon Yun Joo;June Sun Do;Kang Chul Ho
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.36-39
- /
- 1999
본 논문에서는 기존의 반연속 HMM과 신경망 알고리즘인 RBF(Radial Basis Function)를 혼합한 형태를 음성인식에 적용한다. 기존의 반연속 HMM은 학습 과정에서 모든 모델과 상태에서 공유되는 L개의 가우시안 확률 밀도들과 각가우시안 확률 밀도들의 가중치를 결정하는 흔합 밀도계수 의해 입력 음성의 특징을 확률적으로 모델링하는 혼합 확률을 얻고 또 Maximum likelihood와 Baum-Welch 알고리즘을 이용해 초기확률, 전이확률, 관측확률, 평균벡터 $\mu$, 공분산 행렬 $\Sigma$을 학습해 나간다. 그러나 제안한 RBF/반연속 HMM 혼합형태는 RBF의 변형된 방식을 첨가해 반연속 HMM 관측 파라미터를 RBF에 의해 결정함으로써 보단 분별릭 있는 화자독립 인식 시스템이 된다. 그래서 인식 실험결과 인식률에 있어서 기존의 반연속 HMM보다 향상된 인식률을 얻는다.
PDF

Optimized Polynomial RBF Neural Networks Based on PSO Algorithm (PSO 기반 최적화 다항식 RBF 뉴럴 네트워크)

Baek, Jin-Yeol;Oh, Sung-Kwun
- Proceedings of the KIEE Conference
- /
- 2008.07a
- /
- pp.1887-1888
- /
- 2008
본 논문에서는 퍼지 추론 기반의 다항식 RBF 뉴럴네트워크(Polynomial Radial Basis Function Neural Network; pRBFNN)를 설계하고 PSO(Particle Swarm Optimization) 알고리즘을 이용하여 모델의 파라미터를 동정한다. 제안된 모델은 "IF-THEN" 형식으로 기술되는 퍼지 규칙에 의해 조건부, 결론부, 추론부의 기능적 모듈로 표현된다. 조건부의 입력공간 분할에는 HCM 클러스터링에 기반을 두어 구조가 결정되며, 기존에 주로 사용된 가우시안 함수를 RBF로 이용하고, 원뿔형태의 선형 함수를 제안한다. 또한 입력공간 분할시 데이터 집합의 특성을 반영하기 위해 분포상수를 각 입력마다 고려하여 설계함으로서 공간 분할의 정밀성을 높인다. 결론부에서는 기존 상수항의 연결가중치를 다항식 형태로 표현하는 pRBFNN을 제안한다. 제안한 모델의 성능을 평가하기 위해 Box와 Jenkins가 사용한 가스로 시계열 데이터를 적용하고, 기존 모델과의 근사화와 일반화 능력에 대하여 토의한다.
PDF

Image Mosaicing Based on Normalized Correlation and Rectangle-to-Quadrilateral Perspective Transformation (정규상관과 직사각형-사변형 투영 변환에 기반한 영상 모자익)

Kim, Dong-Geun;Jang, Byeong-Tae
- The KIPS Transactions:PartB
- /
- v.8B no.3
- /
- pp.311-318
- /
- 2001
본 논문에서는 중첩되는 영상사이의 평면 투영 변환을 구하는 방법을 제안하였다. 제안한 방법은 정규상관과 직사각형-사변형 평면 투영 변환에 기반 한다. 블록 정합을 이용하여 전역 이동을 계산하고, 중첩되는 영역의 정규 상관 계수 값을 최대로 하는 4개의 대응점을 찾기 위하여 가우시안 영상 피라미드에서 SA(simulated annealing) 알고리즘을 사용하였다. 이들 대응점에서 직사각형-사변형으로의 사상을 이용하여 평면 투영 변환을 계산하고, 마지막으로 중첩되는 영역의 RGB 컬러 값을 선형 가중치에 의해 혼합하였다. 실험으로 세 장의 영상을 한 장읠 큰 모자익 영상으로 합성하는 결과를 보였다.
PDF

Search Result 88, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)