Search | Korea Science

Marker Recognition System for the User Interface of a Serious Case (중증환자 인터페이스를 위한 마커 인식 시스템)

So, In-Mi;Kang, Sun-Kyung;Kim, Young-Un;Jung, Sung-Tae
- The KIPS Transactions:PartB
- /
- v.14B no.3 s.113
- /
- pp.191-198
- /
- 2007
In this paper, we present a marker detection and recognition method from camera image for a disabled person to interact with a server system which can control appliance of surrounding environment. It converts the camera image to a binary image by using multi-threshold and extracts contours of objects in the binary image. After that, it approximates the contours to a list of line segments. It finds rectangular markers by using geometrical features which are extracted from the approximated line segments. It normalizes the shape of extracted markers into exact squares by using the warping technique. It extracts feature vectors from marker image by using principal component analysis and then recognizes the marker. The proposed marker recognition system is robust for light change by using multi-threshold. Also, it is robust for angular variation of camera by using warping technique and principal component analysis. Experimental results show that the proposed method achieves 100% recognition rate at maximum for 21 markers and execution speed of 12 frames/sec.
https://doi.org/10.3745/KIPSTB.2007.14-B.3.191 인용 PDF KSCI

A Study on Illumination Normalization Method based on Bilateral Filter for Illumination Invariant Face Recognition (조명 환경에 강인한 얼굴인식 성능향상을 위한 Bilateral 필터 기반 조명 정규화 방법에 관한 연구)

Lee, Sang-Seop;Lee, Su-Young;Kim, Joong-Kyu
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.47 no.4
- /
- pp.49-55
- /
- 2010
Cast shadow caused by an illumination condition can produce troublesome effects for face recognition system using reflectance image. Consequently, we need to separate cast shadow area from feature area for improvement of recognition accuracy. A Bilateral filter smooths image while preserving edges, by means of a nonlinear combination of nearby pixel values. Processing such characteristics, this method is suited to our purpose in illumination estimation process based on Retinex. Therefore, in this paper, we propose a new illumination normalization method based on the Bilateral filter in face images. The proposed method produces a reflectance image that is preserved relatively exact cast shadow area, because coefficient of filter is designed to multiply proximity and discontinuity of pixels in input image. Performance of our method is measured by a recognition accuracy of principle component analysis(PCA) and evaluated to compare with other conventional illumination normalization methods.
PDF KSCI

A Study on Segmental Duratio Control for the Kroean TTS (한국어 문음성 변환기의 음운지속시간 제어에 관한 연구)

김인영
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.143-146
- /
- 1998
자연스러운 한국어의 음성합성을 위해서는 음운의 지속시간의 제어가 매우 중요하다. 본 연구에서는 POW3848 어절에 대한 음성 데이터에 대해 음운 세그먼트, 음운 라벨링, 품사 태깅을 행한 음성 데이터베이스를 구축하여 한국어 음운의 지속시간을 변화시키는 시간 특징을 통계적으로 분석하였다. 이 시간 특징들 중 변화 폭이 큰 요인들을 제어요소로 각 음운의 고유길이를 최대한 배제하고 단지 음운 발성 환경의 영향에 의한 지속시간 변화만을 고려하는 정규화 지속시간에 대한 회귀트리로 한국어 음운 지속시간을 모델화 하였다. 제안된 음운 지속시간 모델을 실시간 제어 알고리즘으로 구현하여 평가한 결과, 음운 지속시간 예측오차의 88% 정도가 25ms이내 이었고 예측치와 관측치 간의 다중 상관관계수는 0.92 정도로 평가되어, 제안된 모델의 타당성이 입증되었다.
PDF

A Study on the Perception of Disaster and Crisis Using SNS(Twitter) (SNS(트위터)를 활용한 재난 및 위기상황 인식에 관한 연구)

Choi, YeonHo;Hyun, SangYeop;Shin, Younsoon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.11a
- /
- pp.483-486
- /
- 2021
재난 및 위기상황이 발생하면 해당 상황을 신속하고 정확하게 파악해야 많은 사람들을 구조할 수 있다. 본 논문은 SNS에서 재난 및 위기 상황을 정확하게 인식하는 연구를 진행한다. 텍스트 정규화, 워드 토큰화, 단어 임베딩 과정을 통해 전처리를 진행하고 키워드와 여러 특징들을 뽑아 SVM classifier를 사용하여 분류 작업을 실시한다. 실험결과 재난과 연관이 있는 경우에 해시태그의 빈도수, URL 빈도수, 두 키워드간의 거리가 다른 특징들의 조합보다 더 좋은 결과를 나타내었다.
https://doi.org/10.3745/PKIPS.y2021m11a.483 인용 PDF

Disease Region Pattern Recognition Algorithm of Gastrointestinal Image using Wavelet Transform and Neural Network (Wavelet변환과 신경회로망에 의한 위장 영상의 질환 부위 패턴 인식 알고리즘)

이상복;이주신
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.36S no.5
- /
- pp.70-77
- /
- 1999
본 논문에서는 Wavelet을 이용한 위장 영상의 질환 부위 특징을 추출하여 질환 부위 패턴을 인식할 수 있는 알고리즘을 제안하였다. 전처리 과정으로서 위장 영상이 형태정보는 입력 영상을 DWT(Discrete wavelet transform)에 의해 4레벨 DWT 계수 행렬을 구하고 계수 행렬의 특징에 따라 저주파 계수 행렬로부터 저주파 특징 파라미터 32개, 수평 고주파 계수 행렬로부터 수평 고주파 특징 파라미터 16개, 수직 고주파 계수 행렬로부터 수직 고주파 특징 파라미터 16개, 그리고, 대각 고주파 계수 행렬로부터 대각 고주파 특징 파라미터 32개 등 모두 96개의 특징 파라미터를 추출한 후 각각의 특징 파라미터를 최대 값+0.5로 최소 값을 -0.5로 정규화 하여 신경회로망의 입력 벡터로 사용하였다. 위장 영상 패턴 인식을 위한 신경회로망은 교사 학습을 요구하는 다층 구조의 오차 역전파(Error back propagation)알고리즘으로 하였고 구조적 특성을 이용하여 입력층, 중간층, 출력층의 계층 구조로 설계하였다. 설계된 신경회로망의 학습은 학습계수를 0.2로 모우멘텀을 0.6으로 설정하여 출력층 최대오차가 0.01보다 작을 때까지 수행하였으며 약 8000회 정도 학습한 결과 설정값 보다 작은 결과를 얻었고 질환의 종류나 위치, 크기에 관계없이 100%의 인식률을 얻었다.
PDF

A Feature Map Generation Method for MSFC-Based Feature Compression without Min-Max Signaling in VCM (VCM 의 MSFC 기반 특징 압축을 위한 Min-Max 시그널링을 제외한 특징맵 생성 기법)

Dong-Ha Kim;Yong-Uk Yoon;Jae-Gon Kim
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.11a
- /
- pp.79-81
- /
- 2022
MPEG-VCM(Video Coding for Machines)에서는 머신비전(machine vision) 네트워크의 백본(backbone)에서 추출된 이미지/비디오 특징 압축을 위한 표준화를 진행하고 있다. 현재 VCM 표준기술 탐색 과정에서 가장 좋은 압축 성능을 보이는 MSFC(Multi-Scale Feature compression) 기반 압축 네트워크 모델은 추출된 멀티-스케일 특징을 단일-스케일 특징으로 변환하여 특징맵으로 구성하고 이를 VVC 로 압축한다. 본 논문에서는 MSFC 기반 압축 모델에서 Min-Max 값 시그널링을 제외한 최소-최대(Min-Max) 정규화를 포함한 개선된 특징맵 생성 기법을 제시한다. 즉, 제안기법은 VCM 디코더에서의 특징맵 복원을 위한 Min-Max 값을 학습 기반으로 생성함으로써 Min-Max 시그널링의 비트 오버헤드 절감뿐만 아니라 별도의 시그널링 기제를 생략한 보다 단순한 전송 비트스트림 구성을 가능하게 한다. 실험결과 제안기법은 이미지 앵커(Anchor) 대비 BPP-mAP 성능에서 83.24% BD-rate 이득을 보이며, 이는 기존 MSFC 보다 1.74%정도 다소 떨어지지만 별도의 Min-Max 시그널링 없이도 기존의 성능을 유지할 수 있음을 보인다.
PDF

A Single-End-Point DTW Algorithm for Keyword Spotting (핵심어 검출을 위한 단일 끝점 DTW알고리즘)

최용선;오상훈;이수영
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.3
- /
- pp.209-219
- /
- 2004
In order to implement a real time hardware for keyword spotting, we propose a Single-End-Point DTW(SEP-DTW) algorithm which is simple and less complex for computation. The SEP-DTW algorithm only needs a single end point which enables efficient applications, and it has a small wont of computations because the global search area is divided into successive local search areas. Also, we adopt new local constraints and a new distance measure for a better performance of the SEP-DTW algorithm. Besides, we make a normalization of feature same vectors so that they have the same variance in each frequency bin, and each frame has the same energy levels. To construct several reference patterns for each keyword, we use a clustering algorithm for all training patterns, and mean vectors in every cluster are taken as reference patterns. In order to detect a key word for input streams of speech, we measure the distances between reference patterns and input pattern, and we make a decision whether the distances are smaller than a pre-defined threshold value. With isolated speech recognition and keyword spotting experiments, we verify that the proposed algorithm has a better performance than other methods.
PDF KSCI

LSTM-based Business Process Remaining Time Prediction Model Featured in Activity-centric Normalization Techniques (액티비티별 특징 정규화를 적용한 LSTM 기반 비즈니스 프로세스 잔여시간 예측 모델)

Ham, Seong-Hun;Ahn, Hyun;Kim, Kwanghoon Pio
- Journal of Internet Computing and Services
- /
- v.21 no.3
- /
- pp.83-92
- /
- 2020
Recently, many companies and organizations are interested in predictive process monitoring for the efficient operation of business process models. Traditional process monitoring focused on the elapsed execution state of a particular process instance. On the other hand, predictive process monitoring focuses on predicting the future execution status of a particular process instance. In this paper, we implement the function of the business process remaining time prediction, which is one of the predictive process monitoring functions. In order to effectively model the remaining time, normalization by activity is proposed and applied to the predictive model by taking into account the difference in the distribution of time feature values according to the properties of each activity. In order to demonstrate the superiority of the predictive performance of the proposed model in this paper, it is compared with previous studies through event log data of actual companies provided by 4TU.Centre for Research Data.
https://doi.org/10.7472/jksii.2020.21.3.83 인용 PDF KSCI HTML

Human Activity Recognition using View-Invariant Features and Probabilistic Graphical Models (시점 불변인 특징과 확률 그래프 모델을 이용한 인간 행위 인식)

Kim, Hyesuk;Kim, Incheol
- Journal of KIISE
- /
- v.41 no.11
- /
- pp.927-934
- /
- 2014
In this paper, we propose an effective method for recognizing daily human activities from a stream of three dimensional body poses, which can be obtained by using Kinect-like RGB-D sensors. The body pose data provided by Kinect SDK or OpenNI may suffer from both the view variance problem and the scale variance problem, since they are represented in the 3D Cartesian coordinate system, the origin of which is located on the center of Kinect. In order to resolve the problem and get the view-invariant and scale-invariant features, we transform the pose data into the spherical coordinate system of which the origin is placed on the center of the subject's hip, and then perform on them the scale normalization using the length of the subject's arm. In order to represent effectively complex internal structures of high-level daily activities, we utilize Hidden state Conditional Random Field (HCRF), which is one of probabilistic graphical models. Through various experiments using two different datasets, KAD-70 and CAD-60, we showed the high performance of our method and the implementation system.
https://doi.org/10.5626/JOK.2014.41.11.927 인용

An Integrated Face Detection and Recognition System (통합된 시스템에서의 얼굴검출과 인식기법)

박동희;이규봉;이유홍;나상동;배철수
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2003.05a
- /
- pp.165-170
- /
- 2003
This paper presents an integrated approach to unconstrained face recognition in arbitrary scenes. The front end of the system comprises of a scale and pose tolerant face detector. Scale normalization is achieved through novel combination of a skin color segmentation and log-polar mapping procedure. Principal component analysis is used with the multi-view approach proposed in[10] to handle the pose variations. For a given color input image, the detector encloses a face in a complex scene within a circular boundary and indicates the position of the nose. Next, for recognition, a radial grid mapping centered on the nose yields a feature vector within the circular boundary. As the width of the color segmented region provides an estimated size for the face, the extracted feature vector is scale normalized by the estimated size. The feature vector is input to a trained neural network classifier for face identification. The system was evaluated using a database of 20 person's faces with varying scale and pose obtained on different complex backgrounds. The performance of the face recognizer was also quite good except for sensitivity to small scale face images. The integrated system achieved average recognition rates of 87% to 92%.
PDF

Search Result 357, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)