Search | Korea Science

Classification of infant cries using 3D feature vectors (3D 특징 벡터를 이용한 영아 울음소리 분류)

Park, JeongHyeon;Kim, MinSeo;Choi, HyukSoon;Moon, Nammee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.11a
- /
- pp.597-599
- /
- 2022
영아는 울음이라는 비언어적 의사 소통 방식을 사용하여 모든 욕구를 표현한다. 하지만 영아의 울음소리를 파악하는 것에는 어려움이 따른다. 영아의 울음소리를 해석하기 위해 많은 연구가 진행되었다. 이에 본 논문에서는 3D 특징 벡터를 이용한 영아의 울음소리 분류를 제안한다. Donate-a-corpus-cry 데이터 세트는 복통, 트림, 불편, 배고픔, 피곤으로 총 5 개의 클래스로 분류된 데이터를 사용한다. 데이터들은 원래 속도의 90%와 110%로 수정하는 방법인 템포조절을 통해 증강한다. Spectrogram, Mel-Spectrogram, MFCC 로 특징 벡터화를 시켜준 후, 각각의 2 차원 특징벡터를 묶어 3차원 특징벡터로 구성한다. 이후 3 차원 특징 벡터를 ResNet 과 EfficientNet 모델로 학습을 진행한다. 그 결과 2 차원 특징 벡터는 0.89(F1) 3 차원 특징 벡터의 경우 0.98(F1)으로 0.09 의 성능 향상을 보여주었다.
https://doi.org/10.3745/PKIPS.y2022m11a.597 인용 PDF

Vector Silhouette Extraction for Creating a Blueprint of Cultural Assets (문화재의 도면 생성을 위한 벡터 실루엣 추출)

Jung-Il Jung;Jinsoo Cho
- Proceedings of the Korea Information Processing Society Conference
- /
- 2008.11a
- /
- pp.192-195
- /
- 2008
본 논문에서는 발전하는 3D 그래픽스 기술을 이용하여 문화재의 도면 실루엣을 생성하는 방법을 제안하고자 한다. 3D 스캐너로 정밀 실측된 3D 데이터를 이용하여 문화재의 도면을 생성하기 위한 벡터 실루엣(Silhouette) 추출 과정은 다음과 같다. 먼저 실측된 3D 데이터를 정규화 된 3D공간으로 이동하고, 이동 후에는 데이터에 존재하는 모든 에지(edge)를 검출하여 에지리스트(edge list)를 생성한다. 생성된 에지리스트는 다시 윤곽에지(Contour edge)와 주름에지(Crease edge)로 분류하는데, 윤곽에지는 문화재의 윤곽 실루엣을 형성하는데 이용하고, 윤곽에지를 제외한 주름에지는 문화재의 표면 특징을 나타내는 내부문양 실루엣을 형성하는데 이용한다. 내부문양 실루엣은 사용자가 입력하는 임계값과 주름에지를 구성하는 두면의 방향 벡터의 내적을 비교하여 추출한다. 추출한 벡터 실루엣은 윤곽 실루엣과 내부문양 실루엣으로 구분되며, 두 벡터 실루엣을 이용함으로써 문화재의 구조적 해석과 표면의 특징을 해석할 수 있는 도면 실루엣 생성이 가능했다.
https://doi.org/10.3745/PKIPS.y2008m011a.192 인용 PDF

Robust 3D Facial Landmark Detection Using Angular Partitioned Spin Images (각 분할 스핀 영상을 사용한 3차원 얼굴 특징점 검출 방법)

Kim, Dong-Hyun;Choi, Kang-Sun
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.5
- /
- pp.199-207
- /
- 2013
Spin images representing efficiently surface features of 3D mesh models have been used to detect facial landmark points. However, at a certain point, different normal direction can lead to quite different spin images. Moreover, since 3D points are projected to the 2D (${\alpha}-{\beta}$) space during spin image generation, surface features cannot be described clearly. In this paper, we present a method to detect 3D facial landmark using improved spin images by partitioning the search area with respect to angle. By generating sub-spin images for angular partitioned 3D spaces, more unique features describing corresponding surfaces can be obtained, and improve the performance of landmark detection. In order to generate spin images robust to inaccurate surface normal direction, we utilize on averaging surface normal with its neighboring normal vectors. The experimental results show that the proposed method increases the accuracy in landmark detection by about 34% over a conventional method.
https://doi.org/10.5573/ieek.2013.50.5.199 인용 PDF KSCI

3D Motion Estimation Using Optical Flow (Optical Flow를 이용한 3차원 운동 정보에 관한 연구)

조혜리;이경무;이상욱
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.845-848
- /
- 2000
운동(motion) 벡터는 보고 있는 카메라와 관측되는 대상물 사이의 상대적인 움직임에 의해서 발생되는 3차원 물체의 속도가 2차원 영상에 투사되어 맺히는 영상에서의 2차원 속도 벡터를 가리킨다 영상에서 물체의 움직임은 3차원 공간상의 운동을 알 수 있는 중요한 정보로써 물체를 추적하는데 응용되고 있다. 본 논문에서는 여러 장의 연속적인 2차원 밝기 영상으로부터 카메라의 움직임을 추정하는 문제를 다룬다. 기존의 특징 기반 추적 기법에서는 저 단계의 영상 처리 과정에서 모델과 배경의 특징점이 서로 분리되지 않거나, 모델의 특징(feature)이 소실되었을 경우, 추적이 용이하지 못하고, 카메라와 3차원 물체의 병진과 회전 운동에 의해 발생된 움직임의 경우 3차원 표적 특징이 많이 사라져서 오차가 많이 누적되기도 한다. 본 논문에서는 이러한 문제를 해결하기 위하여 목표물 및 배경 특징들을 사용하여 카메라의 운동 정보를 찾아내는 기법을 제안한다. 제안하는 3차원 카메라의 운동 정보 추정 기법은 크게 두 장의 연속된 영상으로부터 3차원 모델과 배경의 많은 특징들에 대한 광류(optical flow) 검색 과정과, 이로부터 취득한 움직임 벡터와 카메라의 비선형 운동 방정식과 Lagrange multiplier를 통한 카메라의 운동 정보 추정 과정으로 구성된다.
PDF

Design of the 3D Object Recognition System with Hierarchical Feature Learning (계층적 특징 학습을 이용한 3차원 물체 인식 시스템의 설계)

Kim, Joohee;Kim, Dongha;Kim, Incheol
- KIPS Transactions on Software and Data Engineering
- /
- v.5 no.1
- /
- pp.13-20
- /
- 2016
In this paper, we propose an object recognition system that can effectively find out its category, its instance name, and several attributes from the color and depth images of an object with hierarchical feature learning. In the preprocessing stage, our system transforms the depth images of the object into the surface normal vectors, which can represent the shape information of the object more precisely. In the feature learning stage, it extracts a set of patch features and image features from a pair of the color image and the surface normal vector through two-layered learning. And then the system trains a set of independent classification models with a set of labeled feature vectors and the SVM learning algorithm. Through experiments with UW RGB-D Object Dataset, we verify the performance of the proposed object recognition system.
https://doi.org/10.3745/KTSDE.2016.5.1.13 인용 PDF KSCI

3D Content Model Hashing Based on Object Feature Vector (객체별 특징 벡터 기반 3D 콘텐츠 모델 해싱)

Lee, Suk-Hwan;Kwon, Ki-Ryong
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.47 no.6
- /
- pp.75-85
- /
- 2010
This paper presents a robust 3D model hashing based on object feature vector for 3D content authentication. The proposed 3D model hashing selects the feature objects with highest area in a 3D model with various objects and groups the distances of the normalized vertices in the feature objects. Then we permute groups in each objects by using a permutation key and generate the final binary hash through the binary process with the group coefficients and a random key. Therefore, the hash robustness can be improved by the group coefficient from the distance distribution of vertices in each object group and th hash uniqueness can be improved by the binary process with a permutation key and a random key. From experimental results, we verified that the proposed hashing has both the robustness against various mesh and geometric editing and the uniqueness.
PDF KSCI

Dynamic Facial Expression of Fuzzy Modeling Using Probability of Emotion (감정확률을 이용한 동적 얼굴표정의 퍼지 모델링)

Kang, Hyo-Seok;Baek, Jae-Ho;Kim, Eun-Tai;Park, Mignon
- Journal of the Korean Institute of Intelligent Systems
- /
- v.19 no.1
- /
- pp.1-5
- /
- 2009
This paper suggests to apply mirror-reflected method based 2D emotion recognition database to 3D application. Also, it makes facial expression of fuzzy modeling using probability of emotion. Suggested facial expression function applies fuzzy theory to 3 basic movement for facial expressions. This method applies 3D application to feature vector for emotion recognition from 2D application using mirror-reflected multi-image. Thus, we can have model based on fuzzy nonlinear facial expression of a 2D model for a real model. We use average values about probability of 6 basic expressions such as happy, sad, disgust, angry, surprise and fear. Furthermore, dynimic facial expressions are made via fuzzy modelling. This paper compares and analyzes feature vectors of real model with 3D human-like avatar.
https://doi.org/10.5391/JKIIS.2009.19.1.001 인용 PDF KSCI

Outer-line measurement for 3D reconstruction of huge structures (거대한 구조물의 3차원 영상 재구성을 위한 외곽선 길이 정보 추출)

Jeon, Byung-Seung;Park, Jung-Min;Kim, Young-Joong;Ko, Han-Seok;Hwang, In-Joon;Lim, Myo-Taeg
- Proceedings of the KIEE Conference
- /
- 2008.10b
- /
- pp.280-281
- /
- 2008
본 논문은 큰 구조물의 3파인 영상 재구성을 위해서 획득한 2차원 영상에서 특징점을 찾아 선으로 조합한 후 선 길이 정보를 추출하는 방법을 제안한다. 거대한 구조물의 외곽선 길이 정보 추출을 위해서는 광각 카메라에 의한 영상을 획득한다. 영상에서의 외곽선들은 모델의 기울어진 정보와 형태, 모델의 크기 등을 결정하게 되는데 광각카메라 사용에 의하여 배럴왜곡, 원근투영왜곡 등이 발생한다. 외곽선 정보 추출의 순서는 먼저모델의 2차원영상을 획득하고 이로부터 왜곡이 보정된 그레이영상을 획득한다. 이 그레이영상에서 잡음을 제거하고 특징점을 찾기 위하여 SUSAN 알고리즘을 사용한다. SUSAN알고리즘 기법은 적은 계산량과 잡음에 매우 강한 장점이 있어서 영상에서의 특징점을 얻기 위한 효과적인 기법이다. 특징점을 3차원 벡터공간에서 맵핑시킨 후 X, Y, Z 좌표축으로 점과 선으로 나타내고 시작점과 끝점의 좌표를 이용하여 벡터 길이를 얻는다. 이러한 벡터 데이터와 3차원 영상 재구성을 위한 라이브러리인 OpenGL을 사용하여 3차원 공간에 거대한 구조물들을 재구성하는 소프트웨어를 개발하였다.
PDF

AdaBoost-based Gesture Recognition Using Time Interval Window Applied Global and Local Feature Vectors with Mono Camera (모노 카메라 영상기반 시간 간격 윈도우를 이용한 광역 및 지역 특징 벡터 적용 AdaBoost기반 제스처 인식)

Hwang, Seung-Jun;Ko, Ha-Yoon;Baek, Joong-Hwan
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.22 no.3
- /
- pp.471-479
- /
- 2018
Recently, the spread of smart TV based Android iOS Set Top box has become common. This paper propose a new approach to control the TV using gestures away from the era of controlling the TV using remote control. In this paper, the AdaBoost algorithm is applied to gesture recognition by using a mono camera. First, we use Camshift-based Body tracking and estimation algorithm based on Gaussian background removal for body coordinate extraction. Using global and local feature vectors, we recognized gestures with speed change. By tracking the time interval trajectories of hand and wrist, the AdaBoost algorithm with CART algorithm is used to train and classify gestures. The principal component feature vector with high classification success rate is searched using CART algorithm. As a result, 24 optimal feature vectors were found, which showed lower error rate (3.73%) and higher accuracy rate (95.17%) than the existing algorithm.
https://doi.org/10.6109/jkiice.2018.22.3.471 인용 PDF KSCI

RGB-D Image Feature Point Extraction and Description Method for 3D Object Recognition (3차원 객체 인식을 위한 RGB-D 영상 특징점 추출 및 특징 기술자 생성 방법)

Park, Noh-Young;Jang, Young-Kyoon;Woo, Woon-Tack
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06c
- /
- pp.448-450
- /
- 2012
본 논문에서는 Kinect 방식의 RGB-D 영상센서를 사용하여, 깊이(Depth) 영상으로부터 3차원 객체의 기하정보를 표현하는 표면 정규 벡터(Surface Normal Vector)를 추출하고, 그 결과를 영상화하는 방법을 제안하며, 제안된 방법으로 생성된 영상으로부터 깊이 영상의 특징점 및 특징 기술자를 추출하여 3차원 객체 인식 성능을 향상시키는 방법을 제안한다. 또한 생성된 RGB-D 특징 기술자들을 객체 단위로 구분 가능한 코드북(CodeBook) 학습을 통한 인식방법을 제안하여 객체의 인식 성능을 높이는 방법을 제안한다. 제안하는 RGB-D 기반의 특징 추출 및 학습 방법은 텍스쳐 유무, 카메라 회전 및 이동 변화 등의 환경변화에 강건함을 실험적으로 증명하였으며, 이 방법은 Kinect 방식의 RGB-D 영상을 사용하는 3차원 객체/공간 인식 및 추적, 혹은 이를 응용하는 증강현실 시스템에 적용하여 사용될 수 있다.

Search Result 108, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)