• 제목/요약/키워드: feature reconstruction

검색결과 218건 처리시간 0.027초

범용 데이터 셋과 얼굴 데이터 셋에 대한 초해상도 융합 기법 (Super Resolution Fusion Scheme for General- and Face Dataset)

  • 문준원;김재석
    • 한국멀티미디어학회논문지
    • /
    • 제22권11호
    • /
    • pp.1242-1250
    • /
    • 2019
  • Super resolution technique aims to convert a low-resolution image with coarse details to a corresponding high-resolution image with refined details. In the past decades, the performance is greatly improved due to progress of deep learning models. However, universal solution for various objects is a still challenging issue. We observe that learning super resolution with a general dataset has poor performance on faces. In this paper, we propose a super resolution fusion scheme that works well for both general- and face datasets to achieve more universal solution. In addition, object-specific feature extractor is employed for better reconstruction performance. In our experiments, we compare our fusion image and super-resolved images from one- of the state-of-the-art deep learning models trained with DIV2K and FFHQ datasets. Quantitative and qualitative evaluates show that our fusion scheme successfully works well for both datasets. We expect our fusion scheme to be effective on other objects with poor performance and this will lead to universal solutions.

심초음파도내에서의 심장 판막 운동 추적을 위한 동영상 처리 기술에 대한 기초 연구 (I) (A study on the development of an image processing technique for tracing the movement of heart valves in echocardiograms (I))

  • 육인수;김재익;최홍호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1997년도 춘계학술대회
    • /
    • pp.88-91
    • /
    • 1997
  • One of the most significant feature of diagnostic ultrasonic instrument is to display information on the soft tissues in the body in real time. In this paper we carried out basic study on the digital moving image processing for tracing the movement of heart valves in echocardiograms. Digital moving image file was made from analog echocardiograms and it was remade as 256 gray-level images on each frame. The ROI(Region of interest) was placed on a heart valve region to process images efficiently. Images were processed by the use of image enhancement filters and morphology filters. The result shows that the processed images were more enhanced than original images. When a moving image is reconstructed by using these enhanced images, we can trace the movement of heart valves more easily. In this study we proposed the availability of the moving image reconstruction using enhancement images.

  • PDF

Identifying potential mergers of globular clusters: a machine-learning approach

  • Pasquato, Mario
    • 천문학회보
    • /
    • 제39권2호
    • /
    • pp.89-89
    • /
    • 2014
  • While the current consensus view holds that galaxy mergers are commonplace, it is sometimes speculated that Globular Clusters (GCs) may also have undergone merging events, possibly resulting in massive objects with a strong metallicity spread such as Omega Centauri. Galaxies are mostly far, unresolved systems whose mergers are most likely wet, resulting in observational as well as modeling difficulties, but GCs are resolved into stars that can be used as discrete dynamical tracers, and their mergers might have been dry, therefore easily simulated with an N-body code. It is however difficult to determine the observational parameters best suited to reveal a history of merging based on the positions and kinematics of GC stars, if evidence of merging is at all observable. To overcome this difficulty, we investigate the applicability of supervised and unsupervised machine learning to the automatic reconstruction of the dynamical history of a stellar system. In particular we test whether statistical clustering methods can classify simulated systems into monolithic versus merger products. We run direct N-body simulations of two identical King-model clusters undergoing a head-on collision resulting in a merged system, and other simulations of isolated King models with the same total number of particles as the merged system. After several relaxation times elapse, we extract a sample of snapshots of the sky-projected positions of particles from each simulation at different dynamical times, and we run a variety of clustering and classification algorithms to classify the snapshots into two subsets in a relevant feature space.

  • PDF

탐색시간의 개선을 위한 함수형 배열의 설계 및 구현 (A Design and Implementation of Functional Array for Improvement of the Traversal Time)

  • 주형석;유원희
    • 한국정보처리학회논문지
    • /
    • 제3권5호
    • /
    • pp.1262-1272
    • /
    • 1996
  • 순수 함수언어는 참조적 투명성을 가지고 있으므로 모든 객체에 대한 직접적 인 갱신이 불가능하다. 배열과 같은 집단자료구조가 갱신되면, 참조적 투명성을 유지하기 위하여 원래의 배열과 갱신된 배열이 모두 유지되어야 한다. 따라서 모든 자료에 대한 참조적 투명성을 유지하면서 자료의 유지 비용을 줄일 수 있는 효과적 인 방법의 개발이 요구되고 있다. 본 논문에서는 이러한 문제를 해결하기 위한 함수형 배열을 제시하고, 검증하고자 하였다. 이와 같은 검증을 위해서, 제안된 방법을 컴비네이터 그래프 감축기 상에서 구현하였다. 제안된 방법에서 배열 연산에 대한 탐색시간을 줄 일 수 있었으며, 갱신 비용과 최근 버전에 대한 접근은 갱신된 자료를 실행시간에 재 구성하는 과정 없이 상수시간에 가능하였다.

  • PDF

Person Re-identification using Sparse Representation with a Saliency-weighted Dictionary

  • Kim, Miri;Jang, Jinbeum;Paik, Joonki
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제6권4호
    • /
    • pp.262-268
    • /
    • 2017
  • Intelligent video surveillance systems have been developed to monitor global areas and find specific target objects using a large-scale database. However, person re-identification presents some challenges, such as pose change and occlusions. To solve the problems, this paper presents an improved person re-identification method using sparse representation and saliency-based dictionary construction. The proposed method consists of three parts: i) feature description based on salient colors and textures for dictionary elements, ii) orthogonal atom selection using cosine similarity to deal with pose and viewpoint change, and iii) measurement of reconstruction error to rank the gallery corresponding a probe object. The proposed method provides good performance, since robust descriptors used as a dictionary atom are generated by weighting some salient features, and dictionary atoms are selected by reducing excessive redundancy causing low accuracy. Therefore, the proposed method can be applied in a large scale-database surveillance system to search for a specific object.

Tiled Stereo Display System for Immersive Telemeeting

  • Kim, Ig-Jae;Ahn, Sang-Chul;Kim, Hyoung-Gon
    • Journal of Information Display
    • /
    • 제8권4호
    • /
    • pp.27-31
    • /
    • 2007
  • In this paper, we present an efficient tiled stereo display system for tangible meeting. For tangible meeting, it is important to provide immersive display with high resolution image to cover up the field of view and provide to the local user the same environment as that of remote site. To achieve these, a high resolution image needs to be transmitted for reconstruction of remote world, and it should be displayed using a tiled display. However, it is hard to transmit high resolution image in real time due to the limit of network bandwidth, and so we receive multiple images and reconstruct a remote world with received images in advance. Then, we update only a specific area where remote user exists by receiving low resolution image in realtime. We synthesize the transmitted image to the existing environmental map of remote world and display it as a stereo image. For this, we developed a new system which supports GPU based real time warping and blending, automatic feature extraction using machine vision technique.

모바일 웹 캡처 메모 시스템의 학습 완성도에 대한 연구 (Mobile Web Capture notes system Research on learning maturity)

  • 이연란;임영환
    • 만화애니메이션 연구
    • /
    • 통권32호
    • /
    • pp.363-381
    • /
    • 2013
  • 본 논문은 모바일 웹 상에서 오프라인의 학습 내용을 복습 할 때 학습 동영상에 필요한 재학습 영역을 프레임 단위로 중요 영역만 캡처한다. 캡처된 프레임은 영상 중에서 진행된 학습 시간과 이미지의 형태로 저장하고 또한 설명에 대한 메모 기능을 함께 저장한다. 캡처 영역은 학습자에 필요한 영역만 재학습하는 학습자 중심의 맞춤형 시스템을 적용할 수 있다. 캡처 프로그램의 구성은 학습 순서에 상관없이 선택한 순서에 따라 프레임 단위 캡처로 사용자 중심의 스토리텔링형 학습을 적용할 수 있다. 캡처 시스템 효과는 전체 학습에 비해 학습 시간을 절약하고 학습자 중심의 프레임 재구성으로 맞춤형 학습에 따른 학습 효용성 향상에 긍정적인 역할을 한다.

빈피킹을 위한 스테레오 비전 기반의 제품 라벨의 3차원 자세 추정 (Stereo Vision-Based 3D Pose Estimation of Product Labels for Bin Picking)

  • 우다야 위제나야카;최성인;박순용
    • 제어로봇시스템학회논문지
    • /
    • 제22권1호
    • /
    • pp.8-16
    • /
    • 2016
  • In the field of computer vision and robotics, bin picking is an important application area in which object pose estimation is necessary. Different approaches, such as 2D feature tracking and 3D surface reconstruction, have been introduced to estimate the object pose accurately. We propose a new approach where we can use both 2D image features and 3D surface information to identify the target object and estimate its pose accurately. First, we introduce a label detection technique using Maximally Stable Extremal Regions (MSERs) where the label detection results are used to identify the target objects separately. Then, the 2D image features on the detected label areas are utilized to generate 3D surface information. Finally, we calculate the 3D position and the orientation of the target objects using the information of the 3D surface.

3D Spatial Interaction Method using Visual Dynamics and Meaning Production of Character

  • Lim, Sooyeon
    • International journal of advanced smart convergence
    • /
    • 제7권3호
    • /
    • pp.130-139
    • /
    • 2018
  • This study is to analyze the relationship between character and human semantic production through research on character visualization artworks and to develop a creative platform that visually expresses the formative and semantic dynamics of characters using the results will be. The 3D spatial interaction system using the character visualization proposed generates the transformation of the character in real time using the interaction with user and the deconstruction of the character structure. Transformations of characters including the intentions of the viewers provide a dynamic visual representation to the viewer and maximize the efficiency of meaning transfer by producing various related meanings. The method of dynamic deconstruction and reconstruction of the characters provided by this system creates special shapes that viewers cannot imagine until now and further extends the interpretation range of the meaning of the characters. Therefore, the proposed system not only induces an active viewing attitude from viewers, but also gives them an opportunity to enjoy watching the artwork and demonstrate creativity as a creator. This system induces new gestures of the viewer in real time through the transformation of characters in accordance with the viewer''s gesture, and has the feature of exchanging emotions with viewers.

적응적 방향성 보간을 이용한 효율적인 공간적 에러 은닉 기법 (An Efficient Spatial Error Concealment Technique Using Adaptive Edge-Oriented Interpolation)

  • 박선규;김원기;정제창
    • 한국통신학회논문지
    • /
    • 제32권5C호
    • /
    • pp.487-495
    • /
    • 2007
  • 에러가 존재하는 전송 환경에서 압축된 영상을 전송하는 경우, 일정한 화질을 유지하기 위해서는 에러 은닉 기법이 필요하다. 본 논문에서는 블록 기반의 영상 압축에 있어서 공간적 에러 은닉 기법을 제안한다. 제안된 기법은 공간 영역에서 슬라이스 단위의 에러가 발생했을 경우, 에러가 발생한 블록에 이웃하는 상 하 블록의 경계 화소를 이용하여 손상된 블록의 상 하 외곽 영역의 세밀한 에지 방향성을 찾아 각각 부분 복원을 한다. 그리고 복원되지 않은 내부 영역은 이웃한 경계 화소를 이용하여 블록 내의 전체적인 에지 방향성을 찾아 적응적으로 최종 복원 영상을 만들어 내는 2단계의 경계 정합 기법이다. 이 기법은 기존의 에러 은닉 방식보다 복잡도가 낮을 뿐만 아니라 에지를 흐름을 잘 복원하여 객관적 주관적으로 우수한 복원 영상을 얻게 해준다.