• 제목/요약/키워드: Visual model

검색결과 2,038건 처리시간 0.027초

멀티모달 맥락정보 융합에 기초한 다중 물체 목표 시각적 탐색 이동 (Multi-Object Goal Visual Navigation Based on Multimodal Context Fusion)

  • 최정현;김인철
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제12권9호
    • /
    • pp.407-418
    • /
    • 2023
  • MultiOn(Multi-Object Goal Visual Navigation)은 에이전트가 미지의 실내 환경 내 임의의 위치에 놓인 다수의 목표 물체들을 미리 정해준 일정한 순서에 따라 찾아가야 하는 매우 어려운 시각적 탐색 이동 작업이다. MultiOn 작업을 위한 기존의 모델들은 행동 선택을 위해 시각적 외관 지도나 목표 지도와 같은 단일 맥락 지도만을 이용할 뿐, 다양한 멀티모달 맥락정보에 관한 종합적인 관점을 활용할 수 없다는 한계성을 가지고 있다. 이와 같은 한계성을 극복하기 위해, 본 논문에서는 MultiOn 작업을 위한 새로운 심층 신경망 기반의 에이전트 모델인 MCFMO(Multimodal Context Fusion for MultiOn tasks)를 제안한다. 제안 모델에서는 입력 영상의 시각적 외관 특징외에 환경 물체의 의미적 특징, 목표 물체 특징도 함께 포함한 멀티모달 맥락 지도를 행동 선택에 이용한다. 또한, 제안 모델은 점-단위 합성곱 신경망 모듈을 이용하여 3가지 서로 이질적인 맥락 특징들을 효과적으로 융합한다. 이 밖에도 제안 모델은 효율적인 이동 정책 학습을 유도하기 위해, 목표 물체의 관측 여부와 방향, 그리고 거리를 예측하는 보조 작업 학습 모듈을 추가로 채용한다. 본 논문에서는 Habitat-Matterport3D 시뮬레이션 환경과 장면 데이터 집합을 이용한 다양한 정량 및 정성 실험들을 통해, 제안 모델의 우수성을 확인하였다.

초등학생의 시력건강행위 영향요인 (Factors Related to Visual Health Promotion Behavior of Elementary School Aged Children)

  • 김정숙;오진주
    • 지역사회간호학회지
    • /
    • 제12권1호
    • /
    • pp.142-149
    • /
    • 2001
  • The health education for elementary school students is a very important factor in the development of adult health practices. Particularly, eyesight is difficult to recover if lost. Therefore, prevention is better than cure. This study was conducted to investigate the factors that affect the visual health behavior of elementary school students and to furnish basic materials and directions for the promotion of elementary school health. The investigation was carried out for 4 days from 9. 18. 2000 to 9. 21. 2000 for 199 children in 3 elementary schools. A questionnaire was composed of 3 questions about general property. 20 questions about visual health behavior. 7 questions about visual self-efficacy. 5 questions about visual motivation. 16 questions about self-conception. 20 questions about the health locus of control. The data was analysed by an SAS program for t-test. ANOVA. correlation, and multiple regression tests. The results are as follows. 1. The visual health behavior of elementary school children was good (average 52.53). 2. For visual health behavior, school, year, and sex were influential factors. economic levels were not. 3. Visual health behavior had a significant correlation with visual self-efficacy, visual health motives and self-conception. but not with the locus of control. 4. In the multiple regression test, visual self-efficacy and self-conception were significant prediction factors -- the suitability of the regression model was 30.8%. Suggestions from the results are as follows: First, school year and sex had a significant influence on visual health behavior: therefore, it is necessary to consider these two factors when education programs are developed. Second, this study was carried out for students in a partial area only. Therefore, repeated studies for a large sample are necessary for the future.

  • PDF

시각 시스템 모델을 이용한 Subband 코딩 (On Using the Human Visual System Model for Subband Coding)

  • 박용철;김근숙;차일환;윤대희
    • 대한전자공학회논문지
    • /
    • 제27권6호
    • /
    • pp.937-943
    • /
    • 1990
  • In this paper, a subband coding scheme using the human visual system(HVS) model for encoding monochrome images is proposed to produce perceptually higher quality images compared with the regular subband coding scheme. The proposed approach first transforms the intensity image to the density image by a point nonlinear transformation. A frequency band dexomposition of the density image is carried out by means of 2-D seaprable quadrature mirror filters, which split the density image spectrum into 16 equall rate subbands. Bits are allocated among the subbands to minimize the weighted mean squar error (WMSE) for differential pulse code modulation(DPCM) coding of the subbands. The weight for each subband is calculated from the modulation transfer function (MTF) of the HVS model at corresponding frequencies. The performances of the proposed approach are evaluated for 256 * 256 monochrome images at the bit rates of 0.5, 0.75 and 1.0 bita per pixel. Computer simulation results indicate that using the HVS model yields more pleasing reconstructed images than regular subband coding approach which does not use HVS model.

  • PDF

깊이 센서를 이용한 능동형태모델 기반의 객체 추적 방법 (Active Shape Model-based Object Tracking using Depth Sensor)

  • 정훈조;이동은
    • 디지털산업정보학회논문지
    • /
    • 제9권1호
    • /
    • pp.141-150
    • /
    • 2013
  • This study proposes technology using Active Shape Model to track the object separating it by depth-sensors. Unlike the common visual camera, the depth-sensor is not affected by the intensity of illumination, and therefore a more robust object can be extracted. The proposed algorithm removes the horizontal component from the information of the initial depth map and separates the object using the vertical component. In addition, it is also a more efficient morphology, and labeling to perform image correction and object extraction. By applying Active Shape Model to the information of an extracted object, it can track the object more robustly. Active Shape Model has a robust feature-to-object occlusion phenomenon. In comparison to visual camera-based object tracking algorithms, the proposed technology, using the existing depth of the sensor, is more efficient and robust at object tracking. Experimental results, show that the proposed ASM-based algorithm using depth sensor can robustly track objects in real-time.

Implementation of Low-cost Autonomous Car for Lane Recognition and Keeping based on Deep Neural Network model

  • Song, Mi-Hwa
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제13권1호
    • /
    • pp.210-218
    • /
    • 2021
  • CNN (Convolutional Neural Network), a type of deep learning algorithm, is a type of artificial neural network used to analyze visual images. In deep learning, it is classified as a deep neural network and is most commonly used for visual image analysis. Accordingly, an AI autonomous driving model was constructed through real-time image processing, and a crosswalk image of a road was used as an obstacle. In this paper, we proposed a low-cost model that can actually implement autonomous driving based on the CNN model. The most well-known deep neural network technique for autonomous driving is investigated and an end-to-end model is applied. In particular, it was shown that training and self-driving on a simulated road is possible through a practical approach to realizing lane detection and keeping.

자동차 시뮬레이터의 가상환경 구성에 대한 연구 (Construction of Virtual Environment for a Vehicle Simulator)

  • 장재원;손권;최경현
    • 한국자동차공학회논문집
    • /
    • 제8권4호
    • /
    • pp.158-168
    • /
    • 2000
  • Vehicle driving simulators can provide engineers with benefits on the development and modification of vehicle models. One of the most important factors to realistic simulations is the fidelity given by a motion system and a real-time visual image generation system. Virtual reality technology has been widely used to achieve high fidelity. In this paper the virtual environment including a visual system like a head-mounted display is developed for a vehicle driving simulator system by employing the virtual reality technique. virtual vehicle and environment models are constructed using the object-oriented analysis and design approach. Accordint to the object model a three dimensional graphic model is developed with CAD tools such as Rhino and Pro/E. For the real-time image generation the optimized IRIS Performer 3D graphics library is embedded with the multi-thread methodology. Compared with the single loop apprach the proposed methodology yields an acceptable image generation speed 20 frames/sec for the simulator.

  • PDF

Visual MINTEQ모델을 이용한 인산염의 용해에 미치는 석회석의 영향 규명 (Explanation of the Effect of Limestone on the Dissolution of a Phosphate with the Visual MINTEQ Model)

  • 김학성;정연태
    • 한국물환경학회지
    • /
    • 제24권3호
    • /
    • pp.285-290
    • /
    • 2008
  • This study was done to explain the role of limestone which might intervene in the phosphorus cycle in a lake. The effects of limestone on the dissolution of phosphate were estimated by simulations with the computer model Visual MINTEQ, which is designed for the chemical equilibrium calculations. According to the calculations limestone shows remarkable effects for the suppression of phosphate dissolution. The limestone can suppress the dissolution of phosphates by sacrificing themselves to acids, and as a consequence can increase the hardness and alkalinity of the lake. Both hardness and alkalinity play an important role in reducing soluble P and thus alleviate the eutrophication potential.

가상현실을 이용한 실시간 차량 그래픽 주행 시뮬레이터 (A Real-Time Graphic Driving Simulator Using Virtual Reality Technique)

  • 장재원;손권;최경현;송남용
    • 한국정밀공학회지
    • /
    • 제17권7호
    • /
    • pp.80-89
    • /
    • 2000
  • Driving simulators provide engineers with a power tool in the development and modification stages of vehicle models. One of the most important factors to realistic simulations is the fidelity obtained by a motion bed and a real-time visual image generation algorithm. Virtual reality technology has been widely used to enhance the fidelity of vehicle simulators. This paper develops the virtual environment for such visual system as head-mounted display for a vehicle driving simulator. Virtual vehicle and environment models are constructed using the object-oriented analysis and design approach. Based on the object model, a three-dimensional graphic model is completed with CAD tools such as Rhino and Pro/ENGINEER. For real-time image generation, the optimized IRIS Performer 3D graphics library is embedded with the multi-thread methodology. The developed software for a virtual driving simulator offers an effective interface to virtual reality devices.

  • PDF

감성정보검색을 위한 지식베이스 구축방법 (The Method to Build Knowledge-Base for User's Preference Retrieval)

  • 김돈한
    • 한국감성과학회:학술대회논문집
    • /
    • 한국감성과학회 2008년도 추계학술대회
    • /
    • pp.5-8
    • /
    • 2008
  • This study proposed the Knowledge Base Building method reflecting the user's preferences based on the fuzzy set theory to develop information contents which support pedestrian's navigation. This research evaluated subject's preferences on the commercial spaces set to the hypothetical destination. Also it surveyed the causal relationship between the visual characteristics and the emotional characteristics to propose the methods of Navigation Knowledge Base (NKB). The NKB was composed by three elements; 1.the correlation model between emotional characteristics, 2.the causal relationship between visual characteristics and emotional characteristics, 3.the transformation model between visual characteristics and the physical characteristics.

  • PDF

Visual Basic을 이용한 강뼈대 구조물의 비선형 해석 (Nonlinear Analysis of Steel Frames Using Visual Basic)

  • 윤영조;김선희;이종석
    • 한국전산구조공학회:학술대회논문집
    • /
    • 한국전산구조공학회 1999년도 가을 학술발표회 논문집
    • /
    • pp.403-410
    • /
    • 1999
  • General1y, H-section is used for columns and beams in the middle and low steel building, But it has a strong and weak axis. Thus if H-section is used for columns, the structure needs reinforcement on the weak axis. Therefore recently, square holler section(S.H.S) is used for columns because it is able to coiler the vulnerability of H-section. Structural analysis is usually executed under the assumption that connections are either ideally pinned joint or fully rigid joint. Actually all connections are semi-rigid which possess a rotational stiffness. Therefore it can be designed economically as using the property of connections which has a rotational stiffness. This paper presents a prediction model curve which is fitted Kishi-Chen power Model about the behavior of connection between H-beam and S.H.S column. Non-linear analysis program was considered the non-linearity of semi-rigid connection and the geometrical non-linearity under the effect of axial force. It was programed by FORTRAN90 and Visual Basic.

  • PDF