• 제목/요약/키워드: scene understanding

검색결과 108건 처리시간 0.025초

Text Extraction from Complex Natural Images

  • Kumar, Manoj;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • 제6권2호
    • /
    • pp.1-5
    • /
    • 2010
  • The rapid growth in communication technology has led to the development of effective ways of sharing ideas and information in the form of speech and images. Understanding this information has become an important research issue and drawn the attention of many researchers. Text in a digital image contains much important information regarding the scene. Detecting and extracting this text is a difficult task and has many challenging issues. The main challenges in extracting text from natural scene images are the variation in the font size, alignment of text, font colors, illumination changes, and reflections in the images. In this paper, we propose a connected component based method to automatically detect the text region in natural images. Since text regions in mages contain mostly repetitions of vertical strokes, we try to find a pattern of closely packed vertical edges. Once the group of edges is found, the neighboring vertical edges are connected to each other. Connected regions whose geometric features lie outside of the valid specifications are considered as outliers and eliminated. The proposed method is more effective than the existing methods for slanted or curved characters. The experimental results are given for the validation of our approach.

노마드적 공간에서 나타나는 유연성에 관한 연구 (A Study on Expression Characteristics of Flexibility in Nomadic Space)

  • 윤주희;김개천
    • 한국실내디자인학회논문집
    • /
    • 제20권3호
    • /
    • pp.119-126
    • /
    • 2011
  • Recently, in the fields of fashion, advertisement, film, literature, philosophy, etc., the word, 'Nomad', is being used frequently across the overall society. The contemporary society is actively incorporating "nomadic thinking" as a new social phenomenon across the boundaries of conventional fields. This is not an exception in the field of space design. This study, via the contemporary nomadic thinking, examined the relationship between space design's application possibility as a new trend and flexible space; then categorized the characteristics of flexible space into flexibility, temporariness, changeability, and correlation; and then analyzed expression characteristics of flexible space. As for unrestricted expression of scene, it was recognized that separation of scene and space leads space to meet the needs of surrounding environment and users; formation of changeable space enables uses of space from various perspectives; and combining external factors (energy, media technologies) with space leads space to self-evolution. Space is perceived as an living organism that is flexibly corresponding, via realistic movement and virtual movement, to the indefinite, diversified thinking of the contemporary society. Therefore, this study illuminates that nomadic thinking has significance as basic thinking to predict development and characteristics of design thinking through understanding the contemporary society with the basic thinking system that has been inherent without restrictions of being fixed to the present, past, and future.

도로교통 영상처리를 위한 고속 영상처리시스템의 하드웨어 구현 (An Onboard Image Processing System for Road Images)

  • 이운근;이준웅;조석빈;고덕화;백광렬
    • 제어로봇시스템학회논문지
    • /
    • 제9권7호
    • /
    • pp.498-506
    • /
    • 2003
  • A computer vision system applied to an intelligent safety vehicle has been required to be worked on a small sized real time special purposed hardware not on a general purposed computer. In addition, the system should have a high reliability even under the adverse road traffic environment. This paper presents a design and an implementation of an onboard hardware system taking into account for high speed image processing to analyze a road traffic scene. The system is mainly composed of two parts: an early processing module of FPGA and a postprocessing module of DSP. The early processing module is designed to extract several image primitives such as the intensity of a gray level image and edge attributes in a real-time Especially, the module is optimized for the Sobel edge operation. The postprocessing module of DSP utilizes the image features from the early processing module for making image understanding or image analysis of a road traffic scene. The performance of the proposed system is evaluated by an experiment of a lane-related information extraction. The experiment shows the successful results of image processing speed of twenty-five frames of 320$\times$240 pixels per second.

Collective Interaction Filtering Approach for Detection of Group in Diverse Crowded Scenes

  • Wong, Pei Voon;Mustapha, Norwati;Affendey, Lilly Suriani;Khalid, Fatimah
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.912-928
    • /
    • 2019
  • Crowd behavior analysis research has revealed a central role in helping people to find safety hazards or crime optimistic forecast. Thus, it is significant in the future video surveillance systems. Recently, the growing demand for safety monitoring has changed the awareness of video surveillance studies from analysis of individuals behavior to group behavior. Group detection is the process before crowd behavior analysis, which separates scene of individuals in a crowd into respective groups by understanding their complex relations. Most existing studies on group detection are scene-specific. Crowds with various densities, structures, and occlusion of each other are the challenges for group detection in diverse crowded scenes. Therefore, we propose a group detection approach called Collective Interaction Filtering to discover people motion interaction from trajectories. This approach is able to deduce people interaction with the Expectation-Maximization algorithm. The Collective Interaction Filtering approach accurately identifies groups by clustering trajectories in crowds with various densities, structures and occlusion of each other. It also tackles grouping consistency between frames. Experiments on the CUHK Crowd Dataset demonstrate that approach used in this study achieves better than previous methods which leads to latest results.

영화 시네마 천국의 테마음악 기능분석 (Movie 'Cinema Paradiso' theme music function analysis)

  • 임주희
    • 한국산학기술학회논문지
    • /
    • 제19권12호
    • /
    • pp.561-568
    • /
    • 2018
  • 본 논문은 이탈리아 출신의 영화음악가 엔니오 모리코네(Ennio Morricone)의 '시네마천국(Cinema Paradiso)'에 사용된 영화음악을 아론 코플랜드(Aron Copland)가 제시한 다섯 가지의 기능론과 현대의 대표적 영화음악의 기능 두 가지를 중심으로 분석한 논문이다. 영화 장면에 따른 각각의 테마음악을 변주시키는 변주법과 기능론을 제시함으로써 영화음악을 전공하는 학생들에게 음악의 테마 변주 테크닉과 장면에 따른 기능론을 습득시키기는데 도움이 될 것으로 판단된다. 엔니오 모리코네는 1960년대 <황야의 무법자>등의 서부영화에서 탁월한 능력을 보이고 1980년대에 들어 <미션(Mission)>, <시네마 천국(Cinema Paradiso)>등에서 전성기를 맞았다고 평가되어진다. 또한 그의 작품들은 테마음악의 변주를 통하여 주인공 인물의 내적 표현과 장면의 효과를 극대화시키는 등 영화음악의 기능론을 다루기에 매우 효과적이다. 특히 '시네마 천국'에서는 테마음악의 다양한 변주를 통하여 영화의 일관성과 다양성을 동시에 추구하였다. 그리고 특정한 분위기를 강조하여 장면의 연속성과 영화의 진행감을 주고자 하였다. 본 연구를 통해 엔니오 모리코네의 '시네마 천국'의 음악을 연구 분석하여 영화음악의 테마 변주법 연구에 의미를 두고자 한다.

상담 장면에서의 명리의 활용에 대한 국내 연구 동향 분석 (National Research Trends Regarding Use of the Four Pillars of Destiny in the Counseling Realm)

  • 홍성규;곽희용;김종우;정선용
    • 동의신경정신과학회지
    • /
    • 제31권4호
    • /
    • pp.289-299
    • /
    • 2020
  • Objectives: The aim of this study is to investigate current research trends of Four Pillars of Destiny and verify its values and potential in the counselling scene, as the Four Pillars of Destiny's territory has been expanding to counselling, medical and psychiatric realm nowadays. Methods: The studies were searched from psychotherapy to general consultation, directly or indirectly related to counseling and Four Pillars of Destiny. Twenty-one published research studies were selected for analysis. The studies were categorized into 7 groups, meta-analysis, comparison with other personality tests, user's trend analysis, utilization in job counseling, disease prediction study, utilization in treatment counseling, and use in Korean medicine. Results: The selected studies attempted to expand Four Pillars of Destiny's usage through combination with other fields such as artificial intelligence, Korean medicine, and personality test. Furthermore by analyzing Four Pillars of Destiny itself to extract its key elements in counseling, such as therapeutic counseling factors and occupational counseling factors. Conclusions: At present, there are no standard use of Four Pillars of Destiny in counseling scene, for no large-scale research has been conducted or completed on this subject. This current status quo leads this paper to end up just understanding the counseling factors and possibilities of Four Pillars of Destiny rather than its psychological theory and clinical effect. However, this research trend analysis will be helpful in preparing future studies investigating Four Pillars of Destiny's counseling effect, application in the counseling scene and its psychological theory. Also, further studies, including confirmation of the theory through the operational definition, prospective research, control study, statistical technique are required in order to evaluate Four Pillars of Destiny's psychological theory and its effects to verify its use in clinical scenes.

농촌 농특산품 전시판매시설 디자인 소비자 의식 분석 및 디자인 개발 - 농촌관광마을을 중심으로 - (An Analysis on Consumers' Awareness of a Rural Specialties Exhibition Shop and the Design Development : Focusing on Rural Tourism Village)

  • 진혜련;서지예;조록환
    • 농촌계획
    • /
    • 제20권4호
    • /
    • pp.253-262
    • /
    • 2014
  • This, an association research for design-improvement and model-development of exhibition shops at rural tourism communities, is to secure objective data by analyzing customers' awareness-tendency of and demand for agricultural-specialty exhibition shops. Survey-questions for finding out consumers' awareness-tendency and demand were determined through brainstorming of a professional council, 30 rural communities of which visit-rate by consumers is considerably high were selected for the recruit of 200 consumers. For investigation and analysis, survey and in-depth interview were carried out at the scene with the application of frequency analysis and summarization of their opinions, which revealed that they have a strong will to visit the rural tourism communities for the purchase of agricultural specialties along with the experience of learning-program and on-the-scene direct dealing and that their viewpoint on the direct dealing at the scene was very positive. Also it was confirmed hat their satisfaction with the purchase of agricultural specialties by on-the-scene direct dealing, their pleasure at the purchase, their satisfaction with services and their intention for re-purchase of them were very high while their satisfaction with the exhibition shops was very low. With on-the-scene survey, the consumers' opinions could be listened to in depth. Almost all of them said their satisfaction with the trip to those rural tourism communities was considerably high since they could go to those communities themselves to relieve the stress from their modern life, to experience healing and to see the goods on the scene. Their satisfaction also was attributed to the fact that they have enough trust in purchase along with feeling the warm-heartedness of rural residents. As to their awareness of exhibition shops, they showed a positive response to the on-the-scene direct dealing at rural communities while they, thinking that the space in those exhibition shops was not sufficiently wide, demanded for more systematic counters in more accessible and affordable exhibition shops so that they might be more satisfied with the exhibition shops. Their demand for the necessity of exhibition shops selling agricultural specialties was found to be over 80%, which indicates that the necessity is very high. As to the suitability of function, they have the opinion that the business at those shops had better be focused on sales since they have the understanding of information when they take a trip to the rural communities, while there was another opinion: since agricultural products are seasonal items they should be exhibited and sold at the same time. More than 90% of the respondents had a positive viewpoint on direct dealing of agricultural specialties on the scene, which showed that their response to it was very high. They preferred the permanent shops equipped with roll-around table-booths. In addition, it was revealed that they want systematic exhibition shops in rural communities because they frequent those communities for on-the-scene direct purchase. The preferred type and opinion resulting from estimation of consumers' demands have been reflected for development of practical designs. The structure of variable principles has been designed so that the types of display-case and table-booth might be created. The result of this study is a positive data as a design model which can be utilized at rural communities and will be commercialized for the verification of its validity.

디지털영화의 플랑세캉스 사용에 관하여 (About the Use of Plan-Sequence in Digital Films)

  • 이지현
    • 트랜스-
    • /
    • 제3권
    • /
    • pp.1-28
    • /
    • 2017
  • 영화미학의 영역이 확장됨에 따라 고전적 연출의 미장센 개념 또한 그 영역을 확장하게 되었다. 본고는 영화 미장센의 개념을 공간적인 연관성을 통해 들여다보면서, 이를 프레임과 쇼트의 차원, 신과 시퀀스의 연속성 개념, 그리고 미학적 차원에서의 플랑세캉스 개념에 이르기까지 세부적 연관성을 통해 들여다보려 한다. 이 과정에서 디지털영화에서 플랑세캉스 사용에 관한 현재적 의미를 찾을 수 있을 것이라 기대한다. 현대영화에 이르러 미장센의 요소는 효율적인 측면에서 더 강조되는 측면이 있지만, 본고는 여전히 고전주의적 미학 체계가 여전히 중요하다는 점을 이르고 있다. 이를 위해 과거 리얼리즘 미학의 최고 테크닉으로 불렸던 플랑세캉스(plan-séquence) 개념이 현대의 디지털영화에서 어떻게 활용되고 있는지를 살핀다. 다양한 미장센의 영역 중에서도 특히 플랑세캉스의 '길게 찍기(long take)'는 사건을 준비하고 응용하는 과정에서 유용하게 활용되는 사실주의의 도구로 쓰인다. 영화 자체가 상업적으로 활용되는 현실의 영역에서, 디지털영화가 플랑세캉스 개념을 통해 '지정학적 무의식'의 알레고리를 활용할 수 있는 개념적 도구임을 본고는 말하고 있다. 이 과정에서 플랑세캉스가 이루고자 하는 거시적 목표가 '시각적 매개 변수'가 아닌, '지속시간'과 관련한 특정 의도로 보는 편이 더 적합하다는 것을 알게 된다. 이때 지속시간 개념이 크로노스(Chronos)가 아닌, 카이로스(kairos)로서의 '주관적이고 의식적인 시간'이라는 점도 본 연구 과정에서 드러나는 성과라 할 수 있다.

  • PDF

Social Pedestrian Group Detection Based on Spatiotemporal-oriented Energy for Crowd Video Understanding

  • Huang, Shaonian;Huang, Dongjun;Khuhroa, Mansoor Ahmed
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권8호
    • /
    • pp.3769-3789
    • /
    • 2018
  • Social pedestrian groups are the basic elements that constitute a crowd; therefore, detection of such groups is scientifically important for modeling social behavior, as well as practically useful for crowd video understanding. A social group refers to a cluster of members who tend to keep similar motion state for a sustained period of time. One of the main challenges of social group detection arises from the complex dynamic variations of crowd patterns. Therefore, most works model dynamic groups to analysis the crowd behavior, ignoring the existence of stationary groups in crowd scene. However, in this paper, we propose a novel unified framework for detecting social pedestrian groups in crowd videos, including dynamic and stationary pedestrian groups, based on spatiotemporal-oriented energy measurements. Dynamic pedestrian groups are hierarchically clustered based on energy flow similarities and trajectory motion correlations between the atomic groups extracted from principal spatiotemporal-oriented energies. Furthermore, the probability distribution of static spatiotemporal-oriented energies is modeled to detect stationary pedestrian groups. Extensive experiments on challenging datasets demonstrate that our method can achieve superior results for social pedestrian group detection and crowd video classification.

Fight Detection in Hockey Videos using Deep Network

  • Mukherjee, Subham;Saini, Rajkumar;Kumar, Pradeep;Roy, Partha Pratim;Dogra, Debi Prosad;Kim, Byung-Gyu
    • Journal of Multimedia Information System
    • /
    • 제4권4호
    • /
    • pp.225-232
    • /
    • 2017
  • Understanding actions in videos is an important task. It helps in finding the anomalies present in videos such as fights. Detection of fights becomes more crucial when it comes to sports. This paper focuses on finding fight scenes in Hockey sport videos using blur & radon transform and convolutional neural networks (CNNs). First, the local motion within the video frames has been extracted using blur information. Next, fast fourier and radon transform have been applied on the local motion. The video frames with fight scene have been identified using transfer learning with the help of pre-trained deep learning model VGG-Net. Finally, a comparison of the methodology has been performed using feed forward neural networks. Accuracies of 56.00% and 75.00% have been achieved using feed forward neural network and VGG16-Net, respectively.