• Title/Summary/Keyword: performance video

Search Result 2,476, Processing Time 0.03 seconds

A Study on the Effect of Physical Upward and Downward Movement Experience on Psychological Judgements (신체의 상향·하향 이동경험이 심리적 판단에 미치는 영향에 관한 연구)

  • Lee, Luri;Lee, Seung-yon;Chung, Hyun Jung
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.4
    • /
    • pp.183-196
    • /
    • 2018
  • Studies that approach from the point of view that human thoughts or minds are dominated by behavior as well as that human behavior is dominated by thoughts or minds, have begun to attract attention from the late 2000s. The physical experience is reminiscent of a metaphorically connected abstract concept, which ultimately affects the judgment or evaluation of a particular object. However, studies that have been carried out so far have been limited to studies on the difference in perception and judgment depending on the objects to be viewed, the objects to be touched, and the objects to which they are carried. In this study, we tried to find out that the physical movement of the body in the upward or downward direction affects the psychological judgment differently. In the first experiment, a pair of words that were considered to be connected metaphorically was tested. In the second experiment, the subjects tried to solve the complicated calculation problem in a short time, and then they watched the video related to the upward movement or downward movement, and then proceeded to measure the psychological judgment. As a result, it was found that 'downward movement' of the body has a metaphorical connection with 'closure', while 'upward movement' is related to 'progress'. In the case of downward-experienced group compared to upward-experienced group, the reverse intentions of their own decision were low, and the confidences in their own decision and the expectations for performance were high.

Detecting Vehicles That Are Illegally Driving on Road Shoulders Using Faster R-CNN (Faster R-CNN을 이용한 갓길 차로 위반 차량 검출)

  • Go, MyungJin;Park, Minju;Yeo, Jiho
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.1
    • /
    • pp.105-122
    • /
    • 2022
  • According to the statistics about the fatal crashes that have occurred on the expressways for the last 5 years, those who died on the shoulders of the road has been as 3 times high as the others who died on the expressways. It suggests that the crashes on the shoulders of the road should be fatal, and that it would be important to prevent the traffic crashes by cracking down on the vehicles intruding the shoulders of the road. Therefore, this study proposed a method to detect a vehicle that violates the shoulder lane by using the Faster R-CNN. The vehicle was detected based on the Faster R-CNN, and an additional reading module was configured to determine whether there was a shoulder violation. For experiments and evaluations, GTAV, a simulation game that can reproduce situations similar to the real world, was used. 1,800 images of training data and 800 evaluation data were processed and generated, and the performance according to the change of the threshold value was measured in ZFNet and VGG16. As a result, the detection rate of ZFNet was 99.2% based on Threshold 0.8 and VGG16 93.9% based on Threshold 0.7, and the average detection speed for each model was 0.0468 seconds for ZFNet and 0.16 seconds for VGG16, so the detection rate of ZFNet was about 7% higher. The speed was also confirmed to be about 3.4 times faster. These results show that even in a relatively uncomplicated network, it is possible to detect a vehicle that violates the shoulder lane at a high speed without pre-processing the input image. It suggests that this algorithm can be used to detect violations of designated lanes if sufficient training datasets based on actual video data are obtained.

A Study on the Development of an Indoor Positioning Support System for Providing Landmark Information (랜드마크 정보 제공을 위한 실내위치측위 지원 시스템 구축에 관한 연구)

  • Ock-Woo NAM;Chang-Soo SHIN;Yun-Soo CHOI
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.26 no.4
    • /
    • pp.130-144
    • /
    • 2023
  • Recently, various positioning technologies are being researched based on signal-based positioning and image-based positioning to obtain accurate indoor location information. Among these, various studies are being conducted on image positioning technology that determines the location of a mobile terminal using images acquired through cameras and sensor data collected as needed. For video-based positioning, a method of determining indoor location is used by matching mobile terminal photos with virtual landmark images, and for this purpose, it is necessary to build indoor spatial information about various landmarks such as billboards, vending machines, and ATM machines. In order to construct indoor spatial information on various landmarks, a panoramic image in the form of a road view and accurate 3D survey results were obtained through c 13 buildings of the Electronics and Telecommunications Research Institute(ETRI). When comparing the 3D total station final result and the terrestrial lidar panoramic image coordinates, the coordinates and distance performance were obtained within about 0.10m, confirming that accurate landmark construction for use in indoor positioning was possible. By utilizing these terrestrial lidar achievements to perform 3D landmark modeling necessary for image positioning, it was possible to more quickly model landmark information that could not be constructed only through 3D modeling using existing as-built drawings.

One-shot multi-speaker text-to-speech using RawNet3 speaker representation (RawNet3를 통해 추출한 화자 특성 기반 원샷 다화자 음성합성 시스템)

  • Sohee Han;Jisub Um;Hoirin Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.67-76
    • /
    • 2024
  • Recent advances in text-to-speech (TTS) technology have significantly improved the quality of synthesized speech, reaching a level where it can closely imitate natural human speech. Especially, TTS models offering various voice characteristics and personalized speech, are widely utilized in fields such as artificial intelligence (AI) tutors, advertising, and video dubbing. Accordingly, in this paper, we propose a one-shot multi-speaker TTS system that can ensure acoustic diversity and synthesize personalized voice by generating speech using unseen target speakers' utterances. The proposed model integrates a speaker encoder into a TTS model consisting of the FastSpeech2 acoustic model and the HiFi-GAN vocoder. The speaker encoder, based on the pre-trained RawNet3, extracts speaker-specific voice features. Furthermore, the proposed approach not only includes an English one-shot multi-speaker TTS but also introduces a Korean one-shot multi-speaker TTS. We evaluate naturalness and speaker similarity of the generated speech using objective and subjective metrics. In the subjective evaluation, the proposed Korean one-shot multi-speaker TTS obtained naturalness mean opinion score (NMOS) of 3.36 and similarity MOS (SMOS) of 3.16. The objective evaluation of the proposed English and Korean one-shot multi-speaker TTS showed a prediction MOS (P-MOS) of 2.54 and 3.74, respectively. These results indicate that the performance of our proposed model is improved over the baseline models in terms of both naturalness and speaker similarity.

Development of a Slope Condition Analysis System using IoT Sensors and AI Camera (IoT 센서와 AI 카메라를 융합한 급경사지 상태 분석 시스템 개발)

  • Seungjoo Lee;Kiyen Jeong;Taehoon Lee;YoungSeok Kim
    • Journal of the Korean Geosynthetics Society
    • /
    • v.23 no.2
    • /
    • pp.43-52
    • /
    • 2024
  • Recent abnormal climate conditions have increased the risk of slope collapses, which frequently result in significant loss of life and property due to the absence of early prediction and warning dissemination. In this paper, we develop a slope condition analysis system using IoT sensors and AI-based camera to assess the condition of slopes. To develop the system, we conducted hardware and firmware design for measurement sensors considering the ground conditions of slopes, designed AI-based image analysis algorithms, and developed prediction and warning solutions and systems. We aimed to minimize errors in sensor data through the integration of IoT sensor data and AI camera image analysis, ultimately enhancing the reliability of the data. Additionally, we evaluated the accuracy (reliability) by applying it to actual slopes. As a result, sensor measurement errors were maintained within 0.1°, and the data transmission rate exceeded 95%. Moreover, the AI-based image analysis system demonstrated nighttime partial recognition rates of over 99%, indicating excellent performance even in low-light conditions. Through this research, it is anticipated that the analysis of slope conditions and smart maintenance management in various fields of Social Overhead Capital (SOC) facilities can be applied.

An Application-Specific and Adaptive Power Management Technique for Portable Systems (휴대장치를 위한 응용프로그램 특성에 따른 적응형 전력관리 기법)

  • Egger, Bernhard;Lee, Jae-Jin;Shin, Heon-Shik
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.34 no.8
    • /
    • pp.367-376
    • /
    • 2007
  • In this paper, we introduce an application-specific and adaptive power management technique for portable systems that support dynamic voltage scaling (DVS). We exploit both the idle time of multitasking systems running soft real-time tasks as well as memory- or CPU-bound code regions. Detailed power and execution time profiles guide an adaptive power manager (APM) that is linked to the operating system. A post-pass optimizer marks candidate regions for DVS by inserting calls to the APM. At runtime, the APM monitors the CPU's performance counters to dynamically determine the affinity of the each marked region. for each region, the APM computes the optimal voltage and frequency setting in terms of energy consumption and switches the CPU to that setting during the execution of the region. Idle time is exploited by monitoring system idle time and switching to the energy-wise most economical setting without prolonging execution. We show that our method is most effective for periodic workloads such as video or audio decoding. We have implemented our method in a multitasking operating system (Microsoft Windows CE) running on an Intel XScale-processor. We achieved up to 9% of total system power savings over the standard power management policy that puts the CPU in a low Power mode during idle periods.

4-way Search Window for Improving The Memory Bandwidth of High-performance 2D PE Architecture in H.264 Motion Estimation (H.264 움직임추정에서 고속 2D PE 아키텍처의 메모리대역폭 개선을 위한 4-방향 검색윈도우)

  • Ko, Byung-Soo;Kong, Jin-Hyeung
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.6
    • /
    • pp.6-15
    • /
    • 2009
  • In this paper, a new 4-way search window is designed for the high-performance 2D PE architecture in H.264 Motion Estimation(ME) to improve the memory bandwidth. While existing 2D PE architectures reuse the overlapped data of adjacent search windows scanned in 1 or 3-way, the new window utilizes the overlapped data of adjacent search windows as well as adjacent multiple scanning (window) paths to enhance the reusage of retrieved search window data. In order to scan adjacent windows and multiple paths instead of single raster and zigzag scanning of adjacent windows, bidirectional row and column window scanning results in the 4-way(up. down, left, right) search window. The proposed 4-way search window could improve the reuse of overlapped window data to reduce the redundancy access factor by 3.1, though the 1/3-way search window redundantly requires $7.7{\sim}11$ times of data retrieval. Thus, the new 4-way search window scheme enhances the memory bandwidth by $70{\sim}58%$ compared with 1/3-way search window. The 2D PE architecture in H.264 ME for 4-way search window consists of $16{\times}16$ pe array. computing the absolute difference between current and reference frames, and $5{\times}16$ reusage array, storing the overlapped data of adjacent search windows and multiple scanning paths. The reference data could be loaded upward and downward into the new 2D PE depending on scanning direction, and the reusage array is combined with the pe array rotating left as well as right to utilize the overlapped data of adjacent multiple scan paths. In experiments, the new implementation of 4-way search window on Magnachip 0.18um could deal with the HD($1280{\times}720$) video of 1 reference frame, $48{\times}48$ search area and $16{\times}16$ macroblock by 30fps at 149.25MHz.

The post-epic characteristics in Jan Lauwers' theatre -, and - (얀 라우어스(Jan Lauwers) 공연의 탈서사적 특징들 -<이사벨라의 방(Isabella's Room)>, <랍스터 가게(The Lobster Shop)>, <사슴의 집(Deer House)>을 중심으로-)

  • Nam, Jisoo
    • Journal of Korean Theatre Studies Association
    • /
    • no.48
    • /
    • pp.447-484
    • /
    • 2012
  • This study aims to analyze the characteristics of post-epic theatre in the Belgian theatre director Jan Lauwers' trilogy titled in "Happy Face/Sad Face": (2004), (2006) and (2008). I regard that it played a very important junction for him to create his own theatrical style compared to earlier years. From this period, Lauwers has tried to create his original plays in order to concentrate the story of our era and has showed to combine a variety of media such as dance, installation, video, singing etc. In this context, I would like to study his own theatricality from the three perspectives of dramaturgy, directing and acting largely based on Hans-Thies Lehmann's theory of post-epic theatre, who pointed out the significance of Lauwer's theatrical leading role very early. First, from the dramaturgical point of view, we need to pay attention to the theme of translunary death; where the living and the dead coexist on the stage. In fact, death is the theme that Lauwers has been struggling to research for quite long time. In his trilogy, the dead never exits the stage. The dead, who is not a representative tragic character, even meddles the things among or with the living and provide comments to people. As a consequence, it happens to reduce a dramaturgical strong tension, leads depreciation of suspense and produces humanism in a way. This approach helps to create his unique comical theatrical atmosphere even though he deals with the contemporary tragic issues such as war, horror and death. Second, from the directing point of view, it is worth to take a look at the polyphonic strategy in terms to applying various media. Among all the things, the arts of dancing and singing in chorus are actively applied in Lauwer's trilogy. The dance is used in individual and microscopic way, on the other hand, singing shows collective and is a macroscopic quality. The dance is the representing media to show Lauwer's simultaneous microscopic mise-en-scene. While main plot takes place around the center-stage, actors perform a dance around the off-centered stage. Instead of exiting from the stage during the performance, the actors would continue dance -sometimes more like movements- around the off-centered stage. This not only describes the narrative, but also shows how each character is engaged to the main plot or incident, and how they look into it as a character. Its simultaneous microscopic mise-en-scene intends to function such as: showing a variety moments of lives, amplifying some moments or incidents, revealing character's emotion, creating illusionary theatrical atmosphere and so on. Meanwhile, singing simple lyrics and tunes are an example of the media to stimulate the audiences' catharsis. As the simple melody lingers in the audiences' mind, it ends up delivering a theatrical message or theme after the performance. This message would be transferred from the singing in chorus functions as a sort of leitmotive in order to make an impression to the audience. This not only richens their emotion but also creates an illusionary effect. Third, from the acting perspective, I'd like to point out the "detachment" aesthetic which Lehmann has pointed out. The actors never go deep into the drama by consistently doing recognize a theatrical illusion. The audience happens to pay attention to their presence through the actor's deliberate gesture, business, movement, rhythm, language, dance etc. The actors are against forming closed action by speaking in various languages or by revealing deliberately stage directions or acts, and by creating expressive mise-en-scene with multiple media. As a consequent, the stage can be transformed to not a metaphoric but a metonymic place. These actions are the ultimate intention for a direct effect to the audience. So to speak, Lauwers uses the anti-illusionary theatrical method: the scenes of fantastic death, interruption of singing and dance, speaking many kinds of languages, acting in detachment-status and so on. These strategies function to make cracks in spectators' desire who has a desire to construct a linear narrative. I'd like to say that it is the numerous potentiality to let the reality penetrate though and collide the reality with a fiction. By doing so, it induces for spectators to see the reality in the fiction. As Lehmann says, "when theatre presents itself as a sketch and not as a finished painting, the spectators are given the chance to feel their own presence, to reflect on it, and to contribute to the unfinished character themselves". In this sense the spectators can perform an objective criticism on our society and world in Lauwer's theatre because there are a number of gaps and cracks in his theatrical illusion where reality can penetrate. This is also the point that we can find out the artists' responsibility in this era of our being.

A Study on Movement Characteristics of Dalgubal Drum Dance (달구벌 북춤 춤사위의 특성에 대한 고찰)

  • Choi, Won-sun
    • (The) Research of the performance art and culture
    • /
    • no.42
    • /
    • pp.147-181
    • /
    • 2021
  • Dalgubal drum dance is inherited in a recreated form by incorporating regional symbolism and the dance philosophy and artisticity of Young Hwangbo, the creator, based on the traditional drum dance of the Yeongnam region. This dance having popularity with the transformation of traditional Korean culture has been invited not only to Yeongnam region including Daegu but also to international various venues. This study explores what the movement characteristics of this Dalgubal drum dance are and the unique charm and symbolic meaning of this dance. Specific analysis was conducted through analyzing Dalgubal drum dance video film of the 89th Korean Myeongmujeon's by using Laban Movement Analysis as a research method. The special features of this dance resulted from the LMA analysis in terms of the four categories-Body, Effort, Shape, and Space-reveal simple yet cheerful personalities and strong yet patient characteristics of the people in Daegu. The harmony of drum sounds(music) and movements(dance) creates various characteristics of dances and reveals the beauty and excitement of unique Korean dance. In particular, drum play and its related dance movements create curved linear spatial pattern of arm movements, Spiral Shape in body posture, and diverse floor patterns occupying whole stage space. These movements show the three-dimensional spatial beauty and the artistic ideas for recreation of traditional drum dance, which considered with the spatial structure of the proscenium stage. In addition, the well-organized structure and harmonious movements of this dance show the traditional Korean philosophy, implying heaven, earth, and human being and the wholeness, and the harmony of yin and yang. The dance aims at communication between the audiences and dancers through sharing excitement and the aesthetic beauty of dance. This can be interpreted as a meaningful expression of traditional Korean philosophy developed with the unique value and characteristics of Korean dance.

A Mobile Landmarks Guide : Outdoor Augmented Reality based on LOD and Contextual Device (모바일 랜드마크 가이드 : LOD와 문맥적 장치 기반의 실외 증강현실)

  • Zhao, Bi-Cheng;Rosli, Ahmad Nurzid;Jang, Chol-Hee;Lee, Kee-Sung;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.1-21
    • /
    • 2012
  • In recent years, mobile phone has experienced an extremely fast evolution. It is equipped with high-quality color displays, high resolution cameras, and real-time accelerated 3D graphics. In addition, some other features are includes GPS sensor and Digital Compass, etc. This evolution advent significantly helps the application developers to use the power of smart-phones, to create a rich environment that offers a wide range of services and exciting possibilities. To date mobile AR in outdoor research there are many popular location-based AR services, such Layar and Wikitude. These systems have big limitation the AR contents hardly overlaid on the real target. Another research is context-based AR services using image recognition and tracking. The AR contents are precisely overlaid on the real target. But the real-time performance is restricted by the retrieval time and hardly implement in large scale area. In our work, we exploit to combine advantages of location-based AR with context-based AR. The system can easily find out surrounding landmarks first and then do the recognition and tracking with them. The proposed system mainly consists of two major parts-landmark browsing module and annotation module. In landmark browsing module, user can view an augmented virtual information (information media), such as text, picture and video on their smart-phone viewfinder, when they pointing out their smart-phone to a certain building or landmark. For this, landmark recognition technique is applied in this work. SURF point-based features are used in the matching process due to their robustness. To ensure the image retrieval and matching processes is fast enough for real time tracking, we exploit the contextual device (GPS and digital compass) information. This is necessary to select the nearest and pointed orientation landmarks from the database. The queried image is only matched with this selected data. Therefore, the speed for matching will be significantly increased. Secondly is the annotation module. Instead of viewing only the augmented information media, user can create virtual annotation based on linked data. Having to know a full knowledge about the landmark, are not necessary required. They can simply look for the appropriate topic by searching it with a keyword in linked data. With this, it helps the system to find out target URI in order to generate correct AR contents. On the other hand, in order to recognize target landmarks, images of selected building or landmark are captured from different angle and distance. This procedure looks like a similar processing of building a connection between the real building and the virtual information existed in the Linked Open Data. In our experiments, search range in the database is reduced by clustering images into groups according to their coordinates. A Grid-base clustering method and user location information are used to restrict the retrieval range. Comparing the existed research using cluster and GPS information the retrieval time is around 70~80ms. Experiment results show our approach the retrieval time reduces to around 18~20ms in average. Therefore the totally processing time is reduced from 490~540ms to 438~480ms. The performance improvement will be more obvious when the database growing. It demonstrates the proposed system is efficient and robust in many cases.