• Title/Summary/Keyword: Time-frequency image

Search Result 508, Processing Time 0.035 seconds

Comprehensive analysis of deep learning-based target classifiers in small and imbalanced active sonar datasets (소량 및 불균형 능동소나 데이터세트에 대한 딥러닝 기반 표적식별기의 종합적인 분석)

  • Geunhwan Kim;Youngsang Hwang;Sungjin Shin;Juho Kim;Soobok Hwang;Youngmin Choo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.4
    • /
    • pp.329-344
    • /
    • 2023
  • In this study, we comprehensively analyze the generalization performance of various deep learning-based active sonar target classifiers when applied to small and imbalanced active sonar datasets. To generate the active sonar datasets, we use data from two different oceanic experiments conducted at different times and ocean. Each sample in the active sonar datasets is a time-frequency domain image, which is extracted from audio signal of contact after the detection process. For the comprehensive analysis, we utilize 22 Convolutional Neural Networks (CNN) models. Two datasets are used as train/validation datasets and test datasets, alternatively. To calculate the variance in the output of the target classifiers, the train/validation/test datasets are repeated 10 times. Hyperparameters for training are optimized using Bayesian optimization. The results demonstrate that shallow CNN models show superior robustness and generalization performance compared to most of deep CNN models. The results from this paper can serve as a valuable reference for future research directions in deep learning-based active sonar target classification.

Spontaneous Speech Emotion Recognition Based On Spectrogram With Convolutional Neural Network (CNN 기반 스펙트로그램을 이용한 자유발화 음성감정인식)

  • Guiyoung Son;Soonil Kwon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.6
    • /
    • pp.284-290
    • /
    • 2024
  • Speech emotion recognition (SER) is a technique that is used to analyze the speaker's voice patterns, including vibration, intensity, and tone, to determine their emotional state. There has been an increase in interest in artificial intelligence (AI) techniques, which are now widely used in medicine, education, industry, and the military. Nevertheless, existing researchers have attained impressive results by utilizing acted-out speech from skilled actors in a controlled environment for various scenarios. In particular, there is a mismatch between acted and spontaneous speech since acted speech includes more explicit emotional expressions than spontaneous speech. For this reason, spontaneous speech-emotion recognition remains a challenging task. This paper aims to conduct emotion recognition and improve performance using spontaneous speech data. To this end, we implement deep learning-based speech emotion recognition using the VGG (Visual Geometry Group) after converting 1-dimensional audio signals into a 2-dimensional spectrogram image. The experimental evaluations are performed on the Korean spontaneous emotional speech database from AI-Hub, consisting of 7 emotions, i.e., joy, love, anger, fear, sadness, surprise, and neutral. As a result, we achieved an average accuracy of 83.5% and 73.0% for adults and young people using a time-frequency 2-dimension spectrogram, respectively. In conclusion, our findings demonstrated that the suggested framework outperformed current state-of-the-art techniques for spontaneous speech and showed a promising performance despite the difficulty in quantifying spontaneous speech emotional expression.

Personalized Exhibition Booth Recommendation Methodology Using Sequential Association Rule (순차 연관 규칙을 이용한 개인화된 전시 부스 추천 방법)

  • Moon, Hyun-Sil;Jung, Min-Kyu;Kim, Jae-Kyeong;Kim, Hyea-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.195-211
    • /
    • 2010
  • An exhibition is defined as market events for specific duration to present exhibitors' main product range to either business or private visitors, and it also plays a key role as effective marketing channels. Especially, as the effect of the opinions of the visitors after the exhibition impacts directly on sales or the image of companies, exhibition organizers must consider various needs of visitors. To meet needs of visitors, ubiquitous technologies have been applied in some exhibitions. However, despite of the development of the ubiquitous technologies, their services cannot always reflect visitors' preferences as they only generate information when visitors request. As a result, they have reached their limit to meet needs of visitors, which consequently might lead them to loss of marketing opportunity. Recommendation systems can be the right type to overcome these limitations. They can recommend the booths to coincide with visitors' preferences, so that they help visitors who are in difficulty for choices in exhibition environment. One of the most successful and widely used technologies for building recommender systems is called Collaborative Filtering. Traditional recommender systems, however, only use neighbors' evaluations or behaviors for a personalized prediction. Therefore, they can not reflect visitors' dynamic preference, and also lack of accuracy in exhibition environment. Although there is much useful information to infer visitors' preference in ubiquitous environment (e.g., visitors' current location, booth visit path, and so on), they use only limited information for recommendation. In this study, we propose a booth recommendation methodology using Sequential Association Rule which considers the sequence of visiting. Recent studies of Sequential Association Rule use the constraints to improve the performance. However, since traditional Sequential Association Rule considers the whole rules to recommendation, they have a scalability problem when they are adapted to a large exhibition scale. To solve this problem, our methodology composes the confidence database before recommendation process. To compose the confidence database, we first search preceding rules which have the frequency above threshold. Next, we compute the confidences of each preceding rules to each booth which is not contained in preceding rules. Therefore, the confidence database has two kinds of information which are preceding rules and their confidence to each booth. In recommendation process, we just generate preceding rules of the target visitors based on the records of the visits, and recommend booths according to the confidence database. Throughout these steps, we expect reduction of time spent on recommendation process. To evaluate proposed methodology, we use real booth visit records which are collected by RFID technology in IT exhibition. Booth visit records also contain the visit sequence of each visitor. We compare the performance of proposed methodology with traditional Collaborative Filtering system. As a result, our proposed methodology generally shows higher performance than traditional Collaborative Filtering. We can also see some features of it in experimental results. First, it shows the highest performance at one booth recommendation. It detects preceding rules with some portions of visitors. Therefore, if there is a visitor who moved with very a different pattern compared to the whole visitors, it cannot give a correct recommendation for him/her even though we increase the number of recommendation. Trained by the whole visitors, it cannot correctly give recommendation to visitors who have a unique path. Second, the performance of general recommendation systems increase as time expands. However, our methodology shows higher performance with limited information like one or two time periods. Therefore, not only can it recommend even if there is not much information of the target visitors' booth visit records, but also it uses only small amount of information in recommendation process. We expect that it can give real?time recommendations in exhibition environment. Overall, our methodology shows higher performance ability than traditional Collaborative Filtering systems, we expect it could be applied in booth recommendation system to satisfy visitors in exhibition environment.

A Study on the Dietary Behavior and Image and Preference of Japanese Foods of University Students in Daegu and Kyungbuk Area (대구, 경북지역 대학생의 식사행동 및 일본음식에 대한 인상 및 기호도 조사 연구)

  • 한재숙;이연정;최석현;최수근;권상용;최영희
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.14 no.1
    • /
    • pp.1-10
    • /
    • 2004
  • This study was conducted to investigate the dietary behavior and image and preference of Japanese foods. The Subjects were consisted of 570 university students(243 males and 327 females) in Daegu and Kyungbuk area, Korea. The students responses to the 10 questions about image of Japanese foods were also measured on 5 point Likert scale. Data were presented by using frequency, percentage, chi-square test and T-test. The results of this study were as follows: (1) On the eating habits, 'the whole family has breakfast together with same foods everyday'scored high as 42.3% and 'foods put in a big platter by gathering everyday'as 35.8%. (2) About the eating customs, 53.5% of the subjects responded that the seat was fixed at meal time, 56.4% didn't start to eat before the patriarch started a meal and 30.9% responded that the head of a family had more foods in number and quantity. (3) On the table manners, 13.4% of the subjects were scolded about 'watching TV on eating', 11.5% about 'making left-over foods', 8.0% about 'misuse of spoon and chopsticks'. (4) The preferred ethnic foods by University students was in other of Korean, Chinese, Italian, Japanese and French foods. (5) Among subjects, 93.8% had no experience of visiting Japan and 92.6% wanted to visit Japan. Images on the Japanese foods were 'the price is too expensive' (mean 4.15) and 'the decoration is wonderful'(mean 4.05). But the subjects did not think Japanese foods as 'hot'(mean 2.21) and 'greasy'(mean 2.51). (6) The favorite Japanese food of subjects was Udon(mean 3.98), Sushi(mean 3.85) and Tempura(mean 3.69). So Udon turned out to be the most popular Japanese foods by university students in Daegu and Kyungbuk area, Korea. But they did not prefer Natto(mean 2.68), Ochazuke(mean 2.76), Okonomiyaki(mean 2.87) and Misosiru and did not eat. From the above results, Korean university students preferred Udon to Natto among Japanese traditional foods, and they estimated Japanese foods as 'too expensive'. Therefore, lowering the price and developing the cooking method for Korean taste were needed to increase the intake of Japanese traditional foods by Korean university students and.

  • PDF

An Analysis of the Cognition of Professionals Regarding the Validity of Planting Design Change that Occurred in the Landscape Construction of a Major Private Company (민간기업 조경공사에서 나타나는 식재설계 변경 타당성에 대한 전문가 인식 분석)

  • Park, Jae-Young;Cho, Se-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.42 no.6
    • /
    • pp.101-110
    • /
    • 2014
  • This study analyzes the validity of the type classification of the type and design changes of apartment landscaping planting construction design changes that were completed in the private sector, efficiently manages the design changes that are displayed over landscaping planting work in general in the future, and performs research by placing the object underlying the presentation. The results are as follows. First, the percentage that occurred in the planting construction of design changes that have occurred in the apartment landscaping construction was carried out in the private sector and accounted for 61.8%. This indicates that part of the planting is a major design change. Second, as the cause of such a design change to be those associated with the field conditions such as lack of main construction period. In particular, due to a change in oral, appeared 7-48 times design changes of one review design change approval is complex, design changes of planting construction had shown a feature that occurs in multiple simultaneous. Third, the 7 types of Design Changes in planting design were delineated as 'design changes for consideration of the user', 'design changes for image improvement', 'design changes for ease of maintenance', 'design changes due to the mismatch of design statement', 'design changes due to the relationship with the engineering species of other', 'design changes due to lack of field study', and 'design changes due to the consideration of feasibility.' Fourth, 'design changes for consideration of the user' and 'design changes for image improvement' were found in more than half of the frequency of the overall changes. This differed from the results shown in public corporations. Fifth, if planting construction design change process, private companies, it was found that is showing the approval of the practice after the previous construction of the construction cost savings due to construction time. However, in the case of a public corporation, these exhibited a different aspect from the private sector and show a design change procedure that reflects the changes after the design change events in the field have occurred. The above results, the type of landscaping works in planting design change of public enterprises, regardless of the private sector, is the same in the seven types, the main reason of and procedures for design changes, indicating that there are other respects. In design change, it may be desirable to apply becomes liquidity rationality and efficiency of the dimension, depending on the nature of the landscape construction.

Development of deep learning network based low-quality image enhancement techniques for improving foreign object detection performance (이물 객체 탐지 성능 개선을 위한 딥러닝 네트워크 기반 저품질 영상 개선 기법 개발)

  • Ki-Yeol Eom;Byeong-Seok Min
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.99-107
    • /
    • 2024
  • Along with economic growth and industrial development, there is an increasing demand for various electronic components and device production of semiconductor, SMT component, and electrical battery products. However, these products may contain foreign substances coming from manufacturing process such as iron, aluminum, plastic and so on, which could lead to serious problems or malfunctioning of the product, and fire on the electric vehicle. To solve these problems, it is necessary to determine whether there are foreign materials inside the product, and may tests have been done by means of non-destructive testing methodology such as ultrasound ot X-ray. Nevertheless, there are technical challenges and limitation in acquiring X-ray images and determining the presence of foreign materials. In particular Small-sized or low-density foreign materials may not be visible even when X-ray equipment is used, and noise can also make it difficult to detect foreign objects. Moreover, in order to meet the manufacturing speed requirement, the x-ray acquisition time should be reduced, which can result in the very low signal- to-noise ratio(SNR) lowering the foreign material detection accuracy. Therefore, in this paper, we propose a five-step approach to overcome the limitations of low resolution, which make it challenging to detect foreign substances. Firstly, global contrast of X-ray images are increased through histogram stretching methodology. Second, to strengthen the high frequency signal and local contrast, we applied local contrast enhancement technique. Third, to improve the edge clearness, Unsharp masking is applied to enhance edges, making objects more visible. Forth, the super-resolution method of the Residual Dense Block (RDB) is used for noise reduction and image enhancement. Last, the Yolov5 algorithm is employed to train and detect foreign objects after learning. Using the proposed method in this study, experimental results show an improvement of more than 10% in performance metrics such as precision compared to low-density images.

Integrated Rotary Genetic Analysis Microsystem for Influenza A Virus Detection

  • Jung, Jae Hwan;Park, Byung Hyun;Choi, Seok Jin;Seo, Tae Seok
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2013.08a
    • /
    • pp.88-89
    • /
    • 2013
  • A variety of influenza A viruses from animal hosts are continuously prevalent throughout the world which cause human epidemics resulting millions of human infections and enormous industrial and economic damages. Thus, early diagnosis of such pathogen is of paramount importance for biomedical examination and public healthcare screening. To approach this issue, here we propose a fully integrated Rotary genetic analysis system, called Rotary Genetic Analyzer, for on-site detection of influenza A viruses with high speed. The Rotary Genetic Analyzer is made up of four parts including a disposable microchip, a servo motor for precise and high rate spinning of the chip, thermal blocks for temperature control, and a miniaturized optical fluorescence detector as shown Fig. 1. A thermal block made from duralumin is integrated with a film heater at the bottom and a resistance temperature detector (RTD) in the middle. For the efficient performance of RT-PCR, three thermal blocks are placed on the Rotary stage and the temperature of each block is corresponded to the thermal cycling, namely $95^{\circ}C$ (denature), $58^{\circ}C$ (annealing), and $72^{\circ}C$ (extension). Rotary RT-PCR was performed to amplify the target gene which was monitored by an optical fluorescent detector above the extension block. A disposable microdevice (10 cm diameter) consists of a solid-phase extraction based sample pretreatment unit, bead chamber, and 4 ${\mu}L$ of the PCR chamber as shown Fig. 2. The microchip is fabricated using a patterned polycarbonate (PC) sheet with 1 mm thickness and a PC film with 130 ${\mu}m$ thickness, which layers are thermally bonded at $138^{\circ}C$ using acetone vapour. Silicatreated microglass beads with 150~212 ${\mu}L$ diameter are introduced into the sample pretreatment chambers and held in place by weir structure for construction of solid-phase extraction system. Fig. 3 shows strobed images of sequential loading of three samples. Three samples were loaded into the reservoir simultaneously (Fig. 3A), then the influenza A H3N2 viral RNA sample was loaded at 5000 RPM for 10 sec (Fig. 3B). Washing buffer was followed at 5000 RPM for 5 min (Fig. 3C), and angular frequency was decreased to 100 RPM for siphon priming of PCR cocktail to the channel as shown in Figure 3D. Finally the PCR cocktail was loaded to the bead chamber at 2000 RPM for 10 sec, and then RPM was increased up to 5000 RPM for 1 min to obtain the as much as PCR cocktail containing the RNA template (Fig. 3E). In this system, the wastes from RNA samples and washing buffer were transported to the waste chamber, which is fully filled to the chamber with precise optimization. Then, the PCR cocktail was able to transport to the PCR chamber. Fig. 3F shows the final image of the sample pretreatment. PCR cocktail containing RNA template is successfully isolated from waste. To detect the influenza A H3N2 virus, the purified RNA with PCR cocktail in the PCR chamber was amplified by using performed the RNA capture on the proposed microdevice. The fluorescence images were described in Figure 4A at the 0, 40 cycles. The fluorescence signal (40 cycle) was drastically increased confirming the influenza A H3N2 virus. The real-time profiles were successfully obtained using the optical fluorescence detector as shown in Figure 4B. The Rotary PCR and off-chip PCR were compared with same amount of influenza A H3N2 virus. The Ct value of Rotary PCR was smaller than the off-chip PCR without contamination. The whole process of the sample pretreatment and RT-PCR could be accomplished in 30 min on the fully integrated Rotary Genetic Analyzer system. We have demonstrated a fully integrated and portable Rotary Genetic Analyzer for detection of the gene expression of influenza A virus, which has 'Sample-in-answer-out' capability including sample pretreatment, rotary amplification, and optical detection. Target gene amplification was real-time monitored using the integrated Rotary Genetic Analyzer system.

  • PDF

The Location and Landscape Composition of Yowol-pavilion Garden Interpreted from Tablet & Poetry (편액과 시문으로 본 요월정원림(邀月亭園林)의 입지 및 조영 해석)

  • Lee, Hyun-Woo;Kim, Sang-Wook;Ren, Qin-Hong
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.3
    • /
    • pp.32-45
    • /
    • 2014
  • The study attempts to interpret original location and landscape composition of Yowol-pavilion Garden under the premise that tablet and poetry are important criteria for inference of unique location and landscape composition of a pavilion garden. The study raises the meaning, status, and value of Yowol Pavilion Garden as a cultural asset. The results of the study are as follows. First, Yowol-pavilion Garden was a place where famous Confucius scholars in Joseon Dynasty in 16th Century, including Kim, Kyung-Woo, the owner of the garden, used to share the taste for the arts and poetries with their colleagues. Along with a main characteristic of Yowol Pavilion Garden as a hideout for the Confucius scholars who stayed away from a political turmoil, the new place characteristic of the garden, a bridgehead for the formation of regional identity, was discovered in the record of "Joseon-Hwanyeo-Seungram Honam-Eupji JangSeong-Eupji", As described in "The first creative poetry of Yowol-pavilion", the intention for the creation of Yowol-pavilion Garden and the motive for its landscape composition is interpreted as a space of rivalry where the world, reality and ideals are mixed up. Second, related to outstanding scenic factors and natural phenomena when taking a view from the pavilion, the name of the house 'Yowol', which means 'Greeting the moon rising on the Mt. Wolbong' is the provision of nature and taste for the arts, and is directly connected to the image of leaving the worldly. In other words, the name was identified to be the one that reflected the intention for landscape composition to follow the provision of nature separating from joy and sorrow of the mundane world. Third, as for the location, it was confirmed through "YeongGwang-Soksu-Yeoji-Seungram" that Yowol-pavilion Garden was a place where the person who made the pavilion prepared for relaxation after stepping down from a government post, and literature and various poetry show that it was also a place of outstanding scenic where Yellow-dragon River meandered facing Mt. Wolbong. Especially, according to an interview with a keeper, the visual perception frequency of the nightscape of Yowol-pavilion Garden is the highest when viewing by considering the east, the direction of Yellow-dragon River, as Suksigak[normal angle's view], towards Yowel-pavilion from the keeper's house. In addition, he said that the most beautiful landscape with high perception strength is when the moon came up from the left side of Yowol-pavilion, cuts across the Lagerstroemia india heal in front of Yowol-pavilion, and crosses the meridian between Mt. Wolbong peaks facing Yowol-pavilion. Currently, the exposure of Yowol-pavilion Garden is $SE\;141.2^{\circ}$, which is almost facing southeast. It is assumed that the exposure of Yowol-pavilion Garden was determined considering the optimized direction for appreciating the trace of the moon and the intention of securing the visibility as well as topographic conditions. Furthermore, it is presumed that the exposure of Yowol-pavilion Garden was determined so that the moon is reflected on the water of Yellow-dragon River and the moon and its reflection form a symmetry. Fourth, currently, Yowol-pavilion Garden is divided into 'inner garden sphere' composed of Yowol-pavilion, meeting place of the clan and administration building, and 'outer garden sphere' which is inclusive of entrance space, Crape Myrtle Community Garden and Pine Tree Forest in the back. Further, Yowol-pavilion Garden has been deteriorated as the edge was expanded to 'Small lake[Yong-so] and Gardens of aquatic plants sphere' and recently-created 'Yellow-dragon Pavilion and park sphere'. Fifth, at the time it was first made, Yowol-pavilion Garden was borderless gardens consisting of mountains and water taking a method of occupying a specific space of nearby nature centering around pavilion by embracing landscape viewed from the pavilion, but interpreted current complex landscapes are identified to be entirely different from landscapes of the original due to 'Different Changes', 'Fragmentation' and 'Apart piece' in many parts. Lastly, considering that Yowol-pavilion Garden belongs to the Cultural Properties Protection Zone, though not the restoration to the landscapes of the original described in tablet and literature record, at least taking a measure from the aspect of land use for minimizing adverse effect on landscape and visual damage is required.