• Title/Summary/Keyword: Deep Features

Search Result 1,096, Processing Time 0.028 seconds

Recognition of Overlapped Sound and Influence Analysis Based on Wideband Spectrogram and Deep Neural Networks (광역 스펙트로그램과 심층신경망에 기반한 중첩된 소리의 인식과 영향 분석)

  • Kim, Young Eon;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.23 no.3
    • /
    • pp.421-430
    • /
    • 2018
  • Many voice recognition systems use methods such as MFCC, HMM to acknowledge human voice. This recognition method is designed to analyze only a targeted sound which normally appears between a human and a device one. However, the recognition capability is limited when there is a group sound formed with diversity in wider frequency range such as dog barking and indoor sounds. The frequency of overlapped sound resides in a wide range, up to 20KHz, which is higher than a voice. This paper proposes the new recognition method which provides wider frequency range by conjugating the Wideband Sound Spectrogram and the Keras Sequential Model based on DNN. The wideband sound spectrogram is adopted to analyze and verify diverse sounds from wide frequency range as it is designed to extract features and also classify as explained. The KSM is employed for the pattern recognition using extracted features from the WSS to improve sound recognition quality. The experiment verified that the proposed WSS and KSM excellently classified the targeted sound among noisy environment; overlapped sounds such as dog barking and indoor sounds. Furthermore, the paper shows a stage by stage analyzation and comparison of the factors' influences on the recognition and its characteristics according to various levels of noise.

The semantic structure of the Russian humor in the works of Michael Zadornov (자도르노프 작품 속에 나라난 러시아 유머의 의미군조)

  • 안병팔
    • Lingua Humanitatis
    • /
    • v.6
    • /
    • pp.321-357
    • /
    • 2004
  • In this article the structure of modern Russian humor is analyzed on the basis of some theories: bi-sociation theory (Koestler 1964), semantic script theory of verbal humor, using the concept of semantic presupposition, pragmatic felicity condition (Searle 1969; Levinson 1983) and grammatical rules (Chomsky 1965). Up to now the listed former theories were not examined and less analyzed by the semantic structure in the study of the structure of Russian humor(HcaeBa 1969; 3 $a_{OPHOB}$ 1991; 1992). Kreps (1981), who analyzed the works of Zoschenko, presented 21 types of humor, using the term 'humoreme'(Kpenc 1981, 36-37). These types are the list of the available means of humor that work not in the base of semantic criteria, but in the base of means of literary rhetoric. Kreps presented types of humor means, such as contradiction, antonymic substitution, macaronic speech and correlation of humoremes in the various types of humor. Apart from Kreps, Manakov (MaHaKOB 1986, 61-79) also studied these problems. He also set the system of the basic types of humor. Manakov introduced the linguistic means of humor of some Russian writers: Gogol, Tchechov. The means that Manakov showed with detailed examples, are trope, epithet, comic comparison, comic metaphor, comic periphrasis, euphemism, pun, zeugma, comic toponym, comic onomatopoeia, mania of foreign vocabulary, folk etymology, dialect etc. But these studies don't explain why these means make the works humorous. An, B.p tried to answer this question (안병팔 1997 a; b). An B.p. explains contexts of humor through the Release theory, the Superiority theory and the Incongruity theory. An, B.p. explained the process of deviation from the grammatical norms through morpho-syntactic and lexical means. But in these studies the humor was not analyzed by the semantic criteria. In order to linguistically evaluate various means of humor formation, it is necessary to elicit its deep structure, which makes it possible to research the formation and interpretation of humor. For this purpose this article, being based on the Incongruity theory, defined the structure of humor as negation of presupposition. Of course the former traditional studies also well shared the concept of 'contradiction' and 'contrast' of humor structure, but they didn't explain the structure by semantic differential features. This study, analyzing the works of' Zadornov, M., tried to note that through the negation of semantic presupposition the structure of contradiction is formed with semantic differential features on the semantic, syntactic or lexical dimensions.

  • PDF

Visualization of Korean Speech Based on the Distance of Acoustic Features (음성특징의 거리에 기반한 한국어 발음의 시각화)

  • Pok, Gou-Chol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.3
    • /
    • pp.197-205
    • /
    • 2020
  • Korean language has the characteristics that the pronunciation of phoneme units such as vowels and consonants are fixed and the pronunciation associated with a notation does not change, so that foreign learners can approach rather easily Korean language. However, when one pronounces words, phrases, or sentences, the pronunciation changes in a manner of a wide variation and complexity at the boundaries of syllables, and the association of notation and pronunciation does not hold any more. Consequently, it is very difficult for foreign learners to study Korean standard pronunciations. Despite these difficulties, it is believed that systematic analysis of pronunciation errors for Korean words is possible according to the advantageous observations that the relationship between Korean notations and pronunciations can be described as a set of firm rules without exceptions unlike other languages including English. In this paper, we propose a visualization framework which shows the differences between standard pronunciations and erratic ones as quantitative measures on the computer screen. Previous researches only show color representation and 3D graphics of speech properties, or an animated view of changing shapes of lips and mouth cavity. Moreover, the features used in the analysis are only point data such as the average of a speech range. In this study, we propose a method which can directly use the time-series data instead of using summary or distorted data. This was realized by using the deep learning-based technique which combines Self-organizing map, variational autoencoder model, and Markov model, and we achieved a superior performance enhancement compared to the method using the point-based data.

Geomorphic Features of ${\check{O}}rumkol$(Frozen Valley) Area (Kyungnam Province, South Korea) - Mainly about Talus - (경남 밀양 얼음골 일대의 지형적 특성 -Talus를 중심으로-)

  • Jeon, Young-Gweon
    • Journal of the Korean association of regional geographers
    • /
    • v.3 no.1
    • /
    • pp.165-182
    • /
    • 1997
  • The aim of this paper is to clarify geomorphic features on talus within ${\check{O}}rumkol$ and the origin of ${\check{O}}rumkol$. ${\check{O}}rumkol$ is located in Milyang of Kyungnam province, in South Korea. ${\check{O}}rumkol$ is good area to study talus. because it is characterized by following three geomorphic landscapes : free face surrounding ${\check{O}}rumkol$ ; ${\check{O}}rumkol$ with deep and wide valley floor ; lots of taluses typically developing within ${\check{O}}rumkol$. The main results can be summarized as follows: 1) The origin of ${\check{O}}rumkol$ may be suggested two assumptions : one is that its origin have been resulted from intrusion structure(intrusive rock might capture less resistant rock as tuff) ; the other is that its origin have been resulted from volcanic depression after intrusion or eruption. But these assumptions are not obvious. therefore more geological evidences will be supplemented after this 2) The characteristics of ${\check{O}}rumkol$ talus (1) Pattern ${\check{O}}rumkol$ taluses are tongue-shaped or cone-shaped in appearance. They are $50{\sim}200m$ in length and the range of the maximum width from 25 to 115m and one of their mean slope gradient from 32 to $36^{\circ}$ (2) Origin ${\check{O}}rumkol$ taluses have been formed under periglacial environment in the last glacial age and they are classified into rock fall talus type, considering in conjunction with the shape, hardness, sorting, weathering conditions of constituent debris. (3) The stage of landform development ${\check{O}}rumkol$ talus slope profiles are mainly concave slope. This concave slope type was eventually caused by talus creep at the lower end of the talus. That means new additions of debris from the free face have virtually ceased and there is no evidence of recent motion in the deposit. Now it is predominant that vegetation cover is gradually increasingly. Therefore ${\check{O}}rumkol$ taluses appear to be relict form stage. at present.

  • PDF

Feature of Intertextuality Environmental Arts -Focusing on Feature of fantasy post-place, speciality of place as well as temporal-spatial expression method- (상호텍스트적인 환경예술의 특성 -환상성.탈 장소성, 장소의 특수성과 시공간 표현방법에 대한 특성을 중심으로-)

  • Jang, Il-Young;Kim, Jin-Seon
    • Archives of design research
    • /
    • v.18 no.3 s.61
    • /
    • pp.63-74
    • /
    • 2005
  • Modern society is diversified society and is under complicated situation as the boundary of each area has been disappeared. To understand and accept such complicated situation as widely as possible, it is required to understand interaction. of receiver with intertextual environmental arts as the structure of open text. This study examined interaction of environmental arts in terms of intertextual feature based on experience of receiver on combined element of different space and time, combination of genres. This is the concept of meaning personal experience or situation as receiver participates the process of completing art works, and set the fantasy, post-place and speciality of location and temporal-spatial expression method, as characteristics of intertextuality. Features of such experience elements are used as methodology of analyzing characteristics of each work. Feature of fantasy uses strategy of inducing spatial experience of receiver with dematerialization for post-place and expands the place where events occur with intervention of contingency and event situation. It suggests the spatial-temporal expression method as the features focusing on process and reflecting changes in spatial-temporal continuum and speciality of place emphasizing context of place. In conclusion, environmental arts needs to be deep rooted on complicated existence aspect of receiver beyond metaphysical dimension depending on presence and to accomplish conversion of awareness of supplying bisection of life from that place. By doing so, environmental arts can live textual life as it gets together with all other texts in terms of text dimension and creativity can be reborn as practical creativity in intertextuality rather than uniqueness. Such combination with other areas and acceptance of various aspects of receivers who see and experience this will result to creation of open works which can be create newly over and over again in multi-dimensional aspects.

  • PDF

A Study on the Historical Consciousness and View of the Three Religions of Won Cheon Seok (원천석(元天錫)의 역사의식과 유불도(儒佛道) 삼교관)

  • Jeong, Seong Sik
    • The Journal of Korean Philosophical History
    • /
    • no.35
    • /
    • pp.165-188
    • /
    • 2012
  • The purpose of this study is to examine the historical consciousness and view of the three religions (Buddhism, Taoism, and Confucianism) of Won Cheon Seok who lived a period of historical transition from the end of the Goryeo Dynasty to the early Joseon Dynasty. Actively speaking for the public in his time and having the same attitude as the Neo-Confucian scholars in the end of Goryeo Dynasty, he kept criticizing the abuse of the power by powerful families who made the people fall into a state of distress and misery. He believed the dispatch of troops to conquer the Yodong region as a great opportunity to boost the valiant spirit of his country; however, the reality was quite opposite to his expectation as Lee Seong Gye had withdrawn the army troops at the Wihwado causing a great risk to his country. He took a very hard line stance against what Lee Seong Gye did. Although he was a Confucian scholar, he did not ignore Buddhism and Taoism and understood that after all the three religions were based on the same principle. His deep understanding of Buddhism and Taoism as well as Confucianism helped him to make sense of Confucianism even further. He was able to sublimate the worldly anguish coming from the Confucian thinking system by indulging himself deeply into the world view of Buddhism and Taoism. In the end, his view on the three religions was based on the idea that they taught the same principle. His view of the three religions with transactional features has a huge implication for the contemporary society in which various values and multiple cultures coexist and have more common grounds.

Leision Detection in Chest X-ray Images based on Coreset of Patch Feature (패치 특징 코어세트 기반의 흉부 X-Ray 영상에서의 병변 유무 감지)

  • Kim, Hyun-bin;Chun, Jun-Chul
    • Journal of Internet Computing and Services
    • /
    • v.23 no.3
    • /
    • pp.35-45
    • /
    • 2022
  • Even in recent years, treatment of first-aid patients is still often delayed due to a shortage of medical resources in marginalized areas. Research on automating the analysis of medical data to solve the problems of inaccessibility for medical services and shortage of medical personnel is ongoing. Computer vision-based medical inspection automation requires a lot of cost in data collection and labeling for training purposes. These problems stand out in the works of classifying lesion that are rare, or pathological features and pathogenesis that are difficult to clearly define visually. Anomaly detection is attracting as a method that can significantly reduce the cost of data collection by adopting an unsupervised learning strategy. In this paper, we propose methods for detecting abnormal images on chest X-RAY images as follows based on existing anomaly detection techniques. (1) Normalize the brightness range of medical images resampled as optimal resolution. (2) Some feature vectors with high representative power are selected in set of patch features extracted as intermediate-level from lesion-free images. (3) Measure the difference from the feature vectors of lesion-free data selected based on the nearest neighbor search algorithm. The proposed system can simultaneously perform anomaly classification and localization for each image. In this paper, the anomaly detection performance of the proposed system for chest X-RAY images of PA projection is measured and presented by detailed conditions. We demonstrate effect of anomaly detection for medical images by showing 0.705 classification AUROC for random subset extracted from the PadChest dataset. The proposed system can be usefully used to improve the clinical diagnosis workflow of medical institutions, and can effectively support early diagnosis in medically poor area.

Classification of Diabetic Retinopathy using Mask R-CNN and Random Forest Method

  • Jung, Younghoon;Kim, Daewon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.12
    • /
    • pp.29-40
    • /
    • 2022
  • In this paper, we studied a system that detects and analyzes the pathological features of diabetic retinopathy using Mask R-CNN and a Random Forest classifier. Those are one of the deep learning techniques and automatically diagnoses diabetic retinopathy. Diabetic retinopathy can be diagnosed through fundus images taken with special equipment. Brightness, color tone, and contrast may vary depending on the device. Research and development of an automatic diagnosis system using artificial intelligence to help ophthalmologists make medical judgments possible. This system detects pathological features such as microvascular perfusion and retinal hemorrhage using the Mask R-CNN technique. It also diagnoses normal and abnormal conditions of the eye by using a Random Forest classifier after pre-processing. In order to improve the detection performance of the Mask R-CNN algorithm, image augmentation was performed and learning procedure was conducted. Dice similarity coefficients and mean accuracy were used as evaluation indicators to measure detection accuracy. The Faster R-CNN method was used as a control group, and the detection performance of the Mask R-CNN method through this study showed an average of 90% accuracy through Dice coefficients. In the case of mean accuracy it showed 91% accuracy. When diabetic retinopathy was diagnosed by learning a Random Forest classifier based on the detected pathological symptoms, the accuracy was 99%.

Lightening of Human Pose Estimation Algorithm Using MobileViT and Transfer Learning

  • Kunwoo Kim;Jonghyun Hong;Jonghyuk Park
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.9
    • /
    • pp.17-25
    • /
    • 2023
  • In this paper, we propose a model that can perform human pose estimation through a MobileViT-based model with fewer parameters and faster estimation. The based model demonstrates lightweight performance through a structure that combines features of convolutional neural networks with features of Vision Transformer. Transformer, which is a major mechanism in this study, has become more influential as its based models perform better than convolutional neural network-based models in the field of computer vision. Similarly, in the field of human pose estimation, Vision Transformer-based ViTPose maintains the best performance in all human pose estimation benchmarks such as COCO, OCHuman, and MPII. However, because Vision Transformer has a heavy model structure with a large number of parameters and requires a relatively large amount of computation, it costs users a lot to train the model. Accordingly, the based model overcame the insufficient Inductive Bias calculation problem, which requires a large amount of computation by Vision Transformer, with Local Representation through a convolutional neural network structure. Finally, the proposed model obtained a mean average precision of 0.694 on the MS COCO benchmark with 3.28 GFLOPs and 9.72 million parameters, which are 1/5 and 1/9 the number compared to ViTPose, respectively.

A Study on Interpretative Basis of Brain as a Place of Mental Function in Oriental Medicine (정신기능소재로서의 뇌에 대한 한의학적 해석근거 연구)

  • Kim Yong Hun;Kim In Rak;Chi Gyoo Yong
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.16 no.5
    • /
    • pp.881-887
    • /
    • 2002
  • This treatise is written in order to solve the important contradiction between the two theories; in oriental medicine psychological function is responsible for heart, but in western one it is responsible for brain. So we take the methods of studying in the aspects of morphological characteristics(MC) and visceral manifestation theory(VMT, 藏象論) and others about two organs-heart and brain. Brain(頭腦) is preferred to understand as a structure which is manifesting mental activity of heart. So the brain can be named with external heart(外心) corresponding to the relation of kidney(外 and external kidney. Saying conversely, the nutritional foundation of the mental function is the blood of heart, but the enlightening and insightful features of mentality make it's own residence move to the organ in the uppermost and positive site, that is head. And the close relationships on mental functions between heart and brain were discussed in various aspects, like investigation on east and west etymological literature, or Jiu gong and Taoist theory as well as Me and VMT, These understandings can make us know about the pathology of brain by itself. It has deep relations with heart fire and heart blood and kidney essence, and gastrointestinal function and liver with lung additionally. In another point, it makes the highly complicated psychological functions to be explained free from body relatively, and so can do a role in the complement of the strict 5 viscera theory.