Search | Korea Science

Convolutional Neural Network based Audio Event Classification

Lim, Minkyu;Lee, Donghyun;Park, Hosung;Kang, Yoseb;Oh, Junseok;Park, Jeong-Sik;Jang, Gil-Jin;Kim, Ji-Hwan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.6
- /
- pp.2748-2760
- /
- 2018
This paper proposes an audio event classification method based on convolutional neural networks (CNNs). CNN has great advantages of distinguishing complex shapes of image. Proposed system uses the features of audio sound as an input image of CNN. Mel scale filter bank features are extracted from each frame, then the features are concatenated over 40 consecutive frames and as a result, the concatenated frames are regarded as an input image. The output layer of CNN generates probabilities of audio event (e.g. dogs bark, siren, forest). The event probabilities for all images in an audio segment are accumulated, then the audio event having the highest accumulated probability is determined to be the classification result. This proposed method classified thirty audio events with the accuracy of 81.5% for the UrbanSound8K, BBC Sound FX, DCASE2016, and FREESOUND dataset.
https://doi.org/10.3837/tiis.2018.06.017 인용 PDF KSCI

Proactive safety support system for vulnerable pedestrians using Deep learning method (보행취약자 보행안전을 위한 딥러닝 응용 기법)

Song, Hyok;Ko, Min-Soo;Yoo, Jisang;Choi, Byeongho
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2017.06a
- /
- pp.107-108
- /
- 2017
횡단보도 인근에서는 보행취약자의 사고가 끊이지 않고 있으며 사고예방 및 사고의 절감을 위하여 선제적안 안전시스템의 개발이 요구되고 있다. 선제적 안전시스템의 개발을 위하여 빅데이터를 이용한 안전 데이터 도출, 영상분석을 이용한 보행자 행동특성 모니터링 시스템의 개발 및 사고감소를 위한 안전 시스템 개발이 진행되고 있다. 보행취약자 위험상황 판단에 대한 정의를 빅데이터 분석을 통해 도출하고 횡단보도 주변 안전 시스템의 개발을 기존 시스템에 적용 및 새로운 시스템을 개발하며 이에 적합한 딥러닝 영상분석 시스템을 개발하였다. 본 논문에서는 딥러닝 모델을 이용하여 객체의 검출, 분석을 수행하는 객체 검출부, 객체의 포즈와 행동을 보여주는 영상 분석부로 구성되어 있으며 기존 모델을 응용하여 최적화한 모델을 적용하였다. 딥러닝 모델의 구동은 리눅스 서버에서 운용되고 있으며 딥러닝 모델 구동을 위한 여러 툴을 적용하였다. 본 연구를 통하여 보행취약자의 검출, 추적, 보행취약자의 포즈 및 위험상황을 인식하고 안전시스템과 연계할 수 있도록 구성하였다.
PDF

Broadcasting Software System for Interactive Service based on Deep Learning (차세대 딥러닝 인공지능을 이용한 양방향 서비스 방송 소프트웨어 시스템)

Yang, Geunseok;Shin, Yongwoo;Roh, Minchul;Kang, Seongho;Joo, Ingyu;Kwak, Jaechul;Ku, Jinwon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2017.06a
- /
- pp.26-28
- /
- 2017
스마트폰 보유율과 모바일 이용 행태가 급변함에 따라 방송사에서는 양방향 서비스를 포함한 다양한 방송 서비스를 제공하려고 노력하고 있다. 양방향 서비스 방송에서 시청자가 보낸 문구를 실제 화면에 보여주기까지 PD 와 담당자들의 수작업이 필요하다. 하지만 하루 평균 약 7,200 건 (MBC 오늘아침 소통중계)의 양방향 서비스 관련 로그가 남게 되어, PD 가 일일이 판별하기에는 많은 노력이 따른다. 이러한 불필요한 노력을 줄이기 위해 본 논문에서는 감정 분석을 이용한 딥러닝 인공지능 기반 양방향 서비스 방송 소프트웨어 시스템을 제안한다. 첫째, 시청자들이 전송한 의견, 건의사항, 내용 등을 전처리 과정을 진행한다. 둘째, 감정 사전을 이용해 전처리 된 단어와 비교하여 시청자가 보낸 문구의 감정 점수를 계산한다. 셋째, 과거 실제 방송에 송출된 시청자 문구를 감정 점수와 함께 딥러닝을 이용하여 훈련시킨다. 본 논문의 성능을 평가하기 위해, 2017 년 생방송 오늘아침 소통중계에 사례연구를 진행하였고 효율성을 보였다. 앞으로 이러한 양방향 서비스 방송 소프트웨어 시스템 도입으로, PD 가 방송 제작에 더욱 집중 할 수 있도록 차별화된 방송을 준비하는데 크게 기여할 것이라 기대한다.
PDF

Korean Semantic Role Labeling using Stacked Bidirectional LSTM-CRFs (Stacked Bidirectional LSTM-CRFs를 이용한 한국어 의미역 결정)

Bae, Jangseong;Lee, Changki
- Journal of KIISE
- /
- v.44 no.1
- /
- pp.36-43
- /
- 2017
Syntactic information represents the dependency relation between predicates and arguments, and it is helpful for improving the performance of Semantic Role Labeling systems. However, syntax analysis can cause computational overhead and inherit incorrect syntactic information. To solve this problem, we exclude syntactic information and use only morpheme information to construct Semantic Role Labeling systems. In this study, we propose an end-to-end SRL system that only uses morpheme information with Stacked Bidirectional LSTM-CRFs model by extending the LSTM RNN that is suitable for sequence labeling problem. Our experimental results show that our proposed model has better performance, as compare to other models.
https://doi.org/10.5626/JOK.2017.44.1.36 인용 KSCI

A Study on the Recognition of Face Based on CNN Algorithms (CNN 알고리즘을 기반한 얼굴인식에 관한 연구)

Son, Da-Yeon;Lee, Kwang-Keun
- Korean Journal of Artificial Intelligence
- /
- v.5 no.2
- /
- pp.15-25
- /
- 2017
Recently, technologies are being developed to recognize and authenticate users using bioinformatics to solve information security issues. Biometric information includes face, fingerprint, iris, voice, and vein. Among them, face recognition technology occupies a large part. Face recognition technology is applied in various fields. For example, it can be used for identity verification, such as a personal identification card, passport, credit card, security system, and personnel data. In addition, it can be used for security, including crime suspect search, unsafe zone monitoring, vehicle tracking crime.In this thesis, we conducted a study to recognize faces by detecting the areas of the face through a computer webcam. The purpose of this study was to contribute to the improvement in the accuracy of Recognition of Face Based on CNN Algorithms. For this purpose, We used data files provided by github to build a face recognition model. We also created data using CNN algorithms, which are widely used for image recognition. Various photos were learned by CNN algorithm. The study found that the accuracy of face recognition based on CNN algorithms was 77%. Based on the results of the study, We carried out recognition of the face according to the distance. Research findings may be useful if face recognition is required in a variety of situations. Research based on this study is also expected to improve the accuracy of face recognition.
https://doi.org/10.24225/kjai.2017.5.2.15 인용 PDF

Image Recognition and Clustering for Virtual Reality based on Cognitive Rehabilitation Contents (가상현실 기반 인지재활 콘텐츠를 위한 영상 인식 및 군집화)

Choi, KwonTaeg
- Journal of Digital Contents Society
- /
- v.18 no.7
- /
- pp.1249-1257
- /
- 2017
Due to the 4th industrial revolution and an aged society, many studies are being conducted to apply virtual reality to medical field. Research on dementia is especially active. This paper proposes virtual reality based on cognitive rehabilitation contents using image recognition and clustering method to improve cognitive and physical disabilities caused by dementia. Unlike the existing cognitive rehabilitation system, this paper uses travel photos that reflect the memories of the subjects to be treated. In order to generate automated cognitive rehabilitation contents, we extract face information, food pictures, place information, and time information from photographs, and normalization is performed for clustering. And we present scenarios that can be used as cognitive rehabilitation contents using travel photos in virtual reality space.
https://doi.org/10.9728/dcs.2017.18.7.1249 인용 PDF KSCI

Design and Implementation of Mobile Communication System for Hearing- impaired Person (청각 장애인을 위한 모바일 통화 시스템 설계 및 구현)

Yun, Dong-Hee;Kim, Young-Ung
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.16 no.5
- /
- pp.111-116
- /
- 2016
According to the Ministry of Science, ICT and Future Planning's survey of information gap, smartphone retention rate of disabled people stayed in one-third of non-disabled people, the situation is significantly less access to information for people with disabilities than non-disabled people. In this paper, we develop an application, CallHelper, that helps to be more convenient to use mobile voice calls to the auditory disabled people. CallHelper runs automatically when a call comes in, translates caller's voice to text output on the mobile screen, and displays the emotion reasoning from the caller's voice to visualize emoticons. It also saves voice, translated text, and emotion data that can be played back.
https://doi.org/10.7236/JIIBC.2016.16.5.111 인용 PDF KSCI

Fault Diagnosis of Induction Motor using Linear Predictive Coding and Deep Neural Network (LPC와 DNN을 결합한 유도전동기 고장진단)

Ryu, Jin Won;Park, Min Su;Kim, Nam Kyu;Chong, Ui Pil;Lee, Jung Chul
- Journal of Korea Multimedia Society
- /
- v.20 no.11
- /
- pp.1811-1819
- /
- 2017
As the induction motor is the core production equipment of the industry, it is necessary to construct a fault prediction and diagnosis system through continuous monitoring. Many researches have been conducted on motor fault diagnosis algorithm based on signal processing techniques using Fourier transform, neural networks, and fuzzy inference techniques. In this paper, we propose a fault diagnosis method of induction motor using LPC and DNN. To evaluate the performance of the proposed method, the fault diagnosis was carried out using the vibration data of the induction motor in steady state and simulated various fault conditions. Experimental results show that the learning time of our proposed method and the conventional spectrum+DNN method is 139 seconds and 974 seconds each executed on the experimental PC, and our method reduces execution time by 1/8 compared with conventional method. And the success rate of the proposed method is 98.08%, which is similar to 99.54% of the conventional method.
https://doi.org/10.9717/kmms.2017.20.11.1811 인용 PDF KSCI

CNN-based Skip-Gram Method for Improving Classification Accuracy of Chinese Text

Xu, Wenhua;Huang, Hao;Zhang, Jie;Gu, Hao;Yang, Jie;Gui, Guan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.12
- /
- pp.6080-6096
- /
- 2019
Text classification is one of the fundamental techniques in natural language processing. Numerous studies are based on text classification, such as news subject classification, question answering system classification, and movie review classification. Traditional text classification methods are used to extract features and then classify them. However, traditional methods are too complex to operate, and their accuracy is not sufficiently high. Recently, convolutional neural network (CNN) based one-hot method has been proposed in text classification to solve this problem. In this paper, we propose an improved method using CNN based skip-gram method for Chinese text classification and it conducts in Sogou news corpus. Experimental results indicate that CNN with the skip-gram model performs more efficiently than CNN-based one-hot method.
https://doi.org/10.3837/tiis.2019.12.016 인용 PDF KSCI HTML

Deep learning-based voice recognition product purchase system for efficient vehicle environment (효율적인 차량 환경을 위한 딥 러닝 기반의 음성인식 상품 구매 시스템)

Kwon, Byung Wook;Kang, Won Min;Park, Jong Hyuk
- Annual Conference of KIPS
- /
- 2017.11a
- /
- pp.330-332
- /
- 2017
최근 차량사고는 운전자의 운전 행동이 많은 비중을 차지하며 행동이 올바르지 못했을 경우 주의가 분산되어 사고가 발생하고 있다. 자동차 업계에서는 자율주행 기술의 출현으로 운전자의 운전환경이 변화되고 있다. 차량 서비스들은 차량에 부착된 센서들을 이용한 다양한 차량 서비스가 개발되고 있으며 차량 서비스는 도로주변 환경과 운전자의 안전에 집중된 서비스가 대부분이다. 하지만 차량에 부착된 센서들의 성능문제로 인한 기능적 문제점으로 상용화가 늦어지고 있다. 본 논문에서는 사용자에게 효율적인 차량 서비스를 제공하기 위해 사용자의 음성을 활용한 상품구매 시스템을 제안한다. 본 시스템은 딥 러닝 기술이 적용된 DB를 통해 사용자의 음성데이터 분류를 통해 상품을 검색 및 구매할 수 있는 시스템이다. 제안된 시스템은 음성인식을 활용하여 별도의 과정 없이 간편하게 상품을 구매할 수 있으며, 사고의 위험으로부터 벗어날 수 있다.
https://doi.org/10.3745/PKIPS.y2017m11a.330 인용 PDF

Search Result 1,738, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)