Global lifelog media cloud development and deployment

Song, Hyeok;Choe, In-Gyu;Lee, Yeong-Han;Go, Min-Su;O, Jin-Taek;Yu, Ji-Sang;

Broadcasting and Media Magazine (방송과미디어)

Volume 22 Issue 1
/
Pages.35-46
/
2017
/
2383-9708(pISSN)

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

Global lifelog media cloud development and deployment

글로벌 라이프로그 미디어 클라우드 개발 및 구축

송혁 (전자부품연구원) ;
최인규 (광운대학교) ;
이영한 (전자부품연구원) ;
고민수 (전자부품연구원) ;
오진택 (판도라티비) ;
유지상 (광운대학교)

Published : 2017.01.30

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

글로벌 라이프로그 미디어 클라우드 서비스를 위하여 네트워크 기술, 클라우드 기술 멀티미디어 App 기술 및 하이라이팅 엔진 기술이 요구된다. 본 논문에서는 미디어 클라우드 서비스를 위한 개발 기술 및 서비스 기술 개발 결과를 보였다. 하이라이팅 엔진은 표정인식기술, 이미지 분류기술, 주목도 지도 생성기술, 모션 분석기술, 동영상 분석 기술, 얼굴 인식 기술 및 오디오 분석기술 등을 포함하고 있다. 표정인식 기술로는 Alexnet을 최적화하여 Alexnet 대비 1.82% 우수한 인식 성능을 보였으며 처리속도면에서 28배 빠른 결과를 보였다. 행동 인식 기술에 있어서는 기존 2D CNN 및 LSTM에 기반한 인식 방법에 비하여 제안하는 3D CNN 기법이 0.8% 향상된 결과를 보였다. (주)판도라티비는 클라우드 기반 라이프로그 동영상 생성 서비스를 개발하여 현재 테스트 서비스를 진행하고 있다.

Keywords

References

Le, Quoc V. "Building high-level features using large scale unsupervised learning." 2013 IEEE international conference on acoustics, speech and signal processing. IEEE, 2013.
Sermanet, Pierre, et al. "Overfeat: Integrated recognition, localization and detection using convolutional networks." arXiv preprint arXiv:1312.6229 2013.
Deng, Li. "Deep learning: from speech recognition to language and multimodal processing." APSIPA Transactions on Signal and Information Processing 2016.
N.D.B. Bruce, J.K. Tsotsos, "Saliency Based on Information Maximization," Advances in Neural Information Processing Systems, 18, pp. 155-162, June 2006.
Tolias, Giorgos, Ronan Sicre, and Herve Jegou. "Particular object retrieval with integral max-pooling of CNN activations." arXiv preprint arXiv:1511.05879 (2015).
Zach, Christopher, Thomas Pock, and Horst Bischof. "A duality based approach for realtime TV-L 1 optical flow." Joint Pattern Recognition Symposium. Springer Berlin Heidelberg, 2007.
H. Kim, S. Lee and A. C. Bovik, "Saliency Prediction on Stereoscopic Videos," in IEEE Transactions on Image Processing, vol. 23, no. 4, pp. 1476-1490, April 2014. https://doi.org/10.1109/TIP.2014.2303640
Min Soo Ko, Hyok Song, "Video Analysis Algorithm based on Saliency Region Detection from Selected Key-frames", ITC-CSCC 2016.
In Kyu Choi, Hyok Song, Jisang Yoo, "Convolutional Neural Networks for Facial Expression Recognition", KOSBE, 11. 2016.
Donahue, Jeffrey, et al. "Long-term recurrent convolutional networks for visual recognition and description." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015.

Broadcasting and Media Magazine (방송과미디어)

Global lifelog media cloud development and deployment

글로벌 라이프로그 미디어 클라우드 개발 및 구축

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)