Search | Korea Science

Subtitle Highlighting System for Video Streaming using Speech Interface STT (Speech to Text) (음성 인터페이스 STT(Speech to Text)를 활용한 동영상 스트리밍 자막 강조 시스템)

Lee, Kang-Chan;Cho, Dae-Soo
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2021.07a
- /
- pp.567-568
- /
- 2021
자막은 자막을 볼 수 있는 모든 사람들의 정보전달, 의사소통을 할 수 있는 유용한 도구로 사용 되고 있지만 자막은 평범한 텍스트로 있어 자막에서 강조된 부분, 감정 등을 전달 할 수 없다는 단점을 가지고 있다. 그러므로 청각 장애인들은 해당 컨텐츠의 감정, 강조 되는 부분을 알 수 없어 대화의 숨은 의미가 다른 방향으로 이해 할 수 있다는 위험성을 가지고 있다. 본 논문에서는 음성을 텍스트로 변환하는 STT(Speech To Text)를 이용하여 동영상 스트리밍 서비스를 실시간으로 음성을 텍스트로 변환과 동시에 강조하는 부분까지 개발하여 청각장애인 입장에서 기존 자막보다 효율적인 시각적 효과를 주는 미디어 접근을 위한 동영상 스트리밍 자막 서비스를 개발하고자 한다.
PDF

Real-time Face Detection based on PCA and LDA (PCA와 LDA를 이용한 실시간 얼굴 검출)

홍은혜;고병철;변혜란
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.10d
- /
- pp.538-540
- /
- 2002
본 논문에서는 실시간 카메라 입력 영상에 적합한 얼굴 검출을 위해 다양한 외부적 환경에 덜 민감한 새로운 알고리즘을 제안한다. 빛이나 조명의 영향에 의한 오류를 방지하기 위해 전처리 과정을 포함시키고 형판 정합방법의 단점을 개선하기 위해 얼굴 인식에서 주로 쓰이는 방법인 주성분 분석(PCA :Principal Component Analyses) 변환을 적용하고. 생성된 주성분(Principal Component)을 선형 판별 분석(LDA: Linear Discriminant Analysis)의 입력으로 사용하는 방법을 통해 얼굴을 검출하도록 하였다. 실험을 위해 실제 환경과 같은 6개 카테고리의 동영상을 중심으로 실험한 결과, 본 논문에서 제안하는 방법이 기존의 PCA만을 이용한 방법보다 좋은 성능을 보여줌을 알 수 있었다.
PDF

Video Stream Processing for Service of Heal-Time Road Traffic Scones on Mobile Phone (모바일폰에서의 실시간 도로교통상황 서비스를 위한 동영상 처리 방법)

고석민;낭종호
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.04a
- /
- pp.223-225
- /
- 2002
오늘날 실시간 하에서 자동적인 교통정보의 분석은 IVHS(Intelligent Vehicle High-way Systems)의 많은 분야에서 필수적으로 사용된다. 또한 바쁜 현대인들이 러시아워에서 교통이 다소 원활한 지역으로 이동하여 시간을 절약하고자 교통 정보를 이용하고자 한다. 하지만 모바일폰은 작은 디스플레이, 메모리, 전원 장치 등등의 제약 사항을 가지고 있다. 본 논문에서는 이러한 제약을 가지고 있는 도로 교통 영상 스트림을 모바일폰에서 서비스하기 위한 실시간 비디오 처리 방법을 제안한다. 영상 스트림의 시간적 정보를 바탕으로 프레임 율을 조절하는 시간적 처리 방안과 불필요한 영역제거, 이미지 크기 변환, 칼라 수 줄이기등의 공간적 활용 방안을 제안하고자 한다. 이와 더불어 모바일폰에서의 질 높은 서비스를 제공하기 위하여 비디오 스트림을 이루는 이미지 각각에 대한 이미지 질 향상에 대한 처리 방법들을 제안 하고자 한다. 본 연구의 실험으로 모바일폰에서 효율적인 도로 교통 영상 서비스를 제공할 수 있음을 알 수 있다.
PDF

Lane and Vehicle Distance Detection Using Camera Image (카메라 영상을 통한 실시간 차선·차간 인식에 관한 연구)

Kim, Yu-sin;Jeong, Dae-ryong;Song, Seong-geun;Song, Tae-hong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.318-321
- /
- 2011
도로 주행 시 운전을 보조하고 안전 운전을 지원하기 위한 기술인 도로상황인지 시스템에 있어 효율적인 차선 차간 검출 기법은 위의 핵심적인 기술이다. 실시간으로 수집되는 도로 상황 영상 데이터 분석에 대한 처리 시간을 단축하기 위하여 각각의 영상 프레임에 대해 관심 영역을 설정한 후 허프 변환을 적용하였다. 본 논문은 카메라로 수집되는 도로 상황 영상에 관심 영역 설정을 통한 실시간 차선 차간 인식에 관한 연구로서, 차선과 차간 인식을 위한 효율적인 알고리즘을 제안한다.
https://doi.org/10.3745/PKIPS.y2011m11a.318 인용 PDF

Automated Image Matching for Satellite Images with Different GSDs through Improved Feature Matching and Robust Estimation (특징점 매칭 개선 및 강인추정을 통한 이종해상도 위성영상 자동영상정합)

Ban, Seunghwan;Kim, Taejung
- Korean Journal of Remote Sensing
- /
- v.38 no.6_1
- /
- pp.1257-1271
- /
- 2022
Recently, many Earth observation optical satellites have been developed, as their demands were increasing. Therefore, a rapid preprocessing of satellites became one of the most important problem for an active utilization of satellite images. Satellite image matching is a technique in which two images are transformed and represented in one specific coordinate system. This technique is used for aligning different bands or correcting of relative positions error between two satellite images. In this paper, we propose an automatic image matching method among satellite images with different ground sampling distances (GSDs). Our method is based on improved feature matching and robust estimation of transformation between satellite images. The proposed method consists of five processes: calculation of overlapping area, improved feature detection, feature matching, robust estimation of transformation, and image resampling. For feature detection, we extract overlapping areas and resample them to equalize their GSDs. For feature matching, we used Oriented FAST and rotated BRIEF (ORB) to improve matching performance. We performed image registration experiments with images KOMPSAT-3A and RapidEye. The performance verification of the proposed method was checked in qualitative and quantitative methods. The reprojection errors of image matching were in the range of 1.277 to 1.608 pixels accuracy with respect to the GSD of RapidEye images. Finally, we confirmed the possibility of satellite image matching with heterogeneous GSDs through the proposed method.
https://doi.org/10.7780/kjrs.2022.38.6.1.21 인용 PDF KSCI HTML

Deep Learning-based Real-Time Super-Resolution Architecture Design (경량화된 딥러닝 구조를 이용한 실시간 초고해상도 영상 생성 기술)

Ahn, Saehyun;Kang, Suk-Ju
- Journal of Broadcast Engineering
- /
- v.26 no.2
- /
- pp.167-174
- /
- 2021
Recently, deep learning technology is widely used in various computer vision applications, such as object recognition, classification, and image generation. In particular, the deep learning-based super-resolution has been gaining significant performance improvement. Fast super-resolution convolutional neural network (FSRCNN) is a well-known model as a deep learning-based super-resolution algorithm that output image is generated by a deconvolutional layer. In this paper, we propose an FPGA-based convolutional neural networks accelerator that considers parallel computing efficiency. In addition, the proposed method proposes Optimal-FSRCNN, which is modified the structure of FSRCNN. The number of multipliers is compressed by 3.47 times compared to FSRCNN. Moreover, PSNR has similar performance to FSRCNN. We developed a real-time image processing technology that implements on FPGA.
https://doi.org/10.5909/JBE.2021.26.2.167 인용 PDF KSCI KPUBS

Real-time 3D Converting System using Stereoscopic Video (스테레오 비디오를 이용한 실시간 3차원 입체 변환 시스템)

Seo, Young-Ho;Choi, Hyun-Jun;Kim, Dong-Wook
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.33 no.10C
- /
- pp.813-819
- /
- 2008
In this paper, we implemented a real-time system which displays 3-dimensional (3D) stereoscopic image with stereo camera. The system consists of a set of stereo camera, FPGA board, and 3D stereoscopic LCD. Two CMOS image sensor were used for the stereo camera. FPGA which processes video data was designed with Verilog-HDL, and it can accommodate various resolutional videos. The stereoscopic image is configured by two methods which are side-by-side and up-down image configuration. After the left and right images are converted to the type for the stereoscopic display, they are stored into SDRAM. When the next frame is inputted into FPGA from two CMOS image sensors, the previous video data is output to the DA converter for displaying it. From this pipeline operation, the real-time operation is possible. After the proposed system was implemented into hardware, we verified that it operated exactly.
PDF KSCI

Seam-line Determination in Image Mosaicking using Adaptive Cost Transform and Dynamic Programming (동적계획법과 적응 비용 변환을 이용한 영상 모자이크의 seam-line 결정)

Chon, Jae-Choon;Suh, Yong-Cheol;Kim, Hyong-Suk
- Journal of the Korean Association of Geographic Information Studies
- /
- v.7 no.2
- /
- pp.16-28
- /
- 2004
A seam-line determination algorithm is proposed to determine image border-line in mosaicing using the transformation of gray value differences and dynamic programming. Since visually good border-line is the one along which pixel differences are as small as possible, it can be determined in association with an optimal path finding algorithm. A well-known effective optimal path finding algorithm is the Dynamic Programming (DP). Direct application of the dynamic programming to the seam-line determination causes the distance effect, in which seam-line is affected by its length as well as the gray value difference. In this paper, an adaptive cost transform algorithm with which the distance effect is suppressed is proposed in order to utilize the dynamic programming on the transformed pixel difference space. Also, a figure of merit which is the summation of fixed number of the biggest pixel difference on the seam-line (SFBPD) is suggested as an evaluation measure of seamlines. The performance of the proposed algorithm has been tested in both quantitively and visually on various kinds of images.
PDF

An Improved Fractal Color Image Decoding Based on Data Dependence and Vector Distortion Measure (데이터의존성과 벡터왜곡척도를 이용한 개선된 프랙탈 칼라영상 복호화)

서호찬;정태일;문광석;안상호;권기룡
- Proceedings of the Korea Multimedia Society Conference
- /
- 1998.04a
- /
- pp.116-121
- /
- 1998
본 논문에서는 데이터의존성과 벡터왜곡척도를 이용하여 개선된 칼라영상을 복호화하였다. 프랙탈 칼라영상의 복원방법은 Zhang과 Po의 벡터왜곡척도를 이용한 R, G, B 칼라 성분간의 상관관계를 고려하여 부호화한 압축파일을 사용하여 수렴될 복원영상을 독립적인 반복변환에 의해 수렴되는 영역과 데이터의존성을 갖는 영역으로 분류하여 데이터의존성 부분이 차지하는 만큼 복호화 과정에서 불필요한 계산량이 제거되었고, R 영역에서 검색한 데이터 의존영역을 G, B 영역에 그대로 사용하여 고속복호화가 가능하였다.
PDF

Partial Accessible JPEG for effective Transmission on Internet (인터넷상에서의 효과적인 전송을 위한 Partial Access 지원 JPEG)

정세윤;김규헌;이재연;배영래
- Proceedings of the IEEK Conference
- /
- 2000.11c
- /
- pp.77-80
- /
- 2000
본 논문에서는 JPEG 영상을 인터넷상에서 효율적으로 전송하기 위한 Partial Access를 지원하는 JPEG 변환 처리 기술을 제안한다. 네트웍 상에서 영상을 전부 전송하지 않고 클라이언트의 브라우져의 디스플레이에 필요한 부분만을 실시간으로 전송한다면 전체 네트웍 효율을 높일 수 있다. 이를 위해서는 영상을 Partial Access 할 수 있어야 한다. 또한, 본 논문에서 제안한 Partial Access 기능이 추가된 JPEG 영상은 기존의 JPEG과 완전 호환되며, 클라이언트는 기존의 일반 웹 브라우져를 그대로 사용할 수 있다.
PDF

Search Result 843, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)