• 제목/요약/키워드: music score images

검색결과 14건 처리시간 0.021초

Super-resolution in Music Score Images by Instance Normalization

  • Tran, Minh-Trieu;Lee, Guee-Sang
    • 스마트미디어저널
    • /
    • 제8권4호
    • /
    • pp.64-71
    • /
    • 2019
  • The performance of an OMR (Optical Music Recognition) system is usually determined by the characterizing features of the input music score images. Low resolution is one of the main factors leading to degraded image quality. In this paper, we handle the low-resolution problem using the super-resolution technique. We propose the use of a deep neural network with instance normalization to improve the quality of music score images. We apply instance normalization which has proven to be beneficial in single image enhancement. It works better than batch normalization, which shows the effectiveness of shifting the mean and variance of deep features at the instance level. The proposed method provides an end-to-end mapping technique between the high and low-resolution images respectively. New images are then created, in which the resolution is four times higher than the resolution of the original images. Our model has been evaluated with the dataset "DeepScores" and shows that it outperforms other existing methods.

Camera-based Music Score Recognition Using Inverse Filter

  • Nguyen, Tam;Kim, SooHyung;Yang, HyungJeong;Lee, GueeSang
    • International Journal of Contents
    • /
    • 제10권4호
    • /
    • pp.11-17
    • /
    • 2014
  • The influence of acquisition environment on music score images captured by a camera has not yet been seriously examined. All existing Optical Music Recognition (OMR) systems attempt to recognize music score images captured by a scanner under ideal conditions. Therefore, when such systems process images under the influence of distortion, different viewpoints or suboptimal illumination effects, the performance, in terms of recognition accuracy and processing time, is unacceptable for deployment in practice. In this paper, a novel, lightweight but effective approach for dealing with the issues caused by camera based music scores is proposed. Based on the staff line information, musical rules, run length code, and projection, all regions of interest are determined. Templates created from inverse filter are then used to recognize the music symbols. Therefore, all fragmentation and deformation problems, as well as missed recognition, can be overcome using the developed method. The system was evaluated on a dataset consisting of real images captured by a smartphone. The achieved recognition rate and processing time were relatively competitive with state of the art works. In addition, the system was designed to be lightweight compared with the other approaches, which mostly adopted machine learning algorithms, to allow further deployment on portable devices with limited computing resources.

Optical Music Score Recognition System for Smart Mobile Devices

  • Han, SeJin;Lee, GueeSang
    • International Journal of Contents
    • /
    • 제10권4호
    • /
    • pp.63-68
    • /
    • 2014
  • In this paper, we propose a smart system that can optically recognize a music score within a document and can play the music after recognition. Many historic handwritten documents have now been digitalized. Converting images of a music score within documents into digital files is particularly difficult and requires considerable resources because a music score consists of a 2D structure with both staff lines and symbols. The proposed system takes an input image using a mobile device equipped with a camera module, and the image is optimized via preprocessing. Binarization, music sheet correction, staff line recognition, vertical line detection, note recognition, and symbol recognition processing are then applied, and a music file is generated in an XML format. The Music XML file is recorded as digital information, and based on that file, we can modify the result, logically correct errors, and finally generate a MIDI file. Our system reduces misrecognition, and a wider range of music score can be recognized because we have implemented distortion correction and vertical line detection. We show that the proposed method is practical, and that is has potential for wide application through an experiment with a variety of music scores.

휴대폰 카메라로 촬영한 악보 영상 인식을 위한 의사트리 알고리즘 (Decision-Tree Algorithm for Recognition of Music Score Images Obtained by Mobile Phone Camera)

  • 박건희;오성열;손화정;유재명;김수형;이귀상
    • 한국콘텐츠학회논문지
    • /
    • 제8권6호
    • /
    • pp.16-25
    • /
    • 2008
  • 현대 사회에서 빼놓을 수 없는 기기인 휴대폰 카메라를 통하여 획득한 악보를 인식함으로써 누구나 손쉽게 전문적인 악보에 대한 지식이 없어도 악보를 연주할 수 있는 시스템을 제안한다. 본 실험은 휴대폰 카메라를 이용하여 촬영한 악보 영상을 전처리과정을 통하여 분리된 심볼들을 인식한 후 미디를 구성한다. 본 논문에서는 실험을 위하여 휴대폰 카메라로 촬영한 임의의 악보 영상 11종을 사용하였다. 전처리 과정을 거친 심볼을 대상으로 제안한 방법을 통하여 인식한 결과 평균 98%의 높은 인식률을 보였다. 본 시스템을 휴대폰에 포팅하여 수행시간을 측정한 결과, 영상의 입력 후 미디 생성까지 걸리는 시간이 평균 8.63초가 소요됨을 알 수 있었다.

A Lightweight and Effective Music Score Recognition on Mobile Phones

  • Nguyen, Tam;Lee, Gueesang
    • Journal of Information Processing Systems
    • /
    • 제11권3호
    • /
    • pp.438-449
    • /
    • 2015
  • Recognition systems for scanned or printed music scores that have been implemented on personal computers have received attention from numerous scientists and have achieved significant results over many years. A modern trend with music scores being captured and played directly on mobile devices has become more interesting to researchers. The limitation of resources and the effects of illumination, distortion, and inclination on input images are still challenges to these recognition systems. In this paper, we introduce a novel approach for recognizing music scores captured by mobile cameras. To reduce the complexity, as well as the computational time of the system, we grouped all of the symbols extracted from music scores into ten main classes. We then applied each major class to SVM to classify the musical symbols separately. The experimental results showed that our proposed method could be applied to real time applications and that its performance is competitive with other methods.

카메라 기반 악보 영상 인식을 위한 오선 검출 및 삭제 알고리즘 (Staff-line Detection and Removal Algorithm for Mobile Phone-based Recognition of Musical Images)

  • 손화정;김수형;오성열
    • 한국콘텐츠학회논문지
    • /
    • 제7권11호
    • /
    • pp.34-42
    • /
    • 2007
  • 본 논문에서는 휴대폰 카메라에서 촬영한 악보 영상에서 오선을 검출하고 삭제하는 방법을 제안한다. 오선 검출 및 삭제는 악보 영상을 인식하기 위한 전처리 기술로서, 기울어짐이나 휘어짐과 같은 왜곡에 손상된 영상에도 효과적으로 적용할 수 있어야 한다. 제안 방법은 기울어짐이나 휘어짐의 정도에 따라 보표를 분할하여 오선을 검출한다. 보표의 분할 개수는 분할 위치에서 좌표값의 차이에 대한 평균값이 임계값 이하가 될 때까지 반복하여 계산한다. 따라서, 분할 개수는 기울어짐의 정도에 따라 적응적으로 추정될 수 있다. 실험을 위해, 휴대폰 카메라로 촬영한 임의 악보 영상을 $1^{\circ}{\sim}3^{\circ}$로 기울어거나 강 약의 휘어짐을 주어 여러 가지 영상을 구성하였다. 실험 결과, 제안 방법이 실험 영상에 대해 정확한 오선 검출 및 삭제가 가능함을 보였다.

인쇄된 악보의 음표인식에 관한 연구 (A Study on the Printed Music Note Recognition)

  • 이창현;권호열;이상희;김백섭
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1992년도 하계학술대회 논문집 A
    • /
    • pp.427-430
    • /
    • 1992
  • In this paper, we proposed an algorithm for the musical note recognition. Firstly, a given bit-mapped music score image is converted to a set of individual note pattern images via vertical projection. Then, the pitch of a note is determinal by comparison in the note-head position with the reference five-lines. Also, the length of a note is found via leader clustering with a set of normalized note patterns. Finally, a datafile to play the music is obtained using the pitch and length of musical notes. Experimental results with a simple musical score image show that the proposed scheme is performed well.

  • PDF

부분적 템플릿 매칭을 활용한 악보인식 (Music Recognition by Partial Template Matching)

  • 유재명;김기홍;이귀상
    • 한국콘텐츠학회논문지
    • /
    • 제8권11호
    • /
    • pp.85-93
    • /
    • 2008
  • 악보인식기술에는 형상 매칭 방법, 통계적인 방법, 신경망을 이용한 방법, 구조적 방법 등이 있다. 본 논문에서는 핸드폰의 디지털 카메라로 얻은 저해상도 이미지를 인식하는 기술에 대해 접근한다. 이러한 저해상도 이미지에는 많은 왜곡이 포함되어 있어 기존 기술을 활용할 때 많은 문제점들을 나타난다. 문제점은 입력영상이 저해상도이며 조명 등의 촬영 상태가 좋지 않는 점이며, 인식 이전 단계 과정에서 음표 부분에 손실과 약간의 변형이 생긴다는 것이다. 이들 인식 방법들의 일반적인 흐름은 먼저, 디지털이미지를 확보하기 위해 카메라 기능을 이용하여 획득한다. 그런 후에 이진화, 오선 제거, 객체영역 분리가 이루어진 후 인식과정을 통해 악보 인식이 이루어진다. 본 연구에서는 특히 핸드폰이라는 제한적인 상황에서 탑재된 카메라를 통해 획득된 이미지를 대상으로 이러한 문제점을 극복하기 위한 인식 기술을 연구하였다. 먼저, 음표를 머리, 대, 꼬리 부분으로 분리하였다. 그리고 음표의 머리 부분에 템플릿을 적용하였고, 나머지 부분에는 패턴을 적용하여 단일 음표로 이루어진 악보에 대해서 100% 가까운 인식률을 얻을 수 있었다.

실시간 이차원 지휘운동의 해석 (On-Line Two-Dimensional Conducting Motion Analysis)

  • 김종성;유범재;오상록
    • 전자공학회논문지B
    • /
    • 제28B권11호
    • /
    • pp.876-885
    • /
    • 1991
  • This paper proposes an on-line method of understanding humans conducting action observed through a vision sensor. The vision system captures images of conducting action and extracts image coordinates of endpoint of the baton. A proposed algorithm based on the expert knowledge about conducting recognizes patterns of the conducting action from the extracted image coordimates and play the corresponding music score. Complementary algorithms are also proposed for identifying the first beat static point and dynamics through extensive experiments, this algorithm is found to detect lower edges and upper edges without error.

  • PDF

Improved Lexicon-driven based Chord Symbol Recognition in Musical Images

  • Dinh, Cong Minh;Do, Luu Ngoc;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • 제12권4호
    • /
    • pp.53-61
    • /
    • 2016
  • Although extensively developed, optical music recognition systems have mostly focused on musical symbols (notes, rests, etc.), while disregarding the chord symbols. The process becomes difficult when the images are distorted or slurred, although this can be resolved using optical character recognition systems. Moreover, the appearance of outliers (lyrics, dynamics, etc.) increases the complexity of the chord recognition. Therefore, we propose a new approach addressing these issues. After binarization, un-distortion, and stave and lyric removal of a musical image, a rule-based method is applied to detect the potential regions of chord symbols. Next, a lexicon-driven approach is used to optimally and simultaneously separate and recognize characters. The score that is returned from the recognition process is used to detect the outliers. The effectiveness of our system is demonstrated through impressive accuracy of experimental results on two datasets having a variety of resolutions.