• Title/Summary/Keyword: 영상 분류 및 검색

Search Result 98, Processing Time 0.028 seconds

Multi-Shape Retrieval Using Multi Curvature-Scale Space Descriptor (다중 곡률-단계 공간 기술자를 이용한 다중형상 검색)

  • Park, Sang Hyun;Lee, Soo-Chahn;Yun, Il-Dong
    • Journal of Broadcast Engineering
    • /
    • v.13 no.6
    • /
    • pp.962-965
    • /
    • 2008
  • 2-D shape descriptors, which are vectors representing characteristics of shapes, enable comparison and classification of shapes and are mainly applied to image and 3-D model retrieval. Existing descriptors have limitations that they only describe shapes of single closed contours or lack in precision, making it difficult to be applied to shapes with multiple contours. Therefore, in this paper, we propose a new shape descriptor called Multi-Curvature-Scale Space that can be applied to shapes with multiple contours. Specifically, we represent the topology of the sub-contours in the multi-contour along with Curvature-Scale Space descriptors to represent the shapes of each sub-contours. Also, by allowing the weight of each component to be controlled when computing the distance between descriptors the weight, we deal with ambiguities in measuring similarity between shapes. Results of various experiments that prove the effectiveness of proposed descriptor are presented.

Tag Recommendation Algorithms in Tagging System (태깅 시스템의 태그 추천 알고리즘)

  • Kim, Hyun-Woo;Lee, Kang-Pyo;Kim, Hyoung-Joo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.9
    • /
    • pp.927-935
    • /
    • 2010
  • In the era of Web 2.0, users create a number of their own Web contents. So, multimedia search becomes much more important than ever. A tag is a simple keyword which describes the Web contents including URL, pictures, and videos. Tags perform a role of descriptors of Web contents and Web metadata properly. If the number of tagged Web data increases, users are more likely to find the desired search result because the system includes the Web contents which have richer Web metadata. However, the number of users who use tags as Web metadata is relatively small. Because of the cumbersome process of adding tags, or users do not know what to add for the better accessibility from the public. Given situation, tag recommendation, which helps the process of adding tags, has been studied to solve these problems. When a user adds some Web contents, the tag recommendation system recommends relevant tags for the Web contents to the use, and the user selects recommended tags. We analyze and categorize various tag recommendation algorithms in tagging system.

Development of Demand Prediction Model for Video Contents Using Digital Big Data (디지털 빅데이터를 이용한 영상컨텐츠 수요예측모형 개발)

  • Song, Min-Gu
    • Journal of Industrial Convergence
    • /
    • v.20 no.4
    • /
    • pp.31-37
    • /
    • 2022
  • Research on what factors affect the success of the movie market is very important for reducing risks in related industries and developing the movie industry. In this study, in order to find out the degree of correlation of independent variables that affect movie performance, a survey was conducted on film experts using the AHP method and the importance of each measurement factor was evaluated. In addition, we hypothesized that factors derived from big data related to search portals and SNS will affect the success of movies due to the increase in the spread and use of smart phones. And a prediction model that reflects both the expert survey information and big data mentioned above was proposed. In order to check the accuracy of the prediction of the proposed model, it was confirmed that it was improved (10.5%) compared to the existing model as a result of verification with real data.Therefore, it is judged that the proposed model will be helpful in decision-making of film production companies and distributors.

Design and Construction of Spectral Library for the Korean Peninsular (한반도 지역의 지표특성을 고려한 분광라이브러리의 설계 및 구축)

  • Shin, Jung-Il;Kim, Sun-Hwa;Lee, Kyu-Sung
    • Korean Journal of Remote Sensing
    • /
    • v.26 no.5
    • /
    • pp.465-475
    • /
    • 2010
  • Spectral library is a database that archives spectral reflectance and related metadata of earth surface materials. Spectral library plays important role to assist analyzing several types of remote sensor data, to determine suitable wavelength band for detecting a certain material, and to classify hyperspectal image data. This paper describes the structure and content of a spectral library that is suitable for the environment of the Korea peninsula while existing spectral libraries have certain limitations to apply for surface materials covering the region. We designed a spectral library that includes vegetation and man-made materials indigenous to the region. The spectral library also includes spectra of mineral and rock, soil, liquid, and some man-made materials from existing spectral libraries. Newly augmented spectra of vegetation and man-made materials were obtained by spectral measurements in laboratory and field. The spectral library viewer was developed to increase efficiency of usage and searching.

The Ontology Based, the Movie Contents Recommendation Scheme, Using Relations of Movie Metadata (온톨로지 기반 영화 메타데이터간 연관성을 활용한 영화 추천 기법)

  • Kim, Jaeyoung;Lee, Seok-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.25-44
    • /
    • 2013
  • Accessing movie contents has become easier and increased with the advent of smart TV, IPTV and web services that are able to be used to search and watch movies. In this situation, there are increasing search for preference movie contents of users. However, since the amount of provided movie contents is too large, the user needs more effort and time for searching the movie contents. Hence, there are a lot of researches for recommendations of personalized item through analysis and clustering of the user preferences and user profiles. In this study, we propose recommendation system which uses ontology based knowledge base. Our ontology can represent not only relations between metadata of movies but also relations between metadata and profile of user. The relation of each metadata can show similarity between movies. In order to build, the knowledge base our ontology model is considered two aspects which are the movie metadata model and the user model. On the part of build the movie metadata model based on ontology, we decide main metadata that are genre, actor/actress, keywords and synopsis. Those affect that users choose the interested movie. And there are demographic information of user and relation between user and movie metadata in user model. In our model, movie ontology model consists of seven concepts (Movie, Genre, Keywords, Synopsis Keywords, Character, and Person), eight attributes (title, rating, limit, description, character name, character description, person job, person name) and ten relations between concepts. For our knowledge base, we input individual data of 14,374 movies for each concept in contents ontology model. This movie metadata knowledge base is used to search the movie that is related to interesting metadata of user. And it can search the similar movie through relations between concepts. We also propose the architecture for movie recommendation. The proposed architecture consists of four components. The first component search candidate movies based the demographic information of the user. In this component, we decide the group of users according to demographic information to recommend the movie for each group and define the rule to decide the group of users. We generate the query that be used to search the candidate movie for recommendation in this component. The second component search candidate movies based user preference. When users choose the movie, users consider metadata such as genre, actor/actress, synopsis, keywords. Users input their preference and then in this component, system search the movie based on users preferences. The proposed system can search the similar movie through relation between concepts, unlike existing movie recommendation systems. Each metadata of recommended candidate movies have weight that will be used for deciding recommendation order. The third component the merges results of first component and second component. In this step, we calculate the weight of movies using the weight value of metadata for each movie. Then we sort movies order by the weight value. The fourth component analyzes result of third component, and then it decides level of the contribution of metadata. And we apply contribution weight to metadata. Finally, we use the result of this step as recommendation for users. We test the usability of the proposed scheme by using web application. We implement that web application for experimental process by using JSP, Java Script and prot$\acute{e}$g$\acute{e}$ API. In our experiment, we collect results of 20 men and woman, ranging in age from 20 to 29. And we use 7,418 movies with rating that is not fewer than 7.0. In order to experiment, we provide Top-5, Top-10 and Top-20 recommended movies to user, and then users choose interested movies. The result of experiment is that average number of to choose interested movie are 2.1 in Top-5, 3.35 in Top-10, 6.35 in Top-20. It is better than results that are yielded by for each metadata.

Sound Visualization based on Emotional Analysis of Musical Parameters (음악 구성요소의 감정 구조 분석에 기반 한 시각화 연구)

  • Kim, Hey-Ran;Song, Eun-Sung
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.104-112
    • /
    • 2021
  • In this study, emotional analysis was conducted based on the basic attribute data of music and the emotional model in psychology, and the result was applied to the visualization rules in the formative arts. In the existing studies using musical parameter, there were many cases with more practical purposes to classify, search, and recommend music for people. In this study, the focus was on enabling sound data to be used as a material for creating artworks and used for aesthetic expression. In order to study the music visualization as an art form, a method that can include human emotions should be designed, which is the characteristics of the arts itself. Therefore, a well-structured basic classification of musical attributes and a classification system on emotions were provided. Also, through the shape, color, and animation of the visual elements, the visualization of the musical elements was performed by reflecting the subdivided input parameters based on emotions. This study can be used as basic data for artists who explore a field of music visualization, and the analysis method and work results for matching emotion-based music components and visualizations will be the basis for automated visualization by artificial intelligence in the future.

The Hand Posture Recognition Using IR-Sensor Array (적외선센서 어레이를 이용한 손동작 검출 방법)

  • Song, Tae-Houn;Jeong, Soon-Mook;Jung, Hyun-Uk;Kwon, Key-Ho;Jeon, Jae-Wook
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.432-435
    • /
    • 2009
  • This paper proposes a hand posture recognition with pattern-matching method, embedding a simple paradigm using an Infrared sensor array. Our pattern-matching based hand posture recognition is specification supports fun and the user experience when communicating between humans and telecommunication devices, including robots. Our non-contact type input device (IR-Sensor Array) transmits commands to control mobile robots. It can also control Google Earth’s map searching programs, and other applications.

  • PDF

3D lattice information space for TV contents based on spatial metaphor : TV interface perspective (공간적 은유를 적용한 3D 격자구조의 TV 콘텐츠 정보공간 제안 : TV 인터페이스 사용성 관점에서)

  • Lee, Jae-Gil;Shin, Donghee
    • Journal of Digital Contents Society
    • /
    • v.15 no.5
    • /
    • pp.651-661
    • /
    • 2014
  • The complexity to search a specific content over TV platform is drastically increasing. Based on previous studies from computer data management system, we propose a new method that helps users to search and select content effectively. In general, contents over computers are represented by spatial metaphor, which replicates our physical environment and value systems about space. We suggest 3D lattice structure to construct information space for TV platform. Users can infer relevance between contents via special clue in information space, so as to select content more easily. Also, they can search contents through its temporal property that also represented in space. We make full use of our natural capability that can reduce additional overload to learning new interface. The results of this study can be significant and heuristic contributions, as they can be applied to diverse service areas utilizing video contents.

Multiresolutional Reconstruction from Contours (윤곽선을 이용한 다중해상도적 복원)

  • 민경하;이인권
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.11
    • /
    • pp.629-654
    • /
    • 2003
  • A new multiresolutional scheme that reconstructs a polygonal mesh from the set of contours is presented. In the first step, we apply a radial gradient method to extract the contours on the sampled slices from a volume data. After classifying the types of the edges on the contours, we represent the contour using the context-free grammar. The polygons between two neighboring contours are generated through the traversal of the derivation trees of the context-free grammar. The polygonal surface of the coarsest resolution is refined through the refinement of the contours, which is executed by casting more rays on the slices. The topologies between the polygonal surfaces of various resolutions are maintained from the fact that the radial gradient method preserves the topologies of the contours of various resolutions. The proposed scheme provides efficient computation and compression methods for the tiling procedure with the feature preservation.

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.