• Title/Summary/Keyword: Product information extraction

Search Result 95, Processing Time 0.023 seconds

A Study on Design and Implementation of Automatic Product Information Indexing and Retrieval System for Online Comparison Shopping on the Web (웹 상의 온라인 비교 쇼핑을 위한 상품 정보 자동 색인 및 검색 시스템의 설계 및 구현에 대한 연구)

  • 강대기;이제선;함호상
    • The Journal of Society for e-Business Studies
    • /
    • v.3 no.2
    • /
    • pp.57-71
    • /
    • 1998
  • In this paper, we describe the approaches of shopping agents and directory services for online comparison shopping on the web, and propose an information indexing and retrieval system, named InfoEye, with a new method for automatic extraction of product information. The developed method is based on the knowledge about presentation of the product information on the Web. The method from the knowledge about presentation of the product information is derived from both the point that online stores display their products to customers in easy-to-browse ways and heuristics made of analyses of product information look-and-feel of domestic online stores. In indexing process, the method is applied to product information extraction from Hypertext Markup Language (HTML) documents collected by a mirroring robot from online stores. We have made InfoEye to a readily usable stage and transferred the technology to Webnara commercial shopping engine. The proposed system is a cutting-edge solution to help customers as a shopping expert by providing information about the reasonable price of a product from dozens of online stores, saving customers shopping time, giving information about new products, and comparing quality factors of products in a same category.

  • PDF

FEROM: Feature Extraction and Refinement for Opinion Mining

  • Jeong, Ha-Na;Shin, Dong-Wook;Choi, Joong-Min
    • ETRI Journal
    • /
    • v.33 no.5
    • /
    • pp.720-730
    • /
    • 2011
  • Opinion mining involves the analysis of customer opinions using product reviews and provides meaningful information including the polarity of the opinions. In opinion mining, feature extraction is important since the customers do not normally express their product opinions holistically but separately according to its individual features. However, previous research on feature-based opinion mining has not had good results due to drawbacks, such as selecting a feature considering only syntactical grammar information or treating features with similar meanings as different. To solve these problems, this paper proposes an enhanced feature extraction and refinement method called FEROM that effectively extracts correct features from review data by exploiting both grammatical properties and semantic characteristics of feature words and refines the features by recognizing and merging similar ones. A series of experiments performed on actual online review data demonstrated that FEROM is highly effective at extracting and refining features for analyzing customer review data and eventually contributes to accurate and functional opinion mining.

Automatic Product Feature Extraction for Efficient Analysis of Product Reviews Using Term Statistics (효율적인 상품평 분석을 위한 어휘 통계 정보 기반 평가 항목 추출 시스템)

  • Lee, Woo-Chul;Lee, Hyun-Ah;Lee, Kong-Joo
    • The KIPS Transactions:PartB
    • /
    • v.16B no.6
    • /
    • pp.497-502
    • /
    • 2009
  • In this paper, we introduce an automatic product feature extracting system that improves the efficiency of product review analysis. Our system consists of 2 parts: a review collection and correction part and a product feature extraction part. The former part collects reviews from internet shopping malls and revises spoken style or ungrammatical sentences. In the latter part, product features that mean items that can be used as evaluation criteria like 'size' and 'style' for a skirt are automatically extracted by utilizing term statistics in reviews and web documents on the Internet. We choose nouns in reviews as candidates for product features, and calculate degree of association between candidate nouns and products by combining inner association degree and outer association degree. Inner association degree is calculated from noun frequency in reviews and outer association degree is calculated from co-occurrence frequency of a candidate noun and a product name in web documents. In evaluation results, our extraction method showed an average recall of 90%, which is better than the results of previous approaches.

Product Information Extraction System Based on STEP in CPC Environment (협업적 제품 거래 환경에서 STEP 기반의 제품정보 추출 시스템)

  • Keem, Joon-Hyoung;Park, Sang-Ho;Kim, Hyun
    • Proceedings of the KSME Conference
    • /
    • 2003.11a
    • /
    • pp.1840-1845
    • /
    • 2003
  • Collaborative product commerce (CPC) supports a collaboration that a global enterprise and customer related to life cycle of product share product information and a collaboration process for the collaboration, and integrating applications. In this paper, we use common data schema in order to solve a interoperability problem about shared product information between enterprises. And we map to common data schema from each other different data format. Therefore we implement CPC Adaptor in order to integrate distributed product information.

  • PDF

Product Information Extraction System Based on STEP in CPC Environment (협업적 제품 거래 환경에서 STEP 기반의 제품정보 추출 시스템)

  • Park, Sang-Ho;Keem, Joon-Hyoung;Kim, Hyun
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.28 no.5
    • /
    • pp.648-653
    • /
    • 2004
  • Collaborative product commerce (CPC) supports a collaboration that a global enterprise and customer related to lift cycle of product share product information and a collaboration process for the collaboration, and integrating applications. In this paper, we use common data schema in order to solve a interoperability problem about shared product information between enterprises. And we map to common data schema from each other different data format. Therefore we implement CPC Adaptor in order to integrate distributed product information.

Extracting of Features in Code Changes of Existing System for Reengineering to Product Line

  • Yoon, Seonghye;Park, Sooyong;Hwang, Mansoo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.119-126
    • /
    • 2016
  • Software maintenance becomes extremely difficult, especially caused by multiple versions in project-based or customer-oriented software development methodology. For reducing the maintenance cost, reengineering to software product line can be a solution to the software which either is a family of products nevertheless little different functionalities or are customized for each different customer's requirement. At an initial stage of the reengineering, the most important activity in software product line is feature extraction with respect to commonality and variability from the existing system due to verifying functional coverage. Several researchers have studied to extract features. They considered only a single version in a single product. However, this is an obstacle to classify the commonality and variability of features. Therefore, we propose a method for systematically extracting features from source code and its change history considering several versions of the existing system. It enables us to represent functionalities reflecting developer's intention, and to clarify the rationale of variation.

Automatic Extraction of Opinion Words from Korean Product Reviews Using the k-Structure (k-Structure를 이용한 한국어 상품평 단어 자동 추출 방법)

  • Kang, Han-Hoon;Yoo, Seong-Joon;Han, Dong-Il
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.6
    • /
    • pp.470-479
    • /
    • 2010
  • In relation to the extraction of opinion words, it may be difficult to directly apply most of the methods suggested in existing English studies to the Korean language. Additionally, the manual method suggested by studies in Korea poses a problem with the extraction of opinion words in that it takes a long time. In addition, English thesaurus-based extraction of Korean opinion words leaves a challenge to reconsider the deterioration of precision attributed to the one to one mismatching between Korean and English words. Studies based on Korean phrase analyzers may potentially fail due to the fact that they select opinion words with a low level of frequency. Therefore, this study will suggest the k-Structure (k=5 or 8) method, which may possibly improve the precision while mutually complementing existing studies in Korea, in automatically extracting opinion words from a simple sentence in a given Korean product review. A simple sentence is defined to be composed of at least 3 words, i.e., a sentence including an opinion word in ${\pm}2$ distance from the attribute name (e.g., the 'battery' of a camera) of a evaluated product (e.g., a 'camera'). In the performance experiment, the precision of those opinion words for 8 previously given attribute names were automatically extracted and estimated for 1,868 product reviews collected from major domestic shopping malls, by using k-Structure. The results showed that k=5 led to a recall of 79.0% and a precision of 87.0%; while k=8 led to a recall of 92.35% and a precision of 89.3%. Also, a test was conducted using PMI-IR (Pointwise Mutual Information - Information Retrieval) out of those methods suggested in English studies, which resulted in a recall of 55% and a precision of 57%.

Hidden Markov Model-based Extraction of Internet Information (은닉 마코브 모델을 이용한 인터넷 정보 추출)

  • Park, Dong-Chul
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.3
    • /
    • pp.8-14
    • /
    • 2009
  • A Hidden Markov Model(HMM)-based information extraction method is proposed in this paper. The proposed extraction method is applied to extraction of products' prices. The input of the proposed IESHMM is the URLs of a search engine's interface, which contains the names of the product types. The output of the system is the list of extracted slots of each product: name, price, image, and URL. With the observation data set Maximum Likelihood algorithm and Baum-Welch algorithm are used for the training of HMM and The Viterbi algorithm is then applied to find the state sequence of the maximal probability that matches the observation block sequence. When applied to practical problems, the proposed HMM-based system shows improved results over a conventional method, PEWEB, in terms of recall ration and accuracy.

An Information System Architecture for Extracting Key Performance Indicators from PDM Databases (PDM 데이터베이스로부터 핵심성과지표를 추출하기 위한 정보 시스템 아키텍쳐)

  • Do, Namchul
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.39 no.1
    • /
    • pp.1-9
    • /
    • 2013
  • The current manufacturers have generated tremendous amount of digitized product data to efficiently share and exchange it with other stakeholders or various software systems for product development. The digitized product data is a valuable asset for manufacturers, and has a potential to support high level strategic decision makings needed at many stages in product development. However, the lack of studies on extraction of key performance indicators(KPIs) from product data management(PDM) databases has prohibited manufacturers to use the product data to support the decision makings. Therefore this paper examines a possibility of an architecture that supports KPIs for evaluation of product development performances, by applying multidimensional product data model and on-line analytic processing(OLAP) to operational databases of product data management. To validate the architecture, the paper provides a prototype product data management system and OLAP applications that implement the multidimensional product data model and analytic processing.

Natural Product Extracts for Depression: Analysis of Patent Status in South Korea (우울증에 대한 천연물 추출물 국내 특허 동향 분석)

  • Ga-Young Jung;Celine Jang;Sang-Ho Kim
    • Journal of Oriental Neuropsychiatry
    • /
    • v.34 no.3
    • /
    • pp.247-257
    • /
    • 2023
  • Objectives: To analyze the trend of Korean patents of natural product extracts for depression. This study also aimed to enhance the development and application of natural product extracts for depression in Korean herbal medicine. Methods: We searched the Korea Intellectual Property Rights Information Service and Science ON to collect Korean patent data of natural product extracts for depression. Two authors independently screened and assessed full texts of screened patents for eligibility. Included patents were analyzed both quantitatively and qualitatively. Results: A total of 62 patents from 2002 to December 2021 were included for analysis. The number of patents has constantly increased since 2002. Glycyrrhiza uralensis Fischer, Perilla frutescens Britton var. acuta Kudo, Zizyphus jujuba Miller var. inermis Rehder were frequently used among 62 patents. Alcohol extraction was mostly used, followed by water extraction and ethyl extraction regarding extract methods. Eight patents were herbal medicine used for treating depression in Korean Medicine. Conclusions: Various Korean herbal medicine and minerals were used as natural-products for treating depression. These results provide fundamental data that can be used for inventing new patents using Korean herbal medicine, developing new natural product extracts for depression, and extending the range of application of these products in clinical setting.