• Title/Summary/Keyword: 웹사이트 검색

Search Result 208, Processing Time 0.023 seconds

Construction of Web-Based Information Retrieval System Using Old Maps :Focusing on Kyung Hee University Hyejung Museum (고지도를 이용한 웹기반 정보검색시스템 구축 방안 -경희대학교 혜정박물관 사례를 중심으로-)

  • Oh, Il-Whan;Lee, Seung-Gwan
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.3
    • /
    • pp.56-64
    • /
    • 2011
  • Old maps are a cultural heritage of recorded information with humanities materials and scientific value. However, there is no sufficient study on the development of web-based information system using old maps to provide the old map information. In this study, we analyze the status of the old map information on the website of the main institutions, construct the web-based information systems to provide the unified map information efficiently and systematically. It is not easy to standardize the data categories and information searching method because of the diversity and complexity of old map. So, the importance of engineering information management is growing. Therefore, the attempting to computerized humanistic old map information and the integrated approach is very important and necessary. This study provide an opportunity to combine the humanities and engineering through the convergence between information technology, humanities, and computational engineering.

Automatic Response and Conceptual Browsing of Internet FAQs Using Self-Organizing Maps (자기구성 지도를 이용한 인터넷 FAQ의 자동응답 및 개념적 브라우징)

  • Ahn, Joon-Hyun;Ryu, Jung-Won;Cho, Sung-Bae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.12 no.5
    • /
    • pp.432-441
    • /
    • 2002
  • Though many services offer useful information on internet, computer users are not so familiar with such services that they need an assistant system to use the services easily In the case of web sites, for example, the operators answer the users e-mail questions, but the increasing number of users makes it hard to answer the questions efficiently. In this paper, we propose an assistant system which responds to the users questions automatically and helps them browse the Hanmail Net FAQ (Frequently Asked Question) conceptually. This system uses two-level self-organizing map (SOM): the keyword clustering SOM and document classification SOM. The keyword clustering SOM reduces a variable length question to a normalized vector and the document classification SOM classifies the question into an answer class. Experiments on the 2,206 e-mail question data collected for a month from the Hanmail net show that this system is able to find the correct answers with the recognition rate of 95% and also the browsing based on the map is conceptual and efficient.

Development of an Automated ESG Document Review System using Ensemble-Based OCR and RAG Technologies

  • Eun-Sil Choi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.9
    • /
    • pp.25-37
    • /
    • 2024
  • This study proposes a novel automation system that integrates Optical Character Recognition (OCR) and Retrieval-Augmented Generation (RAG) technologies to enhance the efficiency of the ESG (Environmental, Social, and Governance) document review process. The proposed system improves text recognition accuracy by applying an ensemble model-based image preprocessing algorithm and hybrid information extraction models in the OCR process. Additionally, the RAG pipeline optimizes information retrieval and answer generation reliability through the implementation of layout analysis algorithms, re-ranking algorithms, and ensemble retrievers. The system's performance was evaluated using certificate images from online portals and corporate internal regulations obtained from various sources, such as the company's websites. The results demonstrated an accuracy of 93.8% for certification reviews and 92.2% for company regulations reviews, indicating that the proposed system effectively supports human evaluators in the ESG assessment process.

A Study on the Gaze Flow of Internet Portal Sites Utilizing Eye Tracking (아이트래킹을 활용한 인터넷 포털사이트의 시선 흐름에 관한 연구)

  • Hwang, Mi-Kyung;Kwon, Mahn-Woo;Lee, Sang-Ho;Kim, Chee-Yong
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.177-183
    • /
    • 2022
  • This study investigated through eye tracking what gaze path the audience searches through portal sites (Naver, Daum, Zoom, and Nate). As a result of the layout analysis according to the gaze path of the search engine, the four main pages, which can be called to be the gateway to information search, appeared in the form of a Z-shaped layout. The news and search pages of each site use an F-shape, which means that when people's eyes move from top to right in an F-shape, they read while moving their eyes from left to right(LTR), which sequentially moves to the bottom. As a result of analyzing through the heat map, gaze plot, and cluster, which are the visual analysis indicators of eye tracking, the concentration of eyes on the photo and head copy was found the most in the heat map, and it can be said to be of high interest in the information. The flow of gaze flows downward from the top left to the right, and it can be seen that the cluster is most concentrated at the top of the portal site. The website designer should focus on improving the accessibility and readability of the information desired by the user in the layout design, and periodic interface changes are required by investigating and analyzing the tendencies and behavioral patterns of the main users.

A Design and Implementation of MathML-based Math Equation Generating Website (MathML에 기반한 수학식 생성 웹사이트의 설계 및 구현)

  • Park, Jeong-Hee;Lee, Mee-Jeong
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.3
    • /
    • pp.173-183
    • /
    • 2003
  • E-learning education methodology using the web has been as much activated with the introduction of the internet to our society. As for the web-based education, there is no exception in case of mathematics. However, when it comes to representing math equations by using HTML image tags, a type of web marked-up language, it can be hard to represent math equations that have structural features, and to do the search, resulting in the difficulty in reusing math related applications. Therefore, based on MathML and using ActiveX control technology, a math equation generating website was designed and implemented in this study. Since this system employed ActiveX control technology, it is possible to generate math equations without the limit of time and place on the web, and to manage the program with the most up-to-dale version. And in this system, it is also possible to save the math equations generated in this system to be referred to for their reuse in the future.

  • PDF

A Study Comparing Public and Medical Librarians' Perceptions of Evaluation Guidelines for Health & Medical Information (건강정보원 평가기준에 대한 공공도서관 및 의학도서관 사서간 인식비교 연구)

  • Noh, Younghee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.25 no.1
    • /
    • pp.107-129
    • /
    • 2014
  • Providing reliable and high quality information sources will be one of the basic skills of librarians in the future. Therefore, this study proposed evaluation criteria for health-related information sources based on a survey of public and medical librarians. As a result, a total of 21 items were selected as evaluation items, in three groups. The first, the health information content group, had 13 evaluation items, including accuracy, recency, medical expertise, regular updates, consideration of audience, objectivity, ease of understanding, plain (non-scientific or technical) language, completeness, relevance to the topic, verifiability, citation of information sources, and specification of precautions or warnings. The second group, the health-information sources group, had 5 evaluation items including clarity of health information for achieving its purpose, clarification of the responsibility of health information, compliance to the privacy policy, fairness of health information providers, and ethics of health information providers. The third group was the health-information website design group, and featured 4 evaluation criteria: ease of access, search capabilities, website ease of use, and query-response services.

Statistical Metadata for Users: A Case Study on the Level of Metadata Provision on Statistical Agency Websites (웹 이용자를 위한 통계 메타데이터: 통계정보 제공사이트의 메타데이터 제공 수준 평가 사례 연구)

  • Oh, Jung-Sun
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.2
    • /
    • pp.161-179
    • /
    • 2007
  • As increasingly diverse kinds of information materials are available on the Internet, it becomes a challenge to define an adequate level of metadata provision for each different type of material in the context of digital libraries. This study explores issues of metadata provision for a particular type of material, statistical tables. Statistical data always involves numbers and numeric values which should be interpreted with an understanding of underlying concepts and constructs. Because of the unique data characteristics, metadata in the statistical domain is essential not only for finding and discovering relevant data, but also for understanding and using the data found. However, in statistical metadata research, more emphasis has been put on the question of what metadata is necessary for processing the data and less on what metadata should be presented to users. In this study, a case study was conducted to gauge the status of metadata provision for statistical tables on the Internet. The websites of two federal statistical agencies in the United States were selected and a content analysis method was used for that purpose. The result showing insufficient and inconsistent provision of metadata demonstrate the need for more discussions on statistical metadata from the ordinary web users' perspective.

Ontology-based User Customized Search Service Considering User Intention (온톨로지 기반의 사용자 의도를 고려한 맞춤형 검색 서비스)

  • Kim, Sukyoung;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.129-143
    • /
    • 2012
  • Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.

User Perspective Website Clustering for Site Portfolio Construction (사이트 포트폴리오 구성을 위한 사용자 관점의 웹사이트 클러스터링)

  • Kim, Mingyu;Kim, Namgyu
    • Journal of Internet Computing and Services
    • /
    • v.16 no.3
    • /
    • pp.59-69
    • /
    • 2015
  • Many users visit websites every day to perform information retrieval, shopping, and community activities. On the other hand, there is intense competition among sites which attempt to profit from the Internet users. Thus, the owners or marketing officers of each site try to design a variety of marketing strategies including cooperation with other sites. Through such cooperation, a site can share customers' information, mileage points, and hyperlinks with other sites. To create effective cooperation, it is crucial to choose an appropriate partner site that may have many potential customers. Unfortunately, it is exceedingly difficult to identify such an appropriate partner among the vast number of sites. In this paper, therefore, we devise a new methodology for recommending appropriate partner sites to each site. For this purpose, we perform site clustering from the perspective of visitors' similarities, and then identify a group of sites that has a number of common customers. We then analyze the potential for the practical use of the proposed methodology through its application to approximately 140 million actual site browsing histories.

웹사이트 컨텐츠 개발을 위한 청소년의 사이버 영양 정보 및 상담 이용실태와 요구도 분석

  • 이정원;김경은;이선영
    • Korean Journal of Community Nutrition
    • /
    • v.7 no.5
    • /
    • pp.664-674
    • /
    • 2002
  • 청소년을 위한 영양 웹사이트의 컨텐츠 개발을 목적으로 사이버 영양정보 이용 현황과 요구도를 파악하고자 서울, 대전, 광주, 대구의 4개 대도시의 남녀 중고등학생 1262명을 임의로 선정하여 2000년 9월부터 10월에 걸쳐 설문지 조사를 실시하고 분석한 결과는 다음과 같다. 조사대상의 인터넷/PC통신의 이용시간은 전체 하루 평균137.0 $\pm$ 100.6분으로 나타났고 남학생이 여학생보다 평균 20.2분 길었다(p<0.05) 영양정보 급원의 이용 비중은 TV/라디오가 가장 컸고(3.69 $\pm$ 1.48) 인터넷/PC통신의 비중은 남녀, 중고생간의 차이 없이 매우 낮았다(1.30 $\pm$1.53). 인터넷/PC통신의 이용은 주로 게임 오락, 채팅이나 사교, 숙제자료를 찾기 위해서였으며 학교 공부 이외의 정보와 지식을 얻기 위한 활용도는 낮았다. 조사대상 중 인터넷/PC통신을 통해 영양정보 습득 경험이 있는 비율은 전차의 34.5%였으며 여학생(38.3%)이남학생(30.3%)보다 높았다(p<0.01). 영양정보 습득을 위해 인터넷/PC퉁신 이용 빈도는 응답자의 72%가 한 달에 1회 이하였으나, 한 달에 4회 이상 이용자도 10.9%나 되었다. 이용 목적은 ‘숙제를 위하여’가 가장 많았고 ‘자신의 건강’ 또는 ‘다이어트를 위함’이 그 다음 순이었다. 이용한 영양 웹사이트들의 전반적인 만족도는 ‘보통’의 수준(3.05 $\pm$ 0.92)이었으며 별로 또는 전혀 만족하지 않는 비율이 29.2%나 되었는데, 개선점으로 ‘정보의 빈약’(36.3%), ‘내용의 지루함과 흥미 부족’ (23.8%), ‘접속 속도의 느림’ (20.7%)의 순이었고, ‘내용의 난해도’는 전체의 13.0%가 지적하였다. 영양 컨텐츠의 내용에 대한 요구도를 조사한 결과 청소년 수준에 맞도록 쉽고 구체적이며 (71,8%), 새로운 정보의 빠른 업데이트(60.6%), 그리고 화면구성면에서는 쉽게 찾아 들어올 수 있고(88.6%) 복잡하지 않은 화면(61.9%)과 캐릭터를 많이 쓰는 것(51.1%)에 대한 요구도가 높았다. 한편 인터넷/PC통신을 통한 영양상담 경험은 남녀나 중고생간의 차이 없이 전반적으로 매우 적었으며(8.1%) 그중 91%가 한 달에 1회 이하로 저조한 상담 빈도를 보였다. 상담목적은 ‘자신의 건강’, ‘숙제’, ‘다이어트’, ‘가족의 건강을 위해’ 순으로 나타났으며, 상담결과에 대한 만족도는 ‘보통’보다 낮은 수준으로서(2.53 $\geq$ 1.01) ‘별로 또는 전혀 만족하지 않는’ 비율이 45.9%로 나타났다. 영양상담사이트의 개선점으로는 답변의 빈약함(43.2%), 난해성(22.7%), 느린 답변(22.7%) 등을 지적하였으며 친절하고 충분한 답변에 대한 요구도가 크게(48.7%) 나타났다. 조사대상들의 영양정보 습득이나 영양상담의 접속 경로는 주로 검색엔진을 통해서였으며 야후와 다음이 가장 많이 이용되었다.