Search | Korea Science

A Study on the Effectiveness of Bigrams in Text Categorization (바이그램이 문서범주화 성능에 미치는 영향에 관한 연구)

Lee, Chan-Do;Choi, Joon-Young
- Journal of Information Technology Applications and Management
- /
- v.12 no.2
- /
- pp.15-27
- /
- 2005
Text categorization systems generally use single words (unigrams) as features. A deceptively simple algorithm for improving text categorization is investigated here, an idea previously shown not to work. It is to identify useful word pairs (bigrams) made up of adjacent unigrams. The bigrams it found, while small in numbers, can substantially raise the quality of feature sets. The algorithm was tested on two pre-classified datasets, Reuters-21578 for English and Korea-web for Korean. The results show that the algorithm was successful in extracting high quality bigrams and increased the quality of overall features. To find out the role of bigrams, we trained the Na$\"{i}$ve Bayes classifiers using both unigrams and bigrams as features. The results show that recall values were higher than those of unigrams alone. Break-even points and F1 values improved in most documents, especially when documents were classified along the large classes. In Reuters-21578 break-even points increased by 2.1%, with the highest at 18.8%, and F1 improved by 1.5%, with the highest at 3.2%. In Korea-web break-even points increased by 1.0%, with the highest at 4.5%, and F1 improved by 0.4%, with the highest at 4.2%. We can conclude that text classification using unigrams and bigrams together is more efficient than using only unigrams.
PDF

Architectural Reference Model for Semantic Library (시맨틱 라이브러리를 위한 아키텍처 참조 모델)

Han, Sung-Kook;Lee, Hyun-Sil
- Journal of the Korean Society for information Management
- /
- v.24 no.1 s.63
- /
- pp.75-101
- /
- 2007
The current technological revolution pushes forward the innovation in the library information systems. This study proposes functional requirements and an architectural reference model of Semantic Library, recognized as a prototype of next-generation library information systems, that is a seamless convergence of the library information systems and the Internet technologies. Semantic Library can realize semantic interoperability and integration based on ontology and metadata, and also renovate information services for users with openness, sharing, participation and collaboration. Semantic Library will be effectively implemented by means of service-oriented architecture and the logical structure of FRBR. In this study, a reference model of Semantic Library consisting of 6 horizontal layers and 3 vertical elements is presented as a next-generation model of library information systems.
https://doi.org/10.3743/KOSIM.2007.24.1.075 인용 PDF

Development of RESTful Web Service for Loading Data focusing on Daily Meteorological Data (데이터 로딩 자동화를 위한 RESTful 웹서비스 개발 - 일별 기상자료 처리를 중심으로 -)

Kim, Taegon;Lee, JeongJae;Nam, Won-Ho;Suh, Kyo
- Journal of The Korean Society of Agricultural Engineers
- /
- v.56 no.6
- /
- pp.93-102
- /
- 2014
Generally data loading is a laborous job to develop models. Meteorological data is basic input data for hydrological models, it is provided through websites of Korea Meteorological Administration (KMA). The website of KMA provides daily meteorological observation data with tabular format classified by years, items, stations. It is cumbersome to manipulate tabular format for model inputs such as time series and multi-item or multi-station data. The provider oriented services which broadcast restricted formed information have caused inconvenient processes. Tim O'Reilly introduces "Web 2.0" which focuses on providing a service based on data. The top ranked IT companies such as google, yahoo, daum, and naver provide customer oriented services with Open API (Application Programming Interface). A RESTful web service, typical implementation for Open API, consists URI request and HTTP response which are simple and light weight protocol than SOAP (Simple Object Access Protocol). The aim of this study is to develop a web-based service that helps loading data for human use instead of machine use. In this study, the developed RESTful web service provides Open API for manipulating meteorological data. The proposed Open API can easily access from spreadsheet programs, web browsers, and various programming environments.
https://doi.org/10.5389/KSAE.2014.56.6.093 인용 PDF KSCI

Government Website Accessibility: Comparison between Korea and the United States (한국과 미국 정부기관의 웹사이트 접근성 평가)

Hong, Soon-Goo;Cho, Jae-Hyung;Lee, Dae-Hyung
- Information Systems Review
- /
- v.7 no.1
- /
- pp.81-96
- /
- 2005
Because the web sites are in common today, the access to the web for disabled people and old aging people, what we call accessibility, becomes more important. Even though efforts to reduce the informational gap resulted from the lack of the accessibility have been carried out, the studies in this field in Korea are not still in popular. In this study, previous research on the measurements for the accessibility is reviewed and then a new model measuring accessibility is suggested. To increase the validity of the measurement, both an automated tool and a manual test are employed. First we used the 'A-Prompt', one of the popular automated validation tools and analyzed web sources, and applied manual tests by HPR Screen Reader. With the error rates calculated, the accessibility of the government web sites between Korea and the United States was compared and finally the conclusions were drawn.
PDF KSCI

A Dynamic Management Method for FOAF Using RSS and OLAP cube (RSS와 OLAP 큐브를 이용한 FOAF의 동적 관리 기법)

Sohn, Jong-Soo;Chung, In-Jeong
- Journal of Intelligence and Information Systems
- /
- v.17 no.2
- /
- pp.39-60
- /
- 2011
Since the introduction of web 2.0 technology, social network service has been recognized as the foundation of an important future information technology. The advent of web 2.0 has led to the change of content creators. In the existing web, content creators are service providers, whereas they have changed into service users in the recent web. Users share experiences with other users improving contents quality, thereby it has increased the importance of social network. As a result, diverse forms of social network service have been emerged from relations and experiences of users. Social network is a network to construct and express social relations among people who share interests and activities. Today's social network service has not merely confined itself to showing user interactions, but it has also developed into a level in which content generation and evaluation are interacting with each other. As the volume of contents generated from social network service and the number of connections between users have drastically increased, the social network extraction method becomes more complicated. Consequently the following problems for the social network extraction arise. First problem lies in insufficiency of representational power of object in the social network. Second problem is incapability of expressional power in the diverse connections among users. Third problem is the difficulty of creating dynamic change in the social network due to change in user interests. And lastly, lack of method capable of integrating and processing data efficiently in the heterogeneous distributed computing environment. The first and last problems can be solved by using FOAF, a tool for describing ontology-based user profiles for construction of social network. However, solving second and third problems require a novel technology to reflect dynamic change of user interests and relations. In this paper, we propose a novel method to overcome the above problems of existing social network extraction method by applying FOAF (a tool for describing user profiles) and RSS (a literary web work publishing mechanism) to OLAP system in order to dynamically innovate and manage FOAF. We employed data interoperability which is an important characteristic of FOAF in this paper. Next we used RSS to reflect such changes as time flow and user interests. RSS, a tool for literary web work, provides standard vocabulary for distribution at web sites and contents in the form of RDF/XML. In this paper, we collect personal information and relations of users by utilizing FOAF. We also collect user contents by utilizing RSS. Finally, collected data is inserted into the database by star schema. The system we proposed in this paper generates OLAP cube using data in the database. 'Dynamic FOAF Management Algorithm' processes generated OLAP cube. Dynamic FOAF Management Algorithm consists of two functions: one is find_id_interest() and the other is find_relation (). Find_id_interest() is used to extract user interests during the input period, and find-relation() extracts users matching user interests. Finally, the proposed system reconstructs FOAF by reflecting extracted relationships and interests of users. For the justification of the suggested idea, we showed the implemented result together with its analysis. We used C# language and MS-SQL database, and input FOAF and RSS as data collected from livejournal.com. The implemented result shows that foaf : interest of users has reached an average of 19 percent increase for four weeks. In proportion to the increased foaf : interest change, the number of foaf : knows of users has grown an average of 9 percent for four weeks. As we use FOAF and RSS as basic data which have a wide support in web 2.0 and social network service, we have a definite advantage in utilizing user data distributed in the diverse web sites and services regardless of language and types of computer. By using suggested method in this paper, we can provide better services coping with the rapid change of user interests with the automatic application of FOAF.
https://doi.org/10.13088/jiis.2011.17.2.039 인용 PDF KSCI

Performance Analysis of QUIC Protocol for Web and Streaming Services (웹 및 스트리밍 서비스에 대한 QUIC 프로토콜 성능 분석)

Nam, Hye-Been;Jung, Joong-Hwa;Choi, Dong-Kyu;Koh, Seok-Joo
- KIPS Transactions on Computer and Communication Systems
- /
- v.10 no.5
- /
- pp.137-144
- /
- 2021
The IETF has recently been standardizing the QUIC protocol for HTTP/3 services. It is noted that HTTP/3 uses QUIC as the underlying protocol, whereas HTTP/1.1 and HTTP/2 are based on TCP. Differently from TCP, the QUIC uses 0-RTT or 1-RTT transmissions to reduce the connection establishment delays of TCP and SCTP. Moreover, to solve the head-of-line blocking problem, QUIC uses the multi-streaming feature. In addition, QUIC provides various features, including the connection migration, and it is available at the Chrome browser. In this paper, we analyze the performance of QUIC for HTTP-based web and streaming services by comparing with the existing TCP and Streaming Control Transmission Protocol (SCTP) in the network environments with different link delays and packet error rates. From the experimental results, we can see that QUIC provides better throughputs than TCP and SCTP, and the gaps of performances get larger, as the link delays and packet error rates increase.
https://doi.org/10.3745/KTCCS.2021.10.5.137 인용 PDF KSCI

Calibration of Portable Particulate Mattere-Monitoring Device using Web Query and Machine Learning

Loh, Byoung Gook;Choi, Gi Heung
- Safety and Health at Work
- /
- v.10 no.4
- /
- pp.452-460
- /
- 2019
Background: Monitoring and control of PM_2.5 are being recognized as key to address health issues attributed to PM_2.5. Availability of low-cost PM_2.5 sensors made it possible to introduce a number of portable PM_2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scatteringe-based PM_2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM_2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM_2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM_2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM_2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM_2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R²) of 0.78 and standard error of 5.0 ㎍/㎥, corresponding to 8% increase in R² and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM_2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible.
https://doi.org/10.1016/j.shaw.2019.08.002 인용 PDF KSCI

An Empirical Analysis of Influential Factors for Widget Interface : Extended TAM Including Attributes (Widget 인터페이스 영향요인 분석 : 속성을 고려한 확장된 기술수용모형)

Han, Mi-Ran;Lee, Sung-Joo;Park, Peom
- Journal of Korea Society of Industrial Information Systems
- /
- v.15 no.2
- /
- pp.127-137
- /
- 2010
A Widget platform is acknowledged to be a next generation intelligent platform that is well suited to Web 2.0 and mobile convergence environments. With prospects of growth, examining users' perceptions of current widgets can be a valuable source of information in setting directions for Widget's future development. This study identifies user interface factors that affect widget usability and investigates a strategic approach to promoting the use of widgets by analyzing user's "intention to use" in connection with the identified interface factors. The experimental results show the consistency, intuition, minimal action, and personalization have a positive(+) effect on perceived ease of use and that personalization and design have a causal effect on perceived enjoyment. Inaddition, perceived ease of use has an influence on perceived enjoyment that, inturn, has a direct influence on intention to use. On the other hand, the hypothesis that perceived ease of use has a direct effect on intention to use was rejected.
https://doi.org/10.9723/jksiis.2010.15.2.127 인용 PDF KSCI

Design and Implementation of Distributed QoS Management Architecture for Real-time Negotiation and Adaptation Control on CORBA Environments (CORBA 환경에서 실시간 협약 및 작응 제어를 위한 분사 QoS 관리 구조의 설계 및 구현)

Lee, Won-Jung;Shin, Chang-Sun;Jeong, Chang-Won;Joo, Su-Chong
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.1C
- /
- pp.21-35
- /
- 2002
Nowadays, in accordance with increasing expectations of multimedia stream service on the internet, a lot of distributed applications are being required and developed. But the models of the existing systems have the problems that cannot support the extensibility and the reusability, when the QoS relating functions are being developed as an integrated modules which are suited on the centralized controlled specific-purpose application services. To cope with these problems, it is suggested in this paper to a distributed QoS management system on CORBA, an object-oriented middleware compliance. This systems we suggested can provides not only for efficient control of resources, various service QoS, and QoS control functions as the existing functions, but also QoS control real-time negotiation and dynamic adaptation in addition. This system consists of QoS Control Management Module(QoS CMM) in client side and QoS Management Module(QoS MM) in server side, respectively. These distributed modules are interfacing with each other via CORBA on different systems for distributed QoS management while serving distributed streaming applications. In phase of design of our system, we use UML(Unified Modeling Language) for designing each component in modules, their method calls and various detailed functions for controlling QoS of stream services. For implementation of our system, we used OrbixWeb 3.1c following CORBA specification on Solaris 2.5/2.7, Java language, Java Media Framework API 2.0 beta2, Mini-SQL 1.0.16 and the multimedia equipments, such as SunVideoPlus/Sun Video capture board and Sun Camera. Finally, we showed a numerical data controlled by real-time negotiation and adaptation procedures based on QoS map information to GUIs on client and server dynamically, while our distributed QoS management system is executing a given streaming service.
PDF KSCI

Comparison Shopping System Based on RSS with Ontology Matching (온톨로지 매칭을 이용한 RSS 기반의 비교쇼핑 시스템)

Park, Sang-Un
- The Journal of Information Systems
- /
- v.20 no.3
- /
- pp.41-61
- /
- 2011
In order to buy products through the Internet, consumers dissipate much time and efforts in collecting and comparing product information from various online shopping malls. Consumers can save their efforts by using price comparison sites, but there are some shortcomings in comparison shopping. Firstly, comparison sites do not show the lowest price of some products that are selling in shopping malls. Secondly, the product information provided by comparison sites is sometimes wrong. Thirdly, there are too many results. In order to overcome the shortcomings, we suggested a comparison shopping system based on RSS by using ontology matching. We used the current RSS standard for syntactic interoperability instead of suggesting new standards. Moreover, we used ontology matching for semantic interoperability to compare product information with different ontologies. The suggested ontology matching consists of three steps. The first step is finding exact sense from WordNet for a given product category, and the second step is searching for matching product category candidates from the products of RSS feeds. The final step is calculating similarities of the candidates with the target product category. From the experiments, we could get better recall rates that are suitable for e-commerce environments and the results show that our system is effective in product comparison.
https://doi.org/10.5859/KAIS.2011.20.3.41 인용 PDF

Search Result 178, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)