• Title/Summary/Keyword: StopWords

Search Result 107, Processing Time 0.023 seconds

A Study on the Analysis of Reliability and Loss Cost by Appling k out of n System in Combined On-board Signaling System (차상통합신호시스템에서 k out of n 시스템 적용에 대한 신뢰도 및 손실비용 분석에 관한 연구)

  • Kim, Min-Kyu;Cha, Gi-Ho;Kim, Min-Seok;Lee, Jong-Woo
    • Journal of the Korean Society for Railway
    • /
    • v.15 no.1
    • /
    • pp.42-47
    • /
    • 2012
  • There are ATC (Automatic Train Control), ATP (Automatic Train Protection), ATS (Automatic Train Stop) and ATO (Automatic Train Operation) etc. in train control systems. As various train control systems are installed according to sections, on-board signaling systems are installed to apply to the section. Hence, operation flexibility of trains is decreased. In other words, when trains are operated in the section where other train control systems are used, the on-board signaling systems are changed. Recently, a study on the combined on-board signaling system has been researched to solve this problem. The combined on-board signaling system consists of ATC, ATP and ATS device. Because the train control systems are vital, it needs to design the combined on-board signaling system by using k out of n system. In this paper, when k out of n system is applied in the combined on-board signaling system, the reliability and loss cost are analyzed by using failure rate in each device. Hence, the ideal number of systems is presented according to the number of outputs.

The Development of an Automatic Indexing System based on a Thesaurus (시소러스를 기반으로 하는 자동색인 시스템에 관한 연구)

  • 임형묵;정상철
    • Korean Journal of Cognitive Science
    • /
    • v.4 no.1
    • /
    • pp.213-242
    • /
    • 1993
  • During the past decades,several automatic indexing systems have been developed such as single term indexing.phrase indexing and thesaurus basedidndexing systems.Among these systems,single term indexing has been known as superior to others despte its simpicity of extracting meaningful terms.On the other hand,thesaurus based one has been conceived as producing low retrival rate ,mainly because thesauri do not usually have enough index terms.so that much of text data fail to be indexed if they do not match with any of index terms in thesauri.This paper develops a thesaurus based indexing system THINS that yields higher retrieval rate than other systems.by doing syntactic analysis of text data and matching them with index terms in thesauri partially.First,the system analyzes the input text syntactically by using the machine translation suystem MATES/EK and extracts noun phrases.After deleting stop words from noun phrases and stemming the remaining ones.it tries to index these with similar index terms in the thesaurus as much as possible. We conduct an experiment with CACM data set that measures the retrieval effectiveness with CACM data set that measures the retrieval effectuvenss of THINS with single term based one under HYKIS-a thesaurus based information retrieval system.It turns out that THINS yields about 10 percent higher precision than single term based one.while shows 8to9 percent lower recall.This retrieval rate shows that THINS improves much better than privious ones that only yields 25 or 30 percent lower precision than single term based one.We also argue that the relatively lower recall is cause by that CRCS-the thesaurus included in CACM datea set is very incomplete one,having only more than one thousand terms,thus THINS is expected to produce much higher rate if it is associated with currently available large thesaurus.

An Exploratory Study on Construction of Electronic Government as Platform with Customized Public Services : to Improve Administrative Aspects of Administrative Processes and Information Systems (맞춤형 공공서비스제공을 위한 플랫폼 전자정부 구축방안에 대한 탐색적 연구: 행정프로세스와 행정정보시스템 개선측면에서)

  • Lee, Sang-Yun;Chung, Myungju
    • Journal of Digital Convergence
    • /
    • v.14 no.1
    • /
    • pp.1-11
    • /
    • 2016
  • Currently Korean government is rushing the new electronic government system introduced as 'platform e-government' with big data and cloud computing technologies and systems, ultimately intending to provide the public institution services customized from the integrated counter or window for the heterogeneous resident services. In this regard, this study suggested how to design the new metadata information system in which mutual integration of information systems can take place, where heterogeneous services can be shared efficiently at the application and data unit, as a separate application that can provide a single one- stop service for residents' petition at the integrated level in the back-office based on the public data in possession of each of government ministries and related organizations. If this proposed system is implemented, the achievement of customized public service can be advanced one step forward in processing the petitions of the residents by organically connected link between 'Demand Chain' and 'Supply Chain' in the integrated window. In other words, it could be made possible through the unification of both the 'Supply Chain' performed in the office space of the officials at the back-office level and the 'Demand Chain' performed in the living space of the residents at the front-office level.

Hwarang's journey and Hyangga (화랑의 순유(巡遊)와 향가)

  • Shin, Jae-hong
    • Journal of Korean Classical Literature and Education
    • /
    • no.15
    • /
    • pp.67-88
    • /
    • 2008
  • In spite of a few numbers of Hyangga that is handed down today, Hyangga has diverse and abundant contents. So it is possible to survey Hyangga as a journey literature of the middle age. On this purpose we can inquire into the Hwarangs' group journey, because the Hwarang was one of the main enjoying group of Hyangga. Hwarangs' group journey shows many aspects. They made a journey for the public purpose like a tour of inspection of people's daily life and the fortresses of the country's peripheral areas. Also, they made a journey for personal purposes like enjoying the attractive view of the mountains and rivers or seeking pretty girls outside of the palace. On these journeys, Hwarang made and enjoyed Hyanggas. Among Hyanggas that remains today, Hyeseongga and Cheoyongga have a direct relation to Hwarang's journey. Hyeseongga was made to eliminate the calamities that occurred at the time of the start of a journey. It is expressed in this poem that Hwarang could take a peaceful trip with the condition that the celestial objects shed light on the earth path. As such, the trip becomes a sacred ceremony. Cheoyongga reflects the fact that the foreigner Cheoyong became Hwarang and toured the streets of Seorabeol, the capital of Shilla. The Cheoyong's bitterness of broken love is expressed in this poem. SongSadahamga and MoJukjirangga come under a broad category of Hwarang's journey literature. SongSadahamga is a farewell poem for Hwarang who leaves to fight on the battlefield. It was universal to make a journey for the fighting of a battle in Shilla period, so many Hyanggas would be made under those situations. MoJukjirangga has the content of Hwarang's trip for saving his follower who was taken by another senior. It expresses the intimate relationship between Hwarang and the follower. Though the words of the song have not been remained, Hyeongeumpogok, Daedogok, and Mungungok were created on the way of Hwarang's journey. These seem to be a series poems which have the proper characteristic features of the Hwarang's journey literature. In these poems, the king's open mind and liberal political views are reflected. In short, Hwarang created and enjoyed Hyanggas on the way of their journey, so Hyangga has the features of journey literature in Korean middle ages.

An exploratory study for the development of a education framework for supporting children's development in the convergence of "art activity" and "language activity": Focused on Text mining method ('미술'과 '언어' 활동 융합형의 아동 발달지원 교육 프레임워크 개발을 위한 탐색적 연구: 텍스트 마이닝을 중심으로)

  • Park, Yunmi;Kim, Sijeong
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.3
    • /
    • pp.297-304
    • /
    • 2021
  • This study aims not only to access the visual thought-oriented approach that has been implemented in established art therapy and education but also to integrate language education and therapeutic approach to support the development of school-age children. Thus, text mining technique was applied to search for areas where different areas of language and art can be integrated. This research was conducted in accordance with the procedure of basic research, preliminary DB construction, text screening, DB pre-processing and confirmation, stop-words removing, text mining analysis and the deduction about the convergent areas. These results demonstrated that this study draws convergence areas related to regional, communication, and learning functions, areas related to problem solving and sensory organs, areas related to art and intelligence, areas related to information and communication, areas related to home and disability, topics, conceptualization, peer-related areas, integration, reorganization, attitudes. In conclusion, this study is meaningful in that it established a framework for designing an activity-centered convergence program of art and language in the future and attempted a holistic approach to support child development.

Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach

  • Marwat, M. Irfan;Khan, Javed Ali;Alshehri, Dr. Mohammad Dahman;Ali, Muhammad Asghar;Hizbullah;Ali, Haider;Assam, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.830-860
    • /
    • 2022
  • [Introduction] Nowadays, many companies are shifting their businesses online due to the growing trend among customers to buy and shop online, as people prefer online purchasing products. [Problem] Users share a vast amount of information about products, making it difficult and challenging for the end-users to make certain decisions. [Motivation] Therefore, we need a mechanism to automatically analyze end-user opinions, thoughts, or feelings in the social media platform about the products that might be useful for the customers to make or change their decisions about buying or purchasing specific products. [Proposed Solution] For this purpose, we proposed an automated SentiDecpective approach, which classifies end-user reviews into negative, positive, and neutral sentiments and identifies deceptive crowd-users rating information in the social media platform to help the user in decision-making. [Methodology] For this purpose, we first collected 11781 end-users comments from the Amazon store and Flipkart web application covering distant products, such as watches, mobile, shoes, clothes, and perfumes. Next, we develop a coding guideline used as a base for the comments annotation process. We then applied the content analysis approach and existing VADER library to annotate the end-user comments in the data set with the identified codes, which results in a labelled data set used as an input to the machine learning classifiers. Finally, we applied the sentiment analysis approach to identify the end-users opinions and overcome the deceptive rating information in the social media platforms by first preprocessing the input data to remove the irrelevant (stop words, special characters, etc.) data from the dataset, employing two standard resampling approaches to balance the data set, i-e, oversampling, and under-sampling, extract different features (TF-IDF and BOW) from the textual data in the data set and then train & test the machine learning algorithms by applying a standard cross-validation approach (KFold and Shuffle Split). [Results/Outcomes] Furthermore, to support our research study, we developed an automated tool that automatically analyzes each customer feedback and displays the collective sentiments of customers about a specific product with the help of a graph, which helps customers to make certain decisions. In a nutshell, our proposed sentiments approach produces good results when identifying the customer sentiments from the online user feedbacks, i-e, obtained an average 94.01% precision, 93.69% recall, and 93.81% F-measure value for classifying positive sentiments.

A Case Study of Improving Instruction by Utilizing Online Instruction Diagnosis Item Pool

  • SHIM, Mi-Ja
    • Educational Technology International
    • /
    • v.6 no.2
    • /
    • pp.23-41
    • /
    • 2005
  • One of the main factors that determine the quality of instruction is the teaching ability of the instructor administering the class. To evaluate teaching ability, methods such as peer review, student feedback, and teaching portfolio can be used. Among these, because feedback from the students is directly associated with how well the students feel they have learned, it is essential to improving instruction and teaching ability. The principal aim of instruction evaluation lies in the evaluation of instructor's qualification and the improvement of instruction quality by enhancing professionalism. However, the mandatory instruction evaluations currently being carried out at the term's end in universities today have limitations in improving instruction in terms of its evaluation items and times. To improve the quality of instruction and raise teaching abilities, instruction evaluations should not stop at simply being carried out but also be utilized as useful data for students and teachers. In other words, they need to be used to develop teaching and improve instruction for teachers, and consequently, should also exert a positive influence on students' scholastic achievements and learning ability. The most important thing in evaluation is the acquisition of accurate information and how to utilize it to improve instruction. The online instruction diagnosis item pool is a more realistic feedback device developed to improve instruction quality. The instruction diagnosis item pool is a cafeteria-like collection of hundreds of feedback questions provided to enable instructors to diagnose their instruction through self-diagnosis or students' feedback, and the instructors can directly select the questions that are appropriate to the special characteristics of their instruction voluntarily make use of them whenever they are needed. The current study, in order to find out if the online instruction diagnosis item pool is truly useful in reforming and improving instruction, conducted pre and post tests using 256 undergraduate students from Y university as subjects, and studied the effects of student feedback on instructions. Results showed that the implementation of instruction diagnosis improved students' responsibility regarding their classes, and students had positive opinions regarding the usefulness of online instruction diagnosis item pool in instruction evaluation. Also, after instruction diagnosis, analyzing the results through consultations with education development specialists, and then establishing and carrying out instruction reforms were shown to be more effective. In order to utilize the instruction diagnostic system more effectively, from planning the execution of instruction diagnosis to analyzing the results, consulting, and deciding how those results could be utilized to instruction, a systematic strategy is needed. In addition, professors and students need to develop a more active sense of ownership in order to elevate the level of their instruction.

Latent topics-based product reputation mining (잠재 토픽 기반의 제품 평판 마이닝)

  • Park, Sang-Min;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.39-70
    • /
    • 2017
  • Data-drive analytics techniques have been recently applied to public surveys. Instead of simply gathering survey results or expert opinions to research the preference for a recently launched product, enterprises need a way to collect and analyze various types of online data and then accurately figure out customer preferences. In the main concept of existing data-based survey methods, the sentiment lexicon for a particular domain is first constructed by domain experts who usually judge the positive, neutral, or negative meanings of the frequently used words from the collected text documents. In order to research the preference for a particular product, the existing approach collects (1) review posts, which are related to the product, from several product review web sites; (2) extracts sentences (or phrases) in the collection after the pre-processing step such as stemming and removal of stop words is performed; (3) classifies the polarity (either positive or negative sense) of each sentence (or phrase) based on the sentiment lexicon; and (4) estimates the positive and negative ratios of the product by dividing the total numbers of the positive and negative sentences (or phrases) by the total number of the sentences (or phrases) in the collection. Furthermore, the existing approach automatically finds important sentences (or phrases) including the positive and negative meaning to/against the product. As a motivated example, given a product like Sonata made by Hyundai Motors, customers often want to see the summary note including what positive points are in the 'car design' aspect as well as what negative points are in thesame aspect. They also want to gain more useful information regarding other aspects such as 'car quality', 'car performance', and 'car service.' Such an information will enable customers to make good choice when they attempt to purchase brand-new vehicles. In addition, automobile makers will be able to figure out the preference and positive/negative points for new models on market. In the near future, the weak points of the models will be improved by the sentiment analysis. For this, the existing approach computes the sentiment score of each sentence (or phrase) and then selects top-k sentences (or phrases) with the highest positive and negative scores. However, the existing approach has several shortcomings and is limited to apply to real applications. The main disadvantages of the existing approach is as follows: (1) The main aspects (e.g., car design, quality, performance, and service) to a product (e.g., Hyundai Sonata) are not considered. Through the sentiment analysis without considering aspects, as a result, the summary note including the positive and negative ratios of the product and top-k sentences (or phrases) with the highest sentiment scores in the entire corpus is just reported to customers and car makers. This approach is not enough and main aspects of the target product need to be considered in the sentiment analysis. (2) In general, since the same word has different meanings across different domains, the sentiment lexicon which is proper to each domain needs to be constructed. The efficient way to construct the sentiment lexicon per domain is required because the sentiment lexicon construction is labor intensive and time consuming. To address the above problems, in this article, we propose a novel product reputation mining algorithm that (1) extracts topics hidden in review documents written by customers; (2) mines main aspects based on the extracted topics; (3) measures the positive and negative ratios of the product using the aspects; and (4) presents the digest in which a few important sentences with the positive and negative meanings are listed in each aspect. Unlike the existing approach, using hidden topics makes experts construct the sentimental lexicon easily and quickly. Furthermore, reinforcing topic semantics, we can improve the accuracy of the product reputation mining algorithms more largely than that of the existing approach. In the experiments, we collected large review documents to the domestic vehicles such as K5, SM5, and Avante; measured the positive and negative ratios of the three cars; showed top-k positive and negative summaries per aspect; and conducted statistical analysis. Our experimental results clearly show the effectiveness of the proposed method, compared with the existing method.

The Current Status and Prospect of Presidential Records Management (대통령기록관리의 현황과 전망)

  • Zoh, Young-Sam
    • The Korean Journal of Archival Studies
    • /
    • no.21
    • /
    • pp.283-322
    • /
    • 2009
  • Legislation and enforcement of the Presidential Records Management Law was an important turning point in Korean archival management history. In the past, the notion of presidential records was vague. The law was a starting point of establishing presidential records management. The Presidential Records Management Law provides the definition of presidential records and its scope, and establishes the protection of presidential records through restricted access to the records. The key to the law is to enable a president freely to produce records and transfer them to the next administration without omission. In other words, it aims to stop the practice that presidential records are produced but never be left. But, 'disputes over the release of presidential records' and the disclosing of access-restricted presidential records presented a crisis to national records management as well as the prospect of presidential records management, even if they were 'legal procedures.' The instability of presidential records management could give a serious impact on the national records management and its operation. Amid this situation, it is required to review the presidential records management system and provide recommendations for improvement, even if the enforcement of law has just started. The most urgent things in improving presidential records management are to secure its independence, specialty, and to complement restricted access to presidential records. For securing independency, presidential records management should be done by a separate organization other than the National Archives of Korea while for promoting specialty, a newly established organization could serve as a professional archive. And for complementing restricted access to the presidential records, the access should be more limited. In other words, more discretion is needed in permitting access. And more specific regulations should be applied to the permitted records. However, these regulatory actions may not have effects unless independency is not secured. Thus, more fundamentally, independency of the National Archives of Korea should be first established.

E-Commerce in the Historical Approach to Usage and Practice of International Trade ("무역상무(貿易商務)에의 역사적(歷史的) 어프로치와 무역취인(貿易取引)의 전자화(電子化)")

  • Tsubaki, Koji
    • THE INTERNATIONAL COMMERCE & LAW REVIEW
    • /
    • v.19
    • /
    • pp.224-242
    • /
    • 2003
  • The author believes that the main task of study in international trade usage and practice is the management of transactional risks involved in international sale of goods. They are foreign exchange risks, transportation risks, credit risk, risk of miscommunication, etc. In most cases, these risks are more serious and enormous than those involved in domestic sales. Historically, the merchant adventurers organized the voyage abroad, secured trade finance, and went around the ocean with their own or consigned cargo until around the $mid-19^{th}$ century. They did business faceto-face at the trade fair or the open port where they maintained the local offices, so-called "Trading House"(商館). Thererfore, the transactional risks might have been one-sided either with the seller or the buyer. The bottomry seemed a typical arrangement for risk sharing among the interested parties to the adventure. In this way, such organizational arrangements coped with or bore the transactional risks. With the advent of ocean liner services and wireless communication across the national border in the $19^{th}$ century, the business of merchant adventurers developed toward the clear division of labor; sales by mercantile agents, and ocean transportation by the steam ship companies. The international banking helped the process to be accelerated. Then, bills of lading backed up by the statute made it possible to conduct documentary sales with a foreign partner in different country. Thus, FOB terms including ocean freight and CIF terms emerged gradually as standard trade terms in which transactional risks were allocated through negotiation between the seller and the buyer located in different countries. Both of them did not have to go abroad with their cargo. Instead, documentation in compliance with the terms of the contract(plus an L/C in some cases) must by 'strictly' fulfilled. In other words, the set of contractual documents must be tendered in advance of the arrival of the goods at port of discharge. Trust or reliance is placed on such contractual paper documents. However, the container transport services introduced as international intermodal transport since the late 1960s frequently caused the earlier arrival of the goods at the destination before the presentation of the set of paper documents, which may take 5 to 10% of the amount of transaction. In addition, the size of the container vessel required the speedy transport documentation before sailing from the port of loading. In these circumstances, computerized processing of transport related documents became essential for inexpensive transaction cost and uninterrupted distribution of the goods. Such computerization does not stop at the phase of transportation but extends to cover the whole process of international trade, transforming the documentary sales into less-paper trade and further into paperless trade, i.e., EDI or E-Commerce. Now we face the other side of the coin, which is data security and paperless transfer of legal rights and obligations. Unfortunately, these issues are not effectively covered by a set of contracts only. Obviously, EDI or E-Commerce is based on the common business process and harmonized system of various data codes as well as the standard message formats. This essential feature of E-Commerce needs effective coordination of different divisions of business and tight control over credit arrangements in addition to the standard contract of sales. In a few word, information does not alway invite "trust". Credit flows from people, or close organizational tie-ups. It is our common understanding that, without well-orchestrated organizational arrangements made by leading companies, E-Commerce does not work well for paperless trade. With such arrangements well in place, participating E-business members do not need to seriously care for credit risk. Finally, it is also clear that E-International Commerce must be linked up with a set of government EDIs such as NACCS, Port EDI, JETRAS, etc, in Japan. Therefore, there is still a long way before us to go for E-Commerce in practice, not on the top of information manager's desk.

  • PDF