• Title/Summary/Keyword: S.D.SCALE

Search Result 1,573, Processing Time 0.028 seconds

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.

Study on the effect of small and medium-sized businesses being selected as suitable business types, on the franchise industry (중소기업적합업종선정이 프랜차이즈산업에 미치는 영향에 관한 연구)

  • Kang, Chang-Dong;Shin, Geon-Chel;Jang, Jae Nam
    • Journal of Distribution Research
    • /
    • v.17 no.5
    • /
    • pp.1-23
    • /
    • 2012
  • The conflict between major corporations and small and medium-sized businesses is being aggravated, the trickle down effect is not working properly, and, as the controversy surrounding the effectiveness of the business limiting system continues to swirl, the plan proposed to protect the business domain of small and medium-sized businesses, resolve polarization between these businesses and large corporations, and protect small family run stores is the suitable business type designation system for small and medium-sized businesses. The current status of carrying out this system of selecting suitable business types among small and medium-sized businesses involves receiving applications for 234 items among the suitable business types and items from small and medium-sized businesses in manufacturing, and then selecting the items of the consultative group by analyzing and investigating the actual conditions. Suitable business type designation in the service industry will involve designation with priority on business types that are experiencing social conflict. Three major classifications of the service industry, related to the livelihood of small and medium-sized businesses, will be first designated, and subsequently this will be expanded sequentially. However, there is the concern that when designated as a suitable business type or item, this will hinder the growth motive for small to medium-sized businesses, and designation all cause decrease in consumer welfare. Also it is highly likely that it will operate as a prior regulation, cause side-effects by limiting competition systematically, and also be in violation against the main regulations of the FTA system. Moreover, it is pointed out that the system does not sufficiently reflect reverse discrimination factor against large corporations. Because conflict between small to medium sized businesses and large corporations results from the expansion of corporations to the service industry, which is unrelated to their key industry, it is necessary to introduce an advanced contract method like a master franchise or local franchise system and to develop local small to medium sized businesses through a franchise system to protect these businesses and dealers. However, this method may have an effect that contributes to stronger competitiveness of small to medium sized franchise businesses by advancing their competitiveness and operational methods a step further, but also has many negative aspects. First, as revealed by the Ministry of Knowledge Economy, the franchise industry is contributing to the strengthening of competitiveness through the economy of scale by organizing existing individual proprietors and increasing the success rate of new businesses. It is also revealed to be a response measure by the government to stabilize the economy of ordinary people and is emphasized as a 'useful way' to revitalize the service industry and improve the competitiveness of individual proprietors, and has been involved in contributions to creating jobs and expanding the domestic market by providing various services to consumers. From this viewpoint, franchises fit the purpose of the suitable business type system and is not something that is against it. Second, designation as a suitable business type may decrease investment for overseas expansion, R&D, and food safety, as well negatively affect the expansion of overseas corporations that have entered the domestic market, due to the contraction and low morale of large domestic franchise corporations that have competitiveness internationally. Also because domestic franchise businesses are hard pressed to secure competitiveness with multinational overseas franchise corporations that are operating in Korea, the system may cause difficulty for domestic franchise businesses in securing international competitiveness and also may result in reverse discrimination against these overseas franchise corporations. Third, the designation of suitable business type and item can limit the opportunity of selection for consumers who have up to now used those products and can cause a negative effect that reduces consumer welfare. Also, because there is the possibility that the range of consumer selection may be reduced when a few small to medium size businesses monopolize the market, by causing reverse discrimination between these businesses, the role of determining the utility of products must be left ot the consumer not the government. Lastly, it is desirable that this is carried out with the supplementation of deficient parts in the future, because fair trade is already secured with the enforcement of the franchise trade law and the best trade standard of the Fair Trade Commission. Overlapping regulations by the suitable business type designation is an excessive restriction in the franchise industry. Now, it is necessary to establish in the domestic franchise industry an environment where a global franchise corporation, which spreads Korean culture around the world, is capable of growing, and the active support by the government is needed. Therefore, systems that do not consider the process or background of the growth of franchise businesses and harm these businesses for the sole reason of them being large corporations must be removed. The inhibition of growth to franchise enterprises may decrease the sales of franchise stores, in some cases even bankrupt them, as well as cause other problems. Therefore the suitable business type system should not hinder large corporations, and as both small dealers and small to medium size businesses both aim at improving competitiveness and combined growth, large corporations, small dealers and small to medium sized businesses, based on their mutual cooperation, should not include franchise corporations that continue business relations with them in this system.

  • PDF

Information Privacy Concern in Context-Aware Personalized Services: Results of a Delphi Study

  • Lee, Yon-Nim;Kwon, Oh-Byung
    • Asia pacific journal of information systems
    • /
    • v.20 no.2
    • /
    • pp.63-86
    • /
    • 2010
  • Personalized services directly and indirectly acquire personal data, in part, to provide customers with higher-value services that are specifically context-relevant (such as place and time). Information technologies continue to mature and develop, providing greatly improved performance. Sensory networks and intelligent software can now obtain context data, and that is the cornerstone for providing personalized, context-specific services. Yet, the danger of overflowing personal information is increasing because the data retrieved by the sensors usually contains privacy information. Various technical characteristics of context-aware applications have more troubling implications for information privacy. In parallel with increasing use of context for service personalization, information privacy concerns have also increased such as an unrestricted availability of context information. Those privacy concerns are consistently regarded as a critical issue facing context-aware personalized service success. The entire field of information privacy is growing as an important area of research, with many new definitions and terminologies, because of a need for a better understanding of information privacy concepts. Especially, it requires that the factors of information privacy should be revised according to the characteristics of new technologies. However, previous information privacy factors of context-aware applications have at least two shortcomings. First, there has been little overview of the technology characteristics of context-aware computing. Existing studies have only focused on a small subset of the technical characteristics of context-aware computing. Therefore, there has not been a mutually exclusive set of factors that uniquely and completely describe information privacy on context-aware applications. Second, user survey has been widely used to identify factors of information privacy in most studies despite the limitation of users' knowledge and experiences about context-aware computing technology. To date, since context-aware services have not been widely deployed on a commercial scale yet, only very few people have prior experiences with context-aware personalized services. It is difficult to build users' knowledge about context-aware technology even by increasing their understanding in various ways: scenarios, pictures, flash animation, etc. Nevertheless, conducting a survey, assuming that the participants have sufficient experience or understanding about the technologies shown in the survey, may not be absolutely valid. Moreover, some surveys are based solely on simplifying and hence unrealistic assumptions (e.g., they only consider location information as a context data). A better understanding of information privacy concern in context-aware personalized services is highly needed. Hence, the purpose of this paper is to identify a generic set of factors for elemental information privacy concern in context-aware personalized services and to develop a rank-order list of information privacy concern factors. We consider overall technology characteristics to establish a mutually exclusive set of factors. A Delphi survey, a rigorous data collection method, was deployed to obtain a reliable opinion from the experts and to produce a rank-order list. It, therefore, lends itself well to obtaining a set of universal factors of information privacy concern and its priority. An international panel of researchers and practitioners who have the expertise in privacy and context-aware system fields were involved in our research. Delphi rounds formatting will faithfully follow the procedure for the Delphi study proposed by Okoli and Pawlowski. This will involve three general rounds: (1) brainstorming for important factors; (2) narrowing down the original list to the most important ones; and (3) ranking the list of important factors. For this round only, experts were treated as individuals, not panels. Adapted from Okoli and Pawlowski, we outlined the process of administrating the study. We performed three rounds. In the first and second rounds of the Delphi questionnaire, we gathered a set of exclusive factors for information privacy concern in context-aware personalized services. The respondents were asked to provide at least five main factors for the most appropriate understanding of the information privacy concern in the first round. To do so, some of the main factors found in the literature were presented to the participants. The second round of the questionnaire discussed the main factor provided in the first round, fleshed out with relevant sub-factors. Respondents were then requested to evaluate each sub factor's suitability against the corresponding main factors to determine the final sub-factors from the candidate factors. The sub-factors were found from the literature survey. Final factors selected by over 50% of experts. In the third round, a list of factors with corresponding questions was provided, and the respondents were requested to assess the importance of each main factor and its corresponding sub factors. Finally, we calculated the mean rank of each item to make a final result. While analyzing the data, we focused on group consensus rather than individual insistence. To do so, a concordance analysis, which measures the consistency of the experts' responses over successive rounds of the Delphi, was adopted during the survey process. As a result, experts reported that context data collection and high identifiable level of identical data are the most important factor in the main factors and sub factors, respectively. Additional important sub-factors included diverse types of context data collected, tracking and recording functionalities, and embedded and disappeared sensor devices. The average score of each factor is very useful for future context-aware personalized service development in the view of the information privacy. The final factors have the following differences comparing to those proposed in other studies. First, the concern factors differ from existing studies, which are based on privacy issues that may occur during the lifecycle of acquired user information. However, our study helped to clarify these sometimes vague issues by determining which privacy concern issues are viable based on specific technical characteristics in context-aware personalized services. Since a context-aware service differs in its technical characteristics compared to other services, we selected specific characteristics that had a higher potential to increase user's privacy concerns. Secondly, this study considered privacy issues in terms of service delivery and display that were almost overlooked in existing studies by introducing IPOS as the factor division. Lastly, in each factor, it correlated the level of importance with professionals' opinions as to what extent users have privacy concerns. The reason that it did not select the traditional method questionnaire at that time is that context-aware personalized service considered the absolute lack in understanding and experience of users with new technology. For understanding users' privacy concerns, professionals in the Delphi questionnaire process selected context data collection, tracking and recording, and sensory network as the most important factors among technological characteristics of context-aware personalized services. In the creation of a context-aware personalized services, this study demonstrates the importance and relevance of determining an optimal methodology, and which technologies and in what sequence are needed, to acquire what types of users' context information. Most studies focus on which services and systems should be provided and developed by utilizing context information on the supposition, along with the development of context-aware technology. However, the results in this study show that, in terms of users' privacy, it is necessary to pay greater attention to the activities that acquire context information. To inspect the results in the evaluation of sub factor, additional studies would be necessary for approaches on reducing users' privacy concerns toward technological characteristics such as highly identifiable level of identical data, diverse types of context data collected, tracking and recording functionality, embedded and disappearing sensor devices. The factor ranked the next highest level of importance after input is a context-aware service delivery that is related to output. The results show that delivery and display showing services to users in a context-aware personalized services toward the anywhere-anytime-any device concept have been regarded as even more important than in previous computing environment. Considering the concern factors to develop context aware personalized services will help to increase service success rate and hopefully user acceptance for those services. Our future work will be to adopt these factors for qualifying context aware service development projects such as u-city development projects in terms of service quality and hence user acceptance.