• Title/Summary/Keyword: Open Source Community

Search Result 75, Processing Time 0.028 seconds

Model Interpretation through LIME and SHAP Model Sharing (LIME과 SHAP 모델 공유에 의한 모델 해석)

  • Yong-Gil Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.2
    • /
    • pp.177-184
    • /
    • 2024
  • In the situation of increasing data at fast speed, we use all kinds of complex ensemble and deep learning algorithms to get the highest accuracy. It's sometimes questionable how these models predict, classify, recognize, and track unknown data. Accomplishing this technique and more has been and would be the goal of intensive research and development in the data science community. A variety of reasons, such as lack of data, imbalanced data, biased data can impact the decision rendered by the learning models. Many models are gaining traction for such interpretations. Now, LIME and SHAP are commonly used, in which are two state of the art open source explainable techniques. However, their outputs represent some different results. In this context, this study introduces a coupling technique of LIME and Shap, and demonstrates analysis possibilities on the decisions made by LightGBM and Keras models in classifying a transaction for fraudulence on the IEEE CIS dataset.

The Mediating effect of Public Transportation Satisfaction on Body Mass Index according to Walking days in Korea middle aged (한국 중년의 대중교통 만족도에 따른 체질량지수에 대한 걷기 일수의 매개효과)

  • Kim, Myung-Gwan;Lee, Eun-Ju
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.6
    • /
    • pp.493-499
    • /
    • 2018
  • This study was conducted to investigate the mediating effects of walking days on body mass index (BMI) according to the public transportation satisfaction of middle aged Koreans aged 40-59 in Korea using the 2015 community health survey data. The purpose of this study was to provide basic data for provision of the neighborhood environment and support programs for the walking activity and promotion of daily walking activities among middle aged people. Among 228,558 individuals, 85,344 middle aged people aged 40-59 years were selected as final subjects for analysis. The data were analyzed with the open source statistics program R 3.4.1 to determine whether the number of walking days had a mediating effect on body mass index (BMI) as an independent variable. The Sobel test revealed that the number of walking days increased by B=0.010(p=.010), and that when the satisfaction with public transportation increased, B=-0.052 (p=.021), the number of walking days decreased by B=-0.038 (p=.001). To increase the number of walking days and decrease the body mass index by increasing public transportation satisfaction by increasing the use of public transportation, public transportation fare adequacy and access convenience among the public transportation satisfaction mentioned above should be improved more than the current level. It is not easy for individuals that live in small cities to reach their destination by public transportation after leaving the metropolitan area; therefore, improvement of public transportation systems is necessary to improve health.

Production of Biodiesel and Nutrient Removal of Municipal Wastewater using a Small Scale Raceway Pond (미세조류 옥외 배양시스템을 이용한 바이오디젤 생산 및 도시하수 영양 염류 제거)

  • Kang, Zion;Kim, Byung-Hyuk;Oh, Hee-Mock;Kim, Hee-Sik
    • Microbiology and Biotechnology Letters
    • /
    • v.41 no.2
    • /
    • pp.207-214
    • /
    • 2013
  • A concerted effort to develop alternative forms of energy is underway due to fossil fuel shortages and its deleterious effects. Recently, bioenergy from microalgae has gained prominence and the use of municipal wastewater as a low cost alternative for a nutrient source has significant advantages. In this study, we have employed municipal wastewater directly after primary treatment (primary settling basin) in a small scale raceway pond (SSRP) for microalgal growth. Indigenous microalgae in the wastewater were encouraged to grow in the SSRP under optimal conditions. The mean removal efficiencies of TN, TP, and $NH_3-N$ after 6 days were 77.77%, 63.55%, and 89.02%, respectively. The average lipid content of the microalgae was 19.51% of dry cell weight, and linolenate and linoleate (18:n) were the predominant fatty acids. The 18S rRNA gene analysis and microscopic observations of the indigenous microalgae community revealed the presence of Chlorella vulgaris and Scenedesmus obliquus as the dominant microalgae. These results indicate that untreated municipal wastewater, serving as an excellent nitrogen and phosphate source for microalgal growth, could be treated using microalgae in open raceway ponds. Moreover, microalgal biomass could be further profitable by the extraction of biodiesel.

A Philosophical Analysis and Design of a New Paradigm of the Rural Policies in Korea (한국 농정(農政)의 철학적 분석과 새로운 패러다임(paradigm)의 설계)

  • Kim, Sun-Yo
    • Journal of Agricultural Extension & Community Development
    • /
    • v.3 no.1
    • /
    • pp.17-41
    • /
    • 1996
  • In the situation of rapid industrialization based on the lopsided development of economy since 1960, Korean rural society has faced a crisis of disruption. As a result, the civilian government has tried a few actions to change the circumstance. However, it is said that the coral polices were not satisfactory. Those who were concerned with the rural problems of these days argue that it is necessary to adopt new policies and further to change the policymakers` philosophies concerning the matter. The arguments are certainly based on the beliefs that the sound policies come from the sound philosophies. This study aims to analyze the existing rural polices and their policymakers` philosophies and to design of a new paradigm. For the purpose, this study was set there specific objectives: First, to overview the moor points of Quantitative Utilitarianism of Jeremy Bentham and the Social Justice Theory of John Rawls, the contrasting frameworks of the moral philosophies; Second, to trace the major or trade of the rural policies since 1960s in Korea; Third, to analyze the policymakers` philosophies reflected on the rural policies; Fourth, to design a new paradigm of the rural policies. This study mainly adopted descriptive method based on the various source of government and non-government statistics, white papers and other researches. The major findings of this study may be summarized as follows: 1. The historical epochs of the rural policies in Korea was divided into the periods: (1) An organizational and institutional establishment for self-reliance of main crops and the New Village Movement $(1969{\sim}70)$; (2) An initiation of `open-door` policies to the foreign farm products $(1970{\sim}80)$; (3) Completion of the UR meetings and the recommendations of the Rural and Fishery Development Commission (1980-present). 2. It was found that the philosophical foundations of coral policies were directly reflected from the utilitarianism of the national development. Under the philosophy it was the modem sector of economy that was to spearhead the national development, and the rural sector was situated to the peripheral position and hardly in the spot-light. Therefore, it may be said that the present situation of the rural society was largely rooted in the model of economic development. 3. As a new direction of the coral policies, many studies were focussing on the NTC (non-trade concerns) functions of agriculture for the present and future society. The researchers argue that the cost of protecting and supporting agriculture and rural society may be higher than that of the burden which the nation should be bear in the case of failure of agriculture. Although it tray be true, however, it should be noted that the argument is another type of utilitarianism which prevailed in the past. As a philosophy of rural policies, utilitarianism is straight forward and persuasive, however, it has also limitations in terms of relativism in broad sense or social justice in specific manna. 4. This study suggests to set the philosophical foundations of rural policies on the basis of Rawl`s Theory of Justice mentioned earlier. It emphasizes the inviolability of social justice which was neglected for the national benefits timing the period of development dictatorship in 1960s and 1970s. The principles of social justice for coral people were identified as twofold; (1) The principle of the t equal liberty; (2) (a) Difference principle, (b) The principle of fair equality of opportunity.

  • PDF

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.