• Title/Summary/Keyword: Management Systems

Search Result 18,915, Processing Time 0.049 seconds

A Real-Time Stock Market Prediction Using Knowledge Accumulation (지식 누적을 이용한 실시간 주식시장 예측)

  • Kim, Jin-Hwa;Hong, Kwang-Hun;Min, Jin-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.109-130
    • /
    • 2011
  • One of the major problems in the area of data mining is the size of the data, as most data set has huge volume these days. Streams of data are normally accumulated into data storages or databases. Transactions in internet, mobile devices and ubiquitous environment produce streams of data continuously. Some data set are just buried un-used inside huge data storage due to its huge size. Some data set is quickly lost as soon as it is created as it is not saved due to many reasons. How to use this large size data and to use data on stream efficiently are challenging questions in the study of data mining. Stream data is a data set that is accumulated to the data storage from a data source continuously. The size of this data set, in many cases, becomes increasingly large over time. To mine information from this massive data, it takes too many resources such as storage, money and time. These unique characteristics of the stream data make it difficult and expensive to store all the stream data sets accumulated over time. Otherwise, if one uses only recent or partial of data to mine information or pattern, there can be losses of valuable information, which can be useful. To avoid these problems, this study suggests a method efficiently accumulates information or patterns in the form of rule set over time. A rule set is mined from a data set in stream and this rule set is accumulated into a master rule set storage, which is also a model for real-time decision making. One of the main advantages of this method is that it takes much smaller storage space compared to the traditional method, which saves the whole data set. Another advantage of using this method is that the accumulated rule set is used as a prediction model. Prompt response to the request from users is possible anytime as the rule set is ready anytime to be used to make decisions. This makes real-time decision making possible, which is the greatest advantage of this method. Based on theories of ensemble approaches, combination of many different models can produce better prediction model in performance. The consolidated rule set actually covers all the data set while the traditional sampling approach only covers part of the whole data set. This study uses a stock market data that has a heterogeneous data set as the characteristic of data varies over time. The indexes in stock market data can fluctuate in different situations whenever there is an event influencing the stock market index. Therefore the variance of the values in each variable is large compared to that of the homogeneous data set. Prediction with heterogeneous data set is naturally much more difficult, compared to that of homogeneous data set as it is more difficult to predict in unpredictable situation. This study tests two general mining approaches and compare prediction performances of these two suggested methods with the method we suggest in this study. The first approach is inducing a rule set from the recent data set to predict new data set. The seocnd one is inducing a rule set from all the data which have been accumulated from the beginning every time one has to predict new data set. We found neither of these two is as good as the method of accumulated rule set in its performance. Furthermore, the study shows experiments with different prediction models. The first approach is building a prediction model only with more important rule sets and the second approach is the method using all the rule sets by assigning weights on the rules based on their performance. The second approach shows better performance compared to the first one. The experiments also show that the suggested method in this study can be an efficient approach for mining information and pattern with stream data. This method has a limitation of bounding its application to stock market data. More dynamic real-time steam data set is desirable for the application of this method. There is also another problem in this study. When the number of rules is increasing over time, it has to manage special rules such as redundant rules or conflicting rules efficiently.

Extracting Beginning Boundaries for Efficient Management of Movie Storytelling Contents (스토리텔링 콘텐츠의 효과적인 관리를 위한 영화 스토리 발단부의 자동 경계 추출)

  • Park, Seung-Bo;You, Eun-Soon;Jung, Jason J.
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.279-292
    • /
    • 2011
  • Movie is a representative media that can transmit stories to audiences. Basically, a story is described by characters in the movie. Different from other simple videos, movies deploy narrative structures for explaining various conflicts or collaborations between characters. These narrative structures consist of 3 main acts, which are beginning, middle, and ending. The beginning act includes 1) introduction to main characters and backgrounds, and 2) conflicts implication and clues for incidents. The middle act describes the events developed by both inside and outside factors and the story dramatic tension heighten. Finally, in the end act, the events are developed are resolved, and the topic of story and message of writer are transmitted. When story information is extracted from movie, it is needed to consider that it has different weights by narrative structure. Namely, when some information is extracted, it has a different influence to story deployment depending on where it locates at the beginning, middle and end acts. The beginning act is the part that exposes to audiences for story set-up various information such as setting of characters and depiction of backgrounds. And thus, it is necessary to extract much kind information from the beginning act in order to abstract a movie or retrieve character information. Thereby, this paper proposes a novel method for extracting the beginning boundaries. It is the method that detects a boundary scene between the beginning act and middle using the accumulation graph of characters. The beginning act consists of the scenes that introduce important characters, imply the conflict relationship between them, and suggest clues to resolve troubles. First, a scene that the new important characters don't appear any more should be detected in order to extract a scene completed the introduction of them. The important characters mean the major and minor characters, which can be dealt as important characters since they lead story progression. Extra should be excluded in order to extract a scene completed the introduction of important characters in the accumulation graph of characters. Extra means the characters that appear only several scenes. Second, the inflection point is detected in the accumulation graph of characters. It is the point that the increasing line changes to horizontal line. Namely, when the slope of line keeps zero during long scenes, starting point of this line with zero slope becomes the inflection point. Inflection point will be detected in the accumulation graph of characters without extra. Third, several scenes are considered as additional story progression such as conflicts implication and clues suggestion. Actually, movie story can arrive at a scene located between beginning act and middle when additional several scenes are elapsed after the introduction of important characters. We will decide the ratio of additional scenes for total scenes by experiment in order to detect this scene. The ratio of additional scenes is gained as 7.67% by experiment. It is the story inflection point to change from beginning to middle act when this ratio is added to the inflection point of graph. Our proposed method consists of these three steps. We selected 10 movies for experiment and evaluation. These movies consisted of various genres. By measuring the accuracy of boundary detection experiment, we have shown that the proposed method is more efficient.

Issue tracking and voting rate prediction for 19th Korean president election candidates (댓글 분석을 통한 19대 한국 대선 후보 이슈 파악 및 득표율 예측)

  • Seo, Dae-Ho;Kim, Ji-Ho;Kim, Chang-Ki
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.199-219
    • /
    • 2018
  • With the everyday use of the Internet and the spread of various smart devices, users have been able to communicate in real time and the existing communication style has changed. Due to the change of the information subject by the Internet, data became more massive and caused the very large information called big data. These Big Data are seen as a new opportunity to understand social issues. In particular, text mining explores patterns using unstructured text data to find meaningful information. Since text data exists in various places such as newspaper, book, and web, the amount of data is very diverse and large, so it is suitable for understanding social reality. In recent years, there has been an increasing number of attempts to analyze texts from web such as SNS and blogs where the public can communicate freely. It is recognized as a useful method to grasp public opinion immediately so it can be used for political, social and cultural issue research. Text mining has received much attention in order to investigate the public's reputation for candidates, and to predict the voting rate instead of the polling. This is because many people question the credibility of the survey. Also, People tend to refuse or reveal their real intention when they are asked to respond to the poll. This study collected comments from the largest Internet portal site in Korea and conducted research on the 19th Korean presidential election in 2017. We collected 226,447 comments from April 29, 2017 to May 7, 2017, which includes the prohibition period of public opinion polls just prior to the presidential election day. We analyzed frequencies, associative emotional words, topic emotions, and candidate voting rates. By frequency analysis, we identified the words that are the most important issues per day. Particularly, according to the result of the presidential debate, it was seen that the candidate who became an issue was located at the top of the frequency analysis. By the analysis of associative emotional words, we were able to identify issues most relevant to each candidate. The topic emotion analysis was used to identify each candidate's topic and to express the emotions of the public on the topics. Finally, we estimated the voting rate by combining the volume of comments and sentiment score. By doing above, we explored the issues for each candidate and predicted the voting rate. The analysis showed that news comments is an effective tool for tracking the issue of presidential candidates and for predicting the voting rate. Particularly, this study showed issues per day and quantitative index for sentiment. Also it predicted voting rate for each candidate and precisely matched the ranking of the top five candidates. Each candidate will be able to objectively grasp public opinion and reflect it to the election strategy. Candidates can use positive issues more actively on election strategies, and try to correct negative issues. Particularly, candidates should be aware that they can get severe damage to their reputation if they face a moral problem. Voters can objectively look at issues and public opinion about each candidate and make more informed decisions when voting. If they refer to the results of this study before voting, they will be able to see the opinions of the public from the Big Data, and vote for a candidate with a more objective perspective. If the candidates have a campaign with reference to Big Data Analysis, the public will be more active on the web, recognizing that their wants are being reflected. The way of expressing their political views can be done in various web places. This can contribute to the act of political participation by the people.

A Study on Problems with the ROK's Bioterrorism Response System and Ways to Improve it (생물테러 대응체제의 문제점과 개선방안 연구)

  • Jung, Yook-Sang
    • Korean Security Journal
    • /
    • no.22
    • /
    • pp.113-144
    • /
    • 2010
  • Bioterrorism is becoming more attractive to terrorist groups owing to the dramatic increase in the utility and lethality of biological weapons in line with today's cutting-edge biological science and technology. The Republic of Korea is facing both internal and external terrorist threats, as well as the possible biological warfare by North Korea. Therefore, it is essential to establish an effective bioterrorism response system in the ROK. In order to come up with the adequate response system for the ROK, an in-depth study has been conducted on the current bioterrorism response system of the U.S. whose preparedness is considered relatively adamant. As a result, the following facts have been found: (1)the legislation with regard to bioterrorism has been established or amended according to the current situation in the U.S., (2)the counter terrorism activities have been integrated with the Department of the Homeland Security as the central agency in order to maximize the national CT capacity, (3)Specific procedures and instructions to cope with bioterrorism have been made into manuals so as to enhance the working-level response capabilities. Next, the analysis on the ROK's bioterrorism response system has been performed in various categories, including the legislation system, task role distribution, cooperative relations, and resource application. It turned out that the ROK's legislation basis is relatively weak and it lacks the apparatus to integrate the bioterrorism response activities on the national level. The shortage of the adequate response facilities and resources, as well as the poor management of manpower have also emerged as problems that hinder the effective CT implementations. Through an analytical and comparative study of the U.S. and the ROK systems, this paper presents several ways to ameliorate improve the current system in the ROK as follows: (1)establish the anti-terrorism law, which would be the basic legal basis for the bioterrorism-related matters; and make revisions to the disaster-related legislation, relevant to bioterrorism response activities, (2)establish an integrated body that has a powerful authority to coordinate the relevant CT agencies; and converge the decentralized functions to maximize the overall response capacity, (3)install the laboratories with a high biosafety level and secure enough of the strategic medical stock-pile, (4)enhance the ability of the inexperienced response personnel by providing with a manual that has detailed instructions.

  • PDF

Improvement of Shelf-life and Quality in Fresh-Cut Tomato Slices:

  • Hong Ji Heun
    • Proceedings of the Korean Society of Postharvest Science and Technology of Agricultural Products Conference
    • /
    • 2004.10a
    • /
    • pp.67-72
    • /
    • 2004
  • Quality of fresh-cut tomato slices was compared during cold storage under various modified atmosphere packaging conditions. Chilling injury of slices in containers sealed with Film A was higher than with Film B; these films had oxygen transmission rates of 87.4 and 60.0 ml $h^{-1}\;m^{-2}\;atm^{-1}$ at $5^{\circ}C\;and\;99\%$ RH, respectively. While slices in containers with an initial atmospheric composition of air, $4\%\;CO_2+1\;or\;20\%\;O_2,\;8\%\;CO_2+1\;or\;20\%\;O_2,\;or\;12\%\;CO_2+20\%\;O_2$ showed fungal growth, slices in containers with $12\%\;CO_2+1\%\;O_2$ did not. Low ethylene in containers enhanced chilling injury. Modified atmosphere packaging provided good quality tomato slices with a shelf-life of 2 weeks or more at $5^{\circ}C$. Experiments were conducted to compare changes in quality of slices of red tomato (Lycopersicon esculentum Mill. 'Sunbeam') fruit from plants grown using black polyethylene or hairy vetch mulches under various foliar disease management systems including: no fungicide applications (NF), a disease forecasting model (Tom-Cast), and weekly fungicide applications (WF), during storage at $5^{\circ}C$ under a modified atmosphere. Slices were analyzed for firmness, soluble solids content (SSC), titratable acidity (TA), pH, electrolyte leakage, fungi, yeasts, and chilling injury. With both NF and Tom-Cast fungicide treatments, slices from tomato fruit grown with hairy vetch (Vicia villosa Roth) mulch were firmer than those from tomato fruit grown with black polyethylene mulch after 12 days storage. Ethylene production of slices from fruit grown using hairy vetch mulch under Tom-Cast was about 1.5- and 5-fold higher than that of slices from WF and NF fungicide treatments after 12 days, respectively. The percentage of water-soaked areas (chilling injury) for slices from tomato fruit grown using black polyethylene mulch under NF was over 7-fold that of slices from tomato fruit grown using hairy vetch under Tom-Cast. When stored at $20^{\circ}C$, slices from light-red tomato fruit grown with black polyethylene or hairy vetch mulches both showed a rapid increase in electrolyte leakage beginning 6 hours after slicing. However, slices from tomato fruit grown using the hairy vetch mulch tended to have lower electrolyte leakage than those grown with black polyethylene mulch. These results suggest that tomato fruit from plants grown using hairy vetch mulch may be more suitable for fresh-cut slices than those grown using black polyethylene mulch. Also, use of the disease forecasting model Tom-Cast, which can result in lower fungicide application than is currently used commercially, resulted in high quality fruit for fresh-cut processing. Experiments were conducted to determine if ethylene influences chilling injury, as measured by percentage of slices exhibiting water-soaked areas in fresh-cut tomato slices of 'Mountain Pride' and 'Sunbeam' tomato (Lycopersicon esculentum Mill.). Ethylene concentration in containers without ventilation significantly increased during storage at $5^{\circ}C$, whereas little or no accumulation of ethylene occurred in containers with one or six perforations. Chilling injury was greatest for slices in containers with six perforations, compared to slices in containers with one perforation, and was over 13-fold greater than that of slices in control containers with no perforations. An experiment was also performed to investigate the effectiveness of including an ethylene absorbent pad in containers on subsequent ethylene accumulation and chilling injury. While ethylene in the no-pad controls increased continually during storage of both 'Mountain Pride' and 'Sunbeam' tomatoes at $5^{\circ}C$ under modified atmosphere conditions, no increase in accumulation of ethylene was observed in containers containing ethylene absorbent pads throughout storage. The ethylene absorbent pad treatment resulted in a significantly higher percentage of chilling injury compared with the no-pad control. In studies aimed at inhibiting ethylene production using AVG during storage of slices, the concentration of ethylene in control containers (no AVG) remained at elevated levels throughout storage, compared to containers with slices treated with AVG. Chilling injury in slices treated with AVG was 5-fold greater than that of controls. Further, we tested the effect of ethylene pretreatment of slices on subsequent slice shelf-life and quality. In slices treated with ethylene (0, 0.1, 1, or $10\;{mu}L\;L^{-1}$) immediately after slicing, ethylene production in non-treated controls was greater than that of all other ethylene pre-treatments. However, pretreatment of slices 3 days after slicing resulted in a different pattern of ethylene production during storage. Ihe rate of ethylene production by slices treated with 1 L $L^{-1}$ ethylene 3 days after slicing was greater during storage than any of the other ethylene treatments. With slices pre-treated with ethylene, both immediately and 3 days after slicing, the rate of ethylene production tended to show an negative correlation with chilling injury. Chemical name used: 1-aminoethoxyvinylglycine (AVG).

  • PDF

Landscape Analysis of the Hallasan National Park in a Jeju Island Biosphere Reserve: Fragmentation Pattern (제주 생물권보전지역 내 한라산국립공원의 경관분석 : 단편화 현상)

  • Kang, Hye-Soon;Kim, Hyun-Jung;Chang, Eun-Mi
    • Korean Journal of Environment and Ecology
    • /
    • v.22 no.3
    • /
    • pp.309-319
    • /
    • 2008
  • Roads are an indicator of anthropogenic activity causing ecosystem disturbances and often lead to habitat fragmentation, habitat loss, and habitat isolation. The Hallasan National Park(153.4$km^2$) on Jeju Island being distinguished for its unique geology, topography, and biota has also been designated as a core area of UNESCO Man and the Biosphere(MAB) Reserve. Although the high conservation value of this park has contributed to a rapid growth of tourists and road construction, landscape changes due to roads have not been examined yet. We used GIS systems to examine the fragmentation pattern caused by roads, in relation to its zonation, elevation, and vegetation. When a buffer was applied to roads(112m width for paved roads and 60m width for both legal and illegal trails), the park consisted of 100 fragments. The ten fragments generated after applying buffer to only paved roads and legal trails ranged from $0.002km^2$ to $38.2km^2$ with a mean of $14.2km^2$, and about 7% of both nature conservation zone and nature environment zone of the park were edge. Fragments in both east and west ends of the park and around the summit exhibited relatively high shape indices with means of 5.19(for 100 fragments) and 7.22(for 10 fragments). All five legal trails are connected to the pit crater of the mountain and vegetation changed from broadleaf forests and conifer forests to grasslands with elevation, consequently resulting in dramatic fragment size reduction in grasslands at high elevation, in particular above 1,400m, where endemic and alpine plants are abundant. These results show that in Hallasan National Park the risks of habitat deterioration and habitat loss due to fragmentation may be more severe in the nature conservation zone dominated by Baengnokdam than in the nature environment zone. Therefore, current road networks of the park appear to fall short of the goal of the national park for ecosystem conservation and protection. Considering that the entire Hallasan National Park also serves as a MAB core area, conservation efforts should focus, first of all, on park rezoning and road management to mitigate habitat fragmentation.

Habitat Characteristics of Benthic Macroinvertebrates at a Headwater Stream in the Yeonyeopsan (Mt.) (연엽산 산지계류에 있어서 저서성 대형무척추동물의 서식특성)

  • Jang, Su-Jin;Nam, Sooyoun;Kim, Suk-Woo;Koo, Hyo-Bin;Kim, Ji-Hyeon;Lee, Youn-Tae;Chun, Kun-Woo
    • Korean Journal of Environment and Ecology
    • /
    • v.34 no.4
    • /
    • pp.334-344
    • /
    • 2020
  • A total of 24 families, 44 species, and 658 benthic macroinvertebrates were identified, and Ecdyonurus dracon Kluge (13%) was the dominant species in forested streams within the Yeonyeopsan (Mt.). A total of four habit categories (i.e., clingers (56%), burrowers (19%), swimmers (14%), and sprawlers (56%)) were identified, and clingers were the dominant habit at all survey points except point one (UP1). Habitat characteristics were depended on the hydraulic factors (e.g., flow velocity, depth, and substrates), water quality (e.g., DO and water temperature), and the habitat characteristics were differed in the riffle, which has a faster the flow velocity, compared by in the stagnant pool. In other words, in riffles, the clingers dominated in high flow velocity with the large maximum and median grain size for substrates in the habitats regardless of depth, but the burrowers and sprawlers were dominant in low flow velocity with the small maximum and median grain size for substrates in the habitats. Moreover, DO and flow velocity were in positive correlation (y = 0.6666x - 0.659, R2 = 0.0851), and the habitat for burrowers was wider than that for sprawlers or clingers. The water depth was negatively correlated with water temperature (y = -26.397x + 283.87, R2 = 0.1802) since the water temperature is more sensitive to insolation in shallow depth. pH was positively correlated with water temperature. The investigation of the habitat characteristics by separating the relations between pH and DO in upstream and downstream showed the low pH and high DO in the upstream with a high crown density of 68%, regardless of community composition. On the other hand, high pH and low DO in the downstream with a relatively low crown density of 51%. It was considered that the riparian forest played a role in suppressing the growth of attached algae and the controlling water temperature in headwater streams. Our findings identified the habitat characteristics of benthic macroinvertebrates in a headwater stream. We expected that the finding can provide reference data for suggesting conservation and management plans in a headwater stream and increasing academic value.

Review on the Protected Areas Issues within Mid-Long Term National Plans for Territory and Environment of Korea; Focus on the "Biodiversity 2011-2020 Strategic Targets" and "Protected Areas Decision" (우리나라 국토 및 환경 분야 중장기 국가계획의 보호지역 관련 내용 고찰 - "생물다양성협약 2011~2020 전략목표" 및 "보호지역 결정문" 내용을 중심으로-)

  • Heo, Hag Young
    • Journal of Environmental Policy
    • /
    • v.11 no.4
    • /
    • pp.3-37
    • /
    • 2012
  • In perspective of biodiversity conservation and protected areas (PAs), the aims of the study are to review the mid-long term national plans, which deal with national territory and environment in Korea, and to find out the way to improve this issue. Key issues were drawn by referring "Biodiversity 2011-2020 Strategic Targets" and "Protected Area Decision" in CBD CoP-10 and 7 National comprehensive or basic Plans were reviewed. Quoting Biodiversity 2011-2020 Strategic Target 5, "By 2020, the rate of loss of all natural habitats, including forests, is at least halved and where feasible brought close to zero, and degradation and fragmentation is significantly reduced", most of national plans included various methods such as "No Net Loss of Green", "No Net Loss of Wetlands", and so on. Regarding the target 11, "By 2020, at least 17% of terrestrial and inland water, and 10% of coastal and marine areas, ecologically representative and well connected systems of PAs and other effective area-based conservation measures, and integrated into the wider landscape and seascapes", 15% by 2015 was set up as a target of total PAs in Korea and 13% by 2015 or 2020 was set up as a target of coastal and marine PAs. CBD CoP-10 Decision X/31 (Protected Areas) invites parties to develop a national long-term action plan for the implementation of PoWPA and describes 10 issues that need greater attention. National action plan for the implementation of PoWPA doesn't be mentioned at any national plans even PoWPA. Regarding the 10 issues, most of issues were well reflected within various national plans, however there is still a need to improve the details and corelation between plans. Particularly, in terms of management effectiveness evaluation (MEE), there was no national plan to directly deal with MEE even though CBD invites parties to work towards assessing 60% of the total PAs by 2015. Based on the review results, below 4 items were suggested; (1)"The Comprehensive Plan of the National Territory" needs more attention on the Biodiversity Conservation and PAs, (2)Consider to establish "National PA System Plan" embedded into "the Comprehensive Plan of National Environment", (3)Establish a "National Action Plan for the implementation of PoWPA", (4)Improve the National Plans through linking with Biodiversity 2011-2020 Strategic Targets and relevant PA key issues.

  • PDF

Building battery deterioration prediction model using real field data (머신러닝 기법을 이용한 납축전지 열화 예측 모델 개발)

  • Choi, Keunho;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.243-264
    • /
    • 2018
  • Although the worldwide battery market is recently spurring the development of lithium secondary battery, lead acid batteries (rechargeable batteries) which have good-performance and can be reused are consumed in a wide range of industry fields. However, lead-acid batteries have a serious problem in that deterioration of a battery makes progress quickly in the presence of that degradation of only one cell among several cells which is packed in a battery begins. To overcome this problem, previous researches have attempted to identify the mechanism of deterioration of a battery in many ways. However, most of previous researches have used data obtained in a laboratory to analyze the mechanism of deterioration of a battery but not used data obtained in a real world. The usage of real data can increase the feasibility and the applicability of the findings of a research. Therefore, this study aims to develop a model which predicts the battery deterioration using data obtained in real world. To this end, we collected data which presents change of battery state by attaching sensors enabling to monitor the battery condition in real time to dozens of golf carts operated in the real golf field. As a result, total 16,883 samples were obtained. And then, we developed a model which predicts a precursor phenomenon representing deterioration of a battery by analyzing the data collected from the sensors using machine learning techniques. As initial independent variables, we used 1) inbound time of a cart, 2) outbound time of a cart, 3) duration(from outbound time to charge time), 4) charge amount, 5) used amount, 6) charge efficiency, 7) lowest temperature of battery cell 1 to 6, 8) lowest voltage of battery cell 1 to 6, 9) highest voltage of battery cell 1 to 6, 10) voltage of battery cell 1 to 6 at the beginning of operation, 11) voltage of battery cell 1 to 6 at the end of charge, 12) used amount of battery cell 1 to 6 during operation, 13) used amount of battery during operation(Max-Min), 14) duration of battery use, and 15) highest current during operation. Since the values of the independent variables, lowest temperature of battery cell 1 to 6, lowest voltage of battery cell 1 to 6, highest voltage of battery cell 1 to 6, voltage of battery cell 1 to 6 at the beginning of operation, voltage of battery cell 1 to 6 at the end of charge, and used amount of battery cell 1 to 6 during operation are similar to that of each battery cell, we conducted principal component analysis using verimax orthogonal rotation in order to mitigate the multiple collinearity problem. According to the results, we made new variables by averaging the values of independent variables clustered together, and used them as final independent variables instead of origin variables, thereby reducing the dimension. We used decision tree, logistic regression, Bayesian network as algorithms for building prediction models. And also, we built prediction models using the bagging of each of them, the boosting of each of them, and RandomForest. Experimental results show that the prediction model using the bagging of decision tree yields the best accuracy of 89.3923%. This study has some limitations in that the additional variables which affect the deterioration of battery such as weather (temperature, humidity) and driving habits, did not considered, therefore, we would like to consider the them in the future research. However, the battery deterioration prediction model proposed in the present study is expected to enable effective and efficient management of battery used in the real filed by dramatically and to reduce the cost caused by not detecting battery deterioration accordingly.

An Embedding /Extracting Method of Audio Watermark Information for High Quality Stereo Music (고품질 스테레오 음악을 위한 오디오 워터마크 정보 삽입/추출 기술)

  • Bae, Kyungyul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.21-35
    • /
    • 2018
  • Since the introduction of MP3 players, CD recordings have gradually been vanishing, and the music consuming environment of music users is shifting to mobile devices. The introduction of smart devices has increased the utilization of music through music playback, mass storage, and search functions that are integrated into smartphones and tablets. At the time of initial MP3 player supply, the bitrate of the compressed music contents generally was 128 Kbps. However, as increasing of the demand for high quality music, sound quality of 384 Kbps appeared. Recently, music content of FLAC (Free License Audio Codec) format using lossless compression method is becoming popular. The download service of many music sites in Korea has classified by unlimited download with technical protection and limited download without technical protection. Digital Rights Management (DRM) technology is used as a technical protection measure for unlimited download, but it can only be used with authenticated devices that have DRM installed. Even if music purchased by the user, it cannot be used by other devices. On the contrary, in the case of music that is limited in quantity but not technically protected, there is no way to enforce anyone who distributes it, and in the case of high quality music such as FLAC, the loss is greater. In this paper, the author proposes an audio watermarking technology for copyright protection of high quality stereo music. Two kinds of information, "Copyright" and "Copy_free", are generated by using the turbo code. The two watermarks are composed of 9 bytes (72 bits). If turbo code is applied for error correction, the amount of information to be inserted as 222 bits increases. The 222-bit watermark was expanded to 1024 bits to be robust against additional errors and finally used as a watermark to insert into stereo music. Turbo code is a way to recover raw data if the damaged amount is less than 15% even if part of the code is damaged due to attack of watermarked content. It can be extended to 1024 bits or it can find 222 bits from some damaged contents by increasing the probability, the watermark itself has made it more resistant to attack. The proposed algorithm uses quantization in DCT so that watermark can be detected efficiently and SNR can be improved when stereo music is converted into mono. As a result, on average SNR exceeded 40dB, resulting in sound quality improvements of over 10dB over traditional quantization methods. This is a very significant result because it means relatively 10 times improvement in sound quality. In addition, the sample length required for extracting the watermark can be extracted sufficiently if the length is shorter than 1 second, and the watermark can be completely extracted from music samples of less than one second in all of the MP3 compression having a bit rate of 128 Kbps. The conventional quantization method can extract the watermark with a length of only 1/10 compared to the case where the sampling of the 10-second length largely fails to extract the watermark. In this study, since the length of the watermark embedded into music is 72 bits, it provides sufficient capacity to embed necessary information for music. It is enough bits to identify the music distributed all over the world. 272 can identify $4*10^{21}$, so it can be used as an identifier and it can be used for copyright protection of high quality music service. The proposed algorithm can be used not only for high quality audio but also for development of watermarking algorithm in multimedia such as UHD (Ultra High Definition) TV and high-resolution image. In addition, with the development of digital devices, users are demanding high quality music in the music industry, and artificial intelligence assistant is coming along with high quality music and streaming service. The results of this study can be used to protect the rights of copyright holders in these industries.