• Title/Summary/Keyword: 계층 알고리즘

Search Result 1,086, Processing Time 0.022 seconds

A Study on the Performance Measurement and Analysis on the Virtual Memory based FTL Policy through the Changing Map Data Resource (멥 데이터 자원 변화를 통한 가상 메모리 기반 FTL 정책의 성능 측정 및 분석 연구)

  • Hyun-Seob Lee
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.1
    • /
    • pp.71-76
    • /
    • 2023
  • Recently, in order to store and manage big data, research and development of a high-performance storage system capable of stably accessing large data have been actively conducted. In particular, storage systems in data centers and enterprise environments use large amounts of SSD (solid state disk) to manage large amounts of data. In general, SSD uses FTL(flash transfer layer) to hide the characteristics of NAND flash memory, which is a medium, and to efficiently manage data. However, FTL's algorithm has a limitation in using DRAM more to manage the location information of NAND where data is stored as the capacity of SSD increases. Therefore, this paper introduces FTL policies that apply virtual memory to reduce DRAM resources used in FTL. The virtual memory-based FTL policy proposed in this paper manages the map data by using LRU (least recently used) policy to load the mapping information of the recently used data into the DRAM space and store the previously used information in NAND. Finally, through experiments, performance and resource usage consumed during data write processing of virtual memory-based FTL and general FTL are measured and analyzed.

A Study on the Classification Model of Overseas Infringing Websites based on Web Hierarchy Similarity Analysis using GNN (GNN을 이용한 웹사이트 Hierarchy 유사도 분석 기반 해외 침해 사이트 분류 모델 연구)

  • Ju-hyeon Seo;Sun-mo Yoo;Jong-hwa Park;Jin-joo Park;Tae-jin Lee
    • Convergence Security Journal
    • /
    • v.23 no.2
    • /
    • pp.47-54
    • /
    • 2023
  • The global popularity of K-content(Korean Wave) has led to a continuous increase in copyright infringement cases involving domestic works, not only within the country but also overseas. In response to this trend, there is active research on technologies for detecting illegal distribution sites of domestic copyrighted materials, with recent studies utilizing the characteristics of domestic illegal distribution sites that often include a significant number of advertising banners. However, the application of detection techniques similar to those used domestically is limited for overseas illegal distribution sites. These sites may not include advertising banners or may have significantly fewer ads compared to domestic sites, making the application of detection technologies used domestically challenging. In this study, we propose a detection technique based on the similarity comparison of links and text trees, leveraging the characteristic of including illegal sharing posts and images of copyrighted materials in a similar hierarchical structure. Additionally, to accurately compare the similarity of large-scale trees composed of a massive number of links, we utilize Graph Neural Network (GNN). The experiments conducted in this study demonstrated a high accuracy rate of over 95% in classifying regular sites and sites involved in the illegal distribution of copyrighted materials. Applying this algorithm to automate the detection of illegal distribution sites is expected to enable swift responses to copyright infringements.

Performance Comparison of Automatic Classification Using Word Embeddings of Book Titles (단행본 서명의 단어 임베딩에 따른 자동분류의 성능 비교)

  • Yong-Gu Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.307-327
    • /
    • 2023
  • To analyze the impact of word embedding on book titles, this study utilized word embedding models (Word2vec, GloVe, fastText) to generate embedding vectors from book titles. These vectors were then used as classification features for automatic classification. The classifier utilized the k-nearest neighbors (kNN) algorithm, with the categories for automatic classification based on the DDC (Dewey Decimal Classification) main class 300 assigned by libraries to books. In the automatic classification experiment applying word embeddings to book titles, the Skip-gram architectures of Word2vec and fastText showed better results in the automatic classification performance of the kNN classifier compared to the TF-IDF features. In the optimization of various hyperparameters across the three models, the Skip-gram architecture of the fastText model demonstrated overall good performance. Specifically, better performance was observed when using hierarchical softmax and larger embedding dimensions as hyperparameters in this model. From a performance perspective, fastText can generate embeddings for substrings or subwords using the n-gram method, which has been shown to increase recall. The Skip-gram architecture of the Word2vec model generally showed good performance at low dimensions(size 300) and with small sizes of negative sampling (3 or 5).

SVC Based Multi-channel Transmission of High Definition Multimedia and Its Improved Service Efficiency (SVC 적용에 의한 다매체 멀티미디어 지원 서비스 효율 향상 기법)

  • Kim, Dong-Hwan;Cho, Min-Kyu;Moon, Seong-Pil;Lee, Jae-Yeal;Jun, Jun-Gil;Chang, Tae-Gyu
    • Journal of IKEEE
    • /
    • v.15 no.2
    • /
    • pp.179-189
    • /
    • 2011
  • This paper presents an SVC based multi-channel transmission technique. Transmission of high definition multimedia and its service efficiency can be significantly improved by the proposed method. In this method, the HD stream is divided into the two layer streams, i.e., a base layer stream and an enhancement layer stream. The divided streams are transmitted through a primary channel and an auxiliary channel, respectively. The proposed technique provides a noble mode switching technique which enables a seamless service of HD multimedia even under the conditions of abrupt and intermittent deterioration of the auxiliary channel. When the enhancement layer stream is disrupted by the channel monitoring in the mode switching algorithm, the algorithm works further to maintain the spatial and time resolution of the HD multimedia by upsampling and interpolating the base layer stream, consequently serving for the non disrupted play of the media. Moreover, the adoption of an adaptive switching algorithm significantly reduces the frequency of channel disruption avoiding the unnecessary switching for the short period variations of the channel. The feasibility of the proposed technique is verified through the simulation study with an example application to the simultaneous utilization of both Ku and Ka bands for HD multimedia broadcasting service. The rainfall modeling and the analysis of the satellite channel attenuation characteristics are performed to simulate the quality of service performance of the proposed HD broadcasting method. The simulation results obtained under a relatively poor channel (weather) situations show that the average lasting period of enhancement layer service is extended from 9.48[min] to 23.12[min] and the average switching frequency is reduced from 3.84[times/hour] to 1.68[times/hour]. It is verified in the satellite example that the proposed SVC based transmission technique best utilizes the Ka band channel for the service of HD broadcasting, although it is characterized by its inherent weather related poor reliability causing severe limitations in its independent application.

Development of a complex failure prediction system using Hierarchical Attention Network (Hierarchical Attention Network를 이용한 복합 장애 발생 예측 시스템 개발)

  • Park, Youngchan;An, Sangjun;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.127-148
    • /
    • 2020
  • The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. In particular, with the growth of cloud computing, the proportional expansion of the data center infrastructure is inevitable. Monitoring the health of these data center facilities is a way to maintain and manage the system and prevent failure. If a failure occurs in some elements of the facility, it may affect not only the relevant equipment but also other connected equipment, and may cause enormous damage. In particular, IT facilities are irregular due to interdependence and it is difficult to know the cause. In the previous study predicting failure in data center, failure was predicted by looking at a single server as a single state without assuming that the devices were mixed. Therefore, in this study, data center failures were classified into failures occurring inside the server (Outage A) and failures occurring outside the server (Outage B), and focused on analyzing complex failures occurring within the server. Server external failures include power, cooling, user errors, etc. Since such failures can be prevented in the early stages of data center facility construction, various solutions are being developed. On the other hand, the cause of the failure occurring in the server is difficult to determine, and adequate prevention has not yet been achieved. In particular, this is the reason why server failures do not occur singularly, cause other server failures, or receive something that causes failures from other servers. In other words, while the existing studies assumed that it was a single server that did not affect the servers and analyzed the failure, in this study, the failure occurred on the assumption that it had an effect between servers. In order to define the complex failure situation in the data center, failure history data for each equipment existing in the data center was used. There are four major failures considered in this study: Network Node Down, Server Down, Windows Activation Services Down, and Database Management System Service Down. The failures that occur for each device are sorted in chronological order, and when a failure occurs in a specific equipment, if a failure occurs in a specific equipment within 5 minutes from the time of occurrence, it is defined that the failure occurs simultaneously. After configuring the sequence for the devices that have failed at the same time, 5 devices that frequently occur simultaneously within the configured sequence were selected, and the case where the selected devices failed at the same time was confirmed through visualization. Since the server resource information collected for failure analysis is in units of time series and has flow, we used Long Short-term Memory (LSTM), a deep learning algorithm that can predict the next state through the previous state. In addition, unlike a single server, the Hierarchical Attention Network deep learning model structure was used in consideration of the fact that the level of multiple failures for each server is different. This algorithm is a method of increasing the prediction accuracy by giving weight to the server as the impact on the failure increases. The study began with defining the type of failure and selecting the analysis target. In the first experiment, the same collected data was assumed as a single server state and a multiple server state, and compared and analyzed. The second experiment improved the prediction accuracy in the case of a complex server by optimizing each server threshold. In the first experiment, which assumed each of a single server and multiple servers, in the case of a single server, it was predicted that three of the five servers did not have a failure even though the actual failure occurred. However, assuming multiple servers, all five servers were predicted to have failed. As a result of the experiment, the hypothesis that there is an effect between servers is proven. As a result of this study, it was confirmed that the prediction performance was superior when the multiple servers were assumed than when the single server was assumed. In particular, applying the Hierarchical Attention Network algorithm, assuming that the effects of each server will be different, played a role in improving the analysis effect. In addition, by applying a different threshold for each server, the prediction accuracy could be improved. This study showed that failures that are difficult to determine the cause can be predicted through historical data, and a model that can predict failures occurring in servers in data centers is presented. It is expected that the occurrence of disability can be prevented in advance using the results of this study.

Development of Analytic Hierarchy Process or Solving Dependence Relation between Multicriteria (다기준 평가항목간 중복도를 반영한 AHP 기법 개발)

  • 송기한;홍상연;정성봉;전경수
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.15-22
    • /
    • 2002
  • Transportation project appraisal should be precise in order to increase the social welfare and efficiency, and it has been evaluated by only a single criterion analysis such as benefit/cost analysis. However, this method cannot assess some qualitative items, and cannot get a proper solution for the clash of interests among various groups. Therefore, the multi-criteria analysis, which can control these problems, is needed, and then Saaty has developed one of these methods, AHP(Analytic Hierarchy Process) method. In AHP, the project is evaluated through weighted score of the criteria and the alternatives, which is surveyed by a questionnaire of specialists. It is based on some strict suppositions such as reciprocal comparison, homogeneity, expectation, independence relationship between multi-criteria, but supposing that each criterion has independence relation with others is too difficult in two reasons. First, in real situation, there cannot be perfect independence relationship between standards. Second, individuals, even though they are specialists of that area, do not feel the degree of independence relation as same as others. This paper develops a modified AHP method for solving this dependence relationship between multi-criteria. First of all. in this method, the degree of dependence relationship between multi-criteria that the specialist feels is surveyed and included to the weighted score of multi-criteria This study supposes three methods to implement this idea. The first model products the degree of dependence relationship in the first step for calculating the weighted score, and the others adjust the result of weighted score from the basic AHP method to the dependence relationship. One of the second methods distributes the cross weighted score to each standard by constant ratio, and the other splits them using Fuzzy measure such as Bel and Pl. Finally, in order to validate these methods, this paper applies them to evaluate the alternatives which can control public resentments against Korean rail path in a city area.