• Title/Summary/Keyword: gram-scale

Search Result 66, Processing Time 0.024 seconds

Implementation of Search Method based on Sequence and Adjacency Relationship of User Query (사용자 검색 질의 단어의 순서 및 단어간의 인접 관계에 기반한 검색 기법의 구현)

  • So, Byung-Chul;Jung, Jin-Woo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.6
    • /
    • pp.724-729
    • /
    • 2011
  • Information retrieval is a method to search the needed data by users. Generally, when a user searches some data in the large scale data set like the internet, ranking-based search is widely used because it is not easy to find the exactly needed data at once. In this paper, we propose a novel ranking-based search method based on sequence and adjacency relationship of user query by the help of TF-IDF and n-gram. As a result, it was possible to find the needed data more accurately with 73% accuracy in more than 19,000 data set.

A Study on the Effect of Food size and Washing rate on the Mixed Digestion of food Waste and Sewage Sludge (음식물쓰레기의 슬러지 병합처리에 있어서 입경 및 세척율의 영향)

  • 최성문;김은호;성낙창;김정권;윤태경;임영석
    • Journal of environmental and Sanitary engineering
    • /
    • v.15 no.4
    • /
    • pp.84-90
    • /
    • 2000
  • 5L laboratory-scale anaerobic digester was used to study the effects of food size and washing rate of food waste on the mixed digestion with sewage sludge. Food waste was crushed with particle diameters of 4mm and 2mm and washed two to three times, and seven to eight times before feeding the batch digester. The digester with crushed of washed food waste showed better performance than that with uncrushed of unwashed to produce methane gas of reduce volatile solids. The digester with 2mm food waste showed 17.4% higher VS/TS reduction rate and 18ml higher methane production rate per gram VS input than that with uncrushed food waste, where VS and TS are volatile solid and total solids in the liquid effluent, respectively. Also food waste crushed eight times gave 8% higher VS/TS reduction rate and 11ml higher methane production rate per gram VS input than unwashed food waste.

  • PDF

Classification Protein Subcellular Locations Using n-Gram Features (단백질 서열의 n-Gram 자질을 이용한 세포내 위치 예측)

  • Kim, Jinsuk
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.12-16
    • /
    • 2007
  • The function of a protein is closely co-related with its subcellular location(s). Given a protein sequence, therefore, how to determine its subcellular location is a vitally important problem. We have developed a new prediction method for protein subcellular location(s), which is based on n-gram feature extraction and k-nearest neighbor (kNN) classification algorithm. It classifies a protein sequence to one or more subcellular compartments based on the locations of top k sequences which show the highest similarity weights against the input sequence. The similarity weight is a kind of similarity measure which is determined by comparing n-gram features between two sequences. Currently our method extract penta-grams as features of protein sequences, computes scores of the potential localization site(s) using kNN algorithm, and finally presents the locations and their associated scores. We constructed a large-scale data set of protein sequences with known subcellular locations from the SWISS-PROT database. This data set contains 51,885 entries with one or more known subcellular locations. Our method show very high prediction precision of about 93% for this data set, and compared with other method, it also showed comparable prediction improvement for a test collection used in a previous work.

  • PDF

Distributed Processing System Design and Implementation for Feature Extraction from Large-Scale Malicious Code (대용량 악성코드의 특징 추출 가속화를 위한 분산 처리 시스템 설계 및 구현)

  • Lee, Hyunjong;Euh, Seongyul;Hwang, Doosung
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.8 no.2
    • /
    • pp.35-40
    • /
    • 2019
  • Traditional Malware Detection is susceptible for detecting malware which is modified by polymorphism or obfuscation technology. By learning patterns that are embedded in malware code, machine learning algorithms can detect similar behaviors and replace the current detection methods. Data must collected continuously in order to learn malicious code patterns that change over time. However, the process of storing and processing a large amount of malware files is accompanied by high space and time complexity. In this paper, an HDFS-based distributed processing system is designed to reduce space complexity and accelerate feature extraction time. Using a distributed processing system, we extract two API features based on filtering basis, 2-gram feature and APICFG feature and the generalization performance of ensemble learning models is compared. In experiments, the time complexity of the feature extraction was improved about 3.75 times faster than the processing time of a single computer, and the space complexity was about 5 times more efficient. The 2-gram feature was the best when comparing the classification performance by feature, but the learning time was long due to high dimensionality.

DGA-based Botnet Detection Technology using N-gram (N-gram을 활용한 DGA 기반의 봇넷 탐지 방안)

  • Jung Il Ok;Shin Deok Ha;Kim Su Chul;Lee Rock Seok
    • Convergence Security Journal
    • /
    • v.22 no.5
    • /
    • pp.145-154
    • /
    • 2022
  • Recently, the widespread proliferation and high sophistication of botnets are having serious consequences not only for enterprises and users, but also for cyber warfare between countries. Therefore, research to detect botnets is steadily progressing. However, the DGA-based botnet has a high detection rate with the existing signature and statistics-based technology, but also has a high limit in the false positive rate. Therefore, in this paper, we propose a detection model using text-based n-gram to detect DGA-based botnets. Through the proposed model, the detection rate, which is the limit of the existing detection technology, can be increased and the false positive rate can also be minimized. Through experiments on large-scale domain datasets and normal domains used in various DGA botnets, it was confirmed that the performance was superior to that of the existing model. It was confirmed that the false positive rate of the proposed model is less than 2 to 4%, and the overall detection accuracy and F1 score are both 97.5%. As such, it is expected that the detection and response capabilities of DGA-based botnets will be improved through the model proposed in this paper.

STUDY OF THE TENSILE BOND STRENGTH OF COMPOSITES RESINS APPLIED TO ACID-ECHED ENAMEL (산처리(酸處理)된 Enamel표면(表面)에 대(對)한 Composite resin의 인장접착강도(引張接着强度)에 관(關)한 연구(硏究))

  • Lee, Young-Kun;Min, Byung-Soon;Choi, Ho-Young;Park, Sung-Jin
    • Restorative Dentistry and Endodontics
    • /
    • v.12 no.2
    • /
    • pp.45-53
    • /
    • 1987
  • The purpose of this study was to evaluate the tensile bond strength between composite resin and the human enamel. Three composite resin systems, two chemical (Clearfil Posterior, and Clearfil Posterior-3) and one light cure (Photo Clearfil-A), used with and without an intermediate resin (clearfil bonding agent), were evaluated under different amounts of load (10g, 200g and 200g for a moment) for in vitro tensile bond strength to acid-eched human enamel. Clinically intact buccal or lingual surfaces of 144 freshly extracted human permanent molars, embedded in acrylic were flattened with No #600 carborundum discs. Samples were randomly assigned to the different materials and treatments using a table of random numbers. Eight samples were thus prepared for each group(Table 2) these surfaces were etched with an acid etchant (Kurarey Co. Japan) in a mode of etching for 30 seconds, washing for 15 seconds, and drying for 30-seconds. During the polymerization of composite resin on the acid-etched enamel surfaces with and without bonding agent 10-gram, 200 gram and temporary 200 gram of load were applied. The specimens were stored in 50% relation humidity at $37^{\circ}C$ for 24 hours before testing. An universal Testing machine (Intesco model No. 2010, Tokyo, Japan) was used to apply tensile loads in the vertical directed (fig 5), and the force required for separation was recorded with a cross head speed of 0.25 mm/min and 20 kg in full scale. The results were as follow: 1. The tensile bond strength was much greater in applying a bonding agent than in not doing that. 2. The tensile bond strength of chemical cure composite resin was higher than that of light cure composite resin with applying on bonding agent on the acid-etched enamel. 3. In case of not applying a bonding agents on the acid-etching enamel, the highest tensile bond strength under 200 gram of load was measured in light cure composite resin. 4. The tensile bond strength under 200-gram of load has no relation with applying the bonding agent. 5. Under the load of 10-gram, There was significant difference in tensile bond strength as applying the bonding agent.

  • PDF

Synthesis of Novel Chiral Diamino Alcohols and Their Application in Copper-Catalyzed Asymmetric Allylic Oxidation of Cycloolefins

  • Faraji, Laleh;Samadi, Saadi;Jadidi, Khosrow;Notash, Behrouz
    • Bulletin of the Korean Chemical Society
    • /
    • v.35 no.7
    • /
    • pp.1989-1995
    • /
    • 2014
  • The small library of new enantiomerically pure (S,S)-diamino alcohols 1 and their hydroxyldiamide precursors 2 were conveniently synthesized on a gram scale from inexpensive and commercially chiral pool amino acids. The catalytic and induced asymmetric effects of the chiral ligands 1 in the asymmetric allylic oxidation of cycloolefins were investigated.

Application of in-situ Flushing to the Soil Contaminated by Organic Compounds (유기물질에 의해 오염된 토양에 대한 in-situ세척기법의 적용성 연구)

  • 최상일;류두현;김형수
    • Journal of Korea Soil Environment Society
    • /
    • v.1 no.2
    • /
    • pp.61-72
    • /
    • 1996
  • A series of batch and lab-scale continuous tests were conducted to optimize the design parameters for the full-scale in-situ soil flushing experiments. The cleaning abilities of the surfactant solutions of Tween 80, Triton X-100 and SDS were compared for the soil artificially contaminated by hydrophobic organic contaminants: n-dodecane, naphthalene and anthracene. Tween 80 and Triton X-100 were shown to be efficient for n-dodecane. SDS and Tween 80 were shown to be efficient for naphthalene and anthracene. At the end of each column test, the sorbed amount of surfactant to soil was also measured. Tween 80 was found to be the least adsorbed surfactant to soil. The flushing ability at flowrate of 7 ml/min, was hampered comparing to flowrate of 3 and 5 ml/min. Initial pH of the soil did not significantly affect the flushing efficiencies. Tween 80 was determined as the most harmless surfactant for the Gram(+) and Gram(-) bacteria.

  • PDF

The Effects of Personality Traits on Stress Level, Stressor, and Stress Coping Style : Focused on S High School Students (성격특성이 스트레스 수준, 발생요인, 대처방식에 미치는 영향 : S고등학교 학생들을 중심으로)

  • Jo, Eun-Hyeon;Lee, Dong-Hyung
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.40 no.3
    • /
    • pp.107-115
    • /
    • 2017
  • The purpose of this study is to find and compare the differences in stress level, stressor, and coping style of high school students according to their personality traits. And to identify the personality traits contributing to stress reduction. The results of the study can be used as basic data for reducing the stress of adolescents to improve their academic achievement and maintain mental and physical stability. The questionnaire survey was conducted for 10 days from June 7 to June 16, 2017 for S high school students in Chungbuk Province. The study employed four measuring instruments as Ego-gram, Ok-gram, stressor scale, and stress coping type scale. Individual personality traits were classified into ego-states and ok-states in TA (Transactional Analysis). Stressors are classified into 5 categories (school life, home life, human relationship, myself, environment) and stress coping styles are classified into 4 categories (problem-focused coping, social-support coping, feeling-focused coping, hope-thinking coping) used in the previous studies. The collected data were analyzed using T-test, ANOVA, and multiple regression analysis. As a result, the stress level of high school students showed significant differences according to sex, year, grade, sleeping time, personality traits, and coping styles. Personality traits that contributed to the stress increase were NP, AC, 'you positive', 'i negative' and personality trait that contributes to stress reduction is 'you negative.' Therefore, we can judge that individual personality traits have significant effects on stress level, stressor, and coping style and need to find the effective stress management method suitable for individual personality traits using counseling, repetition training, self-suggestion etc.

Overproduction of Xanthophyll Pigment in Flavobacterium sp. JSWR-1 under Optimized Culture Conditions

  • Jegadeesh Raman;Young-Joon Ko;Jeong-Seon Kim;Da-Hye Kim;Soo-Jin Kim
    • Journal of Microbiology and Biotechnology
    • /
    • v.34 no.3
    • /
    • pp.710-724
    • /
    • 2024
  • Flavobacterium can synthesize xanthophyll, particularly the pigment zeaxanthin, which has significant economic value in nutrition and pharmaceuticals. Recently, the use of carotenoid biosynthesis by bacteria and yeast fermentation technology has shown to be very efficient and offers significant advantages in large-scale production, cost-effectiveness, and safety. In the present study, JSWR-1 strain capable of producing xanthophyll pigment was isolated from a freshwater reservoir in Wanju-gun, Republic of Korea. Based on the morphological, physiological, and molecular characteristics, JSWR-1 classified as belonging to the Flavobacterium species. The bacterium is strictly aerobic, Gram-negative, rod-shaped, and psychrophilic. The completed genome sequence of the strain Flavobacterium sp. JSWR-1 is predicted to be a single circular 3,425,829-bp chromosome with a G+C content of 35.2% and 2,941 protein-coding genes. The optimization of carotenoid production was achieved by small-scale cultivation, resulting in zeaxanthin being identified as the predominant carotenoid pigment. The enhancement of zeaxanthin biosynthesis by applying different light-irradiation, variations in pH and temperature, and adding carbon and nitrogen supplies to the growth medium. A significant increase in intracellular zeaxanthin concentrations was also recorded during fed-batch fermentation achieving a maximum of 16.69 ± 0.71 mg/l, corresponding to a product yield of 4.05 ± 0.15 mg zeaxanthin per gram cell dry weight. Batch and fed-batch culture extracts exhibit significant antioxidant activity. The results demonstrated that the JSWR-1 strain can potentially serve as a source for zeaxanthin biosynthesis.