• Title/Summary/Keyword: Deviation Tree

Search Result 63, Processing Time 0.022 seconds

Genetic characterization and population structure of six brown layer pure lines using microsatellite markers

  • Karsli, Taki;Balcioglu, Murat Soner
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.1
    • /
    • pp.49-57
    • /
    • 2019
  • Objective: The first stage in both breeding and programs for the conservation of genetic resources are the identification of genetic diversity in the relevant population. The aim of the present study is to identify genetic diversity of six brown layer pure chicken lines (Rhode Island Red [RIRI, RIRII], Barred Rock [BARI, BARII], Columbian Rock [COL], and line 54 [L-54]) with microsatellite markers. Furthermore, the study aims to employ its findings to discuss the possibilities for the conservation and sustainable use of these lines that have been bred as closed populations for a long time. Methods: In the present study, a total number of 180 samples belonging to RIRI (n = 30), RIRII (n = 30), BARI (n = 30), BARII (n = 30), L-54 (n = 30), and COL (n = 30) lines were genotyped using 22 microsatellite loci. Microsatellite markers are extremely useful tools in the identification of genetic diversity since they are distributed throughout the eukaryotic genome in multitudes, demonstrate co-dominant inheritance and they feature a high rate of polymorphism and repeatability. Results: In this study, we found all loci to be polymorphic and identified the average number of alleles per locus to be in the range between 4.41 (BARI) and 5.45 (RIRI); the observed heterozygosity to be in the range between 0.31 (RIRII) and 0.50 (BARII); and $F_{IS}$ (inbreeding coefficient) values in the range between 0.16 (L-54) and 0.46 (RIRII). The $F_{IS}$ values obtained in this context points out to a deviation from Hardy-Weinberg equilibrium due to heterozygote deficiency in six different populations. The Neighbour-Joining tree, Factorial Correspondence Analysis and STRUCTURE clustering analyzes showed that six brown layer lines were separated according to their genetic origins. Conclusion: The results obtained from the study indicate a medium level of genetic diversity, high level inbreeding in chicken lines and high level genetic differentiation between chicken lines.

Comparative characteristic of ensemble machine learning and deep learning models for turbidity prediction in a river (딥러닝과 앙상블 머신러닝 모형의 하천 탁도 예측 특성 비교 연구)

  • Park, Jungsu
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.35 no.1
    • /
    • pp.83-91
    • /
    • 2021
  • The increased turbidity in rivers during flood events has various effects on water environmental management, including drinking water supply systems. Thus, prediction of turbid water is essential for water environmental management. Recently, various advanced machine learning algorithms have been increasingly used in water environmental management. Ensemble machine learning algorithms such as random forest (RF) and gradient boosting decision tree (GBDT) are some of the most popular machine learning algorithms used for water environmental management, along with deep learning algorithms such as recurrent neural networks. In this study GBDT, an ensemble machine learning algorithm, and gated recurrent unit (GRU), a recurrent neural networks algorithm, are used for model development to predict turbidity in a river. The observation frequencies of input data used for the model were 2, 4, 8, 24, 48, 120 and 168 h. The root-mean-square error-observations standard deviation ratio (RSR) of GRU and GBDT ranges between 0.182~0.766 and 0.400~0.683, respectively. Both models show similar prediction accuracy with RSR of 0.682 for GRU and 0.683 for GBDT. The GRU shows better prediction accuracy when the observation frequency is relatively short (i.e., 2, 4, and 8 h) where GBDT shows better prediction accuracy when the observation frequency is relatively long (i.e. 48, 120, 160 h). The results suggest that the characteristics of input data should be considered to develop an appropriate model to predict turbidity.

Feature Extraction and Evaluation for Classification Models of Injurious Falls Based on Surface Electromyography

  • Lim, Kitaek;Choi, Woochol Joseph
    • Physical Therapy Korea
    • /
    • v.28 no.2
    • /
    • pp.123-131
    • /
    • 2021
  • Background: Only 2% of falls in older adults result in serious injuries (i.e., hip fracture). Therefore, it is important to differentiate injurious versus non-injurious falls, which is critical to develop effective interventions for injury prevention. Objects: The purpose of this study was to a. extract the best features of surface electromyography (sEMG) for classification of injurious falls, and b. find a best model provided by data mining techniques using the extracted features. Methods: Twenty young adults self-initiated falls and landed sideways. Falling trials were consisted of three initial fall directions (forward, sideways, or backward) and three knee positions at the time of hip impact (the impacting-side knee contacted the other knee ("knee together") or the mat ("knee on mat"), or neither the other knee nor the mat was contacted by the impacting-side knee ("free knee"). Falls involved "backward initial fall direction" or "free knee" were defined as "injurious falls" as suggested from previous studies. Nine features were extracted from sEMG signals of four hip muscles during a fall, including integral of absolute value (IAV), Wilson amplitude (WAMP), zero crossing (ZC), number of turns (NT), mean of amplitude (MA), root mean square (RMS), average amplitude change (AAC), difference absolute standard deviation value (DASDV). The decision tree and support vector machine (SVM) were used to classify the injurious falls. Results: For the initial fall direction, accuracy of the best model (SVM with a DASDV) was 48%. For the knee position, accuracy of the best model (SVM with an AAC) was 49%. Furthermore, there was no model that has sensitivity and specificity of 80% or greater. Conclusion: Our results suggest that the classification model built upon the sEMG features of the four hip muscles are not effective to classify injurious falls. Future studies should consider other data mining techniques with different muscles.

HTML Text Extraction Using Tag Path and Text Appearance Frequency (태그 경로 및 텍스트 출현 빈도를 이용한 HTML 본문 추출)

  • Kim, Jin-Hwan;Kim, Eun-Gyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1709-1715
    • /
    • 2021
  • In order to accurately extract the necessary text from the web page, the method of specifying the tag and style attributes where the main contents exist to the web crawler has a problem in that the logic for extracting the main contents. This method needs to be modified whenever the web page configuration is changed. In order to solve this problem, the method of extracting the text by analyzing the frequency of appearance of the text proposed in the previous study had a limitation in that the performance deviation was large depending on the collection channel of the web page. Therefore, in this paper, we proposed a method of extracting texts with high accuracy from various collection channels by analyzing not only the frequency of appearance of text but also parent tag paths of text nodes extracted from the DOM tree of web pages.

Predicting rock brittleness indices from simple laboratory test results using some machine learning methods

  • Davood Fereidooni;Zohre Karimi
    • Geomechanics and Engineering
    • /
    • v.34 no.6
    • /
    • pp.697-726
    • /
    • 2023
  • Brittleness as an important property of rock plays a crucial role both in the failure process of intact rock and rock mass response to excavation in engineering geological and geotechnical projects. Generally, rock brittleness indices are calculated from the mechanical properties of rocks such as uniaxial compressive strength, tensile strength and modulus of elasticity. These properties are generally determined from complicated, expensive and time-consuming tests in laboratory. For this reason, in the present research, an attempt has been made to predict the rock brittleness indices from simple, inexpensive, and quick laboratory test results namely dry unit weight, porosity, slake-durability index, P-wave velocity, Schmidt rebound hardness, and point load strength index using multiple linear regression, exponential regression, support vector machine (SVM) with various kernels, generating fuzzy inference system, and regression tree ensemble (RTE) with boosting framework. So, this could be considered as an innovation for the present research. For this purpose, the number of 39 rock samples including five igneous, twenty-six sedimentary, and eight metamorphic were collected from different regions of Iran. Mineralogical, physical and mechanical properties as well as five well known rock brittleness indices (i.e., B1, B2, B3, B4, and B5) were measured for the selected rock samples before application of the above-mentioned machine learning techniques. The performance of the developed models was evaluated based on several statistical metrics such as mean square error, relative absolute error, root relative absolute error, determination coefficients, variance account for, mean absolute percentage error and standard deviation of the error. The comparison of the obtained results revealed that among the studied methods, SVM is the most suitable one for predicting B1, B2 and B5, while RTE predicts B3 and B4 better than other methods.

Weather Conditions Drive the Damage Area Caused by Armillaria Root Disease in Coniferous Forests across Poland

  • Pawel Lech;Oksana Mychayliv;Robert Hildebrand;Olga Orman
    • The Plant Pathology Journal
    • /
    • v.39 no.6
    • /
    • pp.548-565
    • /
    • 2023
  • Armillaria root disease affects forests around the world. It occurs in many habitats and causes losses in the infested stands. Weather conditions are important factors for growth and development of Armillaria species. Yet, the relation between occurrence of damage caused by Armillaria disease and weather variables are still poorly understood. Thus, we used generalized linear mixed models to determine the relationship between weather conditions of current and previous year (temperature, precipitation and their deviation from long-term averages, air humidity and soil temperature) and the incidence of Armillaria-induced damage in young (up to 20 years old) and older (over 20 years old) coniferous stands in selected forest districts across Poland. We used unique data, gathered over the course of 23 years (1987-2009) on tree damage incidence from Armillaria root disease and meteorological parameters from the 24-year period (1986-2009) to reflect the dynamics of damage occurrence and weather conditions. Weather parameters were better predictors of damage caused by Armillaria disease in younger stands than in older ones. The strongest predictor was soil temperature, especially that of the previous year growing season and the current year spring. We found that temperature and precipitation of different seasons in previous year had more pronounced effect on the young stand area affected by Armillaria. Each stand's age class was characterized by a different set of meteorological parameters that explained the area of disease occurrence. Moreover, forest district was included in all models and thus, was an important variable in explaining the stand area affected by Armillaria.

Estimation of forest Site Productivity by Regional Environment and Forest Soil Factors (권역별 입지$\cdot$토양 환경 요인에 의한 임지생산력 추정)

  • Won Hyong-kyu;Jeong Jin-Hyun;Koo Kyo-Sang;Song Myung Hee;Shin Man Yong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.7 no.2
    • /
    • pp.132-140
    • /
    • 2005
  • This study was conducted to develop regional site index equations for main tree species in Gangwon, Gyunggi-Chungcheong, Gyungsang, and Jeolla area of Korea, using environmental and soil factors obtained from a digital forest site map. Using the large data set obtained from the digital forest map, a total of 28 environmental and soil factors were regressed on site index by tree species for developing the best site index equations for each of the regions. The selected main tree species were Larix 1eptolepis, Pinus koraiensis, Pinus densiflora, Pinus thunbergii, and Quercus acutissima. Finally, four to five environmental and soil factors by species were chosen as independent variables in defining the best regional site index equations with the highest coefficients of determination $(R^2)$. For those site index equations, three evaluation statistics such as mean difference, standard deviation of difference and standard error of difference were applied to the data sets independently collected from fields within the region. According to the evaluation statistics, it was found that the regional site index equations by species developed in this study conformed well to the independent data set, having relatively low bias and variation. It was concluded that the regional site index equations by species had sufficient capability for the estimation of site productivity.

Estimation of Forest Productivity for Post-Wild-fire Restoration in East Coastal Areas (동해안 산불피해지 복구를 위한 산림생산력의 추정)

  • Koo, Kyo-Sang;Lee, Myung-Jong;Shin, Man-Yong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.12 no.1
    • /
    • pp.36-44
    • /
    • 2010
  • In order to rehabilitate forest sites damaged by wildfire via natural or artificial restoration, it is important to determine right tree species, which can acclimate to biogeoclimatic environment at the sites. The objectives of this study were to develop site index equation of different tree species for estimating forest productivity and to provide information on species selection for post-wildfire restoration. Site index equation was developed based on environmental information from wildfire damaged areas in Gangneung, Goseong, Donghae, and Samcheok, where were located in east coastal areas of South Korea. Despite the small numbers (4~5) of environmental variables used for the development of the site index equations, statistical analysis (e.g. mean difference, standard deviation of difference, and standard error of difference) showed relatively low bias and variation, suggesting that those equations can provide relatively high capability of estimation and practical applicability with high effectiveness. The small numbers of the variables enabled the model to be applied in a wide range of usages including determination of appropriate tree species for post-wildfire restoration. The estimation of forest site productivity showed the possibility of large distribution in east coastal region as the best site for Korean ash (Fraxinus rhynchophylla) and original oak (Quercus variabilis) that can be used for firebreak in the region. These results imply that damages by forest fire can be reduced significantly by replacing existing pure coniferous forests in the area with ones dominated by broad-leaved deciduous stands, which can play an important role as fire break and/or prevent a transition from surface fire to crown fire.

Wavelet Image Coding according to the Activity Regions (활성 영역에 따른 웨이브렛 영상 부호화)

  • Park, Jeong-Ho;Kim, Dae-Jung;Gwak, Hun-Seong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.2
    • /
    • pp.30-38
    • /
    • 2002
  • In this paper, we propose a new method for image coding which efficiently use the relationship between the properties of spatial image and its wavelet transform. Firstly, an original image is decomposed into several layers by the wavelet transform, and simultaneously decomposed into 2$^n$$\times$2$^n$blocks. Each block is classified into two regions according to their standard deviation, i.e., low activity region(LAR) and high activity region(HAR). The region with low frequency in spatial domain does not only appears as zero regions in wavelet frequency domain like HL, LH, and HH but also gives little influence to the quality of reconstructed image. The other side, the high frequency regions are related to significant coefficients which gives much influence to image reconstruction. In this paper, we propose a image coding method to obtain high compression rate at low bit rate by these properties. The LAR region is encoded by LAR coding method which is proposed in this paper, the HAR by a technique similar to bitplane coding in hierarchical tree. Simulation results show that th,$\boxUl$ proposed coding method has better performance than EZW and SPIHT schemes in terms of image quality and transmitted bit rates, can be successfully applied to the application areas that require of progressive transmission.

LECSEN : Link Exchanged Chain in SEnsor Networks (링크 교환을 이용한 무선 센서 네트워크용 체인 토폴로지 : LECSEN)

  • Shin, Ji-Soo;Suh, Chang-Jin
    • The KIPS Transactions:PartC
    • /
    • v.15C no.4
    • /
    • pp.273-280
    • /
    • 2008
  • In WSN(Wireless Sensor Network) many routing algorithms such as LEACH, PEGASIS and PEDEP consisting of sensor nodes with limited energy have been proposed to extend WSN lifetime. Under the assumption of perfect fusion, these algorithms used convergecast that periodically collects sensed data from all sensor nodes to a base station. But because these schemes studied less energy consumption for a convergecast as well as fairly energy consumption altogether, the minimum energy consumption for a convergecast was not focused enough nor how topology influences to energy consumption. This paper deals with routing topology and energy consumption for a single convergecast in the following ways. We chose major WSN topology as MSC(Minimum Spanning Chain)s, MSTs, PEGASIS chains and proposed LECSEN chains. We solved the MSC length by Linear Programming(LP) and propose the LECSEN chain to compete with MST and MSC. As a result of simulation by Monte Carlo method for calculation of the topology length and standard deviation of link length, we learned that LECSEN is competitive with MST in terms of total energy consumption and shows the best with the view of even energy consumption at the sensor nodes. Thus, we concluded LECSEN is a very useful routing topology in WSN.