• 제목/요약/키워드: dataset records

검색결과 98건 처리시간 0.023초

Genetic parameter analysis of reproductive traits in Large White pigs

  • Yu, Guanghui;Wang, Chuduan;Wang, Yuan
    • Animal Bioscience
    • /
    • 제35권11호
    • /
    • pp.1649-1655
    • /
    • 2022
  • Objective: The primary objective of this study was to determine the genetic parameters for reproductive traits among Large White pigs, including the following traits: total number born (TNB), number born alive (NBA), litter birth weight (LBW), average birth weight (ABW), gestation length (GL), age at first service (AFS) and age at first farrowing (AFF). Methods: The dataset consisted of 19,036 reproductive records from 4,986 sows, and a multi-trait animal model was used to estimate genetic variance components of seven reproductive traits. Results: The heritability estimates for these reproductive traits ranged from 0.09 to 0.26, with the highest heritability for GL and AFF, and the lowest heritability for NBA. The repeatabilities for TNB, NBA, LWB, ABW, and GL were ranged from 0.16 to 0.34. Genetic and phenotypic correlations ranged from -0.41 to 0.99, and -0.34 to 0.98, respectively. In particular, the correlations between TNB, NBA and LBW, between AFS and AFF, exhibited a strong positive correlation. Furthermore, for TNB, NBA, LBW, ABW, and GL, genetic correlations of the same trait between different parities were moderately to strongly correlated (0.32 to 0.97), and the correlations of adjacent parities were higher than those of nonadjacent parities. Conclusion: All the results in the present study can be used as a basis for the genetic assessment of the target population. In the formulation of dam line selection index, AFS or AFF can be considered to combine with TNB in a multiple trait swine breeding value estimation system. Moreover, breeders are encouraged to increase the proportion of sows at parity 3-5 and reinforce the management of sows at parity 1 and parity ≥8.

사립대학 행정정보 데이터세트 운영 개선 방안 (A Study on the Improvement of Administrative Information Data Set Operation of Private Universities)

  • 김현정;배성중
    • 기록학연구
    • /
    • 제74호
    • /
    • pp.187-222
    • /
    • 2022
  • 본 연구의 목적은 사립대학의 행정정보 데이터세트 운영 현황을 분석하고 개선방안을 제시하는 것이다. 이를 위해 사립대학 178개교의 총 820개 시스템에 대해 시스템의 기능, 개발 유형, 데이터의 생성·정정·삭제 시기 등의 정량적인 분석을 실시하였다. 분석 결과, 통상 1개이상의 행정정보시스템을 보유하고 학사관리시스템을 공통적으로 사용하고 있으며 대학의 인프라를 통해 시스템을 자체적으로 개발한 사례가 많고 데이터를 수시로 생성하고 정정하며 업무담당자에 의해 데이터가 삭제되고 있으나 데이터의 삭제나 정정에 대한 규정은 명확하지 않다는 문제점이 도출되었다. 이러한 문제점들을 해결하기 위한 개선 방안으로 범정부 EA포털을 현행화하여 사립대학의 행정정보시스템에 대한 보유 현황을 제대로 파악하고, 데이터의 정정이 이루어지지 않는 시스템을 중심으로 기록관리하며 데이터의 임의 삭제가 이루어지지 않도록 내부 규정의 개정과 교육을 실시할 것 등을 제안하였다.

Northern distribution limits and future suitable habitats of warm temperate evergreen broad-leaved tree species designated as climate-sensitive biological indicator species in South Korea

  • Sookyung, Shin;Jung-Hyun, Kim;Duhee, Kang;Jin-Seok, Kim;Hong Gu, Kang;Hyun-Do, Jang;Jongsung, Lee;Jeong Eun, Han;Hyun Kyung, Oh
    • Journal of Ecology and Environment
    • /
    • 제46권4호
    • /
    • pp.292-303
    • /
    • 2022
  • Background: Climate change significantly influences the geographical distribution of plant species worldwide. Selecting indicator species allows for better-informed and more effective ecosystem management in response to climate change. The Korean Peninsula is the northernmost distribution zone of warm temperate evergreen broad-leaved (WTEB) species in Northeast Asia. Considering the ecological value of these species, we evaluated the current distribution range and future suitable habitat for 13 WTEB tree species designated as climate-sensitive biological indicator species. Results: Up-to-date and accurate WTEB species distribution maps were constructed using herbarium specimens and citizen science data from the Korea Biodiversity Observation Network. Current northern limits for several species have shifted to higher latitudes compared to previous records. For example, the northern latitude limit for Stauntonia hexaphylla is higher (37° 02' N, Deokjeokdo archipelago) than that reported previously (36° 13' N). The minimum temperature of the coldest month (Bio6) is the major factor influencing species distribution. Under future climate change scenarios, suitable habitats are predicted to expand toward higher latitudes inland and along the western coastal areas. Conclusions: Our results support the suitability of WTEB trees as significant biological indicators of species' responses to warming. The findings also suggest the need for consistent monitoring of species distribution shifts. This study provides an important baseline dataset for future monitoring and management of indicator species' responses to changing climate conditions in South Korea.

A Client-Side App Model for Classifying and Storing Documents

  • Elhussein, Bahaeldein;Karrar, Abdelrahman Elsharif;Khalifa, Mahmoud;Alsharani, Mohammed Mujib
    • International Journal of Computer Science & Network Security
    • /
    • 제22권5호
    • /
    • pp.225-233
    • /
    • 2022
  • Due to the large number of documents that are important to people and many of their requests from time to time to perform an essential official procedure, this requires a practical arrangement and organization for them. When necessary, many people struggle with effectively arranging official documents that enable display, which takes a lot of time and effort. Also, no mobile apps specialize in professionally preserving essential electronic records and displaying them when needed. Dataset consisting of 10,841 rows and 13 columns was analyzed using Anaconda, Python, and Mito Data Science new tool obtained from Google Play. The research was conducted using the quantitative descriptive approach. The presented solution is a model specialized in saving essential documents, categorizing according to the user's desire, and displaying them when needed. It is possible to send in an image or a pdf file. Aside from identifying file kinds like PDFs and pictures, the model also looks for and verifies specific file extensions. The file extension and its properties are checked before sharing or saving it by applying the similarity algorithm (Levenshtein). Our method effectively and efficiently facilitated the search process, saving the user time and effort. In conclusion, such an application is not available, which facilitates the process of classifying documents effectively and displaying them quickly and easily for people for printing or sending to some official procedures, and it is considered one of the applications that greatly help in preserving time, effort, and money for people.

개인정보 비식별화를 위한 개체명 유형 재정의와 학습데이터 생성 방법 (Re-defining Named Entity Type for Personal Information De-identification and A Generation method of Training Data)

  • 최재훈;조상현;김민호;권혁철
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2022년도 춘계학술대회
    • /
    • pp.206-208
    • /
    • 2022
  • 최근 빅데이터 산업이 큰 폭으로 발전하는 만큼 개인정보 유출로 인한 사생활 침해 문제의 관심도 높아졌다. 자연어 처리 분야에서는 이를 개체명 인식을 통해 자동화하려는 시도들이 있었다. 본 논문에서는 한국어 위키피디아 문서의 본문에서 비식별화 정보를 지닌 문장을 식별해 반자동으로 개체명 인식 데이터를 구축한다. 이는 범용적인 개체명 인식 데이터에 반해 비식별화 대상이 아닌 정보에 대해 학습되는 비용을 줄일 수 있다. 또한, 비식별화 정보를 분류하기 위해 규칙 및 통계 기반의 추가적인 시스템을 최소화할 수 있는 장점을 가진다. 본 논문에서 제안하는 개체명 인식 데이터는 총 12개의 범주로 분류하며 의료 기록, 가족 관계와 같은 비식별화 대상이 되는 정보를 포함한다. 생성된 데이터셋을 이용한 실험에서 KoELECTRA는 0.87796, RoBERTa는 0.88575의 성능을 보였다.

  • PDF

Negative association between high temperature-humidity index and milk performance and quality in Korean dairy system: big data analysis

  • Dongseok Lee;Daekyum Yoo;Hyeran Kim;Jakyeom Seo
    • Journal of Animal Science and Technology
    • /
    • 제65권3호
    • /
    • pp.588-595
    • /
    • 2023
  • The aim of this study was to investigate the effects of heat stress on milk traits in South Korea using comprehensive data (dairy production and climate). The dataset for this study comprised 1,498,232 test-day records for milk yield, fat- and protein-corrected milk, fat yield, protein yield, milk urea nitrogen (MUN), and somatic cell score (SCS) from 215,276 Holstein cows (primiparous: n = 122,087; multiparous: n = 93,189) in 2,419 South Korean dairy herds. Data were collected from July 2017 to April 2020 through the Dairy Cattle Improvement Program, and merged with meteorological data from 600 automatic weather stations through the Korea Meteorological Administration. The segmented regression model was used to estimate the effects of the temperature-humidity index (THI) on milk traits and elucidate the break point (BP) of the THI. To acquire the least-squares mean of milk traits, the generalized linear model was applied using fixed effects (region, calving year, calving month, parity, days in milk, and THI). For all parameters, the BP of THI was observed; in particular, milk production parameters dramatically decreased after a specific BP of THI (p < 0.05). In contrast, MUN and SCS drastically increased when THI exceeded BP in all cows (p < 0.05) and primiparous cows (p < 0.05), respectively. Dairy cows in South Korea exhibited negative effects on milk traits (decrease in milk performance, increase in MUN, and SCS) when the THI exceeded 70; therefore, detailed feeding management is required to prevent heat stress in dairy cows.

Assessing reproductive performance and predictive models for litter size in Landrace sows under tropical conditions

  • Praew Thiengpimol;Skorn Koonawootrittriron;Thanathip Suwanasopee
    • Animal Bioscience
    • /
    • 제37권8호
    • /
    • pp.1333-1344
    • /
    • 2024
  • Objective: Litter size and piglet loss at birth significantly impact piglet production and are closely associated with sow parity. Understanding how these traits vary across different parities is crucial for effective herd management. This study investigates the patterns of the number of born alive piglets (NBA), number of piglet losses (NPL), and the proportion of piglet losses (PPL) at birth in Landrace sows under tropical conditions. Additionally, it aims to identify the most suitable model for describing these patterns. Methods: A dataset comprising 2,322 consecutive reproductive records from 258 Landrace sows, spanning parities from 1 to 9, was analyzed. Modeling approaches including 2nd and 3rd degree polynomial models, the Wood gamma function, and a longitudinal model were applied at the individual level to predict NBA, NPL, and PPL. The choice of the best-fitting model was determined based on the lowest mean and standard deviation of the difference between predicted and actual values, Akaike information criterion (AIC), and Bayesian information criterion (BIC). Results: Sow parity significantly influenced NBA, NPL, and PPL (p<0.0001). NBA increased until the 4th parity and then declined. In contrast, NPL and PPL decreased until the 2nd parity and then steadily increased until the 8th parity. The 2nd and 3rd degree polynomials, and longitudinal models showed no significant differences in predicting NBA, NPL, and PPL (p>0.05). The 3rd degree polynomial model had the lowest prediction standard deviation and yielded the smallest AIC and BIC. Conclusion: The 3rd degree polynomial model offers the most suitable description of NBA, NPL, and PPL patterns. It holds promise for applications in genetic evaluations to enhance litter size and reduce piglet loss at birth in sows. These findings highlight the importance of accounting for sow parity effects in swine breeding programs, particularly in tropical conditions, to optimize piglet production and sow performance.

ADVANTAGES OF USING ARTIFICIAL NEURAL NETWORKS CALIBRATION TECHNIQUES TO NEAR-INFRARED AGRICULTURAL DATA

  • Buchmann, Nils-Bo;Ian A.Cowe
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1032-1032
    • /
    • 2001
  • Artificial Neural Network (ANN) calibration techniques have been used commercially for agricultural applications since the mid-nineties. Global models, based on transmission data from 850 to 1050 nm, are used routinely to measure protein and moisture in wheat and barley and also moisture in triticale, rye, and oats. These models are currently used commercially in approx. 15 countries throughout the world. Results concerning earlier European ANN models are being published elsewhere. Some of the findings from that study will be discussed here. ANN models have also been developed for coarsely ground samples of compound feed and feed ingredients, again measured in transmission mode from 850 to 1050 nm. The performance of models for pig- and poultry feed will be discussed briefly. These models were developed from a very large data set (more than 20,000 records), and cover a very broad range of finished products. The prediction curves are linear over the entire range for protein, fat moisture, fibre, and starch (measured only on poultry feed), and accuracy is in line with the performance of smaller models based on Partial Least Squares (PLS). A simple bias adjustment is sufficient for calibration transfer across instruments. Recently, we have investigated the possible use of ANN for a different type of NIR spectrometer, based on reflectance data from 1100 to 2500 nm. In one study, based on data for protein, fat, and moisture measured on unground compound feed samples, dedicated ANN models for specific product classes (cattle feed, pig feed, broiler feed, and layers feed) gave moderately better Standard Errors of Prediction (SEP) compared to modified PLS (MPLS). However, if the four product classes were combined into one general calibration model, the performance of the ANN model deteriorated only slightly compared to the class-specific models, while the SEP values for the MPLS predictions doubled. Brix value in molasses is a measure of sugar content. Even with a huge dataset, PLS models were not sufficiently accurate for commercial use. In contrast an ANN model based on the same data improved the accuracy considerably and straightened out non-linearity in the prediction plot. The work of Mr. David Funk (GIPSA, U. S. Department of Agriculture) who has studied the influence of various types of spectral distortions on ANN- and PLS models, thereby providing comparative information on the robustness of these models towards instrument differences, will be discussed. This study was based on data from different classes of North American wheat measured in transmission from 850 to 1050 nm. The distortions studied included the effect of absorbance offset pathlength variation, presence of stray light bandwidth, and wavelength stretch and offset (either individually or combined). It was shown that a global ANN model was much less sensitive to most perturbations than class-specific GIPSA PLS calibrations. It is concluded that ANN models based on large data sets offer substantial advantages over PLS models with respect to accuracy, range of materials that can be handled by a single calibration, stability, transferability, and sensitivity to perturbations.

  • PDF

젖소군의 유생산 수준별 이질성 분산 보정 (Adjustment of heterogeneous variance by milk production level of dairy herd)

  • 조광현;이준호;박경도
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.737-743
    • /
    • 2014
  • 본 연구는 젖소의 검정성적, 총 502,228개를 이용하여 젖소군 분산에 대한 이질성을 비교 검증하고 동질적인 분산을 유도하기 위하여 수행되었다. 유량, 유지방량과 유단백량에 대한 유전력은 각각 0.28, 0.26과 0.24로 추정되었으며, 출생년도별 평균 육종가는 HV (heterogenous variance) 모형의 추정치가 animal 모형의 것보다 일괄적으로 낮게 나타났다. 국제유전능력평가 기준에 적용된 씨수소 545두에 대한 유량, 유지방량, 유단백량의 평균 육종가는 각각 453.54kg, 10.75kg과 14.33kg이었으며, 이질성이 보정된 경우의 평균 육종가는 각각 432.06kg, 10.15kg과 13.40kg으로서 모든 유생산형질에서 보정된 자료의 육종가 평균이 일괄적으로 낮게 나타났다. animal 모형에서 데이터세트 I과 II사이의 표현형 상관계수는 유량 0.839, 유지방량 0.821, 유단백량 0.837이었으며, HV 모형에서는 유량 0.841, 유지방량 0.820, 유단백량 0.836으로 두 모형에서 거의 유사한 결과를 나타내었다. animal 모형과 HV 모형으로 비교한 결과 유량에서 ${\theta}$값은 15.157에서 16.105, 유지방량은 -0.227에서 -0.196으로 증가하였으나 유단백량의 경우 0.630에서 0.586으로 감소하였다.

주색도 분석을 적용한 비음수 행렬 분해 기반의 광원 추정 (Illumination Estimation Based on Nonnegative Matrix Factorization with Dominant Chromaticity Analysis)

  • 이지헌;김대철;하영호
    • 전자공학회논문지
    • /
    • 제52권8호
    • /
    • pp.89-96
    • /
    • 2015
  • 인간의 시각은 색순응을 통해서 사물의 색을 광원의 색에 영향 없이 인지 할 수 있다. 반면에, 카메라는 입력 값을 그대로 기록하기 때문에, 광원에 따라 물체의 색이 다르게 나타난다. 최근에 희박성 제약조건의 비음수 행렬 분해(nonnegative matrix factorization with sparseness constraint; NMFsc)를 이용한 광원추정 방법이 제안되었다. 이 방법은 낮은 희박성 제약조건을 사용해서 광원을 추정하고, 높은 희박성 제약조건을 사용해서 반사율을 추정한다. 하지만, 희박성 제약조건의 비음수 행렬분해를 이용한 광원 추정 방법은, 영상의 전역적인 정보를 사용하므로, 영상에서 동일한 색이 넓은 영역에 존재하는 경우, 추정된 광원이 큰 오차를 가진다. 이러한 단점을 보완하기 위해, 영상에서 주색도 분석과 희박성 제약조건의 비음수 행렬 분해를 이용한 광원 추정 방법을 제안하였다. 먼저 주색도를 분석하기 위해 영상을 색도 좌표계로 옮기고 색도 히스토그램을 이용하여 유사한 색도를 가지는 영역들로 영상을 분할한다. 다음으로 영상의 주색도는 분할된 영상들 중 색도의 표준편차가 가장 적은 영상의 색도로 선택한다. 마지막으로 주색도 분석 결과와 희박성 제약조건의 비음수 행렬 분해를 이용해 입력 영상에서 주색도 성분을 제거하고 최종적인 광원을 추정한다. 실제 촬영 영상에 대한 평균 각오차를 사용하여 기존의 방법과의 성능을 비교하였고, 그 결과 제안하는 방법의 평균 각 오차는 5.5를 나타내어 영상의 주 색도를 포함하여 광원을 추정한 기존 방법의 평균 각 오차 5.7 보다 우수한 성능을 나타내었다.