• 제목/요약/키워드: quantitative annotation

검색결과 27건 처리시간 0.022초

Fillers in the Hong Kong Corpus of Spoken English (HKCSE)

  • Seto, Andy
    • 아시아태평양코퍼스연구
    • /
    • 제2권1호
    • /
    • pp.13-22
    • /
    • 2021
  • The present study employed an analytical framework that is characterised by a synthesis of quantitative and qualitative analyses with a specially designed computer software SpeechActConc to examine speech acts in business communication. The naturally occurring data from the audio recordings and the prosodic transcriptions of the business sub-corpora of the HKCSE (prosodic) are manually annotated with a speech act taxonomy for finding out the frequency of fillers, the co-occurring patterns of fillers with other speech acts, and the linguistic realisations of fillers. The discoursal function of fillers to sustain the discourse or to hold the floor has diverse linguistic realisations, ranging from a sound (e.g. 'uhuh') and a word (e.g. 'well') to sounds (e.g. 'um er') and words, namely phrase ('sort of') and clause (e.g. 'you know'). Some are even combinations of sound(s) and word(s) (e.g. 'and um', 'yes er um', 'sort of erm'). Among the top five frequent linguistic realisations of fillers, 'er' and 'um' are the most common ones found in all the six genres with relatively higher percentages of occurrence. The remaining more frequent realisations consist of clause ('you know'), word ('yeah') and sound ('erm'). These common forms are syntactically simpler than the less frequent realisations found in the genres. The co-occurring patterns of fillers and other speech acts are diverse. The more common co-occurring speech acts with fillers include informing and answering. The findings show that fillers are not only frequently used by speakers in spontaneous conversation but also mostly represented in sounds or non-linguistic realisations.

Mask Region-Based Convolutional Neural Network (R-CNN) Based Image Segmentation of Rays in Softwoods

  • Hye-Ji, YOO;Ohkyung, KWON;Jeong-Wook, SEO
    • Journal of the Korean Wood Science and Technology
    • /
    • 제50권6호
    • /
    • pp.490-498
    • /
    • 2022
  • The current study aimed to verify the image segmentation ability of rays in tangential thin sections of conifers using artificial intelligence technology. The applied model was Mask region-based convolutional neural network (Mask R-CNN) and softwoods (viz. Picea jezoensis, Larix gmelinii, Abies nephrolepis, Abies koreana, Ginkgo biloba, Taxus cuspidata, Cryptomeria japonica, Cedrus deodara, Pinus koraiensis) were selected for the study. To take digital pictures, thin sections of thickness 10-15 ㎛ were cut using a microtome, and then stained using a 1:1 mixture of 0.5% astra blue and 1% safranin. In the digital images, rays were selected as detection objects, and Computer Vision Annotation Tool was used to annotate the rays in the training images taken from the tangential sections of the woods. The performance of the Mask R-CNN applied to select rays was as high as 0.837 mean average precision and saving the time more than half of that required for Ground Truth. During the image analysis process, however, division of the rays into two or more rays occurred. This caused some errors in the measurement of the ray height. To improve the image processing algorithms, further work on combining the fragments of a ray into one ray segment, and increasing the precision of the boundary between rays and the neighboring tissues is required.

A bioinformatic approach to identify pathogenic variants for Stevens-Johnson syndrome

  • Muhammad Ma'ruf;Justitia Cahyani Fadli;Muhammad Reza Mahendra;Lalu Muhammad Irham;Nanik Sulistyani;Wirawan Adikusuma;Rockie Chong;Abdi Wira Septama
    • Genomics & Informatics
    • /
    • 제21권2호
    • /
    • pp.26.1-26.9
    • /
    • 2023
  • Stevens-Johnson syndrome (SJS) produces a severe hypersensitivity reaction caused by Herpes simplex virus or mycoplasma infection, vaccination, systemic disease, or other agents. Several studies have investigated the genetic susceptibility involved in SJS. To provide further genetic insights into the pathogenesis of SJS, this study prioritized high-impact, SJS-associated pathogenic variants through integrating bioinformatic and population genetic data. First, we identified SJS-associated single nucleotide polymorphisms from the genome-wide association studies catalog, followed by genome annotation with HaploReg and variant validation with Ensembl. Subsequently, expression quantitative trait locus (eQTL) from GTEx identified human genetic variants with differential gene expression across human tissues. Our results indicate that two variants, namely rs2074494 and rs5010528, which are encoded by the HLA-C (human leukocyte antigen C) gene, were found to be differentially expressed in skin. The allele frequencies for rs2074494 and rs5010528 also appear to significantly differ across continents. We highlight the utility of these population-specific HLA-C genetic variants for genetic association studies, and aid in early prognosis and disease treatment of SJS.

Predicting Learning Achievements with Indicators of Perceived Affordances Based on Different Levels of Content Complexity in Video-based Learning

  • Dasom KIM;Gyeoun JEONG
    • Educational Technology International
    • /
    • 제25권1호
    • /
    • pp.27-65
    • /
    • 2024
  • The purpose of this study was to identify differences in learning patterns according to content complexity in video-based learning environments and to derive variables that have an important effect on learning achievement within particular learning contexts. To achieve our aims, we observed and collected data on learners' cognitive processes through perceived affordances, using behavioral logs and eye movements as specific indicators. These two types of reaction data were collected from 67 male and female university students who watched two learning videos classified according to their task complexity through the video learning player. The results showed that when the content complexity level was low, learners tended to navigate using other learners' digital logs, but when it was high, students tended to control the learning process and directly generate their own logs. In addition, using derived prediction models according to the degree of content complexity level, we identified the important variables influencing learning achievement in the low content complexity group as those related to video playback and annotation. In comparison, in the high content complexity group, the important variables were related to active navigation of the learning video. This study tried not only to apply the novel variables in the field of educational technology, but also attempt to provide qualitative observations on the learning process based on a quantitative approach.

Locating QTLs controlling overwintering seedling rate in perennial glutinous rice 89-1 (Oryza sativa L.)

  • Deng, Xiaoshu;Gan, Lu;Liu, Yan;Luo, Ancai;Jin, Liang;Chen, Jiao;Tang, Ruyu;Lei, Lixia;Tang, Jianghong;Zhang, Jiani;Zhao, Zhengwu
    • Genes and Genomics
    • /
    • 제40권12호
    • /
    • pp.1351-1361
    • /
    • 2018
  • A new cold tolerant germplasm resource named glutinous rice 89-1 (Gr89-1, Oryza sativa L.) can overwinter using axillary buds, with these buds being ratooned the following year. The overwintering seedling rate (OSR) is an important factor for evaluating cold tolerance. Many quantitative trait loci (QTLs) controlling cold tolerance at different growth stages in rice have been identified, with some of these QTLs being successfully cloned. However, no QTLs conferring to the OSR trait have been located in the perennial O. sativa L. To identify QTLs associated with OSR and to evaluate cold tolerance. 286 $F_{12}$ recombinant inbred lines (RILs) derived from a cross between the cold tolerant variety Gr89-1 and cold sensitive variety Shuhui527 (SH527) were used. A total of 198 polymorphic simple sequence repeat (SSR) markers that were distributed uniformly on 12 chromosomes were used to construct the linkage map. The gene ontology (GO) annotation of the major QTL was performed through the rice genome annotation project system. Three main-effect QTLs (qOSR2, qOSR3, and qOSR8) were detected and mapped on chromosomes 2, 3, and 8, respectively. These QTLs were located in the interval of RM14208 (35,160,202 base pairs (bp))-RM208 (35,520,147 bp), RM218 (8,375,236 bp)-RM232 (9,755,778 bp), and RM5891 (24,626,930 bp)-RM23608 (25,355,519 bp), and explained 19.6%, 9.3%, and 11.8% of the phenotypic variations, respectively. The qOSR2 QTL displayed the largest effect, with a logarithm of odds score (LOD) of 5.5. A total of 47 candidate genes on the qOSR2 locus were associated with 219 GO terms. Among these candidate genes, 11 were related to cell membrane, 7 were associated with cold stress, and 3 were involved in response to stress and biotic stimulus. OsPIP1;3 was the only one candidate gene related to stress, biotic stimulus, cold stress, and encoding a cell membrane protein. After QTL mapping, a total of three main-effect QTLs-qOSR2, qOSR3, and qOSR8-were detected on chromosomes 2, 3, and 8, respectively. Among these, qOSR2 explained the highest phenotypic variance. All the QTLs elite traits come from the cold resistance parent Gr89-1. OsPIP1;3 might be a candidate gene of qOSR2.

항공영상으로부터 YOLOv5를 이용한 도심수목 탐지 (Detection of Urban Trees Using YOLOv5 from Aerial Images)

  • 박채원;정형섭
    • 대한원격탐사학회지
    • /
    • 제38권6_2호
    • /
    • pp.1633-1641
    • /
    • 2022
  • 도시의 인구 집중과 무분별한 개발은 대기오염, 열섬현상과 같은 다양한 환경 문제들을 유발하며, 자연재해로 인한 피해 상황을 악화시키는 등 인재의 원인이 되고 있다. 도심 수목은 이러한 도시 문제들의 해결방안으로 제시되어왔으며, 실제로 환경 개선 기능을 제공하는 등 중요한 역할들을 수행한다. 이에 따라 수목이 도시 환경에 미치는 영향을 파악하기 위해 도심 수목에서 개별목에 대한 정량적인 측정 및 분석이 요구된다. 그러나 도심 수목의 복잡성 및 다양성은 단일 수목 탐지 정확도를 낮추는 문제점이 존재한다. 따라서 본 연구는 수목 개체에 대해 효과적인 탐지가 가능한 고해상도 항공영상 및 object detection에서 뛰어난 성능을 발휘한 You Only Look Once Version 5 (YOLOv5) 모델을 사용하여 도심 수목을 효과적으로 탐지하는 연구를 진행하였다. 수목 AI 학습 데이터셋의 구축을 위한 라벨링 가이드라인을 생성하고 이를 기준으로 동작구 수목에 대해 box annotation을 수행하였다. 구축된 데이터셋으로부터 다양한 scale의 YOLOv5 모델들을 테스트하고 최적의 모델을 채택하여 효율적인 도심 수목 탐지를 수행한 결과, mean Average Precision (mAP) 0.663의 유의미한 결과를 도출하였다.

An Integrated Genomic Resource Based on Korean Cattle (Hanwoo) Transcripts

  • Lim, Da-Jeong;Cho, Yong-Min;Lee, Seung-Hwan;Sung, Sam-Sun;Nam, Jung-Rye;Yoon, Du-Hak;Shin, Youn-Hee;Park, Hye-Sun;Kim, Hee-Bal
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제23권11호
    • /
    • pp.1399-1404
    • /
    • 2010
  • We have created a Bovine Genome Database, an integrated genomic resource for Bos taurus, by merging bovine data from various databases and our own data. We produced 55,213 Korean cattle (Hanwoo) ESTs from cDNA libraries from three tissues. We concentrated on genomic information based on Hanwoo transcripts and provided user-friendly search interfaces within the Bovine Genome Database. The genome browser supported alignment results for the various types of data: Hanwoo EST, consensus sequence, human gene, and predicted bovine genes. The database also provides transcript data information, gene annotation, genomic location, sequence and tissue distribution. Users can also explore bovine disease genes based on comparative mapping of homologous genes and can conduct searches centered on genes within user-selected quantitative trait loci (QTL) regions. The Bovine Genome Database can be accessed at http://bgd.nabc.go.kr.

Quantitative Proteogenomics and the Reconstruction of the Metabolic Pathway in Lactobacillus mucosae LM1

  • Pajarillo, Edward Alain B.;Kim, Sang Hoon;Lee, Ji-Yoon;Valeriano, Valerie Diane V.;Kang, Dae-Kyung
    • 한국축산식품학회지
    • /
    • 제35권5호
    • /
    • pp.692-702
    • /
    • 2015
  • Lactobacillus mucosae is a natural resident of the gastrointestinal tract of humans and animals and a potential probiotic bacterium. To understand the global protein expression profile and metabolic features of L. mucosae LM1 in the early stationary phase, the QExactiveTM Hybrid Quadrupole-Orbitrap Mass Spectrometer was used. Characterization of the intracellular proteome identified 842 proteins, accounting for approximately 35% of the 2,404 protein-coding sequences in the complete genome of L. mucosae LM1. Proteome quantification using QExactiveTM Orbitrap MS detected 19 highly abundant proteins (> 1.0% of the intracellular proteome), including CysK (cysteine synthase, 5.41%) and EF-Tu (elongation factor Tu, 4.91%), which are involved in cell survival against environmental stresses. Metabolic pathway annotation of LM1 proteome using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database showed that half of the proteins expressed are important for basic metabolic and biosynthetic processes, and the other half might be structurally important or involved in basic cellular processes. In addition, glycogen biosynthesis was activated in the early stationary phase, which is important for energy storage and maintenance. The proteogenomic data presented in this study provide a suitable reference to understand the protein expression pattern of lactobacilli in standard conditions

Validation of exercise-response genes in skeletal muscle cells of Thoroughbred racing horses

  • Kim, Doh Hoon;Lee, Hyo Gun;Sp, Nipin;Kang, Dong Young;Jang, Kyoung-Jin;Lee, Hak Kyo;Cho, Byung-Wook;Yang, Young Mok
    • Animal Bioscience
    • /
    • 제34권1호
    • /
    • pp.134-142
    • /
    • 2021
  • Objective: To understand the athletic characteristics of Thoroughbreds, high-throughput analysis has been conducted using horse muscle tissue. However, an in vitro system has been lacking for studying and validating genes from in silico data. The aim of this study is to validate genes from differentially expressed genes (DEGs) of our previous RNA-sequencing data in vitro. Also, we investigated the effects of exercise-induced stress including heat, oxidative, hypoxic and cortisol stress on horse skeletal muscle derived cells with the top six upregulated genes of DEGs. Methods: Enriched pathway analysis was conducted using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) tool with upregulated genes in horse skeletal muscle tissue after exercise. Among the candidates, the top six genes were analysed through geneMANIA to investigate gene networks. Muscle cells derived from neonatal horse skeletal tissue were maintained and subjected to exercise-related stressors. Transcriptional changes in the top six genes followed by stressors were investigated using quantitative reverse transcription-polymerase chain reaction (qRT-PCR). Results: The inflammation response pathway was the most commonly upregulated pathway after horse exercise. Under non-cytotoxic conditions of exercise-related stressors, the transcriptional response of the top six genes was different among types of stress. Oxidative stress yielded the most similar expression pattern to DEGs. Conclusion: Our results indicate that transcriptional change after horse exercise in skeletal muscle tissue strongly relates to stress response. The qRT-PCR results showed that stressors contribute differently to the transcriptional regulation. These results would be valuable information to understand horse exercise in the stress aspect.

Expression and tissue distribution analysis of vimentin and transthyretin proteins associated with coat colors in sheep (Ovis aries)

  • Zhihong Yin;Zhisheng Ma;Siting Wang;Shitong Hao;Xinyou Liu;Quanhai Pang;Xinzhuang Wang
    • Animal Bioscience
    • /
    • 제36권9호
    • /
    • pp.1367-1375
    • /
    • 2023
  • Objective: Pigment production and distribution are controlled through multiple proteins, resulting in different coat color phenotypes of sheep. Methods: The expression distribution of vimentin (VIM) and transthyretin (TTR) in white and black sheep skins was detected by liquid chromatography-electrospray ionization tandem MS (LC-ESI-MS/MS), gene ontology (GO) statistics, immunohistochemistry, Western blot, and quantitative real time polymerase chain reaction (qRT-PCR) to evaluate their role in the coat color formation of sheep. Results: LC-ESI-MS/MS results showed VIM and TTR proteins in white and black skin tissues of sheep. Meanwhile, GO functional annotation analysis suggested that VIM and TTR proteins were mainly concentrated in cellular components and biological process, respectively. Further research confirmed that VIM and TTR proteins were expressed at significantly higher levels in black sheep skins than in white sheep skins by Western blot, respectively. Immunohistochemistry notably detected VIM and TTR in hair follicle, dermal papilla, and outer root sheath of white and black sheep skins. qRT-PCR results also revealed that the expression of VIM and TTR mRNAs was higher in black sheep skins than in white sheep skins. Conclusion: The expression of VIM and TTR were higher in black sheep skins than in white sheep skins and the transcription and translation were unanimous in this study. VIM and TTR proteins were expressed in hair follicles of white and black sheep skins. These results suggested that VIM and TTR were involved in the coat color formation of sheep.