• 제목/요약/키워드: information retrieval.

Search Result 3,681, Processing Time 0.03 seconds

A study on Korean multi-turn response generation using generative and retrieval model (생성 모델과 검색 모델을 이용한 한국어 멀티턴 응답 생성 연구)

  • Lee, Hodong;Lee, Jongmin;Seo, Jaehyung;Jang, Yoonna;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.13-21
    • /
    • 2022
  • Recent deep learning-based research shows excellent performance in most natural language processing (NLP) fields with pre-trained language models. In particular, the auto-encoder-based language model proves its excellent performance and usefulness in various fields of Korean language understanding. However, the decoder-based Korean generative model even suffers from generating simple sentences. Also, there is few detailed research and data for the field of conversation where generative models are most commonly utilized. Therefore, this paper constructs multi-turn dialogue data for a Korean generative model. In addition, we compare and analyze the performance by improving the dialogue ability of the generative model through transfer learning. In addition, we propose a method of supplementing the insufficient dialogue generation ability of the model by extracting recommended response candidates from external knowledge information through a retrival model.

Using Roots and Patterns to Detect Arabic Verbs without Affixes Removal

  • Abdulmonem Ahmed;Aybaba Hancrliogullari;Ali Riza Tosun
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.4
    • /
    • pp.1-6
    • /
    • 2023
  • Morphological analysis is a branch of natural language processing, is now a rapidly growing field. The fundamental tenet of morphological analysis is that it can establish the roots or stems of words and enable comparison to the original term. Arabic is a highly inflected and derivational language and it has a strong structure. Each root or stem can have a large number of affixes attached to it due to the non-concatenative nature of Arabic morphology, increasing the number of possible inflected words that can be created. Accurate verb recognition and extraction are necessary nearly all issues in well-known study topics include Web Search, Information Retrieval, Machine Translation, Question Answering and so forth. in this work we have designed and implemented an algorithm to detect and recognize Arbic Verbs from Arabic text.The suggested technique was created with "Python" and the "pyqt5" visual package, allowing for quick modification and easy addition of new patterns. We employed 17 alternative patterns to represent all verbs in terms of singular, plural, masculine, and feminine pronouns as well as past, present, and imperative verb tenses. All of the verbs that matched these patterns were used when a verb has a root, and the outcomes were reliable. The approach is able to recognize all verbs with the same structure without requiring any alterations to the code or design. The verbs that are not recognized by our method have no antecedents in the Arabic roots. According to our work, the strategy can rapidly and precisely identify verbs with roots, but it cannot be used to identify verbs that are not in the Arabic language. We advise employing a hybrid approach that combines many principles as a result.

Cryopreservation of mesenchymal stem cells derived from dental pulp: a systematic review

  • Sabrina Moreira Paes;Yasmine Mendes Pupo;Bruno Cavalini Cavenago;Thiago Fonseca-Silva;Carolina Carvalho de Oliveira Santos
    • Restorative Dentistry and Endodontics
    • /
    • v.46 no.2
    • /
    • pp.26.1-26.15
    • /
    • 2021
  • Objectives: The aim of the present systematic review was to investigate the cryopreservation process of dental pulp mesenchymal stromal cells and whether cryopreservation is effective in promoting cell viability and recovery. Materials and Methods: This systematic review was developed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement and the research question was determined using the population, exposure, comparison, and outcomes strategy. Electronic searches were conducted in the PubMed, Cochrane Library, Science Direct, LILACS, and SciELO databases and in the gray literature (dissertations and thesis databases and Google Scholar) for relevant articles published up to March 2019. Clinical trial studies performed with dental pulp of human permanent or primary teeth, containing concrete information regarding the cryopreservation stages, and with cryopreservation performed for a period of at least 1 week were included in this study. Results: The search strategy resulted in the retrieval of 185 publications. After the application of the eligibility criteria, 21 articles were selected for a qualitative analysis. Conclusions: The cryopreservation process must be carried out in 6 stages: tooth disinfection, pulp extraction, cell isolation, cell proliferation, cryopreservation, and thawing. In addition, it can be inferred that the use of dimethyl sulfoxide, programmable freezing, and storage in liquid nitrogen are associated with a high rate of cell viability after thawing and a high rate of cell proliferation in both primary and permanent teeth.

A Study on Constructing a Digital Archive System of the Modern Korean Christian Collections (근대 한국기독교 자료의 디지털 아카이브 시스템 구축에 관한 연구)

  • Yang, Ji-Ann
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.8
    • /
    • pp.681-691
    • /
    • 2022
  • The purpose of this study is to construct a digital archive system by analyzing the collections of the Korean Christian Museum at S University, which has a large number of materials related to Korean Christianity published in the modern period from the time of Korea's enlightenment until liberation. In order to construct a digital archive system, indexes and metadata for the collection are complied according to the pre-defined format. After digitizing the selected collection, a database is built using metadata information, and the actual system is divided into a web standard-based management system and a user service system. Also a content-based search system is constructed, which provides the matching value of retrieval results in units of one character and an automatic search term completion function to enhance user convenience. Therefore, collections in the museum, which are difficult to access the original text, are digitized and provided so that they can be easily used, laying the foundation for the long-term development of humanities contents for improving the accessibility and availability of collections for both researchers and the public.

Percutaneous Transhepatic Removal of Migrated Biliary Stent from a Chronic Biloma Cavity (만성 담즙종 공동 내로 이동한 담도 스텐트의 경피경간적 제거)

  • Hyoung Nam Lee
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.2
    • /
    • pp.442-447
    • /
    • 2020
  • Iatrogenic foreign bodies are a challenging complication to both the interventional radiologist and patient, resulting in impaired quality of life and substantial financial cost. The case report describes a successful percutaneous transhepatic removal of an intra-abdominal foreign body. A 72-year-old man underwent surgery for placement of a retrievable covered stent for refractory bile leakage after left hemihepatectomy. Three days after placement, stent folding and migration into a chronic biloma cavity occurred via the bile leakage site. By using a balloon catheter technique, the folded stent could be straightened and repositioned into the bile duct to minimize stent-strut injury during retrieval. The interventional approach could be a valid treatment option for intra-abdominal foreign bodies, as well as intravascular foreign bodies. A thorough understanding of devices and techniques can provide the interventional radiologist with valuable information regarding procedural planning and the management of iatrogenic foreign bodies.

Performance Comparison and Error Analysis of Korean Bio-medical Named Entity Recognition (한국어 생의학 개체명 인식 성능 비교와 오류 분석)

  • Jae-Hong Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.4
    • /
    • pp.701-708
    • /
    • 2024
  • The advent of transformer architectures in deep learning has been a major breakthrough in natural language processing research. Object name recognition is a branch of natural language processing and is an important research area for tasks such as information retrieval. It is also important in the biomedical field, but the lack of Korean biomedical corpora for training has limited the development of Korean clinical research using AI. In this study, we built a new biomedical corpus for Korean biomedical entity name recognition and selected language models pre-trained on a large Korean corpus for transfer learning. We compared the name recognition performance of the selected language models by F1-score and the recognition rate by tag, and analyzed the errors. In terms of recognition performance, KlueRoBERTa showed relatively good performance. The error analysis of the tagging process shows that the recognition performance of Disease is excellent, but Body and Treatment are relatively low. This is due to over-segmentation and under-segmentation that fails to properly categorize entity names based on context, and it will be necessary to build a more precise morphological analyzer and a rich lexicon to compensate for the incorrect tagging.

The Effect of Users' Individual characteristics and Social Influence on Cyberethics and Usage in Web 2.0 - Comparing South Korea and U.S.A. - (웹 2.0 환경에서 사용자의 개인특성과 사회적 영향이 사이버윤리성과 사용성에 미치는 영향 - 한국과 미국의 비교연구 -)

  • Moon, Yun-Ji
    • Management & Information Systems Review
    • /
    • v.33 no.2
    • /
    • pp.101-118
    • /
    • 2014
  • In the mid-2000s, Web 2.0 appears and is becoming a general cultural code with the keyword of participation, sharing, and openness. Web 2.0, in which consumption is being transformed by the participatory web culture, has evolved. However, associated with the evolution of Web 2.0, several significant concerns appears in a society. Among them, this study will focuses on the cyber-ethics issues. There are limitations to solve the cyber-ethics problems only in the technical and legal approaches. Therefore, the current article intends to consider comprehensively the antecedents of cyber-ethics such as individual characteristics, social influence, and cultural characteristics. Specifically, (1) Do individual characteristics(i.e., self-efficacy, locus of control) affect cyber-ethics in the Web 2.0 environment?, (2) Do social influence(i.e., subjective norm) have an effect on cyber-ethics?, (3) Do cyber -ethics have an impact on user participation in the Web 2.0 services(i.e., retrieval and creation)?, finally (4) Do international cultural difference have a moderation effect on the relationship between cyber-ethics and user participation? For testing empirically the hypothesized research model, this study collected questionnaires in South Korea as well as U.S.A. The results showed that individual characteristics and social influence affect cyber-ethics toward user's creative activities in Web 2.0 sites.

  • PDF

Elicitation of Collective Intelligence by Fuzzy Relational Methodology (퍼지관계 이론에 의한 집단지성의 도출)

  • Joo, Young-Do
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.17-35
    • /
    • 2011
  • The collective intelligence is a common-based production by the collaboration and competition of many peer individuals. In other words, it is the aggregation of individual intelligence to lead the wisdom of crowd. Recently, the utilization of the collective intelligence has become one of the emerging research areas, since it has been adopted as an important principle of web 2.0 to aim openness, sharing and participation. This paper introduces an approach to seek the collective intelligence by cognition of the relation and interaction among individual participants. It describes a methodology well-suited to evaluate individual intelligence in information retrieval and classification as an application field. The research investigates how to derive and represent such cognitive intelligence from individuals through the application of fuzzy relational theory to personal construct theory and knowledge grid technique. Crucial to this research is to implement formally and process interpretatively the cognitive knowledge of participants who makes the mutual relation and social interaction. What is needed is a technique to analyze cognitive intelligence structure in the form of Hasse diagram, which is an instantiation of this perceptive intelligence of human beings. The search for the collective intelligence requires a theory of similarity to deal with underlying problems; clustering of social subgroups of individuals through identification of individual intelligence and commonality among intelligence and then elicitation of collective intelligence to aggregate the congruence or sharing of all the participants of the entire group. Unlike standard approaches to similarity based on statistical techniques, the method presented employs a theory of fuzzy relational products with the related computational procedures to cover issues of similarity and dissimilarity.

A Semantic Classification Model for Educational Resource Repositories (교육용 자원 저장소를 위한 의미적 분류 모델)

  • Choi, Myoung-Hoi;Jeong, Dong-Won
    • Journal of KIISE:Databases
    • /
    • v.34 no.1
    • /
    • pp.35-45
    • /
    • 2007
  • This paper proposes a classification model for systematical management of resources in educational repositories. A classification scheme should be provided to systematically store and manage, precisely retrieve, and maximize the usability of the resources. However, there is little research result on the classification scheme and classification model for educational repository resources. It causes several issues such as inefficient management of educational resources, incorrect retrieval, and low usability. However, there are different characteristics between the educational resource information and information of the previous fields. Therefore, a novel research on the classification scheme and classification model for the resources in educational repositories is required. To achieve the goal for efficient and easy use of the educational resources, we should manage consistently the resources according to the classification scheme accepting several views. This paper proposes a classification model to systematically manage and increase the usability of the educational resources. In other words, the proposed classification model can manages dynamically the classification scheme for the resources in educational repositories according to various views. To achieve the objectives, we first define a proper classification scheme for the implementation resources based on the classification scheme in relevant scientific technology fields. Especially, we define a novel classification model to dynamically manage the defined classification scheme. The proposed classification scheme and classification model enable more precise and systematic management of implementation resources and also increase the ease of usability.

Determining the number of Clusters in On-Line Document Clustering Algorithm (온라인 문서 군집화에서 군집 수 결정 방법)

  • Jee, Tae-Chang;Lee, Hyun-Jin;Lee, Yill-Byung
    • The KIPS Transactions:PartB
    • /
    • v.14B no.7
    • /
    • pp.513-522
    • /
    • 2007
  • Clustering is to divide given data and automatically find out the hidden meanings in the data. It analyzes data, which are difficult for people to check in detail, and then, makes several clusters consisting of data with similar characteristics. On-Line Document Clustering System, which makes a group of similar documents by use of results of the search engine, is aimed to increase the convenience of information retrieval area. Document clustering is automatically done without human interference, and the number of clusters, which affect the result of clustering, should be decided automatically too. Also, the one of the characteristics of an on-line system is guarantying fast response time. This paper proposed a method of determining the number of clusters automatically by geometrical information. The proposed method composed of two stages. In the first stage, centers of clusters are projected on the low-dimensional plane, and in the second stage, clusters are combined by use of distance of centers of clusters in the low-dimensional plane. As a result of experimenting this method with real data, it was found that clustering performance became better and the response time is suitable to on-line circumstance.