• Title/Summary/Keyword: Entity-based

Search Result 753, Processing Time 0.029 seconds

Lectotypification of Anaphalis morii Nakai (Asteraceae) with Brief Discussions of its Taxonomic Entity

  • Dong Hyuk Lee;Jun Gi Byeon;Tae Im Heo;Byeong Joo Park;Ji Dong Kim;Jun Woo Lee;Byeong Kwon Lee;Byeong-Hee Choi
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2020.08a
    • /
    • pp.16-16
    • /
    • 2020
  • Systematic studies for genus Anaphalis in Koreaare relatively scarce. As a fundamental step to further taxonomic studies of this genus, we here propose lectotype for A. morii, endemic taxa in Korea with a brief re-evaluation of its taxonomic entity. During the examination of herbarium specimen for A. morii, we found nine sheets of five collections in three herbaria. Among the original materials, we selected a specimen in TI which was first cited by the author and include an additional annotation, matching with his own description. Also, we were able to determine the taxonomic relationships between A. morii and its relatives, A. yakusimensis. Based on our observation, we identified that several morphological characters are clearly differ from A. yakusimensis occurring only Isl. Yakushima in Japan.

  • PDF

A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)

  • Yang, Yunseok;Lee, Hyun Jun;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.25-38
    • /
    • 2019
  • Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.

Named Entity Recognition based on CRF reflecting relative weight (상대적 가중치 자질을 반영한 CRF 기반의 개체명 인식)

  • Jeong, Jin-Wook
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.338-339
    • /
    • 2017
  • 본 논문은 개체명 인식을 위해 CRF 모델을 이용해 분류를 수행했다. 개체명 후보를 개체명으로 식별에서 중의성 문제가 필요하다. 본 논문에서는 이러한 중의성 문제 해결을 위해 학습 셋으로부터 패턴과 형태적 특성을 고려해 개체명 후보를 최대로 선택하고 선택된 개체명 후보의 중의성과 정확도를 높이기 위해 주변의 문맥 자질과 분별 확률 모델인 CRF를 이용해 중의성 문제를 해결한다.

  • PDF

Exploration of the Path Model among Goal Orientation, Self-efficacy, Achievement Need, Entity Theory of Intelligence, Learning Strategy, and Self-handicapping Tendency in Chemistry Education (화학교육의 목표지향성, 자기효능감, 성취욕구, 지능신념, 자기핸디캡경향 및 학습전략 간의 경로모형 탐색)

  • Ko, Young Chun
    • Journal of the Korean Chemical Society
    • /
    • v.57 no.1
    • /
    • pp.147-158
    • /
    • 2013
  • This study is to search an optimal model on causal relationships of the motivations to learn and motivation strategy in chemistry education. The participants in this study are consisted of G and I high schools students (487) in Gwangju. They all answered to the questionnaire. Model I is hypothesized to be path model of the mediation between 'self-efficacy, achievement need, and entity theory of intelligence' and 'learning strategy and self-handicapping tendency of motivation strategy' by goal orientation to explore variables of study effecting the motivation strategy. And Model II is hypothesized path model of the mediation between goal orientation and 'learning strategy and self-handicapping tendency' by 'self-efficacy, achievement need, and entity theory' to explore variables of study effecting the motivation strategy. Based on these models, structural equation modeling techniques are used to evaluate for the path model among goal orientation(learning, performance approach, and performance approach goal orientation), self-efficacy, achievement need, entity theory of intelligence, self-handicapping tendency, and learning strategy in chemistry education. As the results, Model II is considered. Goodness-of-fit indexes of this model related modification models are identified and analyzed in phases. And this model is accomplished by correcting the model the fifth time to enhance goodness-of-fit indexes. In this optimal model II-5 (Fig. 3) on causal relationships of the motivations to learn and learning strategy (p

A Comparative Research on End-to-End Clinical Entity and Relation Extraction using Deep Neural Networks: Pipeline vs. Joint Models (심층 신경망을 활용한 진료 기록 문헌에서의 종단형 개체명 및 관계 추출 비교 연구 - 파이프라인 모델과 결합 모델을 중심으로 -)

  • Sung-Pil Choi
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.1
    • /
    • pp.93-114
    • /
    • 2023
  • Information extraction can facilitate the intensive analysis of documents by providing semantic triples which consist of named entities and their relations recognized in the texts. However, most of the research so far has been carried out separately for named entity recognition and relation extraction as individual studies, and as a result, the effective performance evaluation of the entire information extraction systems was not performed properly. This paper introduces two models of end-to-end information extraction that can extract various entity names in clinical records and their relationships in the form of semantic triples, namely pipeline and joint models and compares their performances in depth. The pipeline model consists of an entity recognition sub-system based on bidirectional GRU-CRFs and a relation extraction module using multiple encoding scheme, whereas the joint model was implemented with a single bidirectional GRU-CRFs equipped with multi-head labeling method. In the experiments using i2b2/VA 2010, the performance of the pipeline model was 5.5% (F-measure) higher. In addition, through a comparative experiment with existing state-of-the-art systems using large-scale neural language models and manually constructed features, the objective performance level of the end-to-end models implemented in this paper could be identified properly.

Change Acceptable In-Depth Searching in LOD Cloud for Efficient Knowledge Expansion (효과적인 지식확장을 위한 LOD 클라우드에서의 변화수용적 심층검색)

  • Kim, Kwangmin;Sohn, Yonglak
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.171-193
    • /
    • 2018
  • LOD(Linked Open Data) cloud is a practical implementation of semantic web. We suggested a new method that provides identity links conveniently in LOD cloud. It also allows changes in LOD to be reflected to searching results without any omissions. LOD provides detail descriptions of entities to public in RDF triple form. RDF triple is composed of subject, predicates, and objects and presents detail description for an entity. Links in LOD cloud, named identity links, are realized by asserting entities of different RDF triples to be identical. Currently, the identity link is provided with creating a link triple explicitly in which associates its subject and object with source and target entities. Link triples are appended to LOD. With identity links, a knowledge achieves from an LOD can be expanded with different knowledge from different LODs. The goal of LOD cloud is providing opportunity of knowledge expansion to users. Appending link triples to LOD, however, has serious difficulties in discovering identity links between entities one by one notwithstanding the enormous scale of LOD. Newly added entities cannot be reflected to searching results until identity links heading for them are serialized and published to LOD cloud. Instead of creating enormous identity links, we propose LOD to prepare its own link policy. The link policy specifies a set of target LODs to link and constraints necessary to discover identity links to entities on target LODs. On searching, it becomes possible to access newly added entities and reflect them to searching results without any omissions by referencing the link policies. Link policy specifies a set of predicate pairs for discovering identity between associated entities in source and target LODs. For the link policy specification, we have suggested a set of vocabularies that conform to RDFS and OWL. Identity between entities is evaluated in accordance with a similarity of the source and the target entities' objects which have been associated with the predicates' pair in the link policy. We implemented a system "Change Acceptable In-Depth Searching System(CAIDS)". With CAIDS, user's searching request starts from depth_0 LOD, i.e. surface searching. Referencing the link policies of LODs, CAIDS proceeds in-depth searching, next LODs of next depths. To supplement identity links derived from the link policies, CAIDS uses explicit link triples as well. Following the identity links, CAIDS's in-depth searching progresses. Content of an entity obtained from depth_0 LOD expands with the contents of entities of other LODs which have been discovered to be identical to depth_0 LOD entity. Expanding content of depth_0 LOD entity without user's cognition of such other LODs is the implementation of knowledge expansion. It is the goal of LOD cloud. The more identity links in LOD cloud, the wider content expansions in LOD cloud. We have suggested a new way to create identity links abundantly and supply them to LOD cloud. Experiments on CAIDS performed against DBpedia LODs of Korea, France, Italy, Spain, and Portugal. They present that CAIDS provides appropriate expansion ratio and inclusion ratio as long as degree of similarity between source and target objects is 0.8 ~ 0.9. Expansion ratio, for each depth, depicts the ratio of the entities discovered at the depth to the entities of depth_0 LOD. For each depth, inclusion ratio illustrates the ratio of the entities discovered only with explicit links to the entities discovered only with link policies. In cases of similarity degrees with under 0.8, expansion becomes excessive and thus contents become distorted. Similarity degree of 0.8 ~ 0.9 provides appropriate amount of RDF triples searched as well. Experiments have evaluated confidence degree of contents which have been expanded in accordance with in-depth searching. Confidence degree of content is directly coupled with identity ratio of an entity, which means the degree of identity to the entity of depth_0 LOD. Identity ratio of an entity is obtained by multiplying source LOD's confidence and source entity's identity ratio. By tracing the identity links in advance, LOD's confidence is evaluated in accordance with the amount of identity links incoming to the entities in the LOD. While evaluating the identity ratio, concept of identity agreement, which means that multiple identity links head to a common entity, has been considered. With the identity agreement concept, experimental results present that identity ratio decreases as depth deepens, but rebounds as the depth deepens more. For each entity, as the number of identity links increases, identity ratio rebounds early and reaches at 1 finally. We found out that more than 8 identity links for each entity would lead users to give their confidence to the contents expanded. Link policy based in-depth searching method, we proposed, is expected to contribute to abundant identity links provisions to LOD cloud.

Implementation and Performance Analysis of SOA Model using Service Platform for .NET Framework (.NET Framework를 서비스 플랫폼으로 사용한 SOA모델 구현 및 성능분석)

  • Lee, Seong-Kyu;Jin, Chan-Uk;Kim, Tai-Suk
    • Journal of the Korea Society for Simulation
    • /
    • v.16 no.4
    • /
    • pp.33-41
    • /
    • 2007
  • Service-Oriented Architecture(SOA) define the interaction method between two computing entities that one entity performs a unit task instead of another entity. This, unit task, is called "Service" and interaction of these services should have independency and loosely coupled task. The effect of SOA's main functions such as loosely coupled task and independent interoperability with influence the possibility of flexible message communication between different way and different users. In this article, we analyzed the performance about system stabilization between general web service and SOA based application that implemented through WCF based messaging framework using .NET Framework and integrated data presentation method. As the result of test, we confirmed that SOA environment using WCF have more advantages.

  • PDF

An Implementation of FRBR Model by Using Topic Maps (Topic Maps를 이용한 MARC데이터의 FRBR모델 구현에 관한 연구)

  • Lee, Hyun-Sil;Han, Sung-Kook
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.3 s.57
    • /
    • pp.289-306
    • /
    • 2005
  • As FRBR defines structural framework based on ER modeling for bibliographic data elements, an effective tool is required to implement FRBR model. In this paper, we present the implementation of FRBR model based on Topic Maps. To show the effectiveness of Topic Maps as the implantation language of FRBR, we implement FRBR model of MyongSungHwangHu by means of Topic Maps. We can ascertain that topic-association of Topic Maps conceptually harmonize with entity-relation of FRBR, which means that Topic Maps is suitable for the implementation of FRBR model.

A Study on the development of Construction Field Management Model based on BIM (BIM기반 건설현장 관리모델 개발에 관한 연구)

  • Jun, Young-Woong;Lee, Myoung-Sik
    • Journal of The Korean Digital Architecture Interior Association
    • /
    • v.9 no.3
    • /
    • pp.69-76
    • /
    • 2009
  • The importance of efficient construction site management has been growing as the amount of construction information increases which is used in the growing construction site. Accordingly in this study, we are trying to find out the application situation and possibility of BIM through theoretical examination and domestic & overseas case study of BIM and we are trying to suggest the way of efficient construction site management formulation through implementation phase-oriented and cooperation entity-oriented analysis in the construction site. We found out that it was possible to minimize time loss and financial loss by visualizing 2D drawings through 3D modeling of target building by applying BIM and that it was possible to improve accuracy of budget planning with quantitative information of 3D model, to plan construction process with more confidence due to accurate architectural information of drawings and quantitative information, and to manage cost and quality through process management based on construction information acquired by BIM including object information by part. It is concluded that we can improve efficiency of construction management between field and each cooperating entity by integrating and linking BIM information through this process.

  • PDF

Conceptual Data Modeling and Information Retrieval System Design (개념적 데이터 모델링과 정보검색 시스템 디자인)

  • Oh Sam-Gyun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.33 no.4
    • /
    • pp.133-156
    • /
    • 1999
  • The purpose of this paper is to show how conceptual data modeling can enhance current information retrieval (IR) systems. The conceptual database design provides for: 1) data mining capability to discover new knowledge based on the relationships between entities, and 2) integrating current separate databases into one IR system (e.g., integrating ISI Citation, a thesaurus, and bibliographic databases into one retrieval system) . Further, as new user requirements are unfolded, modifications of IR systems based on conceptual data modeling will be much easier to make than they were in the current IR systems because conceptual modeling facilitates flexible modifications. The enhanced Entity-Relationship (ER) model was employed in this study to develop conceptual schemas of IR data.

  • PDF