• Title/Summary/Keyword: Language modeling

Search Result 1,023, Processing Time 0.026 seconds

Dependency Structure Applied to Language Modeling for Information Retrieval

  • Lee, Chang-Ki;Lee, Gary Geun-Bae;Jang, Myung-Gil
    • ETRI Journal
    • /
    • v.28 no.3
    • /
    • pp.337-346
    • /
    • 2006
  • In this paper, we propose a new language model, namely, a dependency structure language model, for information retrieval to compensate for the weaknesses of unigram and bigram language models. The dependency structure language model is based on the first-order dependency model and the dependency parse tree generated by a linguistic parser. So, long-distance dependencies can be naturally captured by the dependency structure language model. We carried out extensive experiments to verify the proposed model, where the dependency structure model gives a better performance than recently proposed language models and the Okapi BM25 method, and the dependency structure is more effective than unigram and bigram in language modeling for information retrieval.

  • PDF

Formal Modeling and Verification of an Enhanced Variant of the IEEE 802.11 CSMA/CA Protocol

  • Hammal, Youcef;Ben-Othman, Jalel;Mokdad, Lynda;Abdelli, Abdelkrim
    • Journal of Communications and Networks
    • /
    • v.16 no.4
    • /
    • pp.385-396
    • /
    • 2014
  • In this paper, we present a formal method for modeling and checking an enhanced version of the carrier sense multiple access with collision avoidance protocol related to the IEEE 802.11 MAC layer, which has been proposed as the standard protocol for wireless local area networks. We deal mainly with the distributed coordination function (DCF) procedure of this protocol throughout a sequence of transformation steps. First, we use the unified modeling language state machines to thoroughly capture the behavior of wireless stations implementing a DCF, and then translate them into the input language of the UPPAAL model checking tool, which is a network of communicating timed automata. Finally, we proceed by checking of some of the safety and liveness properties, such as deadlock-freedom, using this tool.

Improving SoC Design Flow with Unified Modeling Language and HDL (UML과 HDL을 이용한 SoC 설계 개선)

  • Kim, Chang-Hoon;Hwang, Sang-Joon;Hong, Seung-Woo;Sung, Man-Young
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2005.07a
    • /
    • pp.135-138
    • /
    • 2005
  • HDL(Hardware Description Language) is the most important modem tools used to describe hardware, and becomes important as we move to higher levels of abstraction. The HDL has been made brisk use of in analog design, MEMS device[1-2], process related field as well as digital design. The most important characteristics of HDL is Abstraction which is the strongest tool that extend greatly designer's design ability. In this paper by the Modelling Continuum with hierarchical structure of abstraction, we apply UML(Unified Modeling Language) to SoC Design with HDL UML makes an easy and visual description of the various levels of abstraction, and gives designers good flexible modeling capabilty for SoC Design.

  • PDF

RAiSE : A Graphical Process Modeling Language Providing Semantic Richness and Ease of Use (RAiSE :다양한 의미론과 사용의 용이성을 제공하는 그래픽 프로세스 모델링 언어)

  • Lee, Hyung-Won
    • The KIPS Transactions:PartD
    • /
    • v.12D no.7 s.103
    • /
    • pp.1007-1016
    • /
    • 2005
  • A key issue for process language design is balancing the need for semantic richness with the need for ease of use. Most process modeling languages fail to satisfy above two conflicting aspects, which is an impediment to the widespread adoption of process modeling languages in the software industry despite of a variety of software process language studies. This paper describes a process modeling language RAiSE attempting to resolve such problem and presents the result of applying RAiSE to a well-known benchmark process, ISPW-6 software process example. RAiSE provides rigorous, yet clear semantics through combing essential features in various modeling paradigms and defining them in a well-structured graphical notation. Process models represented in RAiSE are interpreted and enacted by process engine implemented using CLiPS, a rule based expert system tool.

Statistical Analysis Between Size and Balance of Text Corpus by Evaluation of the effect of Interview Sentence in Language Modeling (언어모델 인터뷰 영향 평가를 통한 텍스트 균형 및 사이즈간의 통계 분석)

  • Jung Eui-Jung;Lee Youngjik
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.87-90
    • /
    • 2002
  • This paper analyzes statistically the relationship between size and balance of text corpus by evaluation of the effect of interview sentences in language model for Korean broadcast news transcription system. Our Korean broadcast news transcription system's ultimate purpose is to recognize not interview speech, but the anchor's and reporter's speech in broadcast news show. But the gathered text corpus for constructing language model consists of interview sentences a portion of the whole, $15\%$ approximately. The characteristic of interview sentence is different from the anchor's and the reporter's in one thing or another. Therefore it disturbs the anchor and reporter oriented language modeling. In this paper, we evaluate the effect of interview sentences in language model for Korean broadcast news transcription system and analyze statistically the relationship between size and balance of text corpus by making an experiment as the same procedure according to varying the size of corpus.

  • PDF

Building a Korean-English Parallel Corpus by Measuring Sentence Similarities Using Sequential Matching of Language Resources and Topic Modeling (언어 자원과 토픽 모델의 순차 매칭을 이용한 유사 문장 계산 기반의 위키피디아 한국어-영어 병렬 말뭉치 구축)

  • Cheon, JuRyong;Ko, YoungJoong
    • Journal of KIISE
    • /
    • v.42 no.7
    • /
    • pp.901-909
    • /
    • 2015
  • In this paper, to build a parallel corpus between Korean and English in Wikipedia. We proposed a method to find similar sentences based on language resources and topic modeling. We first applied language resources(Wiki-dictionary, numbers, and online dictionary in Daum) to match word sequentially. We construct the Wiki-dictionary using titles in Wikipedia. In order to take advantages of the Wikipedia, we used translation probability in the Wiki-dictionary for word matching. In addition, we improved the accuracy of sentence similarity measuring method by using word distribution based on topic modeling. In the experiment, a previous study showed 48.4% of F1-score with only language resources based on linear combination and 51.6% with the topic modeling considering entire word distributions additionally. However, our proposed methods with sequential matching added translation probability to language resources and achieved 9.9% (58.3%) better result than the previous study. When using the proposed sequential matching method of language resources and topic modeling after considering important word distributions, the proposed system achieved 7.5%(59.1%) better than the previous study.

Modeling and Validation of Semantic Constraints for ebXML Business Process Specifications (ebXML 비즈니스 프로세스 명세를 위한 의미 제약의 모델링과 검증)

  • Kim, Jong-Woo;Kim, Hyoung-Do
    • Asia pacific journal of information systems
    • /
    • v.14 no.1
    • /
    • pp.79-100
    • /
    • 2004
  • As a part of ebXML(Electronic Business using eXtensible Markup Language) framework, BPSS(Business Process Specification Schema) has been provided to support the direct specification of the set of elements required to configure a runtime system in order to execute a set of ebXML business transactions. The BPS,' is available in two stand-alone representations, a UML version and an XML version. Due to the limitations of UML notations and XML syntax, however, current ebXML BPSS specification fails to specify formal semantic constraints completely. In this study, we propose a constraint classification scheme for the BPSS specification and describe how to formally represent those semantic constraints using OCL(Object Constraint Language). As a way to validate p Business Process Specification(BPS) with the formal semantic constraints, we suggest a rule-based approach to represent the formal constraints and demonstrate its detailed mechanism for applying the rule-based constraints to the BPS with a prototype implementation.

The Formalization of Business Process Modeling Language for Business Process Management (비즈니스 프로세스 관리를 위한 BPML의 형식화)

  • Lee, Kang-Bae;Yu, Sung-Yeol
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.29 no.3
    • /
    • pp.119-127
    • /
    • 2006
  • In this paper, we present a systematic approach to translating BPML(Business Process Modeling Language) into the ${\pi}-Calculus$. BPML is an executable business process modeling language, like BPEL4WS(Business Process Execution Language for Web Services). It is difficult to find a systematic approach to formalizing these languages; but, by formalizing them, the behavior of the processes can be analyzed and compared so that optimal processes can be designed. For this formalization, we analyzed the activity types and contexts of BPML and suggested the definitions of semantics for each type and context by using the ${\pi}-Calculus$. In addition, we have shown the usefulness of our formalization scheme in that a typical order fulfillment process represented in BPML can be translated into the ${\pi}-Calculus$.

Generic ADL Definition based on UML2.0 (UML 2.0 기반의 Generic ADL 정의)

  • Roh Sunghwan;Kim Kyungrae;Jeon Taewoong;Yoon Seokjin
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.2
    • /
    • pp.167-185
    • /
    • 2006
  • Software architecture, which is the high level model of a software system, should be specified with ADLs (Architecture Description Languages) for its clarity and preciseness. Most of ADLs such as ACME, however, have not come into extensive use in industries since ADL users should learn a distinct notation specific to architecture. On the other hand, UML is a do facto standard general modeling language for software developments. UML provides a consistent notation and various supporting tools during the whole software development cycle. UML, being a general modeling language, does not provide all concepts that are important to architecture description. UML should be extended in order to precisely model architecture. A number of researches on architecture modeling based on WML have been progressed. Ail of them, however, are based on the UML1.x. UML2.0 embraces much more concepts that are important to architecture modeling than UML1.x. In this paper. we defined an architecture modeling language based on UML2.0. We defined Generic ADL by extending UML2.0 and applied the defined Generic ADL to a restaurant reservation system.

An Analysis of the Applications of the Language Models for Information Retrieval (정보검색에서의 언어모델 적용에 관한 분석)

  • Kim Heesop;Jung Youngmi
    • Journal of Korean Library and Information Science Society
    • /
    • v.36 no.2
    • /
    • pp.49-68
    • /
    • 2005
  • The purpose of this study is to examine the research trends and their experiment results on the applications of the language models for information retrieval. We reviewed the previous studies with the following categories: (1) the first generation of language modeling information retrieval (LMIR) experiments which are mainly focused on comparing the language modeling information retrieval with the traditional retrieval models in their retrieval performance, and (2) the second generation of LMIR experiments which are focused on comparing the expanded language modeling information retrieval with the basic language models in their retrieval performance. Through the analysis of the previous experiments results, we found that (1) language models are outperformed the probabilistic model or vector space model approaches, and (2) the expended language models demonstrated better results than the basic language models in their retrieval performance.

  • PDF