• Title/Summary/Keyword: Space Sequence

Search Result 960, Processing Time 0.027 seconds

Restoring Omitted Sentence Constituents in Encyclopedia Documents Using Structural SVM (Structural SVM을 이용한 백과사전 문서 내 생략 문장성분 복원)

  • Hwang, Min-Kook;Kim, Youngtae;Ra, Dongyul;Lim, Soojong;Kim, Hyunki
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.131-150
    • /
    • 2015
  • Omission of noun phrases for obligatory cases is a common phenomenon in sentences of Korean and Japanese, which is not observed in English. When an argument of a predicate can be filled with a noun phrase co-referential with the title, the argument is more easily omitted in Encyclopedia texts. The omitted noun phrase is called a zero anaphor or zero pronoun. Encyclopedias like Wikipedia are major source for information extraction by intelligent application systems such as information retrieval and question answering systems. However, omission of noun phrases makes the quality of information extraction poor. This paper deals with the problem of developing a system that can restore omitted noun phrases in encyclopedia documents. The problem that our system deals with is almost similar to zero anaphora resolution which is one of the important problems in natural language processing. A noun phrase existing in the text that can be used for restoration is called an antecedent. An antecedent must be co-referential with the zero anaphor. While the candidates for the antecedent are only noun phrases in the same text in case of zero anaphora resolution, the title is also a candidate in our problem. In our system, the first stage is in charge of detecting the zero anaphor. In the second stage, antecedent search is carried out by considering the candidates. If antecedent search fails, an attempt made, in the third stage, to use the title as the antecedent. The main characteristic of our system is to make use of a structural SVM for finding the antecedent. The noun phrases in the text that appear before the position of zero anaphor comprise the search space. The main technique used in the methods proposed in previous research works is to perform binary classification for all the noun phrases in the search space. The noun phrase classified to be an antecedent with highest confidence is selected as the antecedent. However, we propose in this paper that antecedent search is viewed as the problem of assigning the antecedent indicator labels to a sequence of noun phrases. In other words, sequence labeling is employed in antecedent search in the text. We are the first to suggest this idea. To perform sequence labeling, we suggest to use a structural SVM which receives a sequence of noun phrases as input and returns the sequence of labels as output. An output label takes one of two values: one indicating that the corresponding noun phrase is the antecedent and the other indicating that it is not. The structural SVM we used is based on the modified Pegasos algorithm which exploits a subgradient descent methodology used for optimization problems. To train and test our system we selected a set of Wikipedia texts and constructed the annotated corpus in which gold-standard answers are provided such as zero anaphors and their possible antecedents. Training examples are prepared using the annotated corpus and used to train the SVMs and test the system. For zero anaphor detection, sentences are parsed by a syntactic analyzer and subject or object cases omitted are identified. Thus performance of our system is dependent on that of the syntactic analyzer, which is a limitation of our system. When an antecedent is not found in the text, our system tries to use the title to restore the zero anaphor. This is based on binary classification using the regular SVM. The experiment showed that our system's performance is F1 = 68.58%. This means that state-of-the-art system can be developed with our technique. It is expected that future work that enables the system to utilize semantic information can lead to a significant performance improvement.

Near-IR TRGB Distance to Nearby Dwarf Irregular Galaxy NGC 6822

  • Sohn, Y.J.;Kang, A.;Han, W.;Park, J.H.;Kim, H.I.;Kim, J.W.;Shin, I.G.;Chun, S.H.
    • Journal of Astronomy and Space Sciences
    • /
    • v.25 no.3
    • /
    • pp.249-254
    • /
    • 2008
  • We report the distance modulus of nearby dwarf irregular galaxy NGC 6822 estimated from the so-called Tip of Red-giant Branch (TRGB) method. To detect the apparent magnitudes of the TRGB we use the color-magnitude diagrams (CMDs) and luminosity functions (LFs) in the near-infrared JHK bands. Foreground stars, main-sequence stars, and supergiant stars have been classified on the (g - K, g) plane and removed on the near-infrared CMDs, from which only RGB and AGB stars are remained on the CMDs and LFs. By applying the Savitzky-Golay filter to the obtained LFs and detecting the peak in the second derivative of the observed LFs, we determined the apparent magnitudes of the TRGB. Theoretical absolute magnitudes of the TRGB are estimated from Yonsei-Yale isochrones with the age of 12Gyr and the metallicity range of -2.0 <[Fe/H]< -0.5. The derived values of distance modulus to NGC 6822 are (m - M) = $23.35{\pm}0.26$, $23.20{\pm}0.42$, and $23.27{\pm}0.50$ for J, H, and K bands, respectively. Distance modulus in bolometric magnitude is also derived as (m - M) = $23.41{\pm}0.17$. We compare the derived values of the TRGB distance modulus to NGC 6822 in the near-infrared bands with the previous results in other bands.

SEJONG OPEN CLUSTER SURVEY (SOS). 0. TARGET SELECTION AND DATA ANALYSIS

  • Sung, Hwankyung;Lim, Beomdu;Bessell, Michael S.;Kim, Jinyoung S.;Hur, Hyeonoh;Chun, Moo-Young;Park, Byeong-Gon
    • Journal of The Korean Astronomical Society
    • /
    • v.46 no.3
    • /
    • pp.103-123
    • /
    • 2013
  • Star clusters are superb astrophysical laboratories containing cospatial and coeval samples of stars with similar chemical composition. We initiate the Sejong Open cluster Survey (SOS) - a project dedicated to providing homogeneous photometry of a large number of open clusters in the SAAO Johnson-Cousins' UBV I system. To achieve our main goal, we pay much attention to the observation of standard stars in order to reproduce the SAAO standard system. Many of our targets are relatively small sparse clusters that escaped previous observations. As clusters are considered building blocks of the Galactic disk, their physical properties such as the initial mass function, the pattern of mass segregation, etc. give valuable information on the formation and evolution of the Galactic disk. The spatial distribution of young open clusters will be used to revise the local spiral arm structure of the Galaxy. In addition, the homogeneous data can also be used to test stellar evolutionary theory, especially concerning rare massive stars. In this paper we present the target selection criteria, the observational strategy for accurate photometry, and the adopted calibrations for data analysis such as color-color relations, zero-age main sequence relations, Sp - MV relations, Sp - $T_{eff}$ relations, Sp - color relations, and $T_{eff}$ - BC relations. Finally we provide some data analysis such as the determination of the reddening law, the membership selection criteria, and distance determination.

Radiative transfer analysis for Amon-Ra instrument

  • Seong, Se-Hyun;Ryu, Dong-Ok;Lee, Jae-Min;Hong, Jin-Suk;Kim, Seong-Hui;Yoon, Jee-Yeon;Park, Won-Hyun;Lee, Han-Shin;Park, Jong-Soo;Yu, Ji-Woong;Kim, Sug-Whan
    • Bulletin of the Korean Space Science Society
    • /
    • 2009.10a
    • /
    • pp.28.4-29
    • /
    • 2009
  • The 'Amon-Ra' instrument of the proposed 'EARTHSHINE' satellite is a dual (i.e. imaging and energy) channel instrument for monitoring the total solar irradiance (TSI) and the Earth's irradiance at around the L1 halo orbit. Earlier studies for this instrument include, but not limited to, design and construction of breadboard Amon-Ra imaging channel, stray light suppression and system performance computation using Integrated Ray Tracing (IRT) technique. The Amon-Ra instrument is required to produce 0.3% in uncertainty for both Sunlight and Earthlight measurement. In this study, we report accurate estimation of the output electric signal derived from the orbital variation of radiant exitance from the Sun and the Earth arriving at the aperture and detector plane of the Amon-Ra. For this, orbital irradiance are computed analytically first and then confirmed by simulation using Integrated Ray Tracing (IRT) model. Specially, the results show the arriving power at the bolometer detector surface is $1.24{\mu}W$ for the Sunlight and $1.28{\mu}W$ for the Earthlight, producing the output signal pulses of 34.31 mV and 35.47 mV respectively. These results demonstrate successfully that the arriving radiative power is well within the bolometer detector dynamic range and, therefore, the proposed detector can be used for the in-orbit measurement sequence. We discuss the computational details and implications as well as the simulation results.

  • PDF

A* Algorithm for Optimal Intra-bay Container Pre-marshalling Plan (컨테이너 터미널에서 베이 내 컨테이너의 최적 재정돈을 위한 A* 알고리즘)

  • Ha, Byung-Hyun;Kim, Sang-Su
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.38 no.2
    • /
    • pp.157-172
    • /
    • 2012
  • In most container terminals, containers are piled up and stored in a yard in order to utilize the space efficiently. Hence, it requires unproductive container-handling operations to retrieve a container that is not placed on the top of a container stack. As a result, to streamline container-loading operations by which containers are transferred from a yard to a vessel, it is necessary to pre-marshal (i.e., shuffle in advance) containers in accordance with container-loading plan. We propose $A^*$ algorithm to find the optimal container-relocation sequence for the intra-bay container pre-marshalling problem. To work out the heuristic estimate for the proposed $A^*$ algorithm, we introduce the container rearrangement problem and obtain the lower bound of the length of the optimal relocation sequence. The performance of the algorithm is validated extensively by the numerical experiments on the problem instances that are given in the previous studies and generated randomly with various parameters.

ON H$\grave{a}$JEK-R$\grave{e}$NYI-TYPE INEQUALITY FOR CONDITIONALLY NEGATIVELY ASSOCIATED RANDOM VARIABLES AND ITS APPLICATIONS

  • Seo, Hye-Young;Baek, Jong-Il
    • Journal of applied mathematics & informatics
    • /
    • v.30 no.3_4
    • /
    • pp.623-633
    • /
    • 2012
  • Let {${\Omega}$, $\mathcal{F}$, P} be a probability space and {$X_n|n{\geq}1$} be a sequence of random variables defined on it. A finite sequence of random variables {$X_n|n{\geq}1$} is said to be conditionally negatively associated given $\mathcal{F}$ if for every pair of disjoint subsets A and B of {1, 2, ${\cdots}$, n}, $Cov^{\mathcal{F}}(f_1(X_i,i{\in}A),\;f_2(X_j,j{\in}B)){\leq}0$ a.s. whenever $f_1$ and $f_2$ are coordinatewise nondecreasing functions. We extend the H$\grave{a}$jek-R$\grave{e}$nyi-type inequality from negative association to conditional negative association of random variables. In addition, some corollaries are given.

A Satellite Navigation Signal Scheme Using Zadoff-Chu Sequence for Reducing the Signal Acquisition Space

  • Park, Dae-Soon;Kim, Jeong-Been;Lee, Je-Won;Kim, Kap-Jin;Song, Kiwon;Ahn, Jae Min
    • Journal of Positioning, Navigation, and Timing
    • /
    • v.2 no.1
    • /
    • pp.1-8
    • /
    • 2013
  • A signal system for improving the code acquisition complexity of Global Navigation Satellite System (GNSS) receiver is proposed and the receiving correlator scheme is presented accordingly. The proposed signal system is a hierarchical code type with a duplexing configuration which consists of the Zadoff-Chu (ZC) code having a good auto-correlation characteristic and the Pseudo Random Noise (PRN) code for distinguishing satellites. The receiving correlator has the scheme that consists of the primary correlator for the ZC code and the secondary correlator which uses the PRN code for the primary correlation results. The simulation results of code acquisition using the receiving correlator of the proposed signal system show that the proposed signal scheme improves the complexity of GNSS receiver and has the code acquisition performance comparable to the existing GNSS signal system using Coarse/Acquisition (C/A) code.

Image Encryption using Non-linear FSR and 2D CAT (벼선형 FSR과 2D CAT을 이용한 영상 암호화)

  • Nam, Tae-Hee;Cho, Sung-Jin;Kim, Seok-Tae
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.7C
    • /
    • pp.663-670
    • /
    • 2009
  • In this paper, we propose the image encryption method which gradually uses NFSR(Non-linear Feedback Shift Register) and 20 CAT(Two-Dimensional Cellular Automata Transform). The encryption method is processed in the following order. First, NFSR is used to create a PN(pseudo noise) sequence, which matches the size of the original image. Then, the created sequence goes through a XOR operation with the original image and process the encipherment. Next, the gateway value is set to produce a 20 CAT basis function. The produced basis function is multiplied by encryption image that has been converted to process the 20 CAT encipherment. Lastly, the results of the experiment which are key space analysis, entropy analysis, and sensitivity analysis verify that the proposed method is efficient and very secure.

Prediction of Genes Lacking in an Ammonia Oxidizing Archaeon for Independent Growth (암모니아 산화 고세균의 독립성장에 필요한 결손 유전자 예측)

  • Han, Sang-Soo;Lee, Jin-Young;Rhee, Sung-Keun;Kim, Geun-Joong
    • KSBB Journal
    • /
    • v.26 no.3
    • /
    • pp.237-242
    • /
    • 2011
  • As a number of archaea are ubiquitously found in non-extreme habitats, elucidation of their functional roles becomes currently an emerging issue. However, most of them are unable to grow in pure culture and so it remains to be established. In order to find genes lacking in the genome of an ammonia-oxidizing archaeon (AOA), we here report on the comparative analyses of an AOA genome with those of experimentally or theoretically established minimal genomes for independent growth. We assessed the genes lacking in AOA using logic of clusters of orthologous groups (COG), remote homology, consensus sequence weight matrix, function-based motif or domain, and then further excluded genes encoding hypothetical orarchaea-specific proteins. The results of these combination analyses revealed 19 candidate genes lacking in the genome of an AOA. Thus, our results provide a possibility of inducing independent growth of AOA when supplemented with product (s) of the lacking gene (s), and also give a chance for finding new proteins with novel sequence or structure space even if the predicted lacking-genes will be found using another algorithms or biochemical studies.

STRONG CONVERGENCE OF COMPOSITE ITERATIVE METHODS FOR NONEXPANSIVE MAPPINGS

  • Jung, Jong-Soo
    • Journal of the Korean Mathematical Society
    • /
    • v.46 no.6
    • /
    • pp.1151-1164
    • /
    • 2009
  • Let E be a reflexive Banach space with a weakly sequentially continuous duality mapping, C be a nonempty closed convex subset of E, f : C $\rightarrow$C a contractive mapping (or a weakly contractive mapping), and T : C $\rightarrow$ C a nonexpansive mapping with the fixed point set F(T) ${\neq}{\emptyset}$. Let {$x_n$} be generated by a new composite iterative scheme: $y_n={\lambda}_nf(x_n)+(1-{\lambda}_n)Tx_n$, $x_{n+1}=(1-{\beta}_n)y_n+{\beta}_nTy_n$, ($n{\geq}0$). It is proved that {$x_n$} converges strongly to a point in F(T), which is a solution of certain variational inequality provided the sequence {$\lambda_n$} $\subset$ (0, 1) satisfies $lim_{n{\rightarrow}{\infty}}{\lambda}_n$ = 0 and $\sum_{n=0}^{\infty}{\lambda}_n={\infty}$, {$\beta_n$} $\subset$ [0, a) for some 0 < a < 1 and the sequence {$x_n$} is asymptotically regular.