Search | Korea Science

Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems (질의응답 시스템에서 처음 보는 단어의 역문헌빈도 기반 단어 임베딩 기법)

Lee, Wooin;Song, Gwangho;Shim, Kyuseok
- Journal of KIISE
- /
- v.43 no.8
- /
- pp.902-909
- /
- 2016
Question answering system (QA system) is a system that finds an actual answer to the question posed by a user, whereas a typical search engine would only find the links to the relevant documents. Recent works related to the open domain QA systems are receiving much attention in the fields of natural language processing, artificial intelligence, and data mining. However, the prior works on QA systems simply replace all words that are not in the training data with a single token, even though such unseen words are likely to play crucial roles in differentiating the candidate answers from the actual answers. In this paper, we propose a method to compute vectors of such unseen words by taking into account the context in which the words have occurred. Next, we also propose a model which utilizes inverse document frequencies (IDF) to efficiently process unseen words by expanding the system's vocabulary. Finally, we validate that the proposed method and model improve the performance of a QA system through experiments.
https://doi.org/10.5626/JOK.2016.43.8.902 인용 KSCI

Analysis of Science Items of the Japanese National Center Test for University Admissions (일본 대학입시센터시험 이과 문항 분석)

Kim, Hyun-Kyung;Kim, Dong-Young;Choi, Hyuk-Joon;Ku, Ja-Ok;Dong, Hyo-Kwan;Shin, Il-Yong;Lee, Yang-Rak
- Journal of The Korean Association For Science Education
- /
- v.30 no.4
- /
- pp.452-471
- /
- 2010
As the Korean College scholastic Ability Test (CSAT) has been implemented for 17 years since 1994, it is becoming more and more difficult to make new items that haven't been previously used to measure students' thinking ability. Therefore, it is necessary to keep conducting research on making new test items that can measure students' scholastic ability reliably. For this reason, multiple choice items on the Japanese university entrance exam, which is a Japanese National Center Test for University Admissions (NCTUA) equivalent of CSAT, were analyzed in order to draw implications for CSAT item development. In this study, we analyzed the Japanese NCTUA administered in January 2009 to investigate the structure of its science test. We also analyzed the NCTUA items by the domains of contents and behaviors, and tried to predict item difficulty from the perspective of Korean applicants. Major findings are as follows: Most NCTUA items measure understanding knowledge or low level thinking ability. Also the alloted time for each item is longer than CSAT. The number of test items, and the number of choice and alloted points for each item are diverse, unlike CSAT. The number of items using real-life materials are much more, but the items are not rigorous in sentence expression compared to CSAT. And the difference of difficulty level among science tests were larger with reference to CSAT. Also science score is required for most applicants regardless whether they are taking liberal arts or going onto the science track.
https://doi.org/10.14697/jkase.2010.30.4.452 인용 PDF KSCI

Search Result 2, Processing Time 0.016 seconds

Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems (질의응답 시스템에서 처음 보는 단어의 역문헌빈도 기반 단어 임베딩 기법)

Analysis of Science Items of the Japanese National Center Test for University Admissions (일본 대학입시센터시험 이과 문항 분석)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)