Search | Korea Science

Implementation of Information Retrieval System by Table-parsing (Table parsing을 이용한 정보검색시스템의 효율향상)

김영순;권혁철
- Proceedings of the Korea Multimedia Society Conference
- /
- 2001.11a
- /
- pp.413-416
- /
- 2001
인터넷 문서에서 구조정보의 대표적인 예라 할 수 있는 표(table)는 의미있는 정보를 가지고 있는 경우가 많다. 하지만 인터넷상의 표는 여러 가지 형태이며, 이것에 맞게 표를 효과적으로 parsing하는 방법이 필요하다. 이렇게 parsing한 표의 정보를 이용하여, 인터넷 문서, 특히 전자상거래 문서에 있는 표를 표준화한 틀에 따라 개념화하여, 의미있는 정보를 추출해 낼 수 있다.
PDF

How Do Elementary School Students Understand Tables? : From Functional Thinking Perspective (초등학생들은 표를 어떻게 이해할까? : 함수적 사고의 관점에서)

Kim, JeongWon
- Education of Primary School Mathematics
- /
- v.20 no.1
- /
- pp.53-68
- /
- 2017
Although the table, as one of the representations for helping mathematics understanding, steadily has been shown in the mathematics textbooks, there have been little studies that focus on the table and analyze how the table may be used in understanding students' functional thinking. This study investigated the elementary school 5th graders' abilities to design function tables. The results showed that about 75% of the students were able to create tables for themselves, which shaped horizontal and included information only from the problem contexts. And the students had more difficulties in solving geometric growing pattern problems than story problems. Building on these results, this paper is expected to provide implications of instructional directions of how to use the table as 'function table'.
https://doi.org/10.7468/jksmec.2017.20.1.53 인용 PDF KSCI

Redesigning Retention Schedules for Accurate Documentation of Government Activities (공공업무의 체계적 기록화를 위한 보유일정표 설계 방안)

Seol, Moon-Won
- Journal of the Korean Society for Library and Information Science
- /
- v.40 no.4
- /
- pp.199-219
- /
- 2006
Retention schedule is the most essential tool for accurate documentation of government activities. The purpose of this study is to provide a guideline for redesigning retention schedules to support systematic documentation of government activities, that can replace the existing retention schedule ('bunryukijunpyo'). This present paper begins with articulating the role of retention schedules in life cycle management of records based on ISO 15489. And it compares and analyses 'disposal authorities' of Australia, 'records schedules' of United States. and the existing records retention schedules of Korea, in terms of types, structure and components of retention schedules. Based on these analyses, it suggests directives to redesign the retention schedules at the state level.
https://doi.org/10.4275/KSLIS.2006.40.4.199 인용 PDF

Dewey for Windows vs. Electronic Dewey Decimal Classification (전자 듀이십진분류표의 비교 연구)

정연경
- Proceedings of the Korean Society for Information Management Conference
- /
- 1997.08a
- /
- pp.91-94
- /
- 1997
Dewey for Windows는 1997년 여론에 나온 듀이십진분류표 제 21판의 전자본이다. 1994년에 Electronic Dewey Decimal Classification이 최초의 전자 분류표로 등장한 후, 보다 나아진 이용자 인터훼이스와 다양한 접근방법을 사용할 수 있는 전자 듀이십진분류표로 개발되었다. 본고에서는 새로 나온 전자분류표의 기능을 살펴보고 최초의 전자분류표와 비교한 후, 개선점을 제시하였으며 한국의 전자십진분류표 개발을 제안하였다.
PDF

넙치(Paralichthys olivaceus) 표지방류에 적합한 표지표 연구

오택윤;김주일;백철인;손호선;고정락;차병렬
- Proceedings of the Korean Society of Fisheries Technology Conference
- /
- 2002.10a
- /
- pp.279-280
- /
- 2002
우리나라에서는 연안어장의 수산자원을 증강시키기 위해 매년 인공종묘 넙치 치어방류를 실시하지만 이에 대한 효과조사가 미미한 것을 해결하기 위하여 넙치치어에 적합한 표지표를 찾고자 본 연구에서는 여러종류 표지표를 사용하여 시험어에게 표지표를 표지하여 표지어의 생존을, 표지표 부착을 그리고 표지어의 성장등을 검토하여, 재포시 어업인의 눈에 쉽게 표지어가 발견되고, 표지어의 생존율에 영향을 미치지 않고, 높은 표지표 부착율을 유지하면서 성장에도 영향을 미치지 않은 표지표를 찾고자 한다. (중략)
PDF

TabQA : Question Answering Model for Table Data (TabQA : 표 양식의 데이터에 대한 질의응답 모델)

Park, Soyoon;Lim, Seungyoung;Kim, Myungji;Lee, Jooyoul
- Annual Conference on Human and Language Technology
- /
- 2018.10a
- /
- pp.263-269
- /
- 2018
본 논문에서는 실생활에서 쓰이는 다양한 구조를 갖는 문서에 대해서도 자연어 질의응답이 가능한 모델을 만들고자, 그 첫걸음으로 표에 대해 자연어 질의응답이 가능한 End-to-End 인공신경망 모델 TabQA를 제안한다. TabQA는 기존 연구들과는 달리 표의 형식에 구애받지 않고 여러 가지 형태의 표를 처리할 수 있으며, 다양한 정보의 인코딩으로 풍부해진 셀의 feature를 통해, 표의 row와 column 객체를 직관적이고도 효과적으로 추상화한다. 우리는 본 연구의 결과를 검증하기 위해 다채로운 어휘를 가지는 표 데이터에 대한 질의응답 쌍을 자체적으로 생성하였으며, 이에 대해 단일 모델 EM 스코어 96.0%에 이르는 결과를 얻었다. 이로써 우리는 추후 더 넓은 범위의 양식이 있는 데이터에 대해서도 자연어로 질의응답 할 수 있는 가능성을 확인하였다.
PDF

Extracting Web-Table Information Using Decision Tree and Rule Based Approach (기계학습과 규칙 기반 접근 방법을 결합한 의미 있는 표 구분과 헤드 영역 추출)

Jung, Sung-Won;Park, Dae-Won;Kwon, Hyuk-Chul
- Annual Conference on Human and Language Technology
- /
- 2004.10d
- /
- pp.5-11
- /
- 2004
일반적으로 HTML문서는 크게 내용과 구조로 이루어져 있다. HTML은 일반 문서와 달리 태그라는 것으로 문서에 추가 정보를 주며, 문서의 내용을 더욱 명확하게 한다. 따라서 태그를 이용하면 일반 문서보다 정보를 쉽게 구별하고 추출할 수 있다. 이러한 여러 가지 태그들 중에서 본 연구는 표를 중점적으로 연구한다. 표는 행과 열을 이용하여 어떤 사실을 조직하여 전달하는 것으로, 다른 구조적 특성들 보다 정보를 조직하는데 매우 유용하며, 글로 기술할 많은 분량을 간단히 줄이는 역할을 한다. 이와 같은 표의 특성에 주목하여 표에서 정보를 추출하는 분야를 기존 연구자들은 Web Table Mining 명명하였다. 본 연구는 기존 연구자들이 간과한 표의 구조적인 특성을 이용하여 전체 인터넷 문서에 적용할 수 있는 방법과 함께, 표에서 의미 있는 정보 추출을 위한 단계적인 모형을 제시한다.
PDF

Extraction of Meaningful Tables from The Web Documents (웹 문서 중 의미 있는 표의 추출)

Jung, Sung-Won;Lee, Won-Hee;Kim, Young-Gi;Kwon, Hyuck-Chul
- Annual Conference on Human and Language Technology
- /
- 2002.10e
- /
- pp.332-339
- /
- 2002
현재까지 정보 검색 시스템은 색인어 위주로써 문서의 구조적 정보를 고려하지 알았다. 글자의 크기나 글자체, 들여 쓰기, 표 등은 저자의 의도를 구체화하며, 문서를 명확하게 하는 주요한 수단이다. 이 연구에서는 특히 표에 주목한다. 표는 많은 문서에 일반적으로 쓰이며, 글을 명확하게 해 준다. 일반 문서에 비해서 웹 문서는 태그를 이용하여 정보를 추가할 수 있어 표를 쉽게 구분할 수 있다. 하지만, 웹 상의 표는 지식을 구조화하는 근본적인 목적이외에, 단순히 화면을 정렬하려고 하는 목적으로도 많이 쓰인다. 이 연구에서는 정보 검색시스템에 표 정보를 사용하기 위한 전처리 단계로 의미 있는 표를 추출하는 방법을 제시하며, 이를 위하여 결정 트리를 사용한다.
PDF

Table Question Answering based on Pre-trained Language Model using TAPAS (TAPAS를 이용한 사전학습 언어 모델 기반의 표 질의응답)

Cho, Sanghyun;Kim, Minho;Kwon, Hyuk-Chul
- Annual Conference on Human and Language Technology
- /
- 2020.10a
- /
- pp.87-90
- /
- 2020
표 질의응답은 반-정형화된 표 데이터에서 질문에 대한 답을 찾는 문제이다. 본 연구에서는 한국어 표 질의응답을 위한 표 데이터에 적합한 TAPAS를 이용한 언어모델 사전학습 방법과 표에서 정답이 있는 셀을 예측하고 선택된 셀에서 정확한 정답의 경계를 예측하기 위한 표 질의응답 모형을 제안한다. 표 사전학습을 위해서 약 10만 개의 표 데이터를 활용했으며, 텍스트 데이터에 사전학습된 BERT 모델을 이용하여 TAPAS를 사전학습한 모델이 가장 좋은 성능을 보였다. 기계독해 모델을 적용했을 때 EM 46.8%, F1 63.8%로 텍스트 텍스트에 사전학습된 모델로 파인튜닝한 것과 비교하여 EM 6.7%, F1 12.9% 향상된 것을 보였다. 표 질의응답 모델의 경우 TAPAS를 통해 생성된 임베딩을 이용하여 행과 열의 임베딩을 추출하고 TAPAS 임베딩, 행과 열의 임베딩을 결합하여 기계독해 모델을 적용했을 때 EM 63.6%, F1 76.0%의 성능을 보였다.
PDF

Test Dataset for validating the meaning of Table Machine Reading Language Model (표 기계독해 언어 모형의 의미 검증을 위한 테스트 데이터셋)

YU, Jae-Min;Cho, Sanghyun;Kwon, Hyuk-Chul
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.10a
- /
- pp.164-167
- /
- 2022
In table Machine comprehension, the knowledge required for language models or the structural form of tables changes depending on the domain, showing a greater performance degradation compared to text data. In this paper, we propose a pre-learning data construction method and an adversarial learning method through meaningful tabular data selection for constructing a pre-learning table language model robust to these domain changes in table machine reading. In order to detect tabular data sed for decoration of web documents without structural information from the extracted table data, a rule through heuristic was defined to identify head data and select table data was applied. An adversarial learning method between tabular data and infobax data with knowledge information about entities was applied. When the data was refined compared to when it was trained with the existing unrefined data, F1 3.45 and EM 4.14 increased in the KorQuAD table data, and F1 19.38, EM 4.22 compared to when the data was not refined in the Spec table QA data showed increased performance.
PDF

Search Result 8,025, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)