• Title/Summary/Keyword: Content Generation Model

Search Result 151, Processing Time 0.024 seconds

A Model of Natural Language Information Retrieval Using Main Keywords and Sub-keywords (주 키워드와 부 키워드를 이용한 자연언어 정보 검색 모델)

  • Kang, Hyun-Kyu;Park, Se-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.12
    • /
    • pp.3052-3062
    • /
    • 1997
  • An Information Retrieval (IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to Produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 850% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.

  • PDF

A Study on the Introduction of Library Services Based on Blockchain (블록체인 기반의 도서관 서비스 도입 및 활용방안에 관한 연구)

  • Ro, Ji-Yoon;Noh, Younghee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.1
    • /
    • pp.371-401
    • /
    • 2022
  • If the blockchain means storing information in a distributed environment that cannot be forged or altered, it is mentioned that this is similar to what librarians collect, preserve, and share authoritative information. In this way, this study examined blockchain technology as a way to collect and provide reliable information, increase work efficiency inside and outside the library, and strengthen cooperative networks. This study attempted to propose various ways to utilize blockchain technology in book relations based on literature surveys and case studies in other fields. To this end, this study first analyzed the field and cases of blockchain application to confirm the possibility and value of blockchain application in the library field, and proposed 12 ways to utilize it based on this. The utilization model was proposed by dividing it into operation and service sectors. In the operation sector, it is a digital identity-based user record storage and authentication function, transparent management and traceable monitoring function, voting-based personnel and recruitment system, blockchain governance-based network efficiency function, and blockchain-based next-generation device management and information integration function. The service sector includes improved book purchase and sharing efficiency due to simplification of intermediaries, digital content copyright protection and management functions, customized service provision based on customer behavior analysis, blockchain-based online learning platforms, sharing platforms, and P2P-based reliable information sharing platforms.

A Case Study of Middle School Students' Abductive Inference during a Geological Field Excursion (야외 지질 학습에서 나타난 중학생들의 귀추적 추론 사례 연구)

  • Maeng, Seung-Ho;Park, Myeong-Sook;Lee, Jeong-A;Kim, Chan-Jong
    • Journal of The Korean Association For Science Education
    • /
    • v.27 no.9
    • /
    • pp.818-831
    • /
    • 2007
  • Recognizing the importance of abductive inquiry in Earth science, some theoretical approaches that deploy abduction have been researched. And, it is necessary that the abductive inquiry in a geological field excursion as a vivid locale of Earth science inquiry should be researched. We developed a geological field trip based on the abductive learning model, and investigated students' abductive inference, thinking strategies used in those inferences, and the impact of a teacher's pedagogical intervention on students' abductive inference. Results showed that students, during the field excursion, could accomplish abductive inference about rock identification, process of different rock generation, joints generation in metamorpa?ic rocks, and terrains at the field trip area. They also used various thinking strategies in finding appropriate rules to construe the facts observed at outcrops. This means that it is significant for the enhancement of abductive reasoning skills that students experience such inquiries as scientists do. In addition, a teacher's pedagogical interventions didn't ensure the content of students' inference while they helped students perform abductive reasoning and guided their use of specific thinking strategies. Students had found reasoning rules to explain the 01: served facts from their wrong prior knowledge. Therefore, during a geological field excursion, teachers need to provide students with proper background knowledge and information in order that students can reason rues for persuasive abductive inference, and construe the geological features of the field trip area by the establishment of appropriate hypotheses.

Performance Analysis of Shell Coal Gasification Combined Cycle systems (Shell 석탄가스화 복합발전 시스템의 성능해석 연구)

  • Kim, Jong-Jin;Park, Moung-Ho;Song, Kyu-So;Cho, Sang-Ki;Seo, Seok-Bin;Kim, Chong-Young
    • Journal of Energy Engineering
    • /
    • v.6 no.1
    • /
    • pp.104-113
    • /
    • 1997
  • This study aims to develop an analysis model using a commercial process simulator-ASPEN PLUS for an IGCC (Integrated Gasification Combined Cycle) system consisting a dry coal feeding, oxygen-blown entrained gasification process by Shell, a low temperature gas clean up process, a General Electric MS7001FA gas turbine, a three pressure, natural recirculation heat recovery steam generator, a regenerative, condensing steam turbine and a cryogenic air separation unit. The comparison between those results of this study and reference one done by other engineer at design conditions shows consistency which means the soundness of this model. The greater moisture contents in Illinois#6 coal causes decreasing gasifier temperature and the greater ash and sulfur content hurt system efficiency due to increased heat loss. As the results of sensitivity analysis using developed model for the parameters of gasifier operating pressure, steam/coal ratio and oxygen/coal ratio, the gasifier temperature increases while combustible gases (CO+H2) decreases throughout the pressure going up. In the steam/coal ratio analysis, when the feeding steam increases the maximum combustible gas generation point moves to lower oxygen/coal ratio feeding condition. Finally, for the oxygen/coal ratio analysis, it shows oxygen/coal ratio 0.77 as a optimum operating condition at steam/coal feeding ratio 0.2.

  • PDF

A CF-based Health Functional Recommender System using Extended User Similarity Measure (확장된 사용자 유사도를 이용한 CF-기반 건강기능식품 추천 시스템)

  • Sein Hong;Euiju Jeong;Jaekyeong Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.1-17
    • /
    • 2023
  • With the recent rapid development of ICT(Information and Communication Technology) and the popularization of digital devices, the size of the online market continues to grow. As a result, we live in a flood of information. Thus, customers are facing information overload problems that require a lot of time and money to select products. Therefore, a personalized recommender system has become an essential methodology to address such issues. Collaborative Filtering(CF) is the most widely used recommender system. Traditional recommender systems mainly utilize quantitative data such as rating values, resulting in poor recommendation accuracy. Quantitative data cannot fully reflect the user's preference. To solve such a problem, studies that reflect qualitative data, such as review contents, are being actively conducted these days. To quantify user review contents, text mining was used in this study. The general CF consists of the following three steps: user-item matrix generation, Top-N neighborhood group search, and Top-K recommendation list generation. In this study, we propose a recommendation algorithm that applies an extended similarity measure, which utilize quantified review contents in addition to user rating values. After calculating review similarity by applying TF-IDF, Word2Vec, and Doc2Vec techniques to review content, extended similarity is created by combining user rating similarity and quantified review contents. To verify this, we used user ratings and review data from the e-commerce site Amazon's "Health and Personal Care". The proposed recommendation model using extended similarity measure showed superior performance to the traditional recommendation model using only user rating value-based similarity measure. In addition, among the various text mining techniques, the similarity obtained using the TF-IDF technique showed the best performance when used in the neighbor group search and recommendation list generation step.

Generation of Ionospheric Delay in Time Comparison for a Specific GEO Satellite by Using Bernese Software

  • Jeong, Kwang Seob;Lee, Young Kyu;Yang, Sung Hoon;Hwang, Sang-wook;Kim, Sanhae;Song, Kyu-Ha;Lee, Wonjin;Ko, Jae Heon
    • Journal of Positioning, Navigation, and Timing
    • /
    • v.6 no.3
    • /
    • pp.125-133
    • /
    • 2017
  • Time comparison is necessary for the verification and synchronization of the clock. Two-way satellite time and frequency (TWSTFT) is a method for time comparison over long distances. This method includes errors such as atmospheric effects, satellite motion, and environmental conditions. Ionospheric delay is one of the significant time comparison error in case of the carrier-phase TWSTFT (TWCP). Global Ionosphere Map (GIM) from Center for Orbit Determination in Europe (CODE) is used to compare with Bernese. Thin shell model of the ionosphere is used for the calculation of the Ionosphere Pierce Point (IPP) between stations and a GEO satellite. Korea Research Institute of Standards and Science (KRISS) and Koganei (KGNI) stations are used, and the analysis is conducted at 29 January 2017. Vertical Total Electron Content (VTEC) which is generated by Bernese at the latitude and longitude of the receiver by processing a Receiver Independent Exchange (RINEX) observation file that is generated from the receiver has demonstrated adequacy by showing similar variation trends with the CODE GIM. Bernese also has showed the capability to produce high resolution IONosphere map EXchange (IONEX) data compared to the CODE GIM. At each station IPP, VTEC difference in two stations showed absolute maximum 3.3 and 2.3 Total Electron Content Unit (TECU) in Bernese and GIM, respectively. The ionospheric delay of the TWCP has showed maximum 5.69 and 2.54 ps from Bernese and CODE GIM, respectively. Bernese could correct up to 6.29 ps in ionospheric delay rather than using CODE GIM. The peak-to-peak value of the ionospheric delay for TWCP in Bernese is about 10 ps, and this has to be eliminated to get high precision TWCP results. The $10^{-16}$ level uncertainty of atomic clock corresponds to 10 ps for 1 day averaging time, so time synchronization performance needs less than 10 ps. Current time synchronization of a satellite and ground station is about 2 ns level, but the smaller required performance, like less than 1 ns, the better. In this perspective, since the ionospheric delay could exceed over 100 ps in a long baseline different from this short baseline case, the elimination of the ionospheric delay is thought to be important for more high precision time synchronization of a satellite and ground station. This paper showed detailed method how to eliminate ionospheric delay for TWCP, and a specific case is applied by using this technique. Anyone could apply this method to establish high precision TWCP capability, and it is possible to use other software such as GIPSYOASIS and GPSTk. This TWCP could be applied in the high precision atomic clocks and used in the ground stations of the future domestic satellite navigation system.

Analysis of the Time-dependent Relation between TV Ratings and the Content of Microblogs (TV 시청률과 마이크로블로그 내용어와의 시간대별 관계 분석)

  • Choeh, Joon Yeon;Baek, Haedeuk;Choi, Jinho
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.163-176
    • /
    • 2014
  • Social media is becoming the platform for users to communicate their activities, status, emotions, and experiences to other people. In recent years, microblogs, such as Twitter, have gained in popularity because of its ease of use, speed, and reach. Compared to a conventional web blog, a microblog lowers users' efforts and investment for content generation by recommending shorter posts. There has been a lot research into capturing the social phenomena and analyzing the chatter of microblogs. However, measuring television ratings has been given little attention so far. Currently, the most common method to measure TV ratings uses an electronic metering device installed in a small number of sampled households. Microblogs allow users to post short messages, share daily updates, and conveniently keep in touch. In a similar way, microblog users are interacting with each other while watching television or movies, or visiting a new place. In order to measure TV ratings, some features are significant during certain hours of the day, or days of the week, whereas these same features are meaningless during other time periods. Thus, the importance of features can change during the day, and a model capturing the time sensitive relevance is required to estimate TV ratings. Therefore, modeling time-related characteristics of features should be a key when measuring the TV ratings through microblogs. We show that capturing time-dependency of features in measuring TV ratings is vitally necessary for improving their accuracy. To explore the relationship between the content of microblogs and TV ratings, we collected Twitter data using the Get Search component of the Twitter REST API from January 2013 to October 2013. There are about 300 thousand posts in our data set for the experiment. After excluding data such as adverting or promoted tweets, we selected 149 thousand tweets for analysis. The number of tweets reaches its maximum level on the broadcasting day and increases rapidly around the broadcasting time. This result is stems from the characteristics of the public channel, which broadcasts the program at the predetermined time. From our analysis, we find that count-based features such as the number of tweets or retweets have a low correlation with TV ratings. This result implies that a simple tweet rate does not reflect the satisfaction or response to the TV programs. Content-based features extracted from the content of tweets have a relatively high correlation with TV ratings. Further, some emoticons or newly coined words that are not tagged in the morpheme extraction process have a strong relationship with TV ratings. We find that there is a time-dependency in the correlation of features between the before and after broadcasting time. Since the TV program is broadcast at the predetermined time regularly, users post tweets expressing their expectation for the program or disappointment over not being able to watch the program. The highly correlated features before the broadcast are different from the features after broadcasting. This result explains that the relevance of words with TV programs can change according to the time of the tweets. Among the 336 words that fulfill the minimum requirements for candidate features, 145 words have the highest correlation before the broadcasting time, whereas 68 words reach the highest correlation after broadcasting. Interestingly, some words that express the impossibility of watching the program show a high relevance, despite containing a negative meaning. Understanding the time-dependency of features can be helpful in improving the accuracy of TV ratings measurement. This research contributes a basis to estimate the response to or satisfaction with the broadcasted programs using the time dependency of words in Twitter chatter. More research is needed to refine the methodology for predicting or measuring TV ratings.

Development a Measurement Scale for Analysis on Factors Influencing College Choice of College Freshman's (전문대학 신입생들의 대학선택 영향 요인 분석을 위한 측정도구 개발)

  • Kim, Myung-Eun;Jang, Won-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.7
    • /
    • pp.50-62
    • /
    • 2016
  • The aim of this study was to develop scale of test taker to affect on process of selection of college. The study procedures were performed as follow : establish of study model, item generation, verification of content validity, 1st and 2nd pilot test and verification of construct validity and reliability. In order to verification of validity and reliability, 230 freshman were surveyed and the data were analyzed by mean, standard deviation, skewness. kurtosis, Pearson correlation, Verimax factor analysis and Cronbach's ${\alpha}$. As result of factor analysis, the 15 factors(61 items) that explain 68.37% of the total variance were extracted and each factors were classified by attribution as follow : Information collection-mass media(4items), college activity (3items), support and environment(4items), education(5items), Influential person on decision making(3items), education quality(4items), education service(5items), reputation(3items), accessibility(2items), personal circumstances(2items), college's mass media(5items), promotion (4items), on-line(4items), person's PR (3items), College image(10items). Cronbach's ${\alpha}$ of total items was 0.916 and Cronbach's ${\alpha}$ of each factors were showed range between 0.694~0.878. Measurement scale of this study may be utilized to collect basic data be required to establish policy strategy of local college be faced with difficulty of admission recruitment.

Validity and Reliability of Professional Identity of Dental Hygienists (치과위생사 전문직 정체성 측정도구의 타당도와 신뢰도)

  • Kim, Hyeong-Mi;Cho, Young-Sik
    • Journal of dental hygiene science
    • /
    • v.17 no.6
    • /
    • pp.533-542
    • /
    • 2017
  • This study aimed to develop a measure of professional identity for Korean dental hygienists and to analyze the factors and characteristics of professional identity of these dental hygienists. In this study, 890 dental hygienists completed a self-administered survey. The final analysis was conducted with a total of 880 responses, excluding 10 questionnaires with unreliable responses and partial responses. A systematic literature review was conducted to establish the definition of professional identity, while content validity, exploratory factor, confirmatory factor, and reliability analysis were conducted to establish the constructive factors. Descriptive statistics, independent sample t-test, and one-way ANOVA were used to identify the level of each factor. Authors conceptualized the professional identity for Korean dental hygienists and item generation, item reduction, and questionnaire formatting. The developed measure of professional identity for dental hygienists consisted of five factors, with 15 items. "Sense of calling," "need for academic capacity building," "performance of delegated authority," "compliance of code of ethics," and "usage of professional organizations" were the constituent factors. Based on these factors, the level of professional identity was the highest in the "performance of delegated authority" and lowest in the "usage of professional organizations." These finding indicated that dental hygienists with higher education levels have higher professional identity levels. The dental hygienists working at the general/university hospital had the highest level of professional identity. Thus, further research is needed to establish a model that consists of factors affecting and influencing the professional identity of dental hygienists.

A Numerical Study for Effective Operation of MSW Incinerator for Waste of High Heating Value by the Addition of Moisture Air (함습공기를 이용한 고발열량 도시폐기물 소각로의 효율적 운전을 위한 수치 해석적 연구)

  • Shin, Mi-Soo;Shin, Na-Ra;Jang, Dong-Soon
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.35 no.2
    • /
    • pp.115-123
    • /
    • 2013
  • Stoker type incinerator is one of the most popular one used as municipal solid waste (MSW) incineration because, in general, it is quite suitable for large capacity and need no preprocessing facility. Nowadays, however, since the combustible portion of incoming MSW increases together with the decrease of the moisture content due to prohibition of directly burying food waste in landfill, the heating value of waste is remarkably increasing in comparison with the early stage of incinerator installation. Consequently, the increased heating value in incinerator operation causes a number of serious problems such as reduction of waste amount to be burned due to the boiler heat capacity together with the significant NO generation in high temperature environment. Therefore, in this study, a series of numerical simulation have been made as parameters of waste amount and the fraction of moisture in air stream in order to investigate optimal operating condition for the resolution of the problems associated with the high heating value of waste mentioned above. In specific, a detailed turbulent reaction flow field calculation with NO model was made for the full scale incinerator of D city. To this end, the injection method of moisturized air as oxidizer was intensively reviewed by the addition of moisture water amount from 10% and 20%. The calculation result, in general, showed that the reduction of maximum flame temperature appears consistently due to the combined effects of the increased specific heat of combustion air and vaporization heat by the addition of water moisture. As a consequence, the generation of NOx concentration was substantially reduced. Further, for the case of 20% moisture amount stream, the afterburner region is quite appropriate in temperature range for the operation of SNCR. This suggests the SNCR facility can be considered for reoperation. which is not in service at all due to the increased heating value of MSW.