• Title/Summary/Keyword: search result

Search Result 2,585, Processing Time 0.03 seconds

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

Impact of Semantic Characteristics on Perceived Helpfulness of Online Reviews (온라인 상품평의 내용적 특성이 소비자의 인지된 유용성에 미치는 영향)

  • Park, Yoon-Joo;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.29-44
    • /
    • 2017
  • In Internet commerce, consumers are heavily influenced by product reviews written by other users who have already purchased the product. However, as the product reviews accumulate, it takes a lot of time and effort for consumers to individually check the massive number of product reviews. Moreover, product reviews that are written carelessly actually inconvenience consumers. Thus many online vendors provide mechanisms to identify reviews that customers perceive as most helpful (Cao et al. 2011; Mudambi and Schuff 2010). For example, some online retailers, such as Amazon.com and TripAdvisor, allow users to rate the helpfulness of each review, and use this feedback information to rank and re-order them. However, many reviews have only a few feedbacks or no feedback at all, thus making it hard to identify their helpfulness. Also, it takes time to accumulate feedbacks, thus the newly authored reviews do not have enough ones. For example, only 20% of the reviews in Amazon Review Dataset (Mcauley and Leskovec, 2013) have more than 5 reviews (Yan et al, 2014). The purpose of this study is to analyze the factors affecting the usefulness of online product reviews and to derive a forecasting model that selectively provides product reviews that can be helpful to consumers. In order to do this, we extracted the various linguistic, psychological, and perceptual elements included in product reviews by using text-mining techniques and identifying the determinants among these elements that affect the usability of product reviews. In particular, considering that the characteristics of the product reviews and determinants of usability for apparel products (which are experiential products) and electronic products (which are search goods) can differ, the characteristics of the product reviews were compared within each product group and the determinants were established for each. This study used 7,498 apparel product reviews and 106,962 electronic product reviews from Amazon.com. In order to understand a review text, we first extract linguistic and psychological characteristics from review texts such as a word count, the level of emotional tone and analytical thinking embedded in review text using widely adopted text analysis software LIWC (Linguistic Inquiry and Word Count). After then, we explore the descriptive statistics of review text for each category and statistically compare their differences using t-test. Lastly, we regression analysis using the data mining software RapidMiner to find out determinant factors. As a result of comparing and analyzing product review characteristics of electronic products and apparel products, it was found that reviewers used more words as well as longer sentences when writing product reviews for electronic products. As for the content characteristics of the product reviews, it was found that these reviews included many analytic words, carried more clout, and related to the cognitive processes (CogProc) more so than the apparel product reviews, in addition to including many words expressing negative emotions (NegEmo). On the other hand, the apparel product reviews included more personal, authentic, positive emotions (PosEmo) and perceptual processes (Percept) compared to the electronic product reviews. Next, we analyzed the determinants toward the usefulness of the product reviews between the two product groups. As a result, it was found that product reviews with high product ratings from reviewers in both product groups that were perceived as being useful contained a larger number of total words, many expressions involving perceptual processes, and fewer negative emotions. In addition, apparel product reviews with a large number of comparative expressions, a low expertise index, and concise content with fewer words in each sentence were perceived to be useful. In the case of electronic product reviews, those that were analytical with a high expertise index, along with containing many authentic expressions, cognitive processes, and positive emotions (PosEmo) were perceived to be useful. These findings are expected to help consumers effectively identify useful product reviews in the future.

Comparison of Results According to Reaction Conditions of Thyroglobulin Test (Thyroglobulin 검사의 반응조건에 따른 결과 비교 분석)

  • Joung, Seung-Hee;Lee, Young-Ji;Moon, Hyung-Ho;Yoo, So-yoen;Kim, Nyun-Ok
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.21 no.1
    • /
    • pp.39-43
    • /
    • 2017
  • Purpose Thyroglobulin (Tg) is a biologic marker of differentiated thyroid carcinoma (DTC), produced by normal thyroid tissue or thyroid cancer tissue. Therefore, the Tg values of DTC patients is the most specific indicator for judging whether recurrence occur or whether the remaining thyroid cancer is present. Thyroid cancer is currently the most common cancer in Korea, of which 90% is differentiated thyroid cancer. The number of patients with thyroid disease of this application also increased, and an accurate and prompt results are required. However, the incubation time of the Tg commonly takes about 24 hours in our hospital, and the result reporting time is delayed, and We could not satisfied with the requirements of clinical departments and patients. In order to fulfill these requirements, experiments were conducted by shortening the incubation time between company B's Kit currently in use and company C's Kit used in other hospitals. Through these experiments, we could perform the correlation with the original method and shortening method, and could find the optimum reaction time to satisfy the needs of the departments and the patients, and we will improve the competitiveness with the EIA examination. Materials and Methods In September 2016, we tested 65 patients company B's kit and company C's kit by three incubation ways. First method $37^{\circ}C$ shaking 2hr/2hr, Second method RT shaking 3hr/2hr, Third method 1hr/1hr shaking at $37^{\circ}C$. Fourth method RT shaking 3hr method which is the original method of Company C's Kit. Fifth method, the incubation time was shortened under room temperature shaking 2hr, Sixth method $37^{\circ}C$ shaking 2hr. And we performed and compared the correlation and coefficient of each methods. Results As a result of performing shortening method on company B currently in use, when comparing the Original method of company B kit, First method $37^{\circ}C$ shaking 2hr/2hr was less than Tg 1.0 ng/mL and the ratio of $R^2=0.5906$, above 1.0 ng/mL In the value, $R^2=0.9597$. Second method RT shaking 3hr/2hr was $R^2=0.7262$ less than value of 1.0 ng/mL, $R^2=0.9566$ above than value of 1.0 ng/mL. Third method $37^{\circ}C$ shaking 1hr/1hr was $R^2=0.7728$ less than value of 1.0 ng/mL, $R^2=0.8904$ above than value of 1.0 ng/mL. Forth, Company C's The original method, RT shaking 3hr was $R^2=0.7542$ less than value of 1.0 ng/mL, and $R^2=0.9711$ above than value of 1.0 ng/mL. Fifth method RT shaking 2hr was $R^2=0.5477$ less than value of 1.0 ng/mL, $R^2=0.9231$ above than value of 1.0 ng/mL. Sixth method $37^{\circ}C$ shaking 2hr showed $R^2=0.2848$ less than value of 1.0 ng/mL, $R^2=0.9028$ above than value of 1.0 ng/mL. Conclusion Samples with both values of 1.0 ng/mL or higher in both of the six methods showed relatively high correlation, but the correlation was relatively low less than value of 1.0 ng/mL. Especially, the $37^{\circ}C$ shaking 2hr method of company C showed a sharp fluctuation from the low concentration value of 1.0 ng/mL or less. Therefore, we are planning to continuously test the time, equipment, incubation temperature and so on for the room temperature shaking 2hr method and $37^{\circ}C$ shaking 1hr/1hr of company C which showed a relatively high correlation. After that, we can search for an appropriate shortening method through additional experiments such as recovery test, dilution test, sensitivity test, and provide more accurate and prompt results to the department of medical treatment, It is competitive with EIA test.

  • PDF

Color Analyses on Digital Photos Using Machine Learning and KSCA - Focusing on Korean Natural Daytime/nighttime Scenery - (머신러닝과 KSCA를 활용한 디지털 사진의 색 분석 -한국 자연 풍경 낮과 밤 사진을 중심으로-)

  • Gwon, Huieun;KOO, Ja Joon
    • Trans-
    • /
    • v.12
    • /
    • pp.51-79
    • /
    • 2022
  • This study investigates the methods for deriving colors which can serve as a reference to users such as designers and or contents creators who search for online images from the web portal sites using specific words for color planning and more. Two experiments were conducted in order to accomplish this. Digital scenery photos within the geographic scope of Korea were downloaded from web portal sites, and those photos were studied to find out what colors were used to describe daytime and nighttime. Machine learning was used as the study methodology to classify colors in daytime and nighttime, and KSCA was used to derive the color frequency of daytime and nighttime photos and to compare and analyze the two results. The results of classifying the colors of daytime and nighttime photos using machine learning show that, when classifying the colors by 51~100%, the area of daytime colors was approximately 2.45 times greater than that of nighttime colors. The colors of the daytime class were distributed by brightness with white as its center, while that of the nighttime class was distributed with black as its center. Colors that accounted for over 70% of the daytime class were 647, those over 70% of the nighttime class were 252, and the rest (31-69%) were 101. The number of colors in the middle area was low, while other colors were classified relatively clearly into day and night. The resulting color distributions in the daytime and nighttime classes were able to provide the borderline color values of the two classes that are classified by brightness. As a result of analyzing the frequency of digital photos using KSCA, colors around yellow were expressed in generally bright daytime photos, while colors around blue value were expressed in dark night photos. For frequency of daytime photos, colors on the upper 40% had low chroma, almost being achromatic. Also, colors that are close to white and black showed the highest frequency, indicating a large difference in brightness. Meanwhile, for colors with frequency from top 5 to 10, yellow green was expressed darkly, and navy blue was expressed brightly, partially composing a complex harmony. When examining the color band, various colors, brightness, and chroma including light blue, achromatic colors, and warm colors were shown, failing to compose a generally harmonious arrangement of colors. For the frequency of nighttime photos, colors in approximately the upper 50% are dark colors with a brightness value of 2 (Munsell signal). In comparison, the brightness of middle frequency (50-80%) is relatively higher (brightness values of 3-4), and the brightness difference of various colors was large in the lower 20%. Colors that are not cool colors could be found intermittently in the lower 8% of frequency. When examining the color band, there was a general harmonious arrangement of colors centered on navy blue. As the results of conducting the experiment using two methods in this study, machine learning could classify colors into two or more classes, and could evaluate how close an image was with certain colors to a certain class. This method cannot be used if an image cannot be classified into a certain class. The result of such color distribution would serve as a reference when determining how close a certain color is to one of the two classes when the color is used as a dominant color in the base or background color of a certain design. Also, when dividing the analyzed images into several classes, even colors that have not been used in the analyzed image can be determined to find out how close they are to a certain class according to the color distribution properties of each class. Nevertheless, the results cannot be used to find out whether a specific color was used in the class and by how much it was used. To investigate such an issue, frequency analysis was conducted using KSCA. The color frequency could be measured within the range of images used in the experiment. The resulting values of color distribution and frequency from this study would serve as references for color planning of digital design regarding natural scenery in the geographic scope of Korea. Also, the two experiments are meaningful attempts for searching the methods for deriving colors that can be a useful reference among numerous images for content creator users of the relevant field.

A Descriptive Study of Oral Health Knowledge & Behaviors in Middle School Students (일부지역 중학생의 구강건강 지식 및 행동에 관한 조사연구)

  • Yoo, Jung-Sook;Kim, Jung-Hee;Han, Su-Jin;Sim, Sang-Hyo;Kim, Yoon-Shin
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.9 no.1
    • /
    • pp.85-97
    • /
    • 2008
  • Objectives: This study was designed to understand the oral health knowledge & conduct of middle-school students, search for the learning objective and the educational method in line with the subjects and of utilizing as the basic data for an effective oral health-care program. Methods: The samples to achieve the purpose of this research are composed of 139 students in middle-school, OO county. Chungcheongbuk-do, the number of male students 64, and female students 75. Data were statistically analyzed by frequency analysis, $x^2$-test or Fisher's exact test by using SPSS WIN Ver. 12.0. Results: Among items on oral-health knowledge in middle-school students. the awareness ratio on a cause and preventive method for oral disease was surveyed to be lower than the awareness ratio on symptoms of oral disease. As a result of examining by comparing knowledge and behavior on the time of tooth brush. both awareness and behavior were the level of 50% or less than it. In particular, 46.2% perceived after lunch. but practice just accounted for 33.0%. The frequency of tooth brush a day was the largest in a case(47.5%) of doing twice a day. However. there was also the response (5.8%) with saying of brushing once or not brushing even once. Thus, the practice of tooth brush was surveyed to be very low even if being a minority of students. The frequency of taking a light meal was 68.8% in less than twice a day. However, even students of taking more than five times were surveyed to be 9.8%. Out of the whole-body health in over 50%-59.9%. the oral health was surveyed to be perceived to be very important. Compared to the awareness level on importance of a tooth, the ratio of visiting a dentistry was analyzed to be very low. Conclusions: The study results suggest that the school oral-health project was examined to have the necessity of being expanded and carried out even in middle-and-high schools, by which the specific oral-health promotion program including oral-health education in this period is developed.

  • PDF

A study on the incremental oral health care of C pediatric clinic using a Dentocult-SM test (C소아치과의원의 개량형 Dentocult-SM검사를 이용한 계속관리에 관한 조사 연구)

  • Woo, Hee-Sun
    • Journal of Korean society of Dental Hygiene
    • /
    • v.8 no.2
    • /
    • pp.39-51
    • /
    • 2008
  • The research was conducted to 100 child patients selected by random sampling, which got a Dentocult-SM test in the first visit and then was being continuously managed, out of child patients of a pediatric clinic located in Gyeonggi-do. The period of there search is one year from June 2007 to May 2008, Using Dentocult-SM test, we analyzed the correlation between the distribution of dental plaque, a streptococcus mutans in saliva and condition of dental caries cavity in the teeth of child patients, then we measured the distribution of a streptococcusmutans. According to SM score, we applied incremental oral heath care for child patients to clinical and obtained the following results, 1. In terms of the age of child patients in research, the number of 3 years old patients was 29(lst ranked), the number of 2 years old patients was 28(2nd ranked). 2. The result of SM score showed that female child patients(52.0%) was higher than male ones in negative, male child patients(52.0%) was higher than female ones in mild, female child patients(68.2%) was higher than male ones in moderate, male child patients(57.1%) was higher than female ones in severe. 3. At the first visit, the SM score showed statistically remarkable difference between dt and dmft. We can also confirm the average of severe is the highest. 4. At the second visit, the SM score showed statistically remarkable difference among dt, ft, and dmft index We can also confirm the average of severe is the highest. 5. At the third visit, The SM score showed statistically remarkable difference among dt, ft, and dmft index We can also confirm the average of severe is the highest. 6. The comparison of dmft index differences to SM score showed statistically no remarkable difference in incremental oral heath care for negative and mild, In addition to that, we can confirm that the incremental oral heath care makes statistically remarkable differences in moderate and severe. 7. The comparison of dt index differences to SM score showed statistically no remarkable difference in incremental oral heath care for negative, mild, and moderate, In addition to that, we can con firm that the incremental oral heath care makes statistically remarkable differences in severe. 8. The comparison of mt index differences to SM score showed statistically no remarkable difference in incremental oral heath care for mild and moderate, In addition to that, we can confirm that the incremental oral heath care makes statistically remarkable differences m severe. 9. The comparison of ft index differences to SM score showed statistically no remarkable difference in incremental oral heath care for mild, In addition to that, we can confirm that the incremental oral heath care makes statistically remarkable differences in negative, moderate, and severe. 10. According to the comparison of dmft index to the age, the 4 years old patients showed the highest number(5.50 in the first visit and 6,08 in the second one). In the third visit, the 6 years old patients showed the highest number(7.00). By the above results, we can find that the incremental oral heath care by SM score makes the results of oral care better. Therefore, the improvement or maintenance in oral health of child patients needs continuing personal oral health management and regular systematic management focused on prevention by the specialist.

  • PDF

The Efficacy of Aspirin in Preventing the Recurrence of Colorectal Adenoma: a Renewed Meta-Analysis of Randomized Trials

  • Zhao, Tai-Yun;Tu, Jing;Wang, Yin;Cheng, Da-Wei;Gao, Xian-Kui;Luo, Hao;Yan, Bi-Chun;Xu, Xiao-Li;Zhang, Hong-Ling;Lu, Xing-Jun;Wang, Yao-Jun
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.5
    • /
    • pp.2711-2717
    • /
    • 2016
  • Background: Through search the possible randomized control trials, we make a renewed meta-analysis in order to assess the impact of aspirin in preventing the recurrence of colorectal adenoma. Materials and Methods: The Medicine/PubMed, Embase, Cochrane Central Register of Controlled Trials (CENTRAL), Chinese biomedical literature service system (SinoMed) databases were searched for the related randomized controlled trials until to the April 2016. Three different authors respectively evaluated the quality of studies and extracted data, and we used the STATA software to analyze, investigate heterogeneity between the data, using the fixed-effects model to calculate and merge data. Results: 7 papers were included the renewed meta-analysis, among these studies, two pairs were identified as representing the same study population, with the only difference being the duration of follow-up. Thus there were only five papers included our meta-analysis, and one Chinese paper were also included the work. Results were categorized by the length of follow-up, different kinds of people, varied dose of oral aspirin. The relative of adenoma in patients taking aspirin vs placebo were 0.73 (95% CI 0.55-0.98, P=0.039) with 1 year follow up; 0.84 (95% CI 0.72-0.98, P=0.484) with greater than 1 year follow up; for the advanced adenoma, the RR 0.68 (95% CI 0.49-0.94, P=0.582),for one year; RR=0.75 (95% CI 0.52-1.07, P=0.552) for greater one year. Furthermore the white population could divided into two subgroups according to the different length of follow-up time. When the length of follow-up time less than 3-year, The RR of two subgroups respective were RR=0.86 (95% CI 0.76-0.98, P=0.332), $I^2=0%$, RR=0.68 (95% CI 0.47-0.98, P=0.552), $I^2=64.6%$, But with the extension of follow-up time greater than 2-year, with the white, oral aspirin without considering dose had no efficacy on preventing the recurrence of any adenoma, the RR was 0.86 (95% CI 0.71-1.05, P=0.302), $I^2=16.4%$. Conclusions: This meta-analysis indicated that oral aspirin is associated with a remarkable decrease in the recurrence of any adenoma and advanced adenomas in patients follow-up for 1 year without concerning the dose of aspirin, but with the extension of follow-up time for greater than 1 year, oral aspirin can be effective on preventing the recurrence of any adenoma, but for the advanced adenoma, the result indicated that oral aspirin had no efficacy, According to the inclusion of ethnic groups, we also divided relevant papers into two subgroups as the yellow and white group. Then the follow-up time was less than 3 years, oral aspirin without considering the dose, had an significant efficacy on preventing the recurrence of any adenoma. But with the follow-up greater than 2 years, oral aspirin had no effect in the white.

A study on the readability of web interface for the elderly user -Focused on readability of Typeface- (고령사용자를 위한 웹 인터페이스에서의 가독성에 관한 연구 -Typeface의 가독성을 중심으로-)

  • Lee, Hyun-Ju;Woo, Seo-Hye;Park, Eun-Young;Suh, Hye-Young;Back, Seung-Chul
    • Archives of design research
    • /
    • v.20 no.3 s.71
    • /
    • pp.315-324
    • /
    • 2007
  • The fast development of the information technology makes Korea one of the most advanced countries in information communication in the world in a short period of time. However, the gap between the aged and the young has been seriously increased. Those who are less than 10% of the older adults are using the internet at present. It means the elderly has many difficulties in using the internet because of their physical and cognitive differences. The purpose of this study is that the aged can easily achieve and use information by developing a guidelines for the Korean typography in the web interface. A literature search was conducted on the web interface design guidelines for older adults. These guidelines were classified by interface component and the study subjects needed for the Korean internet environment were selected. The subjects are a more comfortably readable typeface according to the sizes, a proper text size of Gulim and Batang, a more comfortably readable leading size, the appropriate letter spacing, the proper line length of body, the suitable size proportion between a title and a body, and a more comfortably readable text alignment. Survey questions were made and these Questions were improved after the pretest. Both online and offline survey programs were written and the aged and the young were tested with these programs. The result of this survey shows that there are satisfaction differences between the aged and the young in the readability and legibility of the web contents. Therefore these universal guidelines to be used in the Korean typographical environment for the future aged population were specified. It is expected that this study will be used as basic data for the universal web interface where the older adults can easily use and acquire information.

  • PDF

(Image Analysis of Electrophoresis Gels by using Region Growing with Multiple Peaks) (다중 피크의 영역 성장 기법에 의한 전기영동 젤의 영상 분석)

  • 김영원;전병환
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.5_6
    • /
    • pp.444-453
    • /
    • 2003
  • Recently, a great interest of bio-technology(BT) is concentrated and the image analysis technique for electrophoresis gels is highly requested to analyze genetic information or to look for some new bio-activation materials. For this purpose, the location and quantity of each band in a lane should be measured. In most of existing techniques, the approach of peak searching in a profile of a lane is used. But this peak is improper as the representative of a band, because its location does not correspond to that of the brightest pixel or the center of gravity. Also, it is improper to measure band quantity in most of these approaches because various enhancement processes are commonly applied to original images to extract peaks easily. In this paper, we adopt an approach to measure accumulated brightness as a band quantity in each band region, which Is extracted by not using any process of changing relative brightness, and the gravity center of the region is calculated as a band location. Actually, we first extract lanes with an entropy-based threshold calculated on a gel-image histogram. And then, three other methods are proposed and applied to extract bands. In the MER method, peaks and valleys are searched on a vertical search line by which each lane is bisected. And the minimum enclosing rectangle of each band is set between successive two valleys. On the other hand, in the RG-1 method, each band is extracted by using region growing with a peak as a seed, separating overlapped neighbor bands. In the RG-2 method, peaks and valleys are searched on two vertical lines by which each lane is trisected, and the left and right peaks nay be paired up if they seem to belong to the same band, and then each band region is grown up with a peak or both peaks if exist. To compare above three methods, we have measured the location and amount of bands. As a result, the average errors in band location of MER, RG-1, and RG-2 were 6%, 3%, and 1%, respectively, when the lane length is normalized to a unit value. And the average errors in band amount were 8%, 5%, and 2%, respectively, when the sum of band amount is normalized to a unit value. In conclusion, RG-2 was shown to be more reliable in the accuracy of measuring the location and amount of bands.

Development of a Feature Catalogue for Marine Geographic Information (해양 지리정보 피쳐 카탈로그 작성에 관한 연구)

  • Hong, Sang-Ki;Yun, Suk-Bum
    • Journal of Korea Spatial Information System Society
    • /
    • v.6 no.1 s.11
    • /
    • pp.101-117
    • /
    • 2004
  • Standards are essential to facilitate the efficient use of GIS data. International Standards such as ISO TC211's 19100 series and various technical specifications from OpenGIS Consortium are some of the examples of efforts to maintain the interoperability among GIS applications. Marine GIS is no exception to this rule and in this context. developing standards for marine GIS is also in urgent needs. Using the same meaning and definition for the features commonly found in marine GIS applications is one of the ways to increase the interoperability among systems. One of the key requirements for maintaining the standard meanings for features is to build a common feature catalogue. This paper examines the concept of feature catalogue and describe the ways in which the feature catalogue can be organized. To identify the common features found in various marine GIS applications, a comprehensive search has been made to collect and analyze the features used in various applications. To maintain the interoperability with the National GIS (NGIS) system, the features used in various NGIS applications have been analyzed as well. The result of these analyses are used to create a comprehensive list of common features for marine GIS. This paper then explains the common feature catalogue for marine GIS and the provides the appropriate classification and coding systems for the common features. In addition, a registration tool for registering the common features into the standard registry has been developed in this study. This Web-based tool can be used to input features into the feature catalogue by various applications and also to maintain a standard-compliant feature catalogue by standard agencies.

  • PDF