• Title/Summary/Keyword: Collection Methods

Search Result 2,363, Processing Time 0.028 seconds

Water Quality in a Drainage System Discharging Groundwater from Sangdae-ri Water Curtain Cultivation Area near Musimcheon Stream, Cheongju, Korea (무심천 인근 상대리 수막재배지에서 지하수 사용 후 배출되는 최종 배수로 물의 수질 특성)

  • Moon, Sang-Ho;Kim, Yongcheol;Hwang, Jeong
    • Economic and Environmental Geology
    • /
    • v.48 no.5
    • /
    • pp.409-420
    • /
    • 2015
  • The Sangdae-ri riverside around Musimcheon stream, flowing through Gadeok-myon of Cheongju City, is one of the representative strawberry fields employing water curtain cultivation (WCC) in Korea. In this area, annual groundwater use for WCC has been calculated by a few methods. On the assumption that all the water flowing through the final ditch may be mostly composed of groundwater, the discharge rate in it can be used as a good proxy for assessing the groundwater use. However, in the study area, the final ditch was set up in an unpaved state near and parallel to Musimcheon stream. Under such circumstances, the drainwater is likely to be influenced by infiltration and/or inflow of nearby stream. Hence, we examined whether or not stream water has influenced water flowing out through the final ditch in respect of ion concentrations or field parameters such as T, pH and electrical conductivity (EC) values. The period of measuring field parameters and sample collection was from February 2012 through February 2015. The drainwater in the final ditch did not show the average quality of groundwater, but similar quality of stream water in respect of pH, EC, ion contents and water type. From this, it is suggested that measuring the flow rate of the final ditch should not be directly used for assessing groundwater use in the study area. In addition, because of its sensitivity to ambient temperature, water temperature proved not to be appropriate for estimating the interaction between ditch and stream. For accuracy, additional methods will be needed to calculate mixing ratios between stream and ground water within drainage system.

A Study on Economic Value of Daegu Arboretum based on Contingent Valuation Methods (가상가치평가법을 이용한 대구수목원의 경제적 가치평가)

  • Kang, Kee-Rae;Lee, Kee-Cheol;Lee, Hyun-Taek;Ryu, Byong-Ro;Kim, Dong-Pil
    • Korean Journal of Environment and Ecology
    • /
    • v.25 no.5
    • /
    • pp.787-798
    • /
    • 2011
  • An arboretum is defined as a collection of facilities that conserve plant species by surveying, collecting, and proliferating and preserving the plants in nature, perform diverse researches on plants and display the plants in exhibition spaces or outdoors as well as provide the public with educational programs and refreshment spaces according to the laws concerned. The public, however, recognizes the exhibition and education functions on plants of arboretum more importantly compared with the roles to survey, collect, and proliferate plants as regulated by the laws. In particular, arboretum plays a role to offer a pivotal educational place in urban area where the public can obtain an hands-on experience and understanding on a wide range of plant species and natural environment. The study aims to estimate the non market environmental values of Daegu Arboretum operated by Daegu Metropolitan City government by using the Contingent Valuation Methods (CVM), which yields the current monetary estimates for the arboretum. The value estimation was undertaken by using the Double-Bound Dichotomous Choice (DBDC) method, and each estimated value was derived from respective functions based on a logit distribution known to include relatively stable estimates according to the shape of the distribution. Considering the statistical fitness test results, the author estimated the amounts of the Willingness To Pay (WTP) such as mean WTP of 12,718 KRW, median WTP of 11,033 KRW, and truncated mean WTP of 11,468 KRW, which represented the annual recreational values per a person visiting Daegu Arboretum respectively. The analysis showed that Daegu Arboretum created the annual environmental values which were estimated to be approximately 16 to 19 billion KRW. The study also has an implication that the valuation method for the environment of Daegu Arboretum may be effectively applied for estimating the values of other types of environmental goods by altering the locations or goods to be analyzed.

Characterization of Filamentous Cyanobacteria Encapsulated in Alginate Microcapsules (알긴산염 마이크로캡슐 내부에 동결보존된 사상체 남세균의 특성 연구)

  • Park, Mirye;Kim, Z-Hun;Nam, Seung Won;Lee, Sang Deuk;Yun, Suk Min;Kwon, Dae Ryul;Lee, Chang Soo
    • Microbiology and Biotechnology Letters
    • /
    • v.48 no.2
    • /
    • pp.205-214
    • /
    • 2020
  • Cyanobacteria are microorganisms which have important roles in the nitrogen cycle due to their ability to fix nitrogen in water and soil ecosystems. They also produce valuable materials that may be used in various industries. However, some species of cyanobacteria may limit the use of water resources by causing harmful algal blooms in water ecosystems. Many culture collection depositories provide cyanobacterial strains for research, but their systematic preservation is not well-developed in Korea. In this study, we developed a method for the cryopreservation of the cyanobacteria Trichormus variabilis (syn. Anabaena variabilis), using alginate microcapsules. Two approaches were used for the experiments and their outputs were compared. One of the methods involved the cryopreservation of cells using only a cryoprotectant and the other used the cryoprotectant within microcapsules. After cryopreservation for 35 days, cells preserved with both methods were successfully regenerated from the initial 1.0 × 105 cells/ml to a final concentration of 6.7 × 106 cells/ml and 1.1 × 107 cells/ml. Irregular T. variabilis shapes were found after 14 days of regeneration. T. variabilis internal structures were observed by transmission electron microscopy (TEM), revealing that lipid droplets were reduced after cryopreservation. The expression of the mreB gene, known to be related to cell morphology, was downregulated (54.7%) after cryopreservation. Cryopreservation using cryoprotectant alone or with microcapsules is expected to be applicable to other filamentous cyanobacteria in the future.

Key Methodologies to Effective Site-specific Accessment in Contaminated Soils : A Review (오염토양의 효과적 현장조사에 대한 주요 방법론의 검토)

  • Chung, Doug-Young
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.32 no.4
    • /
    • pp.383-397
    • /
    • 1999
  • For sites to be investigated, the results of such an investigation can be used in determining foals for cleanup, quantifying risks, determining acceptable and unacceptable risk, and developing cleanup plans t hat do not cause unnecessary delays in the redevelopment and reuse of the property. To do this, it is essential that an appropriately detailed study of the site be performed to identify the cause, nature, and extent of contamination and the possible threats to the environment or to any people living or working nearby through the analysis of samples of soil and soil gas, groundwater, surface water, and sediment. The migration pathways of contaminants also are examined during this phase. Key aspects of cost-effective site assessment to help standardize and accelerate the evaluation of contaminated soils at sites are to provide a simple step-by-step methodology for environmental science/engineering professionals to calculate risk-based, site-specific soil levels for contaminants in soil. Its use may significantly reduce the time it takes to complete soil investigations and cleanup actions at some sites, as well as improve the consistency of these actions across the nation. To achieve the effective site assessment, it requires the criteria for choosing the type of standard and setting the magnitude of the standard come from different sources, depending on many factors including the nature of the contamination. A general scheme for site-specific assessment consists of sequential Phase I, II, and III, which is defined by workplan and soil screening levels. Phase I are conducted to identify and confirm a site's recognized environmental conditions resulting from past actions. If a Phase 1 identifies potential hazardous substances, a Phase II is usually conducted to confirm the absence, or presence and extent, of contamination. Phase II involve the collection and analysis of samples. And Phase III is to remediate the contaminated soils determined by Phase I and Phase II. However, important factors in determining whether a assessment standard is site-specific and suitable are (1) the spatial extent of the sampling and the size of the sample area; (2) the number of samples taken: (3) the strategy of taking samples: and (4) the way the data are analyzed. Although selected methods are recommended, application of quantitative methods is directed by users having prior training or experience for the dynamic site investigation process.

  • PDF

A Study on the Design of the Grid-Cell Assessment System for the Optimal Location of Offshore Wind Farms (해상풍력발전단지의 최적 위치 선정을 위한 Grid-cell 평가 시스템 개념 설계)

  • Lee, Bo-Kyeong;Cho, Ik-Soon;Kim, Dae-Hae
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.7
    • /
    • pp.848-857
    • /
    • 2018
  • Recently, around the world, active development of new renewable energy sources including solar power, waves, and fuel cells, etc. has taken place. Particularly, floating offshore wind farms have been developed for saving costs through large scale production, using high-quality wind power and minimizing noise damage in the ocean area. The development of floating wind farms requires an evaluation of the Maritime Safety Audit Scheme under the Maritime Safety Act in Korea. Floating wind farms shall be assessed by applying the line and area concept for systematic development, management and utilization of specified sea water. The development of appropriate evaluation methods and standards is also required. In this study, proper standards for marine traffic surveys and assessments were established and a systemic treatment was studied for assessing marine spatial area. First, a marine traffic data collector using AIS or radar was designed to conduct marine traffic surveys. In addition, assessment methods were proposed such as historical tracks, traffic density and marine traffic pattern analysis applying the line and area concept. Marine traffic density can be evaluated by spatial and temporal means, with an adjusted grid-cell scale. Marine traffic pattern analysis was proposed for assessing ship movement patterns for transit or work in sea areas. Finally, conceptual design of a Marine Traffic and Safety Assessment Solution (MaTSAS) was competed that can be analyzed automatically to collect and assess the marine traffic data. It could be possible to minimize inaccurate estimation due to human errors such as data omission or misprints through automated and systematic collection, analysis and retrieval of marine traffic data. This study could provides reliable assessment results, reflecting the line and area concept, according to sea area usage.

Survey of Knowledge on Insomnia for Sleep Clinic Clients (수면클리닉을 방문한 환자들의 불면증에 대한 인식조사)

  • Soh, Minah
    • Sleep Medicine and Psychophysiology
    • /
    • v.26 no.1
    • /
    • pp.23-32
    • /
    • 2019
  • Objectives: Insomnia is not only the most common sleep-related disorder, but also is one of the most important. Knowledge of the comorbidities of insomnia is essential for proper treatment including pharmacological and non-pharmacological methods to prevent disease chronification. This study aimed to determine sleep clinic patients' knowledge of insomnia. Methods: This study recruited 44 patients (24 males and 20 females; mean age $54.11{\pm}16.30years$) from the sleep clinic at National Center for Mental Health. All subjects were asked to complete a self-report questionnaire about their reasons for visiting a sleep clinic and about their knowledge of treatment and comorbidities of insomnia. Results: The reasons for visiting the sleep clinic were insomnia symptoms of daytime sleepiness, irregular sleeping time, nightmares, snoring, and sleep apnea, in that order. Of the responders, 72.7% had a comorbidity of insomnia, and 22.7% showed high-risk alcohol use. In addition, 70.5% of responders chose pharmacological treatment of insomnia as the first option and reported collection of information about treatment of insomnia mainly from the internet and medical staff. More than half (52.3%) of the respondents reported that they had never heard about non-pharmacological treatments of insomnia such as cognitive behavioral treatment (CBT-I) or light therapy. The response rate about comorbidities of varied, with 75% of responders reporting knowledge of the relation between insomnia and depression, but only 38.6% stating awareness of the relation between insomnia and alcohol use disorder. Of the total responders, 68.2% were worried about hypnotics for insomnia treatment, and 70% were concerned about drug dependence. Conclusion: This study showed that patients at a sleep clinic had limited knowledge about insomnia. It is necessary to develop standardized insomnia treatment guidelines and educational handbooks for those suffering from insomnia. In addition, evaluation of alcohol use disorders is essential in the initial assessment of sleep disorders.

Basic Research on the Possibility of Developing a Landscape Perceptual Response Prediction Model Using Artificial Intelligence - Focusing on Machine Learning Techniques - (인공지능을 활용한 경관 지각반응 예측모델 개발 가능성 기초연구 - 머신러닝 기법을 중심으로 -)

  • Kim, Jin-Pyo;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.51 no.3
    • /
    • pp.70-82
    • /
    • 2023
  • The recent surge of IT and data acquisition is shifting the paradigm in all aspects of life, and these advances are also affecting academic fields. Research topics and methods are being improved through academic exchange and connections. In particular, data-based research methods are employed in various academic fields, including landscape architecture, where continuous research is needed. Therefore, this study aims to investigate the possibility of developing a landscape preference evaluation and prediction model using machine learning, a branch of Artificial Intelligence, reflecting the current situation. To achieve the goal of this study, machine learning techniques were applied to the landscaping field to build a landscape preference evaluation and prediction model to verify the simulation accuracy of the model. For this, wind power facility landscape images, recently attracting attention as a renewable energy source, were selected as the research objects. For analysis, images of the wind power facility landscapes were collected using web crawling techniques, and an analysis dataset was built. Orange version 3.33, a program from the University of Ljubljana was used for machine learning analysis to derive a prediction model with excellent performance. IA model that integrates the evaluation criteria of machine learning and a separate model structure for the evaluation criteria were used to generate a model using kNN, SVM, Random Forest, Logistic Regression, and Neural Network algorithms suitable for machine learning classification models. The performance evaluation of the generated models was conducted to derive the most suitable prediction model. The prediction model derived in this study separately evaluates three evaluation criteria, including classification by type of landscape, classification by distance between landscape and target, and classification by preference, and then synthesizes and predicts results. As a result of the study, a prediction model with a high accuracy of 0.986 for the evaluation criterion according to the type of landscape, 0.973 for the evaluation criterion according to the distance, and 0.952 for the evaluation criterion according to the preference was developed, and it can be seen that the verification process through the evaluation of data prediction results exceeds the required performance value of the model. As an experimental attempt to investigate the possibility of developing a prediction model using machine learning in landscape-related research, this study was able to confirm the possibility of creating a high-performance prediction model by building a data set through the collection and refinement of image data and subsequently utilizing it in landscape-related research fields. Based on the results, implications, and limitations of this study, it is believed that it is possible to develop various types of landscape prediction models, including wind power facility natural, and cultural landscapes. Machine learning techniques can be more useful and valuable in the field of landscape architecture by exploring and applying research methods appropriate to the topic, reducing the time of data classification through the study of a model that classifies images according to landscape types or analyzing the importance of landscape planning factors through the analysis of landscape prediction factors using machine learning.

A Ranking Algorithm for Semantic Web Resources: A Class-oriented Approach (시맨틱 웹 자원의 랭킹을 위한 알고리즘: 클래스중심 접근방법)

  • Rho, Sang-Kyu;Park, Hyun-Jung;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.17 no.4
    • /
    • pp.31-59
    • /
    • 2007
  • We frequently use search engines to find relevant information in the Web but still end up with too much information. In order to solve this problem of information overload, ranking algorithms have been applied to various domains. As more information will be available in the future, effectively and efficiently ranking search results will become more critical. In this paper, we propose a ranking algorithm for the Semantic Web resources, specifically RDF resources. Traditionally, the importance of a particular Web page is estimated based on the number of key words found in the page, which is subject to manipulation. In contrast, link analysis methods such as Google's PageRank capitalize on the information which is inherent in the link structure of the Web graph. PageRank considers a certain page highly important if it is referred to by many other pages. The degree of the importance also increases if the importance of the referring pages is high. Kleinberg's algorithm is another link-structure based ranking algorithm for Web pages. Unlike PageRank, Kleinberg's algorithm utilizes two kinds of scores: the authority score and the hub score. If a page has a high authority score, it is an authority on a given topic and many pages refer to it. A page with a high hub score links to many authoritative pages. As mentioned above, the link-structure based ranking method has been playing an essential role in World Wide Web(WWW), and nowadays, many people recognize the effectiveness and efficiency of it. On the other hand, as Resource Description Framework(RDF) data model forms the foundation of the Semantic Web, any information in the Semantic Web can be expressed with RDF graph, making the ranking algorithm for RDF knowledge bases greatly important. The RDF graph consists of nodes and directional links similar to the Web graph. As a result, the link-structure based ranking method seems to be highly applicable to ranking the Semantic Web resources. However, the information space of the Semantic Web is more complex than that of WWW. For instance, WWW can be considered as one huge class, i.e., a collection of Web pages, which has only a recursive property, i.e., a 'refers to' property corresponding to the hyperlinks. However, the Semantic Web encompasses various kinds of classes and properties, and consequently, ranking methods used in WWW should be modified to reflect the complexity of the information space in the Semantic Web. Previous research addressed the ranking problem of query results retrieved from RDF knowledge bases. Mukherjea and Bamba modified Kleinberg's algorithm in order to apply their algorithm to rank the Semantic Web resources. They defined the objectivity score and the subjectivity score of a resource, which correspond to the authority score and the hub score of Kleinberg's, respectively. They concentrated on the diversity of properties and introduced property weights to control the influence of a resource on another resource depending on the characteristic of the property linking the two resources. A node with a high objectivity score becomes the object of many RDF triples, and a node with a high subjectivity score becomes the subject of many RDF triples. They developed several kinds of Semantic Web systems in order to validate their technique and showed some experimental results verifying the applicability of their method to the Semantic Web. Despite their efforts, however, there remained some limitations which they reported in their paper. First, their algorithm is useful only when a Semantic Web system represents most of the knowledge pertaining to a certain domain. In other words, the ratio of links to nodes should be high, or overall resources should be described in detail, to a certain degree for their algorithm to properly work. Second, a Tightly-Knit Community(TKC) effect, the phenomenon that pages which are less important but yet densely connected have higher scores than the ones that are more important but sparsely connected, remains as problematic. Third, a resource may have a high score, not because it is actually important, but simply because it is very common and as a consequence it has many links pointing to it. In this paper, we examine such ranking problems from a novel perspective and propose a new algorithm which can solve the problems under the previous studies. Our proposed method is based on a class-oriented approach. In contrast to the predicate-oriented approach entertained by the previous research, a user, under our approach, determines the weights of a property by comparing its relative significance to the other properties when evaluating the importance of resources in a specific class. This approach stems from the idea that most queries are supposed to find resources belonging to the same class in the Semantic Web, which consists of many heterogeneous classes in RDF Schema. This approach closely reflects the way that people, in the real world, evaluate something, and will turn out to be superior to the predicate-oriented approach for the Semantic Web. Our proposed algorithm can resolve the TKC(Tightly Knit Community) effect, and further can shed lights on other limitations posed by the previous research. In addition, we propose two ways to incorporate data-type properties which have not been employed even in the case when they have some significance on the resource importance. We designed an experiment to show the effectiveness of our proposed algorithm and the validity of ranking results, which was not tried ever in previous research. We also conducted a comprehensive mathematical analysis, which was overlooked in previous research. The mathematical analysis enabled us to simplify the calculation procedure. Finally, we summarize our experimental results and discuss further research issues.

Application of Support Vector Regression for Improving the Performance of the Emotion Prediction Model (감정예측모형의 성과개선을 위한 Support Vector Regression 응용)

  • Kim, Seongjin;Ryoo, Eunchung;Jung, Min Kyu;Kim, Jae Kyeong;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.185-202
    • /
    • 2012
  • .Since the value of information has been realized in the information society, the usage and collection of information has become important. A facial expression that contains thousands of information as an artistic painting can be described in thousands of words. Followed by the idea, there has recently been a number of attempts to provide customers and companies with an intelligent service, which enables the perception of human emotions through one's facial expressions. For example, MIT Media Lab, the leading organization in this research area, has developed the human emotion prediction model, and has applied their studies to the commercial business. In the academic area, a number of the conventional methods such as Multiple Regression Analysis (MRA) or Artificial Neural Networks (ANN) have been applied to predict human emotion in prior studies. However, MRA is generally criticized because of its low prediction accuracy. This is inevitable since MRA can only explain the linear relationship between the dependent variables and the independent variable. To mitigate the limitations of MRA, some studies like Jung and Kim (2012) have used ANN as the alternative, and they reported that ANN generated more accurate prediction than the statistical methods like MRA. However, it has also been criticized due to over fitting and the difficulty of the network design (e.g. setting the number of the layers and the number of the nodes in the hidden layers). Under this background, we propose a novel model using Support Vector Regression (SVR) in order to increase the prediction accuracy. SVR is an extensive version of Support Vector Machine (SVM) designated to solve the regression problems. The model produced by SVR only depends on a subset of the training data, because the cost function for building the model ignores any training data that is close (within a threshold ${\varepsilon}$) to the model prediction. Using SVR, we tried to build a model that can measure the level of arousal and valence from the facial features. To validate the usefulness of the proposed model, we collected the data of facial reactions when providing appropriate visual stimulating contents, and extracted the features from the data. Next, the steps of the preprocessing were taken to choose statistically significant variables. In total, 297 cases were used for the experiment. As the comparative models, we also applied MRA and ANN to the same data set. For SVR, we adopted '${\varepsilon}$-insensitive loss function', and 'grid search' technique to find the optimal values of the parameters like C, d, ${\sigma}^2$, and ${\varepsilon}$. In the case of ANN, we adopted a standard three-layer backpropagation network, which has a single hidden layer. The learning rate and momentum rate of ANN were set to 10%, and we used sigmoid function as the transfer function of hidden and output nodes. We performed the experiments repeatedly by varying the number of nodes in the hidden layer to n/2, n, 3n/2, and 2n, where n is the number of the input variables. The stopping condition for ANN was set to 50,000 learning events. And, we used MAE (Mean Absolute Error) as the measure for performance comparison. From the experiment, we found that SVR achieved the highest prediction accuracy for the hold-out data set compared to MRA and ANN. Regardless of the target variables (the level of arousal, or the level of positive / negative valence), SVR showed the best performance for the hold-out data set. ANN also outperformed MRA, however, it showed the considerably lower prediction accuracy than SVR for both target variables. The findings of our research are expected to be useful to the researchers or practitioners who are willing to build the models for recognizing human emotions.

An Analytical Approach Using Topic Mining for Improving the Service Quality of Hotels (호텔 산업의 서비스 품질 향상을 위한 토픽 마이닝 기반 분석 방법)

  • Moon, Hyun Sil;Sung, David;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.21-41
    • /
    • 2019
  • Thanks to the rapid development of information technologies, the data available on Internet have grown rapidly. In this era of big data, many studies have attempted to offer insights and express the effects of data analysis. In the tourism and hospitality industry, many firms and studies in the era of big data have paid attention to online reviews on social media because of their large influence over customers. As tourism is an information-intensive industry, the effect of these information networks on social media platforms is more remarkable compared to any other types of media. However, there are some limitations to the improvements in service quality that can be made based on opinions on social media platforms. Users on social media platforms represent their opinions as text, images, and so on. Raw data sets from these reviews are unstructured. Moreover, these data sets are too big to extract new information and hidden knowledge by human competences. To use them for business intelligence and analytics applications, proper big data techniques like Natural Language Processing and data mining techniques are needed. This study suggests an analytical approach to directly yield insights from these reviews to improve the service quality of hotels. Our proposed approach consists of topic mining to extract topics contained in the reviews and the decision tree modeling to explain the relationship between topics and ratings. Topic mining refers to a method for finding a group of words from a collection of documents that represents a document. Among several topic mining methods, we adopted the Latent Dirichlet Allocation algorithm, which is considered as the most universal algorithm. However, LDA is not enough to find insights that can improve service quality because it cannot find the relationship between topics and ratings. To overcome this limitation, we also use the Classification and Regression Tree method, which is a kind of decision tree technique. Through the CART method, we can find what topics are related to positive or negative ratings of a hotel and visualize the results. Therefore, this study aims to investigate the representation of an analytical approach for the improvement of hotel service quality from unstructured review data sets. Through experiments for four hotels in Hong Kong, we can find the strengths and weaknesses of services for each hotel and suggest improvements to aid in customer satisfaction. Especially from positive reviews, we find what these hotels should maintain for service quality. For example, compared with the other hotels, a hotel has a good location and room condition which are extracted from positive reviews for it. In contrast, we also find what they should modify in their services from negative reviews. For example, a hotel should improve room condition related to soundproof. These results mean that our approach is useful in finding some insights for the service quality of hotels. That is, from the enormous size of review data, our approach can provide practical suggestions for hotel managers to improve their service quality. In the past, studies for improving service quality relied on surveys or interviews of customers. However, these methods are often costly and time consuming and the results may be biased by biased sampling or untrustworthy answers. The proposed approach directly obtains honest feedback from customers' online reviews and draws some insights through a type of big data analysis. So it will be a more useful tool to overcome the limitations of surveys or interviews. Moreover, our approach easily obtains the service quality information of other hotels or services in the tourism industry because it needs only open online reviews and ratings as input data. Furthermore, the performance of our approach will be better if other structured and unstructured data sources are added.