• Title/Summary/Keyword: mining analysis

Search Result 3,173, Processing Time 0.032 seconds

Preliminary Study on the Application of Remote Sensing to Mineral Exploration Using Landsat and ASTER Data (Landsat과 ASTER 위성영상 자료를 이용한 광물자원탐사로의 적용 가능성을 위한 예비연구)

  • Lee, Hong-Jin;Park, Maeng-Eon;Kim, Eui-Jun
    • Economic and Environmental Geology
    • /
    • v.43 no.5
    • /
    • pp.467-475
    • /
    • 2010
  • The Landsat and ASTER data have been used in mineralogical and lithological studies, and they have also proved to be useful tool in the initial steps for mineral exploration throughout Nevada mining district, US. Huge pyrophyllite quarry mines, including Jungang, Samsung, Kyeongju, and Naenam located in the southeastern part of Gyeongsang Basin. The geology of study area consists mainly of Cretaceous volcanic rocks, which belong into Cretaceous Hayang and Jindong Group. They were intruded by Bulgugsa granites, so called Sannae-Eonyang granites. To extraction of Ratio model for pyrophyllite deposits, tuffaceous rock and pyrophyllite ores from the Jungang mine used in reflectance spectral analysis and these results were re-sampled to Landsat and ASTER bandpass. As a result of these processes, the pyrophyllite ores spectral features show strong reflectance at band 5, whereas strong absorption at band 7 in Landsat data. In the ASTER data, the pyrophyllite ores spectral features show strong absorption at band 5 and 8, whereas strong reflectance at band 4 and 7. Based on these spectral features, as a result of application of $Py_{Landsat}$ model to hydrothermal alteration zone and other exposed sites, the DN values of two different areas are 1.94 and 1.19 to 1.49, respectively. The differences values between pyrophyllite deposits and concrete-barren area are 0.472 and 0.399 for $Py_{ASTER}$ model, 0.452 and 0.371 for OHIb model, 0.365 and 0.311 for PAK model, respectively. Thus, $Py_{ASTER}$ and $Py_{Landsat}$ model proposed from this study proved to be more useful tool for the extraction of pyrophyllite deposits relative to previous models.

A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling (토픽모델링을 활용한 국내 문헌정보학 연구동향 분석)

  • Park, Ja-Hyun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.1
    • /
    • pp.7-32
    • /
    • 2013
  • The goal of the present study is to identify the topic trend in the field of library and information science in Korea. To this end, we collected titles and s of the papers published in four major journals such as Journal of the Korean Society for information Management, Journal of the Korean Society for Library and Information Science, Journal of Korean Library and Information Science Society, and Journal of the Korean BIBLIA Society for library and Information Science during 1970 and 2012. After that, we applied the well-received topic modeling technique, Latent Dirichlet Allocation(LDA), to the collected data sets. The research findings of the study are as follows: 1) Comparison of the extracted topics by LDA with the subject headings of library and information science shows that there are several distinct sub-research domains strongly tied with the field. Those include library and society in the domain of "introduction to library and information science," professionalism, library and information policy in the domain of "library system," library evaluation in the domain of "library management," collection development and management, information service in the domain of "library service," services by library type, user training/information literacy, service evaluation, classification/cataloging/meta-data in the domain of "document organization," bibliometrics/digital libraries/user study/internet/expert system/information retrieval/information system in the domain of "information science," antique documents in the domain of "bibliography," books/publications in the domain of "publication," and archival study. The results indicate that among these sub-domains, information science and library services are two most focused domains. Second, we observe that there is the growing trend in the research topics such as service and evaluation by library type, internet, and meta-data, but the research topics such as book, classification, and cataloging reveal the declining trend. Third, analysis by journal show that in Journal of the Korean Society for information Management, information science related topics appear more frequently than library science related topics whereas library science related topics are more popular in the other three journals studied in this paper.

Diagnosis of Conflict Problem between the Marine Environmental Conservation and Development, and Policy Implication for Marine Spatial Planning (해양환경보전과 이용·개발의 상충 분석과 해양공간계획에 대한 시사점)

  • Lee, Dae In;Tac, Dae Ho;Kim, Gui Young
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.19 no.3
    • /
    • pp.227-235
    • /
    • 2016
  • This paper emphasized the necessity of the marine spatial planning (MSP) through the analysis of the major developmental projects which could make a contradiction based on the adequacy of the site selection and environmental impacts. The conflicting affairs between space utilization and management plan happen in the following ways: marine renewable energy development, sand mining, reclamation, construction of golf course in coastal area, thermal effluent and waste heat, erosion causing port development. The conflict of stakeholder continues caused by the accumulated environmental impact. For the reasons mentioned above, we found two things. First, it is necessary to comprehend the fact of developmental planning and MSP. Second, it is still unsatisfactory to connect the relevance of laws related to the spatial planning. For the reinforcement of marine environmental policy management, it is necessary to consolidate the property of site selection and assessment of developmental scale. Especially, while the strategic environmental assessment is in progress based on site selection and property of scale, consistent diagnosis is needed in the following concerns: the fact of the marine spatial planning, the relevance between national developmental plan and regional developmental plan, fisheries regulation, marine protected animals. For the environmentally sound and sustainable development (ESSD), MSP should have to be prepared based in a way of top-down including coastal and EEZ plan, relevance of ocean-use zoning and sector planning, 3-D spatial information. And also integrated information system have to be prepared through high-tech marine spatial information. In conclusion, consistent and relevant strategy for MSP should have to include the whole information related to the maritime affairs such as harbor, fishing port, fishing ground, coastal management, marine ecosystem generally.

The Adaptive Personalization Method According to Users Purchasing Index : Application to Beverage Purchasing Predictions (고객별 구매빈도에 동적으로 적응하는 개인화 시스템 : 음료수 구매 예측에의 적용)

  • Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.95-108
    • /
    • 2011
  • TThis is a study of the personalization method that intelligently adapts the level of clustering considering purchasing index of a customer. In the e-biz era, many companies gather customers' demographic and transactional information such as age, gender, purchasing date and product category. They use this information to predict customer's preferences or purchasing patterns so that they can provide more customized services to their customers. The previous Customer-Segmentation method provides customized services for each customer group. This method clusters a whole customer set into different groups based on their similarity and builds predictive models for the resulting groups. Thus, it can manage the number of predictive models and also provide more data for the customers who do not have enough data to build a good predictive model by using the data of other similar customers. However, this method often fails to provide highly personalized services to each customer, which is especially important to VIP customers. Furthermore, it clusters the customers who already have a considerable amount of data as well as the customers who only have small amount of data, which causes to increase computational cost unnecessarily without significant performance improvement. The other conventional method called 1-to-1 method provides more customized services than the Customer-Segmentation method for each individual customer since the predictive model are built using only the data for the individual customer. This method not only provides highly personalized services but also builds a relatively simple and less costly model that satisfies with each customer. However, the 1-to-1 method has a limitation that it does not produce a good predictive model when a customer has only a few numbers of data. In other words, if a customer has insufficient number of transactional data then the performance rate of this method deteriorate. In order to overcome the limitations of these two conventional methods, we suggested the new method called Intelligent Customer Segmentation method that provides adaptive personalized services according to the customer's purchasing index. The suggested method clusters customers according to their purchasing index, so that the prediction for the less purchasing customers are based on the data in more intensively clustered groups, and for the VIP customers, who already have a considerable amount of data, clustered to a much lesser extent or not clustered at all. The main idea of this method is that applying clustering technique when the number of transactional data of the target customer is less than the predefined criterion data size. In order to find this criterion number, we suggest the algorithm called sliding window correlation analysis in this study. The algorithm purposes to find the transactional data size that the performance of the 1-to-1 method is radically decreased due to the data sparity. After finding this criterion data size, we apply the conventional 1-to-1 method for the customers who have more data than the criterion and apply clustering technique who have less than this amount until they can use at least the predefined criterion amount of data for model building processes. We apply the two conventional methods and the newly suggested method to Neilsen's beverage purchasing data to predict the purchasing amounts of the customers and the purchasing categories. We use two data mining techniques (Support Vector Machine and Linear Regression) and two types of performance measures (MAE and RMSE) in order to predict two dependent variables as aforementioned. The results show that the suggested Intelligent Customer Segmentation method can outperform the conventional 1-to-1 method in many cases and produces the same level of performances compare with the Customer-Segmentation method spending much less computational cost.

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

Development of a Detection Model for the Companies Designated as Administrative Issue in KOSDAQ Market (KOSDAQ 시장의 관리종목 지정 탐지 모형 개발)

  • Shin, Dong-In;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.157-176
    • /
    • 2018
  • The purpose of this research is to develop a detection model for companies designated as administrative issue in KOSDAQ market using financial data. Administration issue designates the companies with high potential for delisting, which gives them time to overcome the reasons for the delisting under certain restrictions of the Korean stock market. It acts as an alarm to inform investors and market participants of which companies are likely to be delisted and warns them to make safe investments. Despite this importance, there are relatively few studies on administration issues prediction model in comparison with the lots of studies on bankruptcy prediction model. Therefore, this study develops and verifies the detection model of the companies designated as administrative issue using financial data of KOSDAQ companies. In this study, logistic regression and decision tree are proposed as the data mining models for detecting administrative issues. According to the results of the analysis, the logistic regression model predicted the companies designated as administrative issue using three variables - ROE(Earnings before tax), Cash flows/Shareholder's equity, and Asset turnover ratio, and its overall accuracy was 86% for the validation dataset. The decision tree (Classification and Regression Trees, CART) model applied the classification rules using Cash flows/Total assets and ROA(Net income), and the overall accuracy reached 87%. Implications of the financial indictors selected in our logistic regression and decision tree models are as follows. First, ROE(Earnings before tax) in the logistic detection model shows the profit and loss of the business segment that will continue without including the revenue and expenses of the discontinued business. Therefore, the weakening of the variable means that the competitiveness of the core business is weakened. If a large part of the profits is generated from one-off profit, it is very likely that the deterioration of business management is further intensified. As the ROE of a KOSDAQ company decreases significantly, it is highly likely that the company can be delisted. Second, cash flows to shareholder's equity represents that the firm's ability to generate cash flow under the condition that the financial condition of the subsidiary company is excluded. In other words, the weakening of the management capacity of the parent company, excluding the subsidiary's competence, can be a main reason for the increase of the possibility of administrative issue designation. Third, low asset turnover ratio means that current assets and non-current assets are ineffectively used by corporation, or that asset investment by corporation is excessive. If the asset turnover ratio of a KOSDAQ-listed company decreases, it is necessary to examine in detail corporate activities from various perspectives such as weakening sales or increasing or decreasing inventories of company. Cash flow / total assets, a variable selected by the decision tree detection model, is a key indicator of the company's cash condition and its ability to generate cash from operating activities. Cash flow indicates whether a firm can perform its main activities(maintaining its operating ability, repaying debts, paying dividends and making new investments) without relying on external financial resources. Therefore, if the index of the variable is negative(-), it indicates the possibility that a company has serious problems in business activities. If the cash flow from operating activities of a specific company is smaller than the net profit, it means that the net profit has not been cashed, indicating that there is a serious problem in managing the trade receivables and inventory assets of the company. Therefore, it can be understood that as the cash flows / total assets decrease, the probability of administrative issue designation and the probability of delisting are increased. In summary, the logistic regression-based detection model in this study was found to be affected by the company's financial activities including ROE(Earnings before tax). However, decision tree-based detection model predicts the designation based on the cash flows of the company.

Risk Assessment of Arsenic by Human Exposure of Contaminated Soil, Groundwater and Rice Grain (오염된 토양, 지하수 및 쌀의 인체노출에 따른 비소의 위해성 평가)

  • Lee Jin-Soo;Chon Hyo-Taek
    • Economic and Environmental Geology
    • /
    • v.38 no.5 s.174
    • /
    • pp.535-545
    • /
    • 2005
  • Environmental survey from some abandoned metal mine areas was undertaken on to assess the risk of adverse health effects on human exposure to arsenic influenced by past Au-Ag mining activities. Elevated levels of As were found in tailings from the studied mine areas. This high concentration may have a impact on soils and waters around the tailing piles. In order to perform the human risk assessment, chemical analysis data of soils, rice grains and waters fur As have been used. The HQ values fer As via the rice grain and groundwater consumption were significantly higher compared with other exposure pathways in all metal mine areas. However, there were minimal soil and water dermal contact risks. The resulting Hl values of As from the Dongil, Okdong and Hwacheon mine areas were higher than 5.0, and their toxic risk due to drinking water and rice grain was strong in these mine areas. The cancer risk of being exposed to As by the rice grain route from the Dongil, Okdong and Hwacheon mine areas was $5.2\times10^{-4},\;6.0\times10^{-4}\;and\;8.1\times10^{-4}$, respectively. The As cancer risk via the exposure pathway of drinking water from these mine areas exceeded the acceptable risk of 1 in 10,000 fer regulatory purposes. Thus, the daily intakes of groundwater and rice grain by the local residents from the Dongil, Okdong and Hwacheon mine areas can pose a potential health threat if exposed by long-term arsenic exposure.

Consumers Perceptions on Monosodium L-glutamate in Social Media (소셜미디어 분석을 통한 소비자들의 L-글루타민산나트륨에 대한 인식 조사)

  • Lee, Sooyeon;Lee, Wonsung;Moon, Il-Chul;Kwon, Hoonjeong
    • Journal of Food Hygiene and Safety
    • /
    • v.31 no.3
    • /
    • pp.153-166
    • /
    • 2016
  • The purpose of this study was to investigate consumers' perceptions on monosodium L-glutamate (MSG) in social media. Data were collected from Naver blogs and Naver web communities (Korean representative portal web-site), and media reports including comment sections on a Yonhap news website (Korean largest news agency). The results from Naver blogs and Naver web communities showed that it was primarily mentioned MSG-use restaurant reviews, 'MSG-no added' products, its safety, and methods of reducing MSG in food. When TV shows on current affairs, newspaper, or TV news reported uses and side effects of MSG, search volume for MSG has increased in both PC and mobile search engines. Search volume has increased especially when TV shows on current affairs reported it. There are more periods with increased search volume for Mobile than PC. Also, it was mainly commented about safety of MSG, criticism of low-quality foods, abuse of MSG, and distrust of government below the news on the Yonhap news site. The label of MSG-no added products in market emphasized "MSG-free" even though it is allocated as an acceptable daily intake (ADI) not-specified by the Joint FAO/WHO Expert Committee on Food Additives (JECFA). When consumers search for MSG (monosodium L-glutamate) or purchase food on market, they might perceive that 'MSG-no added' products are better. Competent authorities, offices of education and local government provide guidelines based on no added MSG principle and these policies might affect consumers' perceptions. TV program or news program could be a powerful and effective consumer communication channel about MSG through Mobile rather than PC. Therefore media including TV should report item on monosodium L-glutamate with responsibility and information based on scientific background for consumers to get reliable information.

Treatment of Contaminated Sediment for Water Quality Improvement of Small-scale Reservoir (소하천형 호수의 수질개선을 위한 퇴적저니 처리방안 연구)

  • 배우근;이창수;정진욱;최동호
    • Journal of Soil and Groundwater Environment
    • /
    • v.7 no.4
    • /
    • pp.31-39
    • /
    • 2002
  • Pollutants from industry, mining, agriculture, and other sources have contaminated sediments in many surface water bodies. Sediment contamination poses a severe threat to human health and environment because many toxic contaminants that are barely detectable in the water column can accumulate in sediments at much higher levels. The purpose of this study was to make optimal treatment and disposal plan o( sediment for water quality improvement in small-scale resevoir based on an evaluation of degree of contamination. The degree of contamination were investigated for 23 samples of 9 site at different depth of sediment in small-scale J river. Results for analysis of contaminated sediments were observed that copper concentration of 4 samples were higher than the regulation of hazardous waste (3 mg/L) and that of all samples were exceeded soil pollution warning levels for agricultural areas. Lead and mercury concentration of all samples were detected below both regulations. Necessary of sediment dredge was evaluated for organic matter and nutrient through standard levels of Paldang lake and the lower Han river in Korea and Tokyo bay and Yokohama bay in Japan. The degree of contamination for organic matter and nutrient was not serious. Compared standard levels of Japan, America, and Canada for heavy metal, contaminated sediment was concluded as lowest effect level or limit of tolerance level because standard levels of America and Canada was established worst effect of benthic organisms. The optimal treatment method of sediment contained heavy metal was cement-based solidification/stabilization to prevent heavy metal leaching.

Chemical Speciation of Arsenic in the Water System from Some Abandoned Au-Ag Mines in Korea (국내 폐금은광산 주변 수계내의 As의 화학적 특성)

  • 이지민;이진수;전효택
    • Economic and Environmental Geology
    • /
    • v.36 no.6
    • /
    • pp.481-490
    • /
    • 2003
  • The objectives of this study are (1) to determine the extent and degree of As contamination of the water and sediments influenced by mining activity of the abandoned Au-Ag mines, (2) to examine As speciation In contaminated water, (3) to monitor variation of As contamination in water system throughout the dry and wet seasons, and (4) to investigate the As chemical form in the sediments through the sequential extraction analyses. Natural water(mine water, surface water and groundwater) and sediments were collected in six abandoned Au-Ag mine(Au-bearing quartz veins) areas. The contamination level of As in mine water of the Dongil(524${\mu}m$/L) is more higher than the tolerance level(500 ${\mu}m$/L) for waste water of mine area in Korea. Elevated levels of As in stream water were also found in the Dongil(range of 63.7∼117.6 ${\mu}m$/L.) and Gubong(range of 56.1∼62.9 ${\mu}m$/L) mine areas. Arsenic contamination levels in groundwater used by drinking water were more significant in the Dongil(11.3∼63.5 ${\mu}m$/L), Okdong(0.2∼68.9 ${\mu}m$/L) and Gubong(2.0∼101.0${\mu}m$/L) mine areas. Arsenate[As(V), $H_2AsO_4^-$] is more dominant than arsenite[As(III), $H_3AsO_3$] in water system of the most mine areas. The concentration ratios of As(III) to As(total), however, extend to the 95% in stream water of the Okdong mine area and 70∼82% in groundwater of the Okdong and Dongjung mine areas. As a study of seasonal variation in the water system, relatively high levels of As from the dongil mine area were found in April rather than in September. Sequential extraction analysis showed that As was predominantly present as coprecipitated with Fe hydroxides from sediment samples of the Dongjung and Gubong mine(35.9∼40.5%), which indicates its possibility of re-extraction and inducing elevated contamination of As in the reductive condition. In sediments from the Dongil, Okdong and Hwachon mine area, high percentage(55.2∼83.4%) of As sulfide form was found.