• 제목/요약/키워드: Standard Dataset

검색결과 191건 처리시간 0.028초

Global Productivity and Market Structure Implications of the US-China Trade War: A CGE Modeling Approach

  • Jung, Jaewon
    • Journal of Korea Trade
    • /
    • 제24권8호
    • /
    • pp.153-170
    • /
    • 2020
  • Purpose - As the US-China trade war intensifies and lasts long time, there is growing concern about its potential effects on the global economy. In particular, for the countries like Korea that have a large economic dependence on the economy of the two countries, the US-China trade war may have a great repercussion in many ways. The aim of this paper is to investigate the global productivity and market structure implications of the US-China trade war for Korea, as well as for other surrounding countries and regions. Design/methodology - In this paper, we develop a full multi-country/region multi-sector computable general equilibrium (CGE) model of global trade incorporating heterogeneous workers and firms in individual skill levels and used technologies. We then calibrate the model using a global Social Accounting Matrix (SAM) dataset extracted from the recently released GTAP 10 Database, and assess the potential effects of the US-China trade war on the aggregate real productivity and the market structure for Korea, as well as for other surrounding countries and regions. Findings - We show that the US-China trade war may largely affect the aggregate productivity in each sector in each country/region, as well as the global market structure through entry and exit of firms, which results finally in considerable changes in the industrial comparative advantage of each country/region. Though the effects are diverse sector by sector, the results show that Korea may also be affected significantly: concerning the real productivity implications, it is shown that the machinery industry may be affected the most negatively; on the other hand, it is shown that the number of exporting firms may decrease the most in the other transports industry. Originality/value - As the US-China trade war intensifies, many studies have tried to estimate the possible implications, and for this usually the CGE models have largely been used as the standard tool for evaluating the impacts of changes in trade policies. Standard CGE models, however, cannot be used to assess the global productivity and market structure implications due to the symmetric and simplified base assumptions. This paper is the first to analyze and quantify the possible impacts of the US-China trade war on the aggregate productivity and global market structure using a CGE model incorporating endogenous skill-technology assignment of heterogeneous workers and firms.

Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach

  • Marwat, M. Irfan;Khan, Javed Ali;Alshehri, Dr. Mohammad Dahman;Ali, Muhammad Asghar;Hizbullah;Ali, Haider;Assam, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권3호
    • /
    • pp.830-860
    • /
    • 2022
  • [Introduction] Nowadays, many companies are shifting their businesses online due to the growing trend among customers to buy and shop online, as people prefer online purchasing products. [Problem] Users share a vast amount of information about products, making it difficult and challenging for the end-users to make certain decisions. [Motivation] Therefore, we need a mechanism to automatically analyze end-user opinions, thoughts, or feelings in the social media platform about the products that might be useful for the customers to make or change their decisions about buying or purchasing specific products. [Proposed Solution] For this purpose, we proposed an automated SentiDecpective approach, which classifies end-user reviews into negative, positive, and neutral sentiments and identifies deceptive crowd-users rating information in the social media platform to help the user in decision-making. [Methodology] For this purpose, we first collected 11781 end-users comments from the Amazon store and Flipkart web application covering distant products, such as watches, mobile, shoes, clothes, and perfumes. Next, we develop a coding guideline used as a base for the comments annotation process. We then applied the content analysis approach and existing VADER library to annotate the end-user comments in the data set with the identified codes, which results in a labelled data set used as an input to the machine learning classifiers. Finally, we applied the sentiment analysis approach to identify the end-users opinions and overcome the deceptive rating information in the social media platforms by first preprocessing the input data to remove the irrelevant (stop words, special characters, etc.) data from the dataset, employing two standard resampling approaches to balance the data set, i-e, oversampling, and under-sampling, extract different features (TF-IDF and BOW) from the textual data in the data set and then train & test the machine learning algorithms by applying a standard cross-validation approach (KFold and Shuffle Split). [Results/Outcomes] Furthermore, to support our research study, we developed an automated tool that automatically analyzes each customer feedback and displays the collective sentiments of customers about a specific product with the help of a graph, which helps customers to make certain decisions. In a nutshell, our proposed sentiments approach produces good results when identifying the customer sentiments from the online user feedbacks, i-e, obtained an average 94.01% precision, 93.69% recall, and 93.81% F-measure value for classifying positive sentiments.

스마트폰 음성녹음 파일 구조 및 메타데이터의 위변조 기법에 관한 연구 (A Study on Forgery Techniques of Smartphone Voice Recording File Structure and Metadata)

  • 박재완;곽원준;이상현
    • 문화기술의 융합
    • /
    • 제8권6호
    • /
    • pp.807-812
    • /
    • 2022
  • 최근 음성녹음 파일도 법정 증거로 제출되는 수가 늘어남에 따라 위변조를 주장하는 사례도 증가하고 있다. 객관적 근거인 음성녹음 파일 구조 및 메타데이터를 완벽하게 위변조 할 경우에는 정교한 음성녹음 파일의 위변조 검출은 사실상 불가능하다. 위변조된 음성녹음 파일을 가지고 수행된 파일 구조 및 메타데이터 분석이 법정에서 거부되는 것은 쉽지 않다. 본 연구는 음성녹음 파일 구조 및 메타데이터의 위변조가 손쉽게 가능하다는 것을 증명하는 것을 목적으로 한다. 이를 위해 본 연구에서는 음성녹음 파일의 편집 방법의 유형화를 기반으로 정교한 편집이 가능한 '혼합붙여넣기' 기능을 적용할 경우 위변조 검출의 불가능함을 소개했다. 더욱이 실험을 통해 파일 구조 및 메타데이터의 위변조가 가능하다는 것을 증명했다. 따라서 음성녹음 파일이 디지털 증거로 채택됨에 있어서 더 엄격한 증거능력 판단 기준이 필요하다. 본 연구는 법관이 디지털 증거를 채택함에 무결성의 기준에 공헌할 뿐만 아니라 향후 개발될 것으로 예상되는 녹음파일 위변조 검출 인공지능을 위한 데이터셋 구축 방법에 공헌할 것이다.

StyleGAN Encoder를 활용한 표정 이미지 생성에서의 연령 왜곡 감소에 대한 연구 (A study on age distortion reduction in facial expression image generation using StyleGAN Encoder)

  • 이희열;이승호
    • 전기전자학회논문지
    • /
    • 제27권4호
    • /
    • pp.464-471
    • /
    • 2023
  • 본 논문에서는 StyleGAN Encoder를 활용한 표정 이미지 생성에서의 연령 왜곡을 감소시키는 방법을 제안한다. 표정 이미지 생성 과정은 StyleGAN Encoder를 사용하여 얼굴 이미지를 생성하고, SVM을 이용하여 학습된 boundary를 잠재 벡터에 적용하여 표정을 변화시킨다. 그러나 웃는 표정의 boundary를 학습할 때 표정 변화에 따른 연령 왜곡이 발생한다. 웃는 표정에 대한 SVM 학습에서 생성된 smile boundary는 표정 변화로 인해 생긴 주름이 학습 요소로 포함되어 있으며 연령에 대한 특성도 함께 학습된 것으로 판단한다. 이를 해결하기 위해, 제안된 방법에서는 smile boundary와 age boundary의 상관계수를 계산하고, 이를 이용하여 smile boundary에서 age boundary를 상관계수에 비례하여 조절하는 방식을 도입한다. 제안된 방법의 효과를 확인하기 위해 공개된 표준 얼굴 데이터셋인 FFHQ 데이터셋을 사용하고 FID score를 측정하여 실험한 결과는 다음과 같다. Smile 이미지에서는 기존 방법에 비하여, Ground Truth와 제안된 방법으로 생성된 smile 이미지의 FID score가 약 0.46 향상되었다. 또한, Smile 이미지에서 기존 방법에 비하여, StyleGAN Encoder로 생성된 이미지와 제안된 방법으로 생성된 smile 이미지의 FID score가 약 1.031 향상되었다. Non-smile 이미지에서는 기존 방법에 비하여, Ground Truth와 본 논문에서 제안된 방법으로 생성된 non-smile 이미지의 FID score가 약 2.25 향상되었다. 또한, Non-smile 이미지에서 기존 방법에 비하여, StyleGAN Encoder로 생성된 이미지와 제안된 방법으로 생성된 non-smile 이미지의 FID score가 약 약 1.908 향상됨을 확인하였다. 한편, 각 생성된 표정 이미지의 연령을 추정하여 StyleGAN Encoder로 생성된 이미지의 추정된 연령과 MSE를 측정한 결과, 기존방법 대비 제안하는 방법이 smile 이미지에서 약 1.5, non-smile 이미지에서 약 1.63의 성능 향상되어 제안한 방법에 대한 성능의 효율성이 입증되었다.

가스 하이드레이트 부존층의 구조파악을 위한 탄성파 AVO 분석 AVO모델링, AVO역산 (Seismic AVO Analysis, AVO Modeling, AVO Inversion for understanding the gas-hydrate structure)

  • 김건득;정부흥
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 한국신재생에너지학회 2005년도 춘계학술대회
    • /
    • pp.643-646
    • /
    • 2005
  • The gas hydrate exploration using seismic reflection data, the detection of BSR(Bottom Simulating Reflector) on the seismic section is the most important work flow because the BSR have been interpreted as being formed at the base of a gas hydrate zone. Usually, BSR has some dominant qualitative characteristics on seismic section i.e. Wavelet phase reversal compare to sea bottom signal, Parallel layer with sea bottom, Strong amplitude, Masking phenomenon above the BSR, Cross bedding with other geological layer. Even though a BSR can be selected on seismic section with these guidance, it is not enough to conform as being true BSR. Some other available methods for verifying the BSR with reliable analysis quantitatively i.e. Interval velocity analysis, AVO(Amplitude Variation with Offset)analysis etc. Usually, AVO analysis can be divided by three main parts. The first part is AVO analysis, the second is AVO modeling and the last is AVO inversion. AVO analysis is unique method for detecting the free gas zone on seismic section directly. Therefore it can be a kind of useful analysis method for discriminating true BSR, which might arise from an Possion ratio contrast between high velocity layer, partially hydrated sediment and low velocity layer, water saturated gas sediment. During the AVO interpretation, as the AVO response can be changed depend upon the water saturation ratio, it is confused to discriminate the AVO response of gas layer from dry layer. In that case, the AVO modeling is necessary to generate synthetic seismogram comparing with real data. It can be available to make conclusions from correspondence or lack of correspondence between the two seismograms. AVO inversion process is the method for driving a geological model by iterative operation that the result ing synthetic seismogram matches to real data seismogram wi thin some tolerance level. AVO inversion is a topic of current research and for now there is no general consensus on how the process should be done or even whether is valid for standard seismic data. Unfortunately, there are no well log data acquired from gas hydrate exploration area in Korea. Instead of that data, well log data and seismic data acquired from gas sand area located nearby the gas hydrate exploration area is used to AVO analysis, As the results of AVO modeling, type III AVO anomaly confirmed on the gas sand layer. The Castagna's equation constant value for estimating the S-wave velocity are evaluated as A=0.86190, B=-3845.14431 respectively and water saturation ratio is $50\%$. To calculate the reflection coefficient of synthetic seismogram, the Zoeppritz equation is used. For AVO inversion process, the dataset provided by Hampson-Rushell CO. is used.

  • PDF

ADVANTAGES OF USING ARTIFICIAL NEURAL NETWORKS CALIBRATION TECHNIQUES TO NEAR-INFRARED AGRICULTURAL DATA

  • Buchmann, Nils-Bo;Ian A.Cowe
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1032-1032
    • /
    • 2001
  • Artificial Neural Network (ANN) calibration techniques have been used commercially for agricultural applications since the mid-nineties. Global models, based on transmission data from 850 to 1050 nm, are used routinely to measure protein and moisture in wheat and barley and also moisture in triticale, rye, and oats. These models are currently used commercially in approx. 15 countries throughout the world. Results concerning earlier European ANN models are being published elsewhere. Some of the findings from that study will be discussed here. ANN models have also been developed for coarsely ground samples of compound feed and feed ingredients, again measured in transmission mode from 850 to 1050 nm. The performance of models for pig- and poultry feed will be discussed briefly. These models were developed from a very large data set (more than 20,000 records), and cover a very broad range of finished products. The prediction curves are linear over the entire range for protein, fat moisture, fibre, and starch (measured only on poultry feed), and accuracy is in line with the performance of smaller models based on Partial Least Squares (PLS). A simple bias adjustment is sufficient for calibration transfer across instruments. Recently, we have investigated the possible use of ANN for a different type of NIR spectrometer, based on reflectance data from 1100 to 2500 nm. In one study, based on data for protein, fat, and moisture measured on unground compound feed samples, dedicated ANN models for specific product classes (cattle feed, pig feed, broiler feed, and layers feed) gave moderately better Standard Errors of Prediction (SEP) compared to modified PLS (MPLS). However, if the four product classes were combined into one general calibration model, the performance of the ANN model deteriorated only slightly compared to the class-specific models, while the SEP values for the MPLS predictions doubled. Brix value in molasses is a measure of sugar content. Even with a huge dataset, PLS models were not sufficiently accurate for commercial use. In contrast an ANN model based on the same data improved the accuracy considerably and straightened out non-linearity in the prediction plot. The work of Mr. David Funk (GIPSA, U. S. Department of Agriculture) who has studied the influence of various types of spectral distortions on ANN- and PLS models, thereby providing comparative information on the robustness of these models towards instrument differences, will be discussed. This study was based on data from different classes of North American wheat measured in transmission from 850 to 1050 nm. The distortions studied included the effect of absorbance offset pathlength variation, presence of stray light bandwidth, and wavelength stretch and offset (either individually or combined). It was shown that a global ANN model was much less sensitive to most perturbations than class-specific GIPSA PLS calibrations. It is concluded that ANN models based on large data sets offer substantial advantages over PLS models with respect to accuracy, range of materials that can be handled by a single calibration, stability, transferability, and sensitivity to perturbations.

  • PDF

의학 저널에서 사용되는 구조적 초록의 소표제들에 관한 분석 (Abstracts in Medical Science Journals: An Analysis of Subheadings in Structured Abstracts)

  • Kim, Eungi
    • 한국도서관정보학회지
    • /
    • 제47권1호
    • /
    • pp.199-216
    • /
    • 2016
  • 본 논문은 의학분야 논문가운데 구조적 초록만을 선별하여 소표제들을 분석함으로서 문헌정보학 관점에서 의학분야의 논문초록양식에 소표제의 의미를 논의하였다. 다양한 의학분야 가운데 암, 윤리, 유전학, 감염성 질환, 신경과, 소아과, 면역학, 정신의학 및 심장학의 9개 세부분야를 선택하였고, PubMed 데이터베이스에서 샘플 데이터로 출판된 논문초록 정보를 추출하였다. 이러한 데이터는 최근 초록인 2010년부터 2015까지 5년 동안 출판된 초록들로 제한하였다. 연구는 추출된 샘플들의 양상과 구조적 형식에서 사용된 소표제들의 변종과 변종의 빈도 수 등을 분석하였다. 요약한 연구결과는 다음과 같다. 1) 대다수의 세부 의학분야에서 출판되는 논문들은 구조적 초록이 아닌 비구조화 초록을 주로 사용하고 있다는 것이 드러났다. 2) 의학분야의 논문에서는 소표제 항목을 평균적으로 4.1을 사용하는 것으로 나타났다. 3) 일반적으로 가장 자주 사용되는 부제는 OBJECTIVES(목적), METHODS(방법), RESULTS(결과), CONCLUSIONS(결론)이였다. 특히 이 연구에서 제기된 문제점들과 보고된 소표제 분석결과가 의료과학 저널 편집자와 의학 및 문헌정보학자들에게 유용한 정보가 될 것이다.

한국인의 시리얼 섭취실태와 우유 및 칼슘섭취와의 관련성 연구 - 2001년도 국민건강영양조사 자료를 이용하여 - (Ready-to-eat Cereal Consumption Enhances Milk and Calcium Intake in Korean Population from 2001 Korean National Health and Nutrition Survey)

  • 정진은
    • Journal of Nutrition and Health
    • /
    • 제39권8호
    • /
    • pp.786-794
    • /
    • 2006
  • The purpose of this study was to establish an association between the consumption of ready-to-eat cereal (RTEC), milk, and calcium within the context of the most current population dietary practice in Korea. Inadequate calcium intake among Korean children and adults is one of the important public health concern. Milk is one of the best calcium sources because or its bioavailability, and RTEC is one or the foods commonly consumed with milk. The most recent Korean National Health and Nutrition Survey, 2001 dataset was used as the source of data for this research. Subjects excluding pregnant women, were categorized according to gender and age ($1{\sim}5,\;6{\sim}11,\;12{\sim}19,\;20{\sim}49,\;50+$ years) and then by consumption of RTEC and milk. SAS and SUDAAN were used for statistical analyses. Sample weighted means, standard errors, and population percentages were calculated, and multiple regression model with adjustment for covariates were used to determine the predictability of total daily calcium intake from inclusion of RTEC and milk compared to the meal without RTEC and milk. RTEC was consumed by 2.4% or Korean people. Average calcium intake was 17 times greater when RTEC was consumed with milk than when RTEC was consumed without milk. Respondents who consumed RTEC with milk had significantly higher mean daily calcium and other nutrient intakes than respondents who consumed neither. in the multiple regression analysis, milk consumption with or without RTEC predicted total daily calcium intake after adjusting for age, income, and alcohol consumption (p<0.0001). The percentage of respondents below the estimated average requirement (EAR) level for calcium was lower for RTEC consumers than for RTEC non-consumers in all age-gender groups, especially significant differences were in children aged $1{\sim}5$, boys and girls aged $12{\sim}19$, men aged $20{\sim}49$, and women older than 50 years of age. RTEC consumption was not associated with intake in excess of the tolerable upper intake level (UL) for calcium. In conclusion, RTEC consumption was positively associated with both milk and calcium intakes in all age and gender groups in Korean population.

SWAT: 분산 인-메모리 시스템 기반 SWRL과 ATMS의 효율적 결합 연구 (SWAT: A Study on the Efficient Integration of SWRL and ATMS based on a Distributed In-Memory System)

  • 전명중;이완곤;바트셀렘;박현규;박영택
    • 정보과학회 논문지
    • /
    • 제45권2호
    • /
    • pp.113-125
    • /
    • 2018
  • 최근 빅데이터의 시대가 도래하여 다양한 분야로부터 다량의 지식을 얻을 수 있다. 수집된 지식은 정형화된 형태의 지식으로 가공하여 표현되며, 그 중 W3C의 온톨로지 표준 언어인 OWL이 대표적인 정형화 표현 형식이다. 이렇게 표현된 대용량의 온톨로지로부터 내재된 정보를 도출하기 위해 다양한 방법의 심볼릭 추론(Symbolic Reasoning) 연구가 활발하게 진행되고 있다. 그러나 대부분의 추론 연구들은 서술논리(Description Logic)표현 기반의 제한적인 규칙표현을 지원하며 실생활 기반의 서비스를 구축하기에는 많은 제약이 따른다. 또한 잘못된 지식으로부터 도출된 결과는 규칙들 사이의 종속관계에 따라 연쇄적으로 잘못된 지식이 생산될 수 있기 때문에 이러한 잘못된 지식에 대한 처리를 위한 지식관리가 필요하다. 따라서 본 논문에서는 해당 문제를 해결하기 위해 SWRL(Semantic Web Rule Language) 기반의 추론과 ATMS(Assumption-based Truth Maintenance System)간의 결합을 통해 새롭게 도출된 지식에 대한 관리를 할 수 있는 SWAT(SWRL + ATMS) 시스템을 제안한다. 또한 이 시스템은 대용량 데이터를 처리하기 위해 분산 인-메모리 프레임워크 기반의 SWRL추론과 ATMS를 병합 구축하였으며 이를 바탕으로 웹 형태의 ATMS 모니터링 시스템을 통하여 사용자가 손쉽게 잘못된 지식을 검색 및 수정할 수 있도록 한다. 본 논문에서 제안하는 방법에 대한 평가를 위해 LUBM(Lehigh University Benchmark)데이터 셋을 사용하였으며, 대용량 데이터에 대한 SWRL 추론과 잘못 추론된 정보에 대한 삭제를 통해 효율적인 추론과 관리가 가능한 결합 방법임을 증명한다.

Temporal Change in Radiological Environments on Land after the Fukushima Daiichi Nuclear Power Plant Accident

  • Saito, Kimiaki;Mikami, Satoshi;Andoh, Masaki;Matsuda, Norihiro;Kinase, Sakae;Tsuda, Shuichi;Sato, Tetsuro;Seki, Akiyuki;Sanada, Yukihisa;Wainwright-Murakami, Haruko;Yoshimura, Kazuya;Takemiya, Hiroshi;Takahashi, Junko;Kato, Hiroaki;Onda, Yuichi
    • Journal of Radiation Protection and Research
    • /
    • 제44권4호
    • /
    • pp.128-148
    • /
    • 2019
  • Massive environmental monitoring has been conducted continuously since the Fukushima Daiichi Nuclear Power accident in March of 2011 by different monitoring methods that have different features together with migration studies of radiocesium in diverse environments. These results have clarified the characteristics of radiological environments and their temporal change around the Fukushima site. At three months after the accident, multiple radionuclides including radiostrontium and plutonium were detected in many locations; and it was confirmed that radiocesium was most important from the viewpoint of long-term exposure. Radiation levels around the Fukushima site have decreased greatly over time. The decreasing trend was found to change variously according to local conditions. The air dose rates in environments related to human living have decreased faster than expected from radioactive decay by a factor of 2-3 on average; those in pure forest have decreased more closely to physical decay. The main causes of air dose rate reduction were judged to be radioactive decay, movement of radiocesium in vertical and horizontal directions, and decontamination. Land-use categories and human activities have significantly affected the reduction tendency. Difference in the air dose rate reduction trends can be explained qualitatively according to the knowledge obtained in radiocesium migration studies; whereas, the quantitative explanation for individual sites is an important future challenge. The ecological half-lives of air dose rates have been evaluated by several researchers, and a short-term half-life within 1 year was commonly observed in the studies. An empirical model for predicting air dose rate distribution was developed based on statistical analysis of an extensive car-borne survey dataset, which enabled the prediction with confidence intervals. Different types of contamination maps were integrated to better quantify the spatial data. The obtained data were used for extended studies such as for identifying the main reactor that caused the contamination of arbitrary regions and developing standard procedures for environmental measurement and sampling. Annual external exposure doses for residents who intended to return to their homes were estimated as within a few millisieverts. Different forms of environmental data and knowledge have been provided for wide spectrum of people. Diverse aspects of lessons learned from the Fukushima accident, including practical ones, must be passed on to future generations.