• Title/Summary/Keyword: Data Transform

Search Result 2,230, Processing Time 0.028 seconds

E-Governance Practices in Developing Countries. Its Benefits and Challenges. -The Case of Pakistan- (개발도상국의 전자정부 시행에 따른 장점과 문제 -파키스탄 중심으로-)

  • Aftab, Muhammad
    • Industry Promotion Research
    • /
    • v.4 no.1
    • /
    • pp.79-86
    • /
    • 2019
  • The quality of service delivery by governments to their citizens is a subject of concern in the contemporary society. E-governance is a critical aspect that is transforming government operation and service delivery to citizens and other bodies through application of information technology. This research explores the state of e-governing focusing on nations that are developing with special attention to Pakistan. The difficulties and benefits encountered are identified. The results are vital for implementers of e-governing systems in these countries. The methodology applied entails a secondary research that involves analysis and synthesis of literature relating the research topic. The results reveal that Pakistan has made incredible steps in setting up e-governance systems with growth in internet use and access of data from a digital platform. The findings reveal that e-government is associated with multiple benefits including enhanced quality of services, cost efficiency in service provision, enhance transparency and elimination of corruption, provide the basis for eradication of poverty, boost economic stability of a country, and provide room for direct democracy. The research also found that developing countries experience challenges in form of financial constraints, poor ICT infrastructure, illiteracy on e-government, political consensus constraints, legal obstacles, social and cultural constraints. E-government has the capability to transform the quality of governance provided by governments, and policymakers and implementers should address the constraints that act as a hindrance to its implementation.

Changes in Science Teaching Revealed through the Life History of a Highly Experienced Elementary School Teacher (고경력 초등 교사의 생애사를 통해 본 과학 수업의 변화)

  • Hong, Jiyeong;Oh, Phil Seok
    • Journal of The Korean Association For Science Education
    • /
    • v.41 no.3
    • /
    • pp.251-266
    • /
    • 2021
  • The purpose of this study is to explore the life history of a highly experienced elementary school teacher, Mr. Park and to understand how his science teaching practices have changed throughout his educational career. Using life history interviews with Mr. Park and his life history materials as data sources, a life story of the participant teacher was constructed. The ways teaching changed in the inter-relationship between external and personal factors were then analyzed according to three temporal periods of teaching changes. It was revealed that in the first period of teaching change, Mr. Park changed his science teaching practices by actively accepting influences from external factors, which in turn enabled him to start developing his expertise in student-centered science instruction. By contrast, in the second period of teaching change, Mr. Park strengthened his own ways of teaching while responding critically to the trends of educational change by external factors. In the third and final period of teaching change, Mr. Park made changes in teaching practices by taking advantage of his personal factors to convert influences of external factors into positive ones. Based on these findings, it was suggested that educational policies for teaching changes should be provided in consideration of teachers' life cycles and their expertise and that teachers should make efforts to reflect on the factors for educational change and transform them into positive ones.

Korean Morphological Analysis Method Based on BERT-Fused Transformer Model (BERT-Fused Transformer 모델에 기반한 한국어 형태소 분석 기법)

  • Lee, Changjae;Ra, Dongyul
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.4
    • /
    • pp.169-178
    • /
    • 2022
  • Morphemes are most primitive units in a language that lose their original meaning when segmented into smaller parts. In Korean, a sentence is a sequence of eojeols (words) separated by spaces. Each eojeol comprises one or more morphemes. Korean morphological analysis (KMA) is to divide eojeols in a given Korean sentence into morpheme units. It also includes assigning appropriate part-of-speech(POS) tags to the resulting morphemes. KMA is one of the most important tasks in Korean natural language processing (NLP). Improving the performance of KMA is closely related to increasing performance of Korean NLP tasks. Recent research on KMA has begun to adopt the approach of machine translation (MT) models. MT is to convert a sequence (sentence) of units of one domain into a sequence (sentence) of units of another domain. Neural machine translation (NMT) stands for the approaches of MT that exploit neural network models. From a perspective of MT, KMA is to transform an input sequence of units belonging to the eojeol domain into a sequence of units in the morpheme domain. In this paper, we propose a deep learning model for KMA. The backbone of our model is based on the BERT-fused model which was shown to achieve high performance on NMT. The BERT-fused model utilizes Transformer, a representative model employed by NMT, and BERT which is a language representation model that has enabled a significant advance in NLP. The experimental results show that our model achieves 98.24 F1-Score.

Exploiting Chunking for Dependency Parsing in Korean (한국어에서 의존 구문분석을 위한 구묶음의 활용)

  • Namgoong, Young;Kim, Jae-Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.7
    • /
    • pp.291-298
    • /
    • 2022
  • In this paper, we present a method for dependency parsing with chunking in Korean. Dependency parsing is a task of determining a governor of every word in a sentence. In general, we used to determine the syntactic governor in Korean and should transform the syntactic structure into semantic structure for further processing like semantic analysis in natural language processing. There is a notorious problem to determine whether syntactic or semantic governor. For example, the syntactic governor of the word "먹고 (eat)" in the sentence "밥을 먹고 싶다 (would like to eat)" is "싶다 (would like to)", which is an auxiliary verb and therefore can not be a semantic governor. In order to mitigate this somewhat, we propose a Korean dependency parsing after chunking, which is a process of segmenting a sentence into constituents. A constituent is a word or a group of words that function as a single unit within a dependency structure and is called a chunk in this paper. Compared to traditional dependency parsing, there are some advantage of the proposed method: (1) The number of input units in parsing can be reduced and then the parsing speed could be faster. (2) The effectiveness of parsing can be improved by considering the relation between two head words in chunks. Through experiments for Sejong dependency corpus, we have shown that the USA and LAS of the proposed method are 86.48% and 84.56%, respectively and the number of input units is reduced by about 22%p.

Novel two-stage hybrid paradigm combining data pre-processing approaches to predict biochemical oxygen demand concentration (생물화학적 산소요구량 농도예측을 위하여 데이터 전처리 접근법을 결합한 새로운 이단계 하이브리드 패러다임)

  • Kim, Sungwon;Seo, Youngmin;Zakhrouf, Mousaab;Malik, Anurag
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.spc1
    • /
    • pp.1037-1051
    • /
    • 2021
  • Biochemical oxygen demand (BOD) concentration, one of important water quality indicators, is treated as the measuring item for the ecological chapter in lakes and rivers. This investigation employed novel two-stage hybrid paradigm (i.e., wavelet-based gated recurrent unit, wavelet-based generalized regression neural networks, and wavelet-based random forests) to predict BOD concentration in the Dosan and Hwangji stations, South Korea. These models were assessed with the corresponding independent models (i.e., gated recurrent unit, generalized regression neural networks, and random forests). Diverse water quality and quantity indicators were implemented for developing independent and two-stage hybrid models based on several input combinations (i.e., Divisions 1-5). The addressed models were evaluated using three statistical indices including the root mean square error (RMSE), Nash-Sutcliffe efficiency (NSE), and correlation coefficient (CC). It can be found from results that the two-stage hybrid models cannot always enhance the predictive precision of independent models confidently. Results showed that the DWT-RF5 (RMSE = 0.108 mg/L) model provided more accurate prediction of BOD concentration compared to other optimal models in Dosan station, and the DWT-GRNN4 (RMSE = 0.132 mg/L) model was the best for predicting BOD concentration in Hwangji station, South Korea.

Rendering Quality Improvement Method based on Depth and Inverse Warping (깊이정보와 역변환 기반의 포인트 클라우드 렌더링 품질 향상 방법)

  • Lee, Heejea;Yun, Junyoung;Park, Jong-Il
    • Journal of Broadcast Engineering
    • /
    • v.26 no.6
    • /
    • pp.714-724
    • /
    • 2021
  • The point cloud content is immersive content recorded by acquiring points and colors corresponding to the real environment and objects having three-dimensional location information. When a point cloud content consisting of three-dimensional points having position and color information is enlarged and rendered, the gap between the points widens and an empty hole occurs. In this paper, we propose a method for improving the quality of point cloud contents through inverse transformation-based interpolation using depth information for holes by finding holes that occur due to the gap between points when expanding the point cloud. The points on the back are rendered between the holes created by the gap between the points, acting as a hindrance to applying the interpolation method. To solve this, remove the points corresponding to the back side of the point cloud. Next, a depth map at the point in time when an empty hole is generated is extracted. Finally, inverse transform is performed to extract pixels from the original data. As a result of rendering content by the proposed method, the rendering quality improved by 1.2 dB in terms of average PSNR compared to the conventional method of increasing the size to fill the blank area.

Non-Destructive Scientific Analysis of the Gold Fabric Excavated of Cheongsong Shim's Grave (청송심씨 묘에서 출토된 금직물의 비파괴 과학적 분석)

  • Lee, Hwang-Jo;Wi, Koang-Chul
    • Journal of Conservation Science
    • /
    • v.38 no.3
    • /
    • pp.243-253
    • /
    • 2022
  • Using non-destructive analytical methods, we identified the material characteristics of two gold fabric artifacts excavated from the Cheongsong Sim clan (Bugeum Wonsam, Jikgeum Chima), including the artifact condition, fiber type, surface contamination, and metallic threads. We found that the artifacts were buried and had turned brown; thus, we were unable to determine their original color. The fiber type was determined to be silk from cocoons, based on scanning electron microscopy, Fourier transform infrared (FT-IR) analyses of Amide I, II, III, and IV peaks, and color reactions Further, the FT-IR and X-ray fluorescence (XRF) analyses identified the white and black stains as natural resin hydrolyzed substances, such as lipids and proteins, that occurred as microbial decomposition due to body decay. Finally, the XRF analyses identified the thin gold layer of the metallic yarn as gold (Au). According to the FT-IR data and the color reaction to the metallic yarn medium, the adhesive component of the medium was a product of-Amides I, II, III, and 3000 cm-1 within Amides A and B (an animal type), respectively. Thus, the medium was identified as Hanji (Korean paper), which is made from domestically produced Broussonetia kazinoki fibers.

Malicious Traffic Classification Using Mitre ATT&CK and Machine Learning Based on UNSW-NB15 Dataset (마이터 어택과 머신러닝을 이용한 UNSW-NB15 데이터셋 기반 유해 트래픽 분류)

  • Yoon, Dong Hyun;Koo, Ja Hwan;Won, Dong Ho
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.2
    • /
    • pp.99-110
    • /
    • 2023
  • This study proposed a classification of malicious network traffic using the cyber threat framework(Mitre ATT&CK) and machine learning to solve the real-time traffic detection problems faced by current security monitoring systems. We applied a network traffic dataset called UNSW-NB15 to the Mitre ATT&CK framework to transform the label and generate the final dataset through rare class processing. After learning several boosting-based ensemble models using the generated final dataset, we demonstrated how these ensemble models classify network traffic using various performance metrics. Based on the F-1 score, we showed that XGBoost with no rare class processing is the best in the multi-class traffic environment. We recognized that machine learning ensemble models through Mitre ATT&CK label conversion and oversampling processing have differences over existing studies, but have limitations due to (1) the inability to match perfectly when converting between existing datasets and Mitre ATT&CK labels and (2) the presence of excessive sparse classes. Nevertheless, Catboost with B-SMOTE achieved the classification accuracy of 0.9526, which is expected to be able to automatically detect normal/abnormal network traffic.

Evaluating Physical Characteristics of Raindrop in Anseong, Gyeonggi Province (강우입자의 물리적 특성평가: 경기도 안성시 지역을 사례로)

  • KIM, Jin Kwan;YANG, Dong Yoon;KIM, Min Seok
    • Journal of The Geomorphological Association of Korea
    • /
    • v.17 no.1
    • /
    • pp.49-57
    • /
    • 2010
  • To evaluate physical characteristics of open rainfall in Korea, terminal velocity of raindrop and drop size distributions (DSD) were continuously measured using by laser-optical disdrometer around Gosam reservoir, Anseong-si, Gyeonggi-do during three rainfall events from 2008 to 2009. The relationships between kinetic energies (KE, Jm-2mm-1; KER, Jm-2h-1) and rainfall intensity were obtained, respectively. Moreover, we compared the rainfall intensity from a disdrometer with the rainfall intensity from a tipping bucket raingauge to transform the kinetic energy of rainfall using the data from a tipping bucket raingauge. Therefore, the established relationships between kinetic energies (KE and KER) and rainfall intensity could be a useful model to consider the kinetic energy of raindrop using the rainfall intensity below 40mmh-1 of max 5-min rainfall intensity in the middle of South Korea. However, to better examine the relationship between kinetic energy and rainfall intensity, further measurement will be required.

Prediction of Germination of Korean Red Pine (Pinus densiflora) Seed using FT NIR Spectroscopy and Binary Classification Machine Learning Methods (FT NIR 분광법 및 이진분류 머신러닝 방법을 이용한 소나무 종자 발아 예측)

  • Yong-Yul Kim;Ja-Jung Ku;Da-Eun Gu;Sim-Hee Han;Kyu-Suk Kang
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.2
    • /
    • pp.145-156
    • /
    • 2023
  • In this study, Fourier-transform near-infrared (FT-NIR) spectra of Korean red pine seeds stored at -18℃ and 4℃ for 18 years were analyzed. To develop seed-germination prediction models, the performance of seven machine learning methods, namely XGBoost, Boosted Tree, Bootstrap Forest, Neural Networks, Decision Tree, Support Vector Machine, PLS-DA, were compared. The predictive performance, assessed by accuracy, misclassification, and area under the curve (0.9722, 0.0278, and 0.9735 for XGBoost, and 0.9653, 0.0347, and 0.9647 for Boosted Tree), was better for the XGBoost and decision tree models when compared with other models. The 54 wave-number variables of the two models were of high relative importance in seed-germination prediction and were grouped into six spectral ranges (811~1,088 nm, 1,137~1,273 nm, 1,336~1,453 nm, 1,666~1,671 nm, 1,879~2,045 nm, and 2,058~2,409 nm) for aromatic amino acids, cellulose, lignin, starch, fatty acids, and moisture, respectively. Use of the NIR spectral data and two machine learning models developed in this study gave >96% accuracy for the prediction of pine-seed germination after long-term storage, indicating this approach could be useful for non-destructive viability testing of stored seed genetic resources.