• Title/Summary/Keyword: position prediction

Search Result 537, Processing Time 0.023 seconds

Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

  • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.509-520
    • /
    • 2015
  • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

A Study on Predicting the Logistics Demand of Inland Ports on the Yangtze River (장강 내수로 항만의 물류 수요 예측에 관한 연구)

  • Zhen Wu;Hyun-Chung Kim
    • Korea Trade Review
    • /
    • v.48 no.3
    • /
    • pp.217-242
    • /
    • 2023
  • This study aims to analyze the factors influencing the logistics demand of inland ports along the Yangtze River and predict future port logistics demand based on these factors. The logistics demand prediction using system dynamics techniques was conducted for a total of six ports, including Chongqing and Yibin ports in the upper reaches, Jingzhou and Wuhan ports in the middle reaches, and Nanjing and Suzhou ports in the lower reaches of the Yangtze River. The logistics demand for all ports showed an increasing trend in the mid-term prediction until 2026. The logistics demand of Chongqing port was mainly influenced by the scale of the hinterland economy, while Yibin port appeared to heavily rely on the level of port automation. In the case of the upper and middle reach ports, logistics demand increased as the energy consumption of the hinterland increased and the air pollution situation worsened. The logistics demand of the middle reach ports was greatly influenced by the hinterland infrastructure, while the lower reach ports were sensitive to changes in the urban construction area. According to the sensitivity analysis, the logistics demand of ports relying on large cities was relatively stable against the increase and decrease of influential factors, while ports with smaller hinterland city scales reacted sensitively to changes in influential factors. Therefore, a strategy should be established to strengthen policy support for Chongqing port as the core port of the upper Yangtze River and have surrounding ports play a supporting role for Chongqing port. The upper reach ports need to play a supporting role for Chongqing port and consider measures to enhance connections with middle and lower reach ports and promote the port industry. The development strategy for inland ports along the Yangtze River suggests the establishment of direct routes and expansion of the transportation network for South Korean ports and stakeholders. It can suggest expanding the hinterland network and building an efficient transportation system linked with the logistics hub. Through cooperation, logistics efficiency can be enhanced in both regions, which will contribute to strengthening the international position and competitiveness of each port.

A Characterization of Oil Sand Reservoir and Selections of Optimal SAGD Locations Based on Stochastic Geostatistical Predictions (지구통계 기법을 이용한 오일샌드 저류층 해석 및 스팀주입중력법을 이용한 비투멘 회수 적지 선정 사전 연구)

  • Jeong, Jina;Park, Eungyu
    • Economic and Environmental Geology
    • /
    • v.46 no.4
    • /
    • pp.313-327
    • /
    • 2013
  • In the study, three-dimensional geostatistical simulations on McMurray Formation which is the largest oil sand reservoir in Athabasca area, Canada were performed, and the optimal site for steam assisted gravity drainage (SAGD) was selected based on the predictions. In the selection, the factors related to the vertical extendibility of steam chamber were considered as the criteria for an optimal site. For the predictions, 110 borehole data acquired from the study area were analyzed in the Markovian transition probability (TP) framework and three-dimensional distributions of the composing media were predicted stochastically through an existing TP based geostatistical model. The potential of a specific medium at a position within the prediction domain was estimated from the ensemble probability based on the multiple realizations. From the ensemble map, the cumulative thickness of the permeable media (i.e. Breccia and Sand) was analyzed and the locations with the highest potential for SAGD applications were delineated. As a supportive criterion for an optimal SAGD site, mean vertical extension of a unit permeable media was also delineated through transition rate based computations. The mean vertical extension of a permeable media show rough agreement with the cumulative thickness in their general distribution. However, the distributions show distinctive disagreement at a few locations where the cumulative thickness was higher due to highly alternating juxtaposition of the permeable and the less permeable media. This observation implies that the cumulative thickness alone may not be a sufficient criterion for an optimal SAGD site and the mean vertical extension of the permeable media needs to be jointly considered for the sound selections.

The relationship between the morphology of mandibular symphysis and the craniofacial morphology in class III malocclusion (III급 부정교합자의 이부형태와 두개안면형태의 연관성)

  • KIM, Sang-Doo;KWON, Oh-Won;SUNG, Jae-Hyun
    • The korean journal of orthodontics
    • /
    • v.26 no.5 s.58
    • /
    • pp.509-522
    • /
    • 1996
  • By studying the relationship between the morphology of mandibular symphysis and craniofacial morphology in classIII malocclusion, this study aims at deciding whether the morphogy of mandibular symphysis can be used as a predictor on the growth of mandible. The materials used for this study were the cephalometric radiographs of male class III malocclusion. The subjected age groups were 10-12(G1 group) and 20 and above(G2 group): 50 were selected from each group. Each group was again divided, according to the ratio of symphysis, into Large(L), Average(A), and Small(S). The results of this study were summarized as follows: 1. In average the ratio of symphysis, G2 group showed significantly bigger than G1 group(p<0.05) 2. In both G1 and G2 groups, the ratio of symphysis had no relationship with the measurements on the cranial base and the maxilla(p>0.05). 3. In both G1 and G2 groups, there was not distinct difference in the antero-posterior positions among L, A, S subgroups. 4. L and A subgroups showed significantly larger than S subgroup in lower gonial angle and chin angle in G1 group (p<0.05). 5. In the measurements on the vortical relation of the face, anterior total face height(ATFH) and anterior lower face height(ALFH) of L subgroup were significantly larger than that of S subgroup in G1 group(p<0.05) and also mandible showed a tendency to grow downward vertically. 6. In the measurements on the tooth position and inclination, L subgroup showed as compared with S subgroup a tendency of extrusion of maxillary and mandibular teeth in G1 group, but G2 group showed such tendency only in mandibular teeth. 7. In the measurements on the abnormal growth prediction by Schulhof, in G1, there was no significant difference among L, A, S sugroups. 8. In the correlative analysis of the ratio of symphysis and other measurements, G1 group showed significant correlationships in chin angle, PP/MP angie, ANS-Me and other, while G2 group showed the same only in MP-LIT and MP-LMMC(p<0.05, p<0.01). In summarizing the above, in the G1 group, consisting of young males, no difference was noted in horizontal relation between L and S subgroups; in vertical relation, L subgroup showed a stronger tendency of downward growth of mandible than S subgroup. In adult male G2 group, however, no distinct morphological difference of craniofacial complex by the ratio of symphysis.

  • PDF

Landslide Susceptibility Mapping Using Deep Neural Network and Convolutional Neural Network (Deep Neural Network와 Convolutional Neural Network 모델을 이용한 산사태 취약성 매핑)

  • Gong, Sung-Hyun;Baek, Won-Kyung;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_2
    • /
    • pp.1723-1735
    • /
    • 2022
  • Landslides are one of the most prevalent natural disasters, threating both humans and property. Also landslides can cause damage at the national level, so effective prediction and prevention are essential. Research to produce a landslide susceptibility map with high accuracy is steadily being conducted, and various models have been applied to landslide susceptibility analysis. Pixel-based machine learning models such as frequency ratio models, logistic regression models, ensembles models, and Artificial Neural Networks have been mainly applied. Recent studies have shown that the kernel-based convolutional neural network (CNN) technique is effective and that the spatial characteristics of input data have a significant effect on the accuracy of landslide susceptibility mapping. For this reason, the purpose of this study is to analyze landslide vulnerability using a pixel-based deep neural network model and a patch-based convolutional neural network model. The research area was set up in Gangwon-do, including Inje, Gangneung, and Pyeongchang, where landslides occurred frequently and damaged. Landslide-related factors include slope, curvature, stream power index (SPI), topographic wetness index (TWI), topographic position index (TPI), timber diameter, timber age, lithology, land use, soil depth, soil parent material, lineament density, fault density, normalized difference vegetation index (NDVI) and normalized difference water index (NDWI) were used. Landslide-related factors were built into a spatial database through data preprocessing, and landslide susceptibility map was predicted using deep neural network (DNN) and CNN models. The model and landslide susceptibility map were verified through average precision (AP) and root mean square errors (RMSE), and as a result of the verification, the patch-based CNN model showed 3.4% improved performance compared to the pixel-based DNN model. The results of this study can be used to predict landslides and are expected to serve as a scientific basis for establishing land use policies and landslide management policies.

Dynamic Equilibrium Position Prediction Model for the Confluence Area of Nakdong River (낙동강 합류부 삼각주의 동적 평형 위치 예측 모델: 감천-낙동강 합류점 중심 분석 연구)

  • Minsik Kim;Haein Shin;Wook-Hyun Nahm;Wonsuck Kim
    • Economic and Environmental Geology
    • /
    • v.56 no.4
    • /
    • pp.435-445
    • /
    • 2023
  • A delta is a depositional landform that is formed when sediment transported by a river is deposited in a relatively low-energy environment, such as a lake, sea, or a main channel. Among these, a delta formed at the confluence of rivers has a great importance in river management and research because it has a significant impact on the hydraulic and sedimentological characteristics of the river. Recently, the equilibrium state of the confluence area has been disrupted by large-scale dredging and construction of levees in the Nakdong River. However, due to the natural recovery of the river, the confluence area is returning to its pre-dredging natural state through ongoing sedimentation. The time-series data show that the confluence delta has been steadily growing since the dredging, but once it reaches a certain size, it repeats growth and retreat, and the overall size does not change significantly. In this study, we developed a model to explain the sedimentation-erosion processes in the confluence area based on the assumption that the confluence delta reaches a dynamic equilibrium. The model is based on two fundamental principles: sedimentation due to supply from the tributary and erosion due to the main channel. The erosion coefficient that represents the Nakdong River confluence areas, was obtained using data from the tributaries of the Nakdong River. Sensitivity analyses were conducted using the developed model to understand how the confluence delta responds to changes in the sediment and water discharges of the tributary and the main channel, respectively. We then used annual average discharge of the Nakdong River's tributaries to predict the dynamic equilibrium positions of the confluence deltas. Finally, we conducted a simulation experiment on the development of the Gamcheon-Nakdong River delta using recorded daily discharge. The results showed that even though it is a simple model, it accurately predicted the dynamic equilibrium positions of the confluence deltas in the Nakdong River, including the areas where the delta had not formed, and those where the delta had already formed and predicted the trend of the response of the Gamcheon-Nakdong River delta. However, the actual retreat in the Gamcheon-Nakdong River delta was not captured fully due to errors and limitations in the simplification process. The insights through this study provide basic information on the sediment supply of the Nakdong River through the confluence areas, which can be implemented as a basic model for river maintenance and management.

Visualizing the Results of Opinion Mining from Social Media Contents: Case Study of a Noodle Company (소셜미디어 콘텐츠의 오피니언 마이닝결과 시각화: N라면 사례 분석 연구)

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.89-105
    • /
    • 2014
  • After emergence of Internet, social media with highly interactive Web 2.0 applications has provided very user friendly means for consumers and companies to communicate with each other. Users have routinely published contents involving their opinions and interests in social media such as blogs, forums, chatting rooms, and discussion boards, and the contents are released real-time in the Internet. For that reason, many researchers and marketers regard social media contents as the source of information for business analytics to develop business insights, and many studies have reported results on mining business intelligence from Social media content. In particular, opinion mining and sentiment analysis, as a technique to extract, classify, understand, and assess the opinions implicit in text contents, are frequently applied into social media content analysis because it emphasizes determining sentiment polarity and extracting authors' opinions. A number of frameworks, methods, techniques and tools have been presented by these researchers. However, we have found some weaknesses from their methods which are often technically complicated and are not sufficiently user-friendly for helping business decisions and planning. In this study, we attempted to formulate a more comprehensive and practical approach to conduct opinion mining with visual deliverables. First, we described the entire cycle of practical opinion mining using Social media content from the initial data gathering stage to the final presentation session. Our proposed approach to opinion mining consists of four phases: collecting, qualifying, analyzing, and visualizing. In the first phase, analysts have to choose target social media. Each target media requires different ways for analysts to gain access. There are open-API, searching tools, DB2DB interface, purchasing contents, and so son. Second phase is pre-processing to generate useful materials for meaningful analysis. If we do not remove garbage data, results of social media analysis will not provide meaningful and useful business insights. To clean social media data, natural language processing techniques should be applied. The next step is the opinion mining phase where the cleansed social media content set is to be analyzed. The qualified data set includes not only user-generated contents but also content identification information such as creation date, author name, user id, content id, hit counts, review or reply, favorite, etc. Depending on the purpose of the analysis, researchers or data analysts can select a suitable mining tool. Topic extraction and buzz analysis are usually related to market trends analysis, while sentiment analysis is utilized to conduct reputation analysis. There are also various applications, such as stock prediction, product recommendation, sales forecasting, and so on. The last phase is visualization and presentation of analysis results. The major focus and purpose of this phase are to explain results of analysis and help users to comprehend its meaning. Therefore, to the extent possible, deliverables from this phase should be made simple, clear and easy to understand, rather than complex and flashy. To illustrate our approach, we conducted a case study on a leading Korean instant noodle company. We targeted the leading company, NS Food, with 66.5% of market share; the firm has kept No. 1 position in the Korean "Ramen" business for several decades. We collected a total of 11,869 pieces of contents including blogs, forum contents and news articles. After collecting social media content data, we generated instant noodle business specific language resources for data manipulation and analysis using natural language processing. In addition, we tried to classify contents in more detail categories such as marketing features, environment, reputation, etc. In those phase, we used free ware software programs such as TM, KoNLP, ggplot2 and plyr packages in R project. As the result, we presented several useful visualization outputs like domain specific lexicons, volume and sentiment graphs, topic word cloud, heat maps, valence tree map, and other visualized images to provide vivid, full-colored examples using open library software packages of the R project. Business actors can quickly detect areas by a swift glance that are weak, strong, positive, negative, quiet or loud. Heat map is able to explain movement of sentiment or volume in categories and time matrix which shows density of color on time periods. Valence tree map, one of the most comprehensive and holistic visualization models, should be very helpful for analysts and decision makers to quickly understand the "big picture" business situation with a hierarchical structure since tree-map can present buzz volume and sentiment with a visualized result in a certain period. This case study offers real-world business insights from market sensing which would demonstrate to practical-minded business users how they can use these types of results for timely decision making in response to on-going changes in the market. We believe our approach can provide practical and reliable guide to opinion mining with visualized results that are immediately useful, not just in food industry but in other industries as well.