• 제목/요약/키워드: Multi-dimensional Approach

Search Result 329, Processing Time 0.024 seconds

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

Exploring Policy Contexts and Sustainable Management Structure for Park Regeneration - A Focus on the Case of Green Estate Ltd, Sheffield, UK - (공원 재생을 위한 정책 및 지속 가능한 경영구조 연구 - 그린 에스테이트 사례를 중심으로 -)

  • Nam, Jin-Vo;Kim, Nam-Choon;Kim, Du-Won
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.22 no.4
    • /
    • pp.15-34
    • /
    • 2019
  • Today, there is increasing recognition of the importance of urban regeneration for better public places. Urban parks as a public area play an important role in harnessing its positive impact on people's well-being: where the standards and funding of/for the parks are getting worse. There is however less a focus on policy approach to park regeneration in the country. Neverthless, a few UK's cases of such innovative park management(PM) has shown successful park regeneration based on policy support. Therefore, the aim of this research is to draw policy implications by exploring a case of successful park regeneration. To address the aim, this research conducts an in-depth case study of 'Manor Fields Park, UK', digging into its PM structure and PM body 'Green Estate Ltd' in relation to relevant policy. The data is mainly collected by interviews including a group interview. The analytical framework 'Place-keeping(PK)' and its six dimensions are employed to determine the characteristics of MFP's PM structure. Resultingly, there is a significant shift in the approach to PM which stresses the principle of long-term and self-sustaining structure led by a non-profit organisation and strong impacts of policy. In this context, PK highlights significant drivers for parks regeneration particularly in terms of policy implications: 1)providing policy support to encourage non-profit organisations in PM, 2)extending community involvement in decision-making processes, 3)promoting income generation by community groups, 4)shifting public awareness of shared responsibility for PM, 5)completing regular park maintenance assessment by community groups, and 6)delivering low-maintenance approaches to PM. To support these implications, PM structure for successful parks regeneration does meet a holistic and multi-dimensional approach of place-keeping underlined by understanding policy contexts and rethinking current status quo of PM. Addressing these implications will shed light on urban PM in an era of austerity and ultimately contribute to improving people's well-being.

An investigation of the User Research Techniques in the User-Centered Design Framework - Focused on the on-line community services development for 13-18 Young Adults (사용자 중심 디자인 프레임워크에서 사용자 조사기법의 역할에 관한 연구 - 13-18 청소년용 온라인 커뮤니티 컨텐트 개발 프로젝트를 중심으로)

  • 이종호
    • Archives of design research
    • /
    • v.17 no.2
    • /
    • pp.77-86
    • /
    • 2004
  • User-Centered Design Approach plays important role in dealing with usability issues for developing modern technology products. Yet it is still questionable whether the User-Centered approach is enough for the development of successful consumer contents since the User-Centered Design is originated from the software engineering field where meeting customers' functional requirement is the most critical aspect in developing a software. However, modern consumer market is already saturated and in order to meet ever increasing consumer requirements, the User-Centered Design approach needs to be expanded. As a way of incorporating the User-Centered Approach into the consumer product development, Jordan suggested the 'Pleasure-based Approach' in industrial design field, which usually generates multi-dimensional user requirements: 1)physical, 2)cognitive, 3)identity and 4) social. It is the current tendency that many portal and community service providers focus on fulfilling both functional and emotional needs for users when developing new items, contents and services. Previously fulfilling consumers' emotional needs solely depend on visual designer's graphical sense and capability. However, taking the customer-centered approach on withdrawing consumers' unknown needs is getting critical in the competitive market environment. This paper reviews different types of user research techniques and categorized into 6 ways based on Kano(1992)'s product quality model. Based on his theory, only performance factors, such as suability, can be identified through the user-centered design approach. The user-centered design approach has to be expanded to include factors include personality, sociability, pleasure, and so on. In order to identify performance as well as excellent factors through user research, a user-research framework was established and tested through the case study, which is ' the development of new online service for teens '. The results of the user research were summarized at the end of the paper and the pros and cons of each research techniques were analyzed.

  • PDF

A methodology for assessing fatigue life of a countersunk riveted lap joint

  • Li, Gang;Renaud, Guillaume;Liao, Min;Okada, Takao;Machida, Shigeru
    • Advances in aircraft and spacecraft science
    • /
    • v.4 no.1
    • /
    • pp.1-19
    • /
    • 2017
  • Fatigue life prediction of a multi-row countersunk riveted lap joint was performed numerically. The stress and strain conditions in a highly stressed substructure of the joint were analysed using a global/local finite element (FE) model coupling approach. After validation of the FE models using experimental strain measurements, the stress/strain condition in the local three-dimensional (3D) FE model was simulated under a fatigue loading condition. This local model involved multiple load cases with nonlinearity in material properties, geometric deformation, and contact boundary conditions. The resulting stresses and strains were used in the Smith-Watson-Topper (SWT) strain life equation to assess the fatigue "initiation life", defined as the life to a 0.5 mm deep crack. Effects of the rivet-hole clearance and rivet head deformation on the predicted fatigue life were identified, and good agreement in the fatigue life was obtained between the experimental and the numerical results. Further crack growth from a 0.5 mm crack to the first linkup of two adjacent cracks was evaluated using the NRC in-house tool, CanGROW. Good correlation in the fatigue life was also obtained between the experimental result and the crack growth analysis. The study shows that the selected methodology is promising for assessing the fatigue life for the lap joint, which is expected to improve research efficiency by reducing test quantity and cost.

Marketing for Real and Virtual Museums: A marketing Model to Explain Visitor Behavior in Real Museums and an Outlook on its Applicability to Virtual Museums

  • Terlutter, Ralf;Diehl, Sandra
    • Journal of Global Scholars of Marketing Science
    • /
    • v.10
    • /
    • pp.45-70
    • /
    • 2002
  • The purpose of this study is to obtain more insight into the explanation and prognosis of consumer behavior in real and virtual museums. The analysis focuses on the influence of the museum environment on the museum patrons (rather than on the influence of the art objects). On the basis of the emotional approach to environmental psychology by Mehrabian and Russell (1974), a behavior model has been developed for museums. The model, which is based on the emotional variables pleasure, arousal and dominance (PAD), is also enhanced by cognitive variabies (learning attractiveness, education standard and information demand). The enhancement of the classical model was necessary because cognitive variables play a major role in cultural institutions such as museums: One important objective of museums is the communication of cultural knowledge to visitors. The model is tested empirically using structural equation modeling. 301 visitors were interviewed individually. Two different museum environments were represented using visual stimuli. The theoretical model for museums can be proved empirically. The degree to which the model fits the empirical data was extensively tested. The model showed high compatibility with the data and could be accepted. The study proves that a model can be developed, which explains visitor behavior in museums. The model shows museum designers how museums should be designed to be both emotionally appealing and a learning environment. Based on empirical studies in virtual stores on the Internet, it is discussed whether the research findings in these environments may be applied to virtual museum environments. In order to create an emotionally appealing virtual museum, it is recommended that one uses a 3-dimensional representation to offer various possibilities for interaction and to create a multi-sensual environment that appears highly realistic.

  • PDF

Quantitative and Qualitative Gradient of Pain Experience, Sleep Quality and Psychological Distress in Patients with Different Phenotypes of Temporomandibular Disorders

  • Choi, Hee Hun;Kim, Hye-Kyoung;Kim, Mee-Eun
    • Journal of Oral Medicine and Pain
    • /
    • v.45 no.3
    • /
    • pp.56-64
    • /
    • 2020
  • Purpose: Temporomandibular disorders (TMD) is a mosaic of clinical signs and symptoms that can be regarded as a set of phenotypes that are affected by various factors including pain sensitivity, pain disability, sleep and psychological functioning. The aims of this study were to evaluate association of pain experience, sleep quality and psychological distress with different phenotypes of TMD patients. Methods: This retrospective study included a cohort (n=1,858; 63.8% for female, mean age=34.9±15.9 years) of patients with TMD. A set of self-administered questionnaires concerning pain interference (Brief Pain Inventory), pain disability (Graded Chronic Pain Scale), sleep quality (Pittsburg Sleep Questionnaire Index), psychological distress (Symptom Checklist-90 revised), and pain catastrophizing (Pain Catastrophizing Scale) were administered to all participants at the first consultation. All TMD patients were classified into four groups including TMD with internal derangement without pain (TMD_ID, n=370), TMD with joint pain (TMD_J, n=571), TMD with muscle pain (TMD_M, n=541) and TMD with muscle-joint combined pain (TMD_MJ, n=376). Results: The female ratio was particularly high in the group with TMD_MJ (p=0.001). The patients with muscle pain and both muscle and joint pain had longer symptom duration (p=0.004) and presented significantly higher scores in pain experience (p<0.001), subjective sleep quality (p<0.001), pain catastrophizing (p<0.001) and psychological distress (p<0.05) except for paranoid-ideation than the groups with only joint problems. Conclusions: The results of this study highlight the importance of multi-dimensional approach that consider pain disability, sleep quality, and psychological functioning in the management of TMD with muscle component. This study would contribute to a better understanding of interaction between heterogeneous TMD and multiple risk factors in order to build tailored treatment based on different phenotypes.

Field Measurement of Suspended Material Distribution at the River Confluence (하천 합류부에서의 부유입자 분포에 대한 현장측정)

  • Kwak, Sunghyun;Lee, Kyungsu;Cho, Hanil;Seo, Yongjae;Lyu, Siwan
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.37 no.2
    • /
    • pp.467-474
    • /
    • 2017
  • Each river confluence has the inherent hydraulic and mixing characteristics coming from its bathymetry and topography. It is necessary to make the measurement covering the spatial extent of studying area in order to catch these 2-dimensional intrinsic characteristics. This study focuses to investigate the hydraulic and mixing characteristics at the confluence of Nakdong and Geumho River, from field measurement of flow, water quality, and suspended particle distribution with ADCP (Riversurveyor M9), multi-parameter water quality sonde (YSI6600V2), and submersible system for in-situ observations of particle size distribution and volume concentration (LISST : Laser In-Situ Scattering & Transmissometry), respectively. From the results, it can be found that the field measurement of suspended particle and water quality distribution can be the useful approach to catch the hydraulic and mixing characteristics at a river confluence.

Observation on the Shoreline Changes Using Digital Aerial Imagery for Bangamoeri Beaches (디지털항공영상을 활용한 방아머리 해빈의 해안선 변화 관측)

  • Yun, Kong-Hyun;Song, Yeong Sun
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.6_1
    • /
    • pp.971-980
    • /
    • 2017
  • In this research, it was presented that the strategic approach for the long-term shoreline changes using historic digital aerial images can be effective for the analysis on the bangameori beach, west coast of South Korea. For this purpose, we collected several historic digital aerial images over 9 years in the research filed and conducted GPS-VRS surveying for GCP (Ground Control Point) acquisition. Also we collected existing two dimensional shoreline digital map which was published by KHOA (Korea Hydrographic and Oceanographic Agency) in the year 2013. With these multi data sets, we provided quantitative analysis on coastal erosion using the long-term shoreline changes in the beach. Also, As the results it was found that 2m sea level was retreated in the research period with maximum 0.31m length.

Multi-faceted Citation Analysis for Quality Assessment of Scholarly Publications (학술논문 품질평가를 위한 다방면 인용분석방식)

  • Yang, Ki-Duk;Meho, Lokman
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.2
    • /
    • pp.79-96
    • /
    • 2011
  • Despite the widespread use, critics claim that citation analysis has serious limitations in evaluating the research performance of scholars. First, conventional citation analysis methods yield one-dimensional and sometimes misleading evaluation as a result of not taking into account differences in citation quality, not filtering out citation noise such as self-citations, and not considering non-numeric aspects of citations such as language, culture, and time. Second, the citation database coverage of today is disjoint and incomplete, which can result in conflicting quality assessment outcomes across different data sources. This paper discuss the findings from a citation analysis study that measured the impact of scholarly publications based on the data mined from Web of Science, Scopus, and Google Scholar, and briefly describes a work-in-progress prototype system called CiteSearch, which is designed to overcome the weaknesses of existing citation analysis methods with a robust citation-based quality assessment approach.

Reference Node Selection Scheme for Estimating Relative Locations of Mobile Robots (이동 로봇의 상대위치 추정을 위한 기준노드 선택 기법)

  • Ha, Taejin;Kim, Sunyong;Park, Sun Young;Kwon, Daehoon;Ham, Jaehyun;Lim, Hyuk
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.19 no.4
    • /
    • pp.508-516
    • /
    • 2016
  • When GPS signals are not available, a relative localization can be alternatively used to represent the topological relationship between mobile nodes. A relative location map of a network can be constructed by using the distance information between all the pairs of nodes in the network. If a network is large, a number of small local maps are individually constructed and are merged to obtain the whole map. However, this approach may result in a high computation and communication overhead. In this paper, we propose a reference-node selection scheme for relative localization map construction, which chooses a subset of nodes as a reference node that is supposed to construct local maps. The scheme is a greedy algorithm that iteratively chooses nodes with high degree as a reference node until the chosen local maps are successfully merged with a sufficient number of common nodes between nearby local maps. The simulation results indicate that the proposed scheme achieves higher localization accuracy with a reduced computational overhead.