• Title/Summary/Keyword: Singular value

Search Result 568, Processing Time 0.027 seconds

Personalized Size Recommender System for Online Apparel Shopping: A Collaborative Filtering Approach

  • Dongwon Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.8
    • /
    • pp.39-48
    • /
    • 2023
  • This study was conducted to provide a solution to the problem of sizing errors occurring in online purchases due to discrepancies and non-standardization in clothing sizes. This paper discusses an implementation approach for a machine learning-based recommender system capable of providing personalized sizes to online consumers. We trained multiple validated collaborative filtering algorithms including Non-Negative Matrix Factorization (NMF), Singular Value Decomposition (SVD), k-Nearest Neighbors (KNN), and Co-Clustering using purchasing data derived from online commerce and compared their performance. As a result of the study, we were able to confirm that the NMF algorithm showed superior performance compared to other algorithms. Despite the characteristic of purchase data that includes multiple buyers using the same account, the proposed model demonstrated sufficient accuracy. The findings of this study are expected to contribute to reducing the return rate due to sizing errors and improving the customer experience on e-commerce platforms.

Haunting the London Streets: Virginia Woolf's Urban Travelogues Re-appraised

  • Choi, Young Sun
    • Journal of English Language & Literature
    • /
    • v.55 no.3
    • /
    • pp.415-427
    • /
    • 2009
  • Woolf s preoccupation with the interplay between gender, commercialism, and the modern city is exposed in higher relief by her feminist remapping of the city through a discourse of fl nerie, which is epitomized in her singular urban travelogues such as Street Haunting and The London Scene essays. A fanatical London-adventurer herself, she assumes the persona of the fl neuse in exploring the street of modern London and especially the public sphere of the marketplace, as represented in Oxford Street Tide. Living and working in the quarter of Bloomsbury, in close proximity to the capital s famous sites of tourism, entertainment, and mass consumption, Woolf was placed at the heart of urban spectacle. In spite of the lack of critical analysis of this high-profile writer s interest in such quotidian matters as shopping, fashion and appearance, which would be informed by a hierarchy of value within literary criticism, it seems that they are inextricably intertwined with her quest into more serious-minded topics that revolve around the twin pillars of her literary project: feminism and modernism. Her essays, in particular, suggest this point in one way or another, mirroring her extraordinary susceptibility to such concerns. For Woolf, street sauntering is synonymous with an act of creative mobility, by which she plays with the notion of shifting identities, rediscovers the urban rarities and splendors, and ultimately pins them down in her literary output. By adopting the identity of a masterly rambler/observer/explorer with an omnipotent gaze, she firmly anchors herself as an active interpreter of urban modernity and viewer of its spectacle. She thus challenges the idea of public space as a male domain, which is central to the classic androcentric discourse of loitering, spectatorship and urban modernity.

Multi-environment Trial Analysis for Yield-related Traits of Early Maturing Korean Rice Cultivars

  • Seung Young Lee;Hyun-Sook Lee;Chang-Min Lee;Su-Kyung Ha;Youngjun Mo;Ji-Ung Jeung
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.252-252
    • /
    • 2022
  • Genotype-by-environment interaction (GEI) refers to the comparative response of genotypes to different environments conditions. Thus, understanding GEI is a fundamental component for selecting superior genotypes for breeding programs. The significance of utilizing early maturing cultivars not only provides flexibility in planting dates, but also serves as an effective strategy to reduce methane emission from the paddy fields. In this study, we conducted multi-environment trials (METs) to evaluate yield-related traits such as culm length, panicle length, panicle number, spikelet per plant, and thousand grain weight. A total of eighty-one Korean commercial rice cultivars categorized as early maturing cultivars, were cultivated in three regions, two planting seasons for two years. The genotype main effect plus genotype-by-environment interaction (GGE) biplot analysis of yield-related traits and grain yield explained 70.02-91.24% of genotype plus GEI variation, and exhibited various patterns of mega-environment delineation, discriminating ability, representativeness, and genotype rankings across the planting seasons and environments. Moreover, simultaneous selection using weighted average of absolute scores from the singular value decomposition (WAASB) and multi-trait stability index (MTSI) revealed six highly recommended genotypes with high stability and crop productivity. The winning genotypes under specific environment can be utilized as useful genetic materials to develop regional specialty cultivars, and recommended genotypes can be used as elite climate-resilient parents to improve yield-potential and reduce methane emission as part to accomplish carbon-neutrality.

  • PDF

A comparison study of canonical methods: Application to -Omics data (오믹스 자료를 이용한 정준방법 비교)

  • Seungsoo Lee;Eun Jeong Min
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.2
    • /
    • pp.157-176
    • /
    • 2024
  • Integrative analysis for better understanding of complex biological systems gains more attention. Observing subjects from various perspectives and conducting integrative analysis of those multiple datasets enables a deeper understanding of the subject. In this paper, we compared two methods that simultaneously consider two datasets gathered from the same objects, canonical correlation analysis (CCA) and co-inertia analysis (CIA). Since CCA cannot handle the case when the data exhibit high-dimensionality, two strategies were considered instead: Utilization of a ridge constant (CCA-ridge) and substitution of covariance matrices of each data to identity matrix and then applying penalized singular value decomposition (CCA-PMD). To illustrate CIA and CCA, both extensions of CCA and CIA were applied to NCI60 cell line data. It is shown that both methods yield biologically meaningful and significant results by identifying important genes that enhance our comprehension of the data. Their results shows some dissimilarities arisen from the different criteria used to measure the relationship between two sets of data in each method. Additionally, CIA exhibits variations dependent on the weight matrices employed.

Reduced Order Modeling of Marine Engine Status by Principal Component Analysis (주성분 분석을 통한 선박 기관 상태의 차수 축소 모델링)

  • Seungbeom Lee;Jeonghwa Seo;Dong-Hwan Kim;Sangmin Han;Kwanwoo Kim;Sungwook Chung;Byeongwoo Yoo
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.61 no.1
    • /
    • pp.8-18
    • /
    • 2024
  • The present study concerns reduced order modeling of a marine diesel engine, which can be used for outlier detection in status monitoring and carbon intensity index calculation. Principal Component Analysis (PCA) is introduced for the reduced order modeling, focusing on the feasibility of detecting and treating nonlinear variables. By cross-correlation, it is found that there are seven non-linear data channels among 23 data channels, i.e., fuel mode, exhaust gas temperature after the turbocharger, and cylinder coolant temperatures. The dataset is handled so that the mean is located at the nominal continuous rating. Polynomial presentation of the dataset is also applied to reflect the linearity between the engine speed and other channels. The first principal mode shows strong effects of linearity of the most data channels to show the linearity of the system. The non-linear variables are effectively explained by other modes. second mode concerns the temperature of the cylinder cooling water, which shows small correlation with other variables. The third and fourth modes correlates the fuel mode and turbocharger exhaust gas temperature, which have inferior linearity to other channels. PCA is proven to be applicable to data given in binary type of fuel mode selection, as well as numerical type data.

Optimal supervised LSA method using selective feature dimension reduction (선택적 자질 차원 축소를 이용한 최적의 지도적 LSA 방법)

  • Kim, Jung-Ho;Kim, Myung-Kyu;Cha, Myung-Hoon;In, Joo-Ho;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.13 no.1
    • /
    • pp.47-60
    • /
    • 2010
  • Most of the researches about classification usually have used kNN(k-Nearest Neighbor), SVM(Support Vector Machine), which are known as learn-based model, and Bayesian classifier, NNA(Neural Network Algorithm), which are known as statistics-based methods. However, there are some limitations of space and time when classifying so many web pages in recent internet. Moreover, most studies of classification are using uni-gram feature representation which is not good to represent real meaning of words. In case of Korean web page classification, there are some problems because of korean words property that the words have multiple meanings(polysemy). For these reasons, LSA(Latent Semantic Analysis) is proposed to classify well in these environment(large data set and words' polysemy). LSA uses SVD(Singular Value Decomposition) which decomposes the original term-document matrix to three different matrices and reduces their dimension. From this SVD's work, it is possible to create new low-level semantic space for representing vectors, which can make classification efficient and analyze latent meaning of words or document(or web pages). Although LSA is good at classification, it has some drawbacks in classification. As SVD reduces dimensions of matrix and creates new semantic space, it doesn't consider which dimensions discriminate vectors well but it does consider which dimensions represent vectors well. It is a reason why LSA doesn't improve performance of classification as expectation. In this paper, we propose new LSA which selects optimal dimensions to discriminate and represent vectors well as minimizing drawbacks and improving performance. This method that we propose shows better and more stable performance than other LSAs' in low-dimension space. In addition, we derive more improvement in classification as creating and selecting features by reducing stopwords and weighting specific values to them statistically.

  • PDF

Usefulness of Permeability Map by Perfusion MRI of Brain Tumor the Grade Assessment (뇌종양의 등급분류를 위한 관류 자기공명영상을 이용한 투과성영상(Permeability Map)의 유용성 평가)

  • Bae, Sung-Jin;Lee, Joo-Young;Chang, Hyuk-Won
    • Journal of radiological science and technology
    • /
    • v.32 no.3
    • /
    • pp.325-334
    • /
    • 2009
  • Purpose : This study was conducted to assess how effective the permeability ratio and relative cerebral blood volume ratio are to tumor through perfusion MRI by measuring and reflecting the grade assessment and differential diagnosis and the permeability and relative cerebral blood volume of contrast media plunged from blood vessel into organ due to breakdown of blood-brain barrier in cerebral. Subject and Method : Subject of study was 29 patients whose diagnosis were confirmed by biopsy after surgery and 550 (11 slice$\times$50 image) perfusion MRI were used to make image of relative cerebral blood volume with the program furnished on instrument. The other method was to transmit to private computer and the image analysis was made additionally by making image of relative cerebral blood volume-reformulated singular value decomposition, rCBV-rSVD and permeability using IDL.6.2. In addition, Kruskal-wallis test tonggyein non numerical average by a comparative analysis of brain tumors Results : The rCBV ratio (Functool PF; GE Medical Systems and IDL 6.2 program by analysis) and permeability ratio of tumors were as follows; high grade glioma(n=4), (14.75, 19.25) 13.13. low grade astrocytoma(n=5) (14.80, 15.90) 11.60, glioblastoma(n=5) (10.90, 18.60), 22.00, metastasis(n=6) (11.00, 15.08). 22.33. meningioma(n=6) (18.58, 7.67), 5.58. oliogodendroglioma(n=3) (23.33, 16.33, 15.67. Conclusion : It was not easy to classify the grade with the relative cerebral blood volume ratio measured by using the relative cerebral blood image by type of tumors, however, permeability ratio measured by permeability image revealed that the higher the grade of tumor, the higher the measured permeability ratio, showing the assessment of tumor grade is more effective to differential diagnosis.

  • PDF

Recommender system using BERT sentiment analysis (BERT 기반 감성분석을 이용한 추천시스템)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.1-15
    • /
    • 2021
  • If it is difficult for us to make decisions, we ask for advice from friends or people around us. When we decide to buy products online, we read anonymous reviews and buy them. With the advent of the Data-driven era, IT technology's development is spilling out many data from individuals to objects. Companies or individuals have accumulated, processed, and analyzed such a large amount of data that they can now make decisions or execute directly using data that used to depend on experts. Nowadays, the recommender system plays a vital role in determining the user's preferences to purchase goods and uses a recommender system to induce clicks on web services (Facebook, Amazon, Netflix, Youtube). For example, Youtube's recommender system, which is used by 1 billion people worldwide every month, includes videos that users like, "like" and videos they watched. Recommended system research is deeply linked to practical business. Therefore, many researchers are interested in building better solutions. Recommender systems use the information obtained from their users to generate recommendations because the development of the provided recommender systems requires information on items that are likely to be preferred by the user. We began to trust patterns and rules derived from data rather than empirical intuition through the recommender systems. The capacity and development of data have led machine learning to develop deep learning. However, such recommender systems are not all solutions. Proceeding with the recommender systems, there should be no scarcity in all data and a sufficient amount. Also, it requires detailed information about the individual. The recommender systems work correctly when these conditions operate. The recommender systems become a complex problem for both consumers and sellers when the interaction log is insufficient. Because the seller's perspective needs to make recommendations at a personal level to the consumer and receive appropriate recommendations with reliable data from the consumer's perspective. In this paper, to improve the accuracy problem for "appropriate recommendation" to consumers, the recommender systems are proposed in combination with context-based deep learning. This research is to combine user-based data to create hybrid Recommender Systems. The hybrid approach developed is not a collaborative type of Recommender Systems, but a collaborative extension that integrates user data with deep learning. Customer review data were used for the data set. Consumers buy products in online shopping malls and then evaluate product reviews. Rating reviews are based on reviews from buyers who have already purchased, giving users confidence before purchasing the product. However, the recommendation system mainly uses scores or ratings rather than reviews to suggest items purchased by many users. In fact, consumer reviews include product opinions and user sentiment that will be spent on evaluation. By incorporating these parts into the study, this paper aims to improve the recommendation system. This study is an algorithm used when individuals have difficulty in selecting an item. Consumer reviews and record patterns made it possible to rely on recommendations appropriately. The algorithm implements a recommendation system through collaborative filtering. This study's predictive accuracy is measured by Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). Netflix is strategically using the referral system in its programs through competitions that reduce RMSE every year, making fair use of predictive accuracy. Research on hybrid recommender systems combining the NLP approach for personalization recommender systems, deep learning base, etc. has been increasing. Among NLP studies, sentiment analysis began to take shape in the mid-2000s as user review data increased. Sentiment analysis is a text classification task based on machine learning. The machine learning-based sentiment analysis has a disadvantage in that it is difficult to identify the review's information expression because it is challenging to consider the text's characteristics. In this study, we propose a deep learning recommender system that utilizes BERT's sentiment analysis by minimizing the disadvantages of machine learning. This study offers a deep learning recommender system that uses BERT's sentiment analysis by reducing the disadvantages of machine learning. The comparison model was performed through a recommender system based on Naive-CF(collaborative filtering), SVD(singular value decomposition)-CF, MF(matrix factorization)-CF, BPR-MF(Bayesian personalized ranking matrix factorization)-CF, LSTM, CNN-LSTM, GRU(Gated Recurrent Units). As a result of the experiment, the recommender system based on BERT was the best.

Improvement of Small Baseline Subset (SBAS) Algorithm for Measuring Time-series Surface Deformations from Differential SAR Interferograms (차분 간섭도로부터 지표변위의 시계열 관측을 위한 개선된 Small Baseline Subset (SBAS) 알고리즘)

  • Jung, Hyung-Sup;Lee, Chang-Wook;Park, Jung-Won;Kim, Ki-Dong;Won, Joong-Sun
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.2
    • /
    • pp.165-177
    • /
    • 2008
  • Small baseline subset (SBAS) algorithm has been recently developed using an appropriate combination of differential interferograms, which are characterized by a small baseline in order to minimize the spatial decorrelation. This algorithm uses the singular value decomposition (SVD) to measure the time-series surface deformation from the differential interferograms which are not temporally connected. And it mitigates the atmospheric effect in the time-series surface deformation by using spatially low-pass and temporally high-pass filter. Nevertheless, it is not easy to correct the phase unwrapping error of each interferogram and to mitigate the time-varying noise component of the surface deformation from this algorithm due to the assumption of the linear surface deformation in the beginning of the observation. In this paper, we present an improved SBAS technique to complement these problems. Our improved SBAS algorithm uses an iterative approach to minimize the phase unwrapping error of each differential interferogram. This algorithm also uses finite difference method to suppress the time-varying noise component of the surface deformation. We tested our improved SBAS algorithm and evaluated its performance using 26 images of ERS-1/2 data and 21 images of RADARSAT-1 fine beam (F5) data at each different locations. Maximum deformation amount of 40cm in the radar line of sight (LOS) was estimated from ERS-l/2 datasets during about 13 years, whereas 3 cm deformation was estimated from RADARSAT-1 ones during about two years.

A Numerical Solution Method of the Boundary Integral Equation -Axisymmetric Flow- (경계적분방정식의 수치해법 -축대칭 유동-)

  • Chang-Gu,Kang
    • Bulletin of the Society of Naval Architects of Korea
    • /
    • v.27 no.3
    • /
    • pp.38-46
    • /
    • 1990
  • A numerical solution method of the boundary integral equation for axisymmetric potential flows is presented. Those are represented by ring source and ring vorticity distribution. Strengths of ring source and ring vorticity are approximated by linear functions of a parameter $\zeta$ on a segment. The geometry of the body is represented by a cubic B-spline. Limiting integral expressions as the field point tends to the surface having ring source and ring vorticity distribution are derived upto the order of ${\zeta}ln{\zeta}$. In numerical calculations, the principal value integrals over the adjacent segments cancel each other exactly. Thus the singular part proportional to $\(\frac{1}{\zeta}\)$ can be subtracted off in the calculation of the induced velocity by singularities. And the terms proportional to $ln{\zeta}$ and ${\zeta}ln{\zeta}$ can be integrated analytically. Thus those are subtracted off in the numerical calculations and the numerical value obtained from the analytic integrations for $ln{\zeta}$ and ${\zeta}ln{\zeta}$ are added to the induced velocity. The four point Gaussian Quadrature formula was used to evaluate the higher order terms than ${\zeta}ln{\zeta}$ in the integration over the adjacent segments to the field points and the integral over the segments off the field points. The root mean square errors, $E_2$, are examined as a function of the number of nodes to determine convergence rates. The convergence rate of this method approaches 2.

  • PDF