This work is partially funded by the National Research Foundation of Korea (NRF) grants 2018R1D1 A1B07043034 and 2019R1A4A1028134, and by Korea University grant K2105791.
- Chapelle O, Sindhwani V, and Keerthi SS (2008). Optimization techniques for semi-supervised support vector machines, Journal of Machine Learning Research, 9, 203-233.
- Collobert R, Sinz F, Weston J, and Bottou L (2006). Large scale transductive svms, Journal of Machine Learning Research, 7, 1687-1712.
- Dempster AP, Laird NM, and Rubin DB (1977). Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society: Series B (Methodological), 39, 1-22.
- Ester M, Kriegel HP, Sander J, and Xu X (1996). A density-based algorithm for discovering clusters in large spatial databases with noise., Kdd, 96, 226-231.
- Le Thi Hoai A and Tao PD (1997). Solving a class of linearly constrained indefinite quadratic problems by dc algorithms, Journal of global optimization, 11, 253-285.
- Lu Y, Liu PY, Xiao P, and Deng HW (2005). Hotelling's t 2 multivariate profiling for detecting differential expression in microarrays, Bioinformatics, 21, 3105-3113.
- Shaw RG and Mitchell-Olds T (1993). Anova for unbalanced data: an overview, Ecology, 74, 1638-1645.
- Shen X, Tseng GC, Zhang X, and Wong WH (2003). On-learning, Journal of the American Statistical Association, 98, 724-734.
- Vapnik V (2015). The Nature of Statistical Learning Theory, Springer science & business media.
- Wahba G (1990). Spline Models for Observational Data, Philadelphia, SIAM.
- Wu Y and Liu Y (2007). Robust truncated hinge loss support vector machines, Journal of the American Statistical Association, 102, 974-983.
- Yarowsky D (1995). Unsupervised word sense disambiguation rivaling supervised methods, 33rd Annual Meeting of the Association for Computational Linguistics, 189-196.