• Title/Summary/Keyword: Voice training

Search Result 182, Processing Time 0.024 seconds

Comparative study of data augmentation methods for fake audio detection (음성위조 탐지에 있어서 데이터 증강 기법의 성능에 관한 비교 연구)

  • KwanYeol Park;Il-Youp Kwak
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.2
    • /
    • pp.101-114
    • /
    • 2023
  • The data augmentation technique is effectively used to solve the problem of overfitting the model by allowing the training dataset to be viewed from various perspectives. In addition to image augmentation techniques such as rotation, cropping, horizontal flip, and vertical flip, occlusion-based data augmentation methods such as Cutmix and Cutout have been proposed. For models based on speech data, it is possible to use an occlusion-based data-based augmentation technique after converting a 1D speech signal into a 2D spectrogram. In particular, SpecAugment is an occlusion-based augmentation technique for speech spectrograms. In this study, we intend to compare and study data augmentation techniques that can be used in the problem of false-voice detection. Using data from the ASVspoof2017 and ASVspoof2019 competitions held to detect fake audio, a dataset applied with Cutout, Cutmix, and SpecAugment, an occlusion-based data augmentation method, was trained through an LCNN model. All three augmentation techniques, Cutout, Cutmix, and SpecAugment, generally improved the performance of the model. In ASVspoof2017, Cutmix, in ASVspoof2019 LA, Mixup, and in ASVspoof2019 PA, SpecAugment showed the best performance. In addition, increasing the number of masks for SpecAugment helps to improve performance. In conclusion, it is understood that the appropriate augmentation technique differs depending on the situation and data.

Screen Performance and Social Attitude of Song Gang-Ho (송강호의 스크린 퍼포먼스와 사회적 태도)

  • Kim, Jong-Guk
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.2
    • /
    • pp.123-132
    • /
    • 2019
  • This paper analyzes the performances of actor Song Kang-Ho in the background of interdisciplinary and integrated film acting, using performance rather than acting as a general term. If the act is a concept limited to acting training or acting skills, performance is a broad concept that includes expressions, movements, and emotions. The performance on the screen can be explained in the context of film and can be extended to the social attitude of acting. In addition, I used the term screen in terms of representation rather than film referring to medium. Song Kang-Ho expressed the performances of various characters in more than 30 films. Although his facial expressions, gestures, and voices suitable for individual characters in various genres are represented in various ways, personality inherent in the actor Song Kang-Ho integrates persona with character. What drives it is the social attitude of screen performance. As a sign, acting is an ideological construct and foregrounds a character who describes a certain social and historical moment. Song Gang-Ho as actor, persona and character, who asserts the popularity, speaks to society and makes discourse. His comic performance is always confronting the tragedy of life, his face is the spirit of the times, and it expands into social meaning. The face of the close-up does not laugh at all, the gesture symbolized by the curved rear view is exaggerated disorderedly and disturbingly, and the voice using dialect accent does not follow the standard of the vocal.

A Study on The Adoption of Drama for Improving Early Childhood Teacher's Artistic Competence (유아교사의 예술적 역량 함양을 위한 교육연극 활용에 관한 고찰)

  • Kim, Ji-Youn;Kim, Su-youn
    • (The) Research of the performance art and culture
    • /
    • no.41
    • /
    • pp.69-92
    • /
    • 2020
  • This study describes the impact of early childhood teacher's artistic competence on art education pedagogy and improved curriculum design. Furthermore, the effect of drama as a way of improving early childhood teacher's artistic competence is explained. Many researchers have mentioned that early childhood is a period of sensitivity and potential. Therefore, it will be helpful if children meet a teacher who understands them and inspires their innate artistic sense at a level of their eyes. It explained which aspect of artistic competence should be focused for the teacher training education. There are many approaches to develop early childhood teachers' artistic competence. Adopting drama is one of them. The strong points of drama to improve their artistic competence are as follows. Firstly, human's movement and voice are the main artistic channel in drama. What we are doing in daily life is found are drama world. It means if early childhood teachers experience drama activity, they will feel more comfortable and intimate with it. In addition, early childhood teachers tend to be familiar with dramatic play, so they can more easily access to drama world. Secondly, drama will be helpful to understand different feelings and to broaden and deepen understandings of others' standpoints. For early childhood teachers, drama activity will be helpful to understand how dramatic art form works and to lead children's play in diversified and sincere way. In addition, drama activity will be useful to build horizontal and democratic relationships between children and the teacher. It is one of the main emphases of 2019 revised Nori national curriculum. To sum up, drama will be a excellent method to develop artistic competence for early childhood teachers. Thus, it is expected that They have more opportunities to experience drama as an art form.

Construction of Cham Identity in Cambodia

  • Maunati, Yekti;Sari, Betti Rosita
    • SUVANNABHUMI
    • /
    • v.6 no.1
    • /
    • pp.107-135
    • /
    • 2014
  • Cham identities which are socially constructed and multilayered, display their markers in a variety of elements, including homeland attachment to the former Kingdom of Champa, religion, language and cultural traditions, to mention a few. However, unlike other contemporary diasporic experience which binds the homeland and the host country, the Cham diaspora in Cambodia has a unique pattern as it seems to have no voice in the political and economic spheres in Vietnam, its homeland. The relations between the Cham in Cambodia and Vietnam seem to be limited to cultural heritages such as Cham musical traditions, traditional clothing, and the architectural heritage. Many Cham people have established networks outside Cambodia with areas of the Muslim world, like Malaysia, Indonesia, southern Thailand and the Middle Eastern countries. Pursuing education or training in Islam as well as working in those countries, especially Malaysia has become a way for the Cham to widen their networks and increase their knowledge of particularly, Islam. Returning to Cambodia, these people become religious teachers or ustadz (Islamic teachers in the pondok [Islamic boarding school]). This has developed slowly, side by side with the formation of their identity as Cham Muslims. Among certain Cham, the absence of an ancient cultural heritage as an identity marker has been replaced by the Islamic culture as the important element of identity. However, being Cham is not a single identity, it is fluid and contested. Many scholars argue that the Cham in Cambodia constitute three groups: the Cham Chvea, Cham, and Cham Bani (Cham Jahed). The so-called Cham Jahed has a unique practice of Islam. Unlike other Cham who pray five times a day, Cham Jahed people pray, once a week, on Fridays. They also have a different ritual for the wedding ceremony which they regard as the authentic tradition of the Cham. Indeed, they consider themselves pure descendants of the Cham in Vietnam; retaining Cham traditions and tending to maintain their relationship with their fellow Cham in Central Vietnam. In terms of language, another marker of identity, the Cham and the Cham Jahed share the same language, but Cham Jahed preserve the written Cham script more often than the Cham. Besides, the Cham Jahed teaches the language to the young generation intensively. This paper, based on fieldwork in Cambodia in 2010 and 2011 will focus on the process of the formation of the Cham identity, especially of those called Cham and Cham Jahed.

  • PDF

Study on Improving Maritime English Proficiency Through the Use of a Maritime English Platform (해사영어 플랫폼을 활용한 표준해사영어 실력 향상에 관한 연구)

  • Jin Ki Seor;Young-soo Park;Dongsu Shin;Dae Won Kim
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.7
    • /
    • pp.930-938
    • /
    • 2023
  • Maritime English is a specialized language system designed for ship operations, maritime safety, and external and internal communication onboard. According to the International Maritime Organization's (IMO) International Convention on Standards of Training, Certification and Watchkeeping for Seafarers (STCW), it is imperative that navigational officers engaged in international voyages have a thorough understanding of Maritime English including the use of Standard Marine Communication Phrases (SMCP). This study measured students' proficiency in Maritime English using a learning and testing platform that includes voice recognition, translation, and word entry tasks to evaluate the resulting improvement in Maritime English exam scores. Furthermore, the study aimed to investigate the level of platform use needed for cadets to qualify as junior navigators. The experiment began by examining the correlation between students' overall English skills and their proficiency in SMCP through an initial test, followed by the evaluation of improvements in their scores and changes in exam duration during the mid-term and final exams. The initial test revealed a significant dif erence in Maritime English test scores among groups based on individual factors, such as TOEIC scores and self-assessment of English ability, and both the mid-term and final tests confirmed substantial score improvements for the group using the platform. This study confirmed the efficacy of a learning platform that could be extensively applied in maritime education and potentially expanded beyond the scope of Maritime English education in the future.

A Survey on the Certification and Curriculum Development for Hospice and Palliative Care Professionals (호스피스.완화의료 전문인력 자격인증방안과 교육과정개발을 위한 설문조사)

  • Kang, Jin-A;Kim, Do-Yeun;Shin, Dong-Wook;Kim, Si-Young;Lee, Soon-Nam
    • Journal of Hospice and Palliative Care
    • /
    • v.13 no.1
    • /
    • pp.32-40
    • /
    • 2010
  • Purpose: The survey was aimed to provide basic data to develop a certification system for hospice and palliative care professionals. Methods: National Cancer Center (NCC) and the Korean Society for Hospice & Palliative Care (KSHPC) conducted the survey for hospice and palliative care professionals who worked at 34 Palliative care units designated by the Ministry of Health, Welfare, and Family Affairs (MW) and the members of the KSHPC. The survey was conducted via e-mail from June 17 to 23, 2009. Total 220 professionals were surveyed. Results: Most of the hospice and palliative care professionals reported a great need for certification system: Physician, 90% (n=51) nurse, 84% (n=134) social worker, 89% (n=35). In regard with the requirement for the certification, a majority of physicians (46%) preferreddiploma course, while social workers (46%) preferred training course for medical social workers. Concerning the certification body, physician (45%) preferred the KSHPC and the MW almost equally, while nurses (50%) and social workers (60%) preferred the MW highly. As for the body to develop and accredit advance training course for each professionals, most of the physicians (51%) preferred the KSHPC, whereas nurses and social workers preferred collaboration of the MW (or NCC) with the professional society, such as the KSHPC (23%), the Korean Hospice & Palliative nurses association for nurses (21%), or the Korean association of (medical) social workers (37%). Lastly, all respondents preferred the course format of once a week, full day, and some field study at weekend. Conclusion: Korean hospice and palliative care professionals identified the great need for the certification system, therefore, the adequate system development must be followed to reflect their voice.

Speech Recognition Using Linear Discriminant Analysis and Common Vector Extraction (선형 판별분석과 공통벡터 추출방법을 이용한 음성인식)

  • 남명우;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.35-41
    • /
    • 2001
  • This paper describes Linear Discriminant Analysis and common vector extraction for speech recognition. Voice signal contains psychological and physiological properties of the speaker as well as dialect differences, acoustical environment effects, and phase differences. For these reasons, the same word spelled out by different speakers can be very different heard. This property of speech signal make it very difficult to extract common properties in the same speech class (word or phoneme). Linear algebra method like BT (Karhunen-Loeve Transformation) is generally used for common properties extraction In the speech signals, but common vector extraction which is suggested by M. Bilginer et at. is used in this paper. The method of M. Bilginer et al. extracts the optimized common vector from the speech signals used for training. And it has 100% recognition accuracy in the trained data which is used for common vector extraction. In spite of these characteristics, the method has some drawback-we cannot use numbers of speech signal for training and the discriminant information among common vectors is not defined. This paper suggests advanced method which can reduce error rate by maximizing the discriminant information among common vectors. And novel method to normalize the size of common vector also added. The result shows improved performance of algorithm and better recognition accuracy of 2% than conventional method.

  • PDF

Increasing Accuracy of Stock Price Pattern Prediction through Data Augmentation for Deep Learning (데이터 증강을 통한 딥러닝 기반 주가 패턴 예측 정확도 향상 방안)

  • Kim, Youngjun;Kim, Yeojeong;Lee, Insun;Lee, Hong Joo
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.1-12
    • /
    • 2019
  • As Artificial Intelligence (AI) technology develops, it is applied to various fields such as image, voice, and text. AI has shown fine results in certain areas. Researchers have tried to predict the stock market by utilizing artificial intelligence as well. Predicting the stock market is known as one of the difficult problems since the stock market is affected by various factors such as economy and politics. In the field of AI, there are attempts to predict the ups and downs of stock price by studying stock price patterns using various machine learning techniques. This study suggest a way of predicting stock price patterns based on the Convolutional Neural Network(CNN) among machine learning techniques. CNN uses neural networks to classify images by extracting features from images through convolutional layers. Therefore, this study tries to classify candlestick images made by stock data in order to predict patterns. This study has two objectives. The first one referred as Case 1 is to predict the patterns with the images made by the same-day stock price data. The second one referred as Case 2 is to predict the next day stock price patterns with the images produced by the daily stock price data. In Case 1, data augmentation methods - random modification and Gaussian noise - are applied to generate more training data, and the generated images are put into the model to fit. Given that deep learning requires a large amount of data, this study suggests a method of data augmentation for candlestick images. Also, this study compares the accuracies of the images with Gaussian noise and different classification problems. All data in this study is collected through OpenAPI provided by DaiShin Securities. Case 1 has five different labels depending on patterns. The patterns are up with up closing, up with down closing, down with up closing, down with down closing, and staying. The images in Case 1 are created by removing the last candle(-1candle), the last two candles(-2candles), and the last three candles(-3candles) from 60 minutes, 30 minutes, 10 minutes, and 5 minutes candle charts. 60 minutes candle chart means one candle in the image has 60 minutes of information containing an open price, high price, low price, close price. Case 2 has two labels that are up and down. This study for Case 2 has generated for 60 minutes, 30 minutes, 10 minutes, and 5minutes candle charts without removing any candle. Considering the stock data, moving the candles in the images is suggested, instead of existing data augmentation techniques. How much the candles are moved is defined as the modified value. The average difference of closing prices between candles was 0.0029. Therefore, in this study, 0.003, 0.002, 0.001, 0.00025 are used for the modified value. The number of images was doubled after data augmentation. When it comes to Gaussian Noise, the mean value was 0, and the value of variance was 0.01. For both Case 1 and Case 2, the model is based on VGG-Net16 that has 16 layers. As a result, 10 minutes -1candle showed the best accuracy among 60 minutes, 30 minutes, 10 minutes, 5minutes candle charts. Thus, 10 minutes images were utilized for the rest of the experiment in Case 1. The three candles removed from the images were selected for data augmentation and application of Gaussian noise. 10 minutes -3candle resulted in 79.72% accuracy. The accuracy of the images with 0.00025 modified value and 100% changed candles was 79.92%. Applying Gaussian noise helped the accuracy to be 80.98%. According to the outcomes of Case 2, 60minutes candle charts could predict patterns of tomorrow by 82.60%. To sum up, this study is expected to contribute to further studies on the prediction of stock price patterns using images. This research provides a possible method for data augmentation of stock data.

  • PDF

The Adoption and Diffusion of Semantic Web Technology Innovation: Qualitative Research Approach (시맨틱 웹 기술혁신의 채택과 확산: 질적연구접근법)

  • Joo, Jae-Hun
    • Asia pacific journal of information systems
    • /
    • v.19 no.1
    • /
    • pp.33-62
    • /
    • 2009
  • Internet computing is a disruptive IT innovation. Semantic Web can be considered as an IT innovation because the Semantic Web technology possesses the potential to reduce information overload and enable semantic integration, using capabilities such as semantics and machine-processability. How should organizations adopt the Semantic Web? What factors affect the adoption and diffusion of Semantic Web innovation? Most studies on adoption and diffusion of innovation use empirical analysis as a quantitative research methodology in the post-implementation stage. There is criticism that the positivist requiring theoretical rigor can sacrifice relevance to practice. Rapid advances in technology require studies relevant to practice. In particular, it is realistically impossible to conduct quantitative approach for factors affecting adoption of the Semantic Web because the Semantic Web is in its infancy. However, in an early stage of introduction of the Semantic Web, it is necessary to give a model and some guidelines and for adoption and diffusion of the technology innovation to practitioners and researchers. Thus, the purpose of this study is to present a model of adoption and diffusion of the Semantic Web and to offer propositions as guidelines for successful adoption through a qualitative research method including multiple case studies and in-depth interviews. The researcher conducted interviews with 15 people based on face-to face and 2 interviews by telephone and e-mail to collect data to saturate the categories. Nine interviews including 2 telephone interviews were from nine user organizations adopting the technology innovation and the others were from three supply organizations. Semi-structured interviews were used to collect data. The interviews were recorded on digital voice recorder memory and subsequently transcribed verbatim. 196 pages of transcripts were obtained from about 12 hours interviews. Triangulation of evidence was achieved by examining each organization website and various documents, such as brochures and white papers. The researcher read the transcripts several times and underlined core words, phrases, or sentences. Then, data analysis used the procedure of open coding, in which the researcher forms initial categories of information about the phenomenon being studied by segmenting information. QSR NVivo version 8.0 was used to categorize sentences including similar concepts. 47 categories derived from interview data were grouped into 21 categories from which six factors were named. Five factors affecting adoption of the Semantic Web were identified. The first factor is demand pull including requirements for improving search and integration services of the existing systems and for creating new services. Second, environmental conduciveness, reference models, uncertainty, technology maturity, potential business value, government sponsorship programs, promising prospects for technology demand, complexity and trialability affect the adoption of the Semantic Web from the perspective of technology push. Third, absorptive capacity is an important role of the adoption. Fourth, suppler's competence includes communication with and training for users, and absorptive capacity of supply organization. Fifth, over-expectance which results in the gap between user's expectation level and perceived benefits has a negative impact on the adoption of the Semantic Web. Finally, the factor including critical mass of ontology, budget. visible effects is identified as a determinant affecting routinization and infusion. The researcher suggested a model of adoption and diffusion of the Semantic Web, representing relationships between six factors and adoption/diffusion as dependent variables. Six propositions are derived from the adoption/diffusion model to offer some guidelines to practitioners and a research model to further studies. Proposition 1 : Demand pull has an influence on the adoption of the Semantic Web. Proposition 1-1 : The stronger the degree of requirements for improving existing services, the more successfully the Semantic Web is adopted. Proposition 1-2 : The stronger the degree of requirements for new services, the more successfully the Semantic Web is adopted. Proposition 2 : Technology push has an influence on the adoption of the Semantic Web. Proposition 2-1 : From the perceptive of user organizations, the technology push forces such as environmental conduciveness, reference models, potential business value, and government sponsorship programs have a positive impact on the adoption of the Semantic Web while uncertainty and lower technology maturity have a negative impact on its adoption. Proposition 2-2 : From the perceptive of suppliers, the technology push forces such as environmental conduciveness, reference models, potential business value, government sponsorship programs, and promising prospects for technology demand have a positive impact on the adoption of the Semantic Web while uncertainty, lower technology maturity, complexity and lower trialability have a negative impact on its adoption. Proposition 3 : The absorptive capacities such as organizational formal support systems, officer's or manager's competency analyzing technology characteristics, their passion or willingness, and top management support are positively associated with successful adoption of the Semantic Web innovation from the perceptive of user organizations. Proposition 4 : Supplier's competence has a positive impact on the absorptive capacities of user organizations and technology push forces. Proposition 5 : The greater the gap of expectation between users and suppliers, the later the Semantic Web is adopted. Proposition 6 : The post-adoption activities such as budget allocation, reaching critical mass, and sharing ontology to offer sustainable services are positively associated with successful routinization and infusion of the Semantic Web innovation from the perceptive of user organizations.

The Evolution of Cyber Singer Viewed from the Coevolution of Man and Machine (인간과 기계의 공진화적 관점에서 바라본 사이버가수의 진화과정)

  • Kim, Dae-Woo
    • Cartoon and Animation Studies
    • /
    • s.39
    • /
    • pp.261-295
    • /
    • 2015
  • Cyber singer appeared in the late 1990s has disappeared briefly appeared. although a few attempts in the 2000s, it did not show significant successes. cyber singer was born thanks to the technical development of the IT industry and the emergence of an idol training system in the music industry. It was developed by Vocaloid 'Seeyou' starting from 'Adam'. cyber singer that differenatiated typical digital characters in a cartoon or game may be subject to idolize to the music as a medium. They also feature forming a plurality of fandom. therefore, such attempts and repeated failures, this could be considered a fashion, but it flew content creation and ongoing attempts to take advantage of the new media, such as Vocaloid can see that there are expectations for a true Cyber-born singer. Early-Cyber singer is made only resemble human appearance, but 'Sciart' and 'Seeyou' has been evolving to becoming more like the human capabilities. in this paper, stylized cyber singer had disappeared in the past in the process of developing the technology to evolve into own artificial life does not end in failure cases, gradually led to a change in public perceptions of the image look looking machine was an attempt in that sense. With the direction of the evolution of the mechanical function to obtain a human, fun and human exchanges and mutual feelings. And it is equipped with an artificial life form that evolved with it only in appearance and function. in order to support this logic, I refer to the study of the coevolution of man and machine at every Bruce Mazlish. And, I have analyzed the evolution of cyber singer Bruce research from the perspective of the development process since the late 1990s, the planning of the eight singers who have appeared and design of the cyber character and important voices to be evaluated as a singer (vocal). The machine has been evolving coevolution with humans. cyber singer ambivalent development targets are recognized, but strive to become the new artificial creatures of horror idea of human desire and death continues. therefore, the new Cyber-organisms are likely to be the same style as 'Seeyou'. because, cartoon forms and whirring voice may not be in the form of a signifier is the real human desires, but this is because the contemporary public's desire to be desired and the technical development of this type can be created at the point where the cross-signifier.