• Title/Summary/Keyword: speech effort

Search Result 65, Processing Time 0.022 seconds

A Study on Routine Formulas and Downgraders of Request Act in High School English Textbooks

  • Yang, Eun-Mi
    • English Language & Literature Teaching
    • /
    • v.11 no.2
    • /
    • pp.111-134
    • /
    • 2005
  • This paper examines high school English textbooks to ascertain if they appropriately reflect the kinds and frequencies of routine formulas and downgraders of request act used by English native speakers. It is important to present authentic routine formulas in textbooks for students to acquire proper, efficient and safe communication strategies to communicate with other English speakers. For the analysis, currently available 7 series of 21 high school English textbooks under the $7^{th}$ National Curriculum were selected. Each series of textbooks contains 3 school grade textbooks as High School English, High School English I, and High School English II. The results show that the high school English textbooks generally demonstrate a secund reflection of the English native speakers' use of request strategies and downgraders. That is, the textbooks were found to have presented mostly casual forms of routine formulas while they have not presented sufficient coverage of elaborated polite routine formulas for requesting which English native speakers frequently use. The presence of some kinds of the frequently used downgraders was also very small in proportion in the textbooks. More effort should be given to complement the deficiency in this area by teachers and researchers.

  • PDF

An Empirical Analysis of Auditory Interfaces in Human-computer Interaction

  • Nam, Yoonjae
    • International Journal of Contents
    • /
    • v.9 no.3
    • /
    • pp.29-34
    • /
    • 2013
  • This study attempted to compare usability of auditory interfaces, which is a comprehensive concept that includes safety, utility, effectiveness, and efficiency, in personal computing environments: verbal messages (speech sounds), earcons (musical sounds), and auditory icons (natural sounds). This study hypothesized that verbal messages would offer higher usability than earcons and auditory icons, since the verbal messages are easy to interpret and understand based on semiotic process. In this study, usability was measured by a set of seven items: ability to inform what the program is doing, relevance to visual interfaces, degree of stimulation, degree of understandability, perceived time pressure, clearness of sound outputs, and degrees of satisfaction. Through the experimental research, the results showed that verbal messages provided the highest level of usability. On the contrary, auditory icons showed the lowest level of usability, as they require users to establish new coding schemes, and thus demand more mental effort from users.

Survey of Recent Research in Education based on Artificial Intelligence (AI 기반 교육 현황과 기술 동향)

  • Jeon, H.B.;Chung, H.;Kang, B.O.;Lee, Y.K.
    • Electronics and Telecommunications Trends
    • /
    • v.36 no.1
    • /
    • pp.71-80
    • /
    • 2021
  • Artificial intelligence (AI) will have a huge impact on future education. We look at the role of AI in education and changes in schools. Personalized education is being attempted in limited services, and an interactive tutor service with speech recognition/dialog technology is being developed. In the future, we look forward to fully personalized education for individual students through AI teachers. Teachers are expected to make more effort to teach creative thinking, critical thinking, communication, and collaboration. As the speed of development of AI technology accelerates, we expect that AI-based education will be deeply established around us in the near future. We first introduce the details of the personalization technology and then discuss the AI-based foreign language speaking education research conducted by ETRI.

Interactive Data Acquisition System based on Hand Tracking to evaluate Children's Cognitive Abilities

  • Ekaterina, Ten;Lee, Suk-Ho
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.14 no.3
    • /
    • pp.108-114
    • /
    • 2022
  • Autism (ASD) is a mental disorder characterized by a pronounced deficit in personal, social, speech, and other aspects of development and communication skills. Since autism is a complex developmental disorder that requires a lot of effort to recognize, this research was conducted to develop an interactive data Acquisition System and detect the first signs of ASD in children. The proposed system presents several variants of the tasks in an entertaining form, using hand tracking. Hand tracking is used to attract children's attention and interest them more to achieve more accurate results. The creation of the system is based on such libraries as OpenCV, PyGame, TensorFlow, and Mediapipe. The ultimate goal of the paper is to obtain data on the disease of autism in children for use in further diagnosis by medical experts.

A Study on Low Pitch Accent Produced in Different Locations in English Sentences (영어 문장 내 상이한 위치에 나타난 저성조 피치 액센트 연구)

  • Yi, So-Pae;Kim, Soo-Jung
    • Phonetics and Speech Sciences
    • /
    • v.3 no.4
    • /
    • pp.63-70
    • /
    • 2011
  • Recent studies on English $L^*$ (low pitch accent) have revealed the difference of changes in acoustic manifestation between utterances produced by Koreans and those produced by native speakers of English. However, not much effort has been made to compare $L^*$ focused constituents and non-focused constituents. At the same time, most previous works on focus realization are lacking in terms of normalization of acoustic measurement. Therefore, this research is dedicated to comparing the $L^*$ focused items and non-focused items realized by Koreans and Americans and to examining the realization of English $L^*$ produced by the two language groups with improved normalization of the acoustic features (F0, intensity and duration). Within-group analysis comparing focused words and non-focused words showed both Americans and Koreans prolonged the $L^*$ focused syllables but the effect size of syllable lengthening made by Koreans was far less than that made by Americans. Furthermore, significant F0 lowering was found in Americans but not in Koreans. However, the effect of intensity change caused by $L^*$ focus was not significant within each group. The effect of focused words was tested between the two groups revealing that Koreans implemented English $L^*$ focus with higher F0, lower intensity and shorter duration than Americans. In the instances in which a significant Group x Focus Location (initial, middle and final of a sentence) interaction was found, further analysis testing the effect of Group on each Focus Location was conducted. The testing showed that the Koreans produced shorter syllables at initial and middle of a sentence and higher F0 at initial of a sentence than Americans. Implications for the intonation training were also discussed.

  • PDF

WalkieTagging : Efficient Speech-Based Video Annotation Method for Smart Devices (워키태깅 : 스마트폰 환경에서 음성기반의 효과적인 영상 콘텐츠 어노테이션 방법에 관한 연구)

  • Park, Joon Young;Lee, Soobin;Kang, Dongyeop;Seok, YoungTae
    • Journal of Information Technology Services
    • /
    • v.12 no.1
    • /
    • pp.271-287
    • /
    • 2013
  • The rapid growth and dissemination of touch-based mobile devices such as smart phones and tablet PCs, gives numerous benefits to people using a variety of multimedia contents. Due to its portability, it enables users to watch a soccer game, search video from YouTube, and sometimes tag on contents on the road. However, the limited screen size of mobile devices and touch-based character input methods based on this, are still major problems of searching and tagging multimedia contents. In this paper, we propose WalkieTagging, which provides a much more intuitive way than that of previous one. Just like any other previous video tagging services, WalkieTagging, as a voice-based annotation service, supports inserting detailed annotation data including start time, duration, tags, with little effort of users. To evaluate our methods, we developed the Android-based WalkieTagging application and performed user study via a two-week. Through our experiments by a total of 46 people, we observed that experiment participator think our system is more convenient and useful than that of touch-based one. Consequently, we found out that voice-based annotation methods can provide users with much convenience and satisfaction than that of touch-based methods in the mobile environments.

Emotion Recognition of Low Resource (Sindhi) Language Using Machine Learning

  • Ahmed, Tanveer;Memon, Sajjad Ali;Hussain, Saqib;Tanwani, Amer;Sadat, Ahmed
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.8
    • /
    • pp.369-376
    • /
    • 2021
  • One of the most active areas of research in the field of affective computing and signal processing is emotion recognition. This paper proposes emotion recognition of low-resource (Sindhi) language. This work's uniqueness is that it examines the emotions of languages for which there is currently no publicly accessible dataset. The proposed effort has provided a dataset named MAVDESS (Mehran Audio-Visual Dataset Mehran Audio-Visual Database of Emotional Speech in Sindhi) for the academic community of a significant Sindhi language that is mainly spoken in Pakistan; however, no generic data for such languages is accessible in machine learning except few. Furthermore, the analysis of various emotions of Sindhi language in MAVDESS has been carried out to annotate the emotions using line features such as pitch, volume, and base, as well as toolkits such as OpenSmile, Scikit-Learn, and some important classification schemes such as LR, SVC, DT, and KNN, which will be further classified and computed to the machine via Python language for training a machine. Meanwhile, the dataset can be accessed in future via https://doi.org/10.5281/zenodo.5213073.

Using Syntax and Shallow Semantic Analysis for Vietnamese Question Generation

  • Phuoc Tran;Duy Khanh Nguyen;Tram Tran;Bay Vo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.10
    • /
    • pp.2718-2731
    • /
    • 2023
  • This paper presents a method of using syntax and shallow semantic analysis for Vietnamese question generation (QG). Specifically, our proposed technique concentrates on investigating both the syntactic and shallow semantic structure of each sentence. The main goal of our method is to generate questions from a single sentence. These generated questions are known as factoid questions which require short, fact-based answers. In general, syntax-based analysis is one of the most popular approaches within the QG field, but it requires linguistic expert knowledge as well as a deep understanding of syntax rules in the Vietnamese language. It is thus considered a high-cost and inefficient solution due to the requirement of significant human effort to achieve qualified syntax rules. To deal with this problem, we collected the syntax rules in Vietnamese from a Vietnamese language textbook. Moreover, we also used different natural language processing (NLP) techniques to analyze Vietnamese shallow syntax and semantics for the QG task. These techniques include: sentence segmentation, word segmentation, part of speech, chunking, dependency parsing, and named entity recognition. We used human evaluation to assess the credibility of our model, which means we manually generated questions from the corpus, and then compared them with the generated questions. The empirical evidence demonstrates that our proposed technique has significant performance, in which the generated questions are very similar to those which are created by humans.

A Study on Art's Public Features and Social Intervention by Keith Haring (미술의 공공성과 키스 해링(Keith Haring)의 사회적 개입에 관한 연구)

  • Kim, Jee-Young
    • The Journal of Art Theory & Practice
    • /
    • no.8
    • /
    • pp.59-87
    • /
    • 2009
  • This thesis started from the attempt to make it clear that 80's American artist Keith Haring(1958-1990) had conducted social intervention of criticism, resistance, and participation through his works, and so pursued public value. Haring of graffiti fame left popular and familiar cartoon style pictures on the street wall, the billboards, the posters and so on. Popular and playful works was explained as his unique characteristics, but Haring's creative way at the field has more value than just being grasped as artist's personal characteristics. Haring's work pieces became everyday art by joining with people's life, and are working as a social speaking place. So I think that these Haring's art works possess characteristics of 'the public sphere'. 'The Public Sphere' means that is independent and free from the government or partisan economic forces, so that is not connected with the interested relations, and that is the sphere of rational argumentation without 'disguise' or 'fabrication', and that is the sphere where general public can participate in and is inspected by them. The public sphere between the sphere of public authority such a nation and a market and the private sphere of free individual, it is mutually connected with them and works as the space forming public opinion. Private individuals communicate with this public sphere and perform a role of direct and indirect check, balance, and social criticism way off from power. Openness that should include the voice of not only leading power but also the socially weak such as citizens, women, homosexuals, minority races, and so on, and alienated class, is an index of the public characteristics. The public sphere is not working just with speech and mass media. Many artists as well as Haring open their mouth and act through an art at the center of society, and create another public sphere by an art. I understood that the real participatory and practical characteristics on the Haring's work is a phenomenon and current of a part of the art world including Haring. Such current started from 1960s is the in-depth effort to be connected with the life more closely, to communicate with people, and to improve problems of life. And it has pursued public value on the different way from the nation or public power. Artists have intervened in the society with strategic and positive ways in order to raise pushed-out value and sinked rights as the public agenda, and labored to accept the value of variety and difference at the society. The aspect of such social intervention is the notable features, findable on the Haring's works and process. Haring's works include art historical meanings and are expressed with familiar and plastic language, so they were able to communicate with various classes. And he secured various customers at the field and the street. This communicative and public approach factor raised the possibility much for his works to work as the public sphere. Haring presented critical and resistant speech toward society with his works based on this factor. He asserted his position and justice of gender identity as a sexual minority. And his such work continued to movement for alienated class and social week over his own rights. His speech and message on the wall painting, poster, T-shirts, billboard of the subway, and so on worked as a spectacle and pressed concern with social issues and consciousness shift. And he's been trying to protect and care people who is injured by HIV and drug and to realize social justice through social week protection. Haring's works planned to meet many people as much as possible performed its role of intervening in society through criticism, resistance, speech, and participation, and controlling and checking social issues. These things considered, Haring's works show his consciousness about public attributes of art, and obviously include public value seeking. And also we can find the meaning of such his work as that an art is working as the public sphere and shows the possibility to discuss and practice public issues.

  • PDF

Classification of Consonants by SOM and LVQ (SOM과 LVQ에 의한 자음의 분류)

  • Lee, Chai-Bong;Lee, Chang-Young
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.6 no.1
    • /
    • pp.34-42
    • /
    • 2011
  • In an effort to the practical realization of phonetic typewriter, we concentrate on the classification of consonants in this paper. Since many of consonants do not show periodic behavior in time domain and thus the validity for Fourier analysis of them are not convincing, vector quantization (VQ) via LBG clustering is first performed to check if the feature vectors of MFCC and LPCC are ever meaningful for consonants. Experimental results of VQ showed that it's not easy to draw a clear-cut conclusion as to the validity of Fourier analysis for consonants. For classification purpose, two kinds of neural networks are employed in our study: self organizing map (SOM) and learning vector quantization (LVQ). Results from SOM revealed that some pairs of phonemes are not resolved. Though LVQ is free from this difficulty inherently, the classification accuracy was found to be low. This suggests that, as long as consonant classification by LVQ is concerned, other types of feature vectors than MFCC should be deployed in parallel. However, the combination of MFCC/LVQ was not found to be inferior to the classification of phonemes by language-moded based approach. In all of our work, LPCC worked worse than MFCC.