• Title/Summary/Keyword: MOS score

Search Result 117, Processing Time 0.025 seconds

Quality Evaluation of JPEG2000 Compressed Images in PACS Environments (PACS 환경에서 JPEG2000 압축 영상의 화질 평가)

  • Lee, Yong-Jai
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.682-684
    • /
    • 2005
  • 현재 설러 병원에서 PACS 시스템을 도입해 유용하게 사용하고 있다. 병원 진료에서 방사선 영상 정보는 중요한 위치를 차지한다. 방사선 영상은 관전압(KVP)과 관전류(mAs)로 방사선량을 조절한 후 인체에 조사하여 얻게 되는데, KVP와 mAs, 인체의 두께에 따라 영상의 질이 변하게 된다. 이와 같이 장비에서 촬영된 영상은 판독을 거처 진료에 이용되고 일정한 시간이 지나면 압축하여 보관하게 되는데, 압축율을 높게 적용할수록 저장장치에 대한 경제적인 효과는 크다. 이에 저자는 1) CR, DR 촬영 조건별 흉부 영상을 얻어 JPEG 2000 압축방식을 적용해 촬영조건이 압축영상에 미치는 영향을 평가하였고, 2) MOS(Mean opinion score) 평가를 통해 영상판독에 영향을 주지 않는 유효 압축율을 제시하였다.

  • PDF

A Study on a Improvement of the Speech Quality with Variable Window in CELP Vocoder (가변 윈도우를 이용한 CELP 부호화기의 음질 향상에 관한 연구)

  • Ju, Sang-Gyu
    • Proceedings of the KAIS Fall Conference
    • /
    • 2010.05a
    • /
    • pp.265-268
    • /
    • 2010
  • There have been proposed two types of low bit rate vocoder upto now : One is MBE type using the spectrum modeling and another is CELP type using the hybrid coding method. CELP type vocoder has mainly studied between them. Specially, much of intensity is concentrated in CELP vocoder due to the emergence of Internet Phone and PCS in a domestic. In order to improve the speech quality in CELP vocoder, in this paper, we proposed a new spectrum analysis algorithm with variable window. In CELP vocoder, the spectrum of the synthesised speech signal is distorted because the fixed size windows is used for spectrum analysis. So we have measured the spectral leakage and in order to minimize the spectral leakage have adjusted the window size. Applying this method G.723.1 ACELP, we can get SD(Spectral Distortion) reduction 0.084(dB), residual energy reduction 6.3% and MOS(Mean Opinion Score) improvement 0.1.

  • PDF

Speech Packet Transmission Using the AMR-WB Coder with FEC (FEC기능을 추가한 AMR-WB 음성 부호화기를 이용한 음성 패킷 전송)

  • 황정준;이인성
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.11
    • /
    • pp.63-71
    • /
    • 2003
  • This paper suggests the packet loss recovery method to communicate in real time in the Internet. To reduce the effects of packet loss, Forward Error Correction (FEC) that adds redundant information to voice packets can be used. Adaptive Multi Rate Wideband(AMR-WB) codec which is recently selected by the Third Generation Partnership Project(3GPP) for GSM and the third generation mobile communication WCDMA system and has also been standardized in ITU-T for providing wideband speech services is used. The major cause for speech qualitly degradation in IP-networks is packet loss. So, We recovered single lossy packet by using FEC method and concealed continued errors. The proposed scheme if evaluated in the Gilbert Internet channel model. The high quality of audio maintained up to 30% packet loss.

Research for measuring degradation of IPTV-serviced videos (IPTV 서비스 영상에 대한 객관적 품질측정 방안 연구)

  • Kim, Won-Jun;Kim, Chang-Ick;Kim, Jin-Sul;Lee, Hyun-Woo;Ryu, Won
    • Journal of Broadcast Engineering
    • /
    • v.13 no.4
    • /
    • pp.440-451
    • /
    • 2008
  • With the advent of IP-based multimedia service based on IP network, there is a rapidly increasing demand for IPTV. Unlike the previous coaxial cable based TV, IPTV provides a variety of convergence services based on IP newark. However, since the IPTV service quality is a lot affected by the network degradation such as packet loss and jitter, it may not be guaranteed. In this paper, we propose an objective measure for various degradations of IPTV-based videos considering subjective assessment. To this end, we first determine QoE(Quality of Experience) indicators, which can affect human visual perception. Then we develop the video quality metric for each QoE indicator. Subjective assessment based on MOS is conducted and used to construct mapping relationship between each measure and perceived visual quality. Experiments are performed on various videos to confirm the efficiency and robustness of the proposed method and show high correlation with subjective assessment.

IDS Performance on MANET with Packet Aggregation Transmissions (패킷취합전송이 있는 MANET에서 IDS 성능)

  • Kim, Young-Dong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.6
    • /
    • pp.695-701
    • /
    • 2014
  • Blackhole attacks having a unauthorized change of routing data will cause critical effects for transmission performance. The transmission performance will be improved to the a certain level by using or having IDS(Intrusion Detection System)/IPS(Intrusion Prevention System) as countermeasures to blackhole attacks. In this papar, the effects of IDS to ene-to-end performance of packet aggregation transmission are analyzed on MANET(Mobile Ad-hoc Network) with IDS under blackhole attacks. MANET simulator based on NS-2 is used to analyze performance parameters as MOS, connection ratio, delay and packet loss rate as standard performance parameters, an another performance factor which is suggested in this paper. VoIP(Voice over Internet Protocol) traffics, one of voice services, is used for performance analysis. A suggestion for IDS implementation on MANET with packet aggregations under blackhole is shown as one of results.

CA Joint Resource Allocation Algorithm Based on QoE Weight

  • LIU, Jun-Xia;JIA, Zhen-Hong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.5
    • /
    • pp.2233-2252
    • /
    • 2018
  • For the problem of cross-layer joint resource allocation (JRA) in the Long-Term Evolution (LTE)-Advanced standard using carrier aggregation (CA) technology, it is difficult to obtain the optimal resource allocation scheme. This paper proposes a joint resource allocation algorithm based on the weights of user's average quality of experience (JRA-WQOE). In contrast to prevalent algorithms, the proposed method can satisfy the carrier aggregation abilities of different users and consider user fairness. An optimization model is established by considering the user quality of experience (QoE) with the aim of maximizing the total user rate. In this model, user QoE is quantified by the mean opinion score (MOS) model, where the average MOS value of users is defined as the weight factor of the optimization model. The JRA-WQOE algorithm consists of the iteration of two algorithms, a component carrier (CC) and resource block (RB) allocation algorithm called DABC-CCRBA and a subgradient power allocation algorithm called SPA. The former is used to dynamically allocate CC and RB for users with different carrier aggregation capacities, and the latter, which is based on the Lagrangian dual method, is used to optimize the power allocation process. Simulation results showed that the proposed JRA-WQOE algorithm has low computational complexity and fast convergence. Compared with existing algorithms, it affords obvious advantages such as improving the average throughput and fairness to users. With varying numbers of users and signal-to-noise ratios (SNRs), the proposed algorithm achieved higher average QoE values than prevalent algorithms.

End-to-end non-autoregressive fast text-to-speech (End-to-end 비자기회귀식 가속 음성합성기)

  • Kim, Wiback;Nam, Hosung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.47-53
    • /
    • 2021
  • Autoregressive Text-to-Speech (TTS) models suffer from inference instability and slow inference speed. Inference instability occurs when a poorly predicted sample at time step t affects all the subsequent predictions. Slow inference speed arises from a model structure that forces the predicted samples from time steps 1 to t-1 to predict the sample at time step t. In this study, an end-to-end non-autoregressive fast text-to-speech model is suggested as a solution to these problems. The results of this study show that this model's Mean Opinion Score (MOS) is close to that of Tacotron 2 - WaveNet, while this model's inference speed and stability are higher than those of Tacotron 2 - WaveNet. Further, this study aims to offer insight into the improvement of non-autoregressive models.

A Clinical Study on the Effect of Acupuncture and Bee-Venom Acupuncture for Patients with Chronic Whiplash Injury (교통사고 후 편타성 손상에 대한 침치료 및 봉독약침치료의 유효성 평가)

  • Kim, Kun-Hyung;Choi, Yang-Sik;Nam, Dong-Woo;Kim, Jong-In;Cho, Ki-Ho;Choi, Do-Young;Lee, Jae-Dong
    • Journal of Acupuncture Research
    • /
    • v.23 no.6
    • /
    • pp.145-152
    • /
    • 2006
  • Objectives : The aim of this study is to investigate the effect of Acupuncture and Bee-venom acupuncture for patients with chronic whiplash injury. Methods : Subjects were voluntarily recruited by newspapers and internet. Acupuncture(Eo-Hyeol Bang) and Bee-Venom Acupuncture were performed two times a week for 4 weeks. The patients' symptoms were assessed before, after 4 weeks of treatment by Visual Analogue Scale(VAS), Medical Outcome Study(MOS) 36-Item Short-Form Health Survey(SF-36) Results : VAS score was significantly improved after 4 weeks(p<0.05) compared to the pre-treatment. There were significant changes in physical functioning(PF), Social functioning(SF), role-physical(RP), role emotional(RE), mental health(MH), bodily pain(BP) score of SF-36 after 4 weeks(p<0.05), but there were no significant changes in vitality(VT), general health(GH) score of SF-36. Conclusion: This study suggests that Acupuncture(Eo-Hyeol Bang) and Bee-Venom Acupuncture can be applicable to improve symptoms in patients with chronic whiplash injury.

  • PDF

Adaptive Enhancement Algorithm of Perceptual Filter Using Variable Threshold (가변 임계값을 이용한 지각 필터의 적응적인 음질 개선 알고리즘)

  • 차형태
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.6
    • /
    • pp.446-453
    • /
    • 2004
  • In this paper, a new adaptive perceptual filter using variable threshold to enhance audio signals degraded by additively nonstationary noise is proposed. The adaptive perceptual filter updates variable threshold each time according to the power of signal and the effect of noise variation. So the noisy audio signal is enhanced by the method which controls a residual noise effectively. The proposed algorithm uses the perceptual filter which transforms a time domain signal into frequency domain and calculates an intensity energy and an excitation energy in bark domain. In this method. the stage updated the response of filter is decided by threshold. The proposed algorithm using vairable threshold effectively controls a residual noise using the energy difference of audio signals degraded by the additive nonstationary noise. The proposed method is tested with the noisy audio signals degraded by nonstationary noise at various signal -to-noise ratios (SNR). We carry out NMR and MOS test when the input SNR is 15dB. 20dB. 25dB and 30dB. An approximate improvement of 17.4dB. 15.3dB, 12.8dB. 9.8dB in NMR and enhancement of 2.9, 2.5, 2.3, 1.7 in MOS test is achieved with the input signals. respectively.

Salient Region Detection Algorithm for Music Video Browsing (뮤직비디오 브라우징을 위한 중요 구간 검출 알고리즘)

  • Kim, Hyoung-Gook;Shin, Dong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.112-118
    • /
    • 2009
  • This paper proposes a rapid detection algorithm of a salient region for music video browsing system, which can be applied to mobile device and digital video recorder (DVR). The input music video is decomposed into the music and video tracks. For the music track, the music highlight including musical chorus is detected based on structure analysis using energy-based peak position detection. Using the emotional models generated by SVM-AdaBoost learning algorithm, the music signal of the music videos is classified into one of the predefined emotional classes of the music automatically. For the video track, the face scene including the singer or actor/actress is detected based on a boosted cascade of simple features. Finally, the salient region is generated based on the alignment of boundaries of the music highlight and the visual face scene. First, the users select their favorite music videos from various music videos in the mobile devices or DVR with the information of a music video's emotion and thereafter they can browse the salient region with a length of 30-seconds using the proposed algorithm quickly. A mean opinion score (MOS) test with a database of 200 music videos is conducted to compare the detected salient region with the predefined manual part. The MOS test results show that the detected salient region using the proposed method performed much better than the predefined manual part without audiovisual processing.