Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
On Enhancement and Quality Assessment of Audio and Video in Communication Systems
Blekinge Tekniska Högskola, Fakulteten för teknikvetenskaper, Institutionen för tillämpad signalbehandling.
2014 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

The use of audio and video communication has increased exponentially over the last decade and has gone from speech over GSM to HD resolution video conference between continents on mobile devices. As the use becomes more widespread the interest in delivering high quality media increases even on devices with limited resources. This includes both development and enhancement of the communication chain but also the topic of objective measurements of the perceived quality. The focus of this thesis work has been to perform enhancement within speech encoding and video decoding, to measure influence factors of audio and video performance, and to build methods to predict the perceived video quality. The audio enhancement part of this thesis addresses the well known problem in the GSM system with an interfering signal generated by the switching nature of TDMA cellular telephony. Two different solutions are given to suppress such interference internally in the mobile handset. The first method involves the use of subtractive noise cancellation employing correlators, the second uses a structure of IIR notch filters. Both solutions use control algorithms based on the state of the communication between the mobile handset and the base station. The video enhancement part presents two post-filters. These two filters are designed to improve visual quality of highly compressed video streams from standard, block-based video codecs by combating both blocking and ringing artifacts. The second post-filter also performs sharpening. The third part addresses the problem of measuring audio and video delay as well as skewness between these, also known as synchronization. This method is a black box technique which enables it to be applied on any audiovisual application, proprietary as well as open standards, and can be run on any platform and over any network connectivity. The last part addresses no-reference (NR) bitstream video quality prediction using features extracted from the coded video stream. Several methods have been used and evaluated: Multiple Linear Regression (MLR), Artificial Neural Network (ANN), and Least Square Support Vector Machines (LS-SVM), showing high correlation with both MOS and objective video assessment methods as PSNR and PEVQ. The impact from temporal, spatial and quantization variations on perceptual video quality has also been addressed, together with the trade off between these, and for this purpose a set of locally conducted subjective experiments were performed.

Ort, förlag, år, upplaga, sidor
Karlskrona: Blekinge Institute of Technology , 2014.
Serie
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 16
Nyckelord [en]
QoE, video quality assessment, video quality metric, multi-linear regression, artificial neural network, support vector machine, quality predictor, machine learning, temporal scaling, spatial scaling, video compression, deblocking filter, noise cancelling, synchronization, audio delay, video delay, GSM interference signal, noise cancellation, notch filtering
Nationell ämneskategori
Signalbehandling
Identifikatorer
URN: urn:nbn:se:bth-00604Lokalt ID: oai:bth.se:forskinfoDE7E8BB7B60A3B4EC1257D820035F0B6ISBN: 978-91-7295-295-9 (tryckt)OAI: oai:DiVA.org:bth-00604DiVA, id: diva2:833999
Tillgänglig från: 2014-12-11 Skapad: 2014-10-31 Senast uppdaterad: 2025-09-30Bibliografiskt granskad

Open Access i DiVA

fulltext(3373 kB)1231 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 3373 kBChecksumma SHA-512
e85008425fd65aa57f0362fa574545dfe250a3ac2aea95f16c0b523f35d0d63787c355ba43344729c1f448251b3f95e59757dcb5b530cb7c749c0dcc2899772c
Typ fulltextMimetyp application/pdf

Person

Rossholm, Andreas

Sök vidare i DiVA

Av författaren/redaktören
Rossholm, Andreas
Av organisationen
Institutionen för tillämpad signalbehandling
Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 1241 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 753 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf