Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Microphone Array Wiener Beamforming with modeling of SRP- PHAT for Speaker Localization
Blekinge Tekniska Högskola, Sektionen för ingenjörsvetenskap.
2012 (Engelska)Självständigt arbete på avancerad nivå (masterexamen)Studentuppsats (Examensarbete)
Abstract [en]

The use of microphone arrays to acquire and recognize speech in meetings (conference) poses several problems for speech processing as there exist many speakers within a small space, typically around a table. The necessity to design a suitable microphone array system with minimum noise and more efficient localization algorithms is drawing attention of researchers to work on it. Extensive research is being carried out on Microphone Array Beamforming to make the system, robust, viable and elegant for commercial use. This study is done with a similar objective. A system consisting of 4 microphones arranged in linear array is setup in a simulated reverberant environment. Filter-and-sum beam forming is implemented both in time domain and frequency domain. A Wiener filter is chosen as post filtering technique. One of the main goals of the thesis is to improve the quality of the primary speech signal based on microphone array with Wiener beam forming (filter-and-sum beam forming with wiener post filtering). Weighted over lap add (WOLA) filter bank is also implemented as a part of frequency domain wiener beam forming to make use of subband beam forming. Also RLS algorithm is used to make the subband beamforming adaptive. Speaker localization plays a pivotal role in the development of speech enhancement methods requiring information of the speaker position. Among many localization algorithms, Steered Response Power (SRP) with a combination of Phase Alignment Transform (PHAT) called SRP-PHAT has proved to be a robust one in many studies. Also as a part of this project, modeling of SRP-PHAT for detecting the speaker position for the above described system is done. To evaluate the system performance, Signal-to-Noise-Ratio (SNR) is calculated for both original and beam formed signals. Perpetual Evaluation of Speech Quality (PESQ), an International Telecommunication Union (ITU-T) standard for evaluating quality in speech signals is used for determining the Mean opinion Score (MOS) for both the original and the beam formed signals.

Ort, förlag, år, upplaga, sidor
2012. , s. 66
Nyckelord [en]
MIcrophone array, Wiener beamforming, Source localization
Nationell ämneskategori
Signalbehandling
Identifikatorer
URN: urn:nbn:se:bth-5988Lokalt ID: oai:bth.se:arkivex5C48E84513A16AD3C12579E5005DFE87OAI: oai:DiVA.org:bth-5988DiVA, id: diva2:833404
Uppsök
teknik
Handledare
Tillgänglig från: 2015-04-22 Skapad: 2012-04-19 Senast uppdaterad: 2015-06-30Bibliografiskt granskad

Open Access i DiVA

fulltext(1028 kB)515 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 1028 kBChecksumma SHA-512
67ad1faf14f48d358d6ec3842a976e35a25e153c139714aadc6f50c947892f1639891ccaf978faa9caa76e60bb20d279624c38c93e6d47d1efec2717fd117884
Typ fulltextMimetyp application/pdf

Av organisationen
Sektionen för ingenjörsvetenskap
Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 515 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 136 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf