Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank
Blekinge Institute of Technology, School of Engineering.
2012 (English)Independent thesis Advanced level (degree of Master (Two Years))Student thesisAlternative title
Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank (Swedish)
Abstract [en]

This thesis describes the design and implementation of a speech enhancement system that uses 4-channel microphone array beam forming and speech enhancement algorithms applied to a speech signal in a multiple source environment. To locate the accurate Direction of Arrival (DOA) from the source, it is necessary to design a suitable microphone array system with more efficient localization algorithm. The goal of the system is to improve the quality of the primary speech signal. A filter bank is a signal processing tool that can facilitate manipulation of signals in the frequency domain. The WOLA (Weighted Overlap and Add) filter is an efficient method used to implement a uniformly distributed multi-channel filter bank. The WOLA is generally used in applications that demand high quality filters in term of stop band rejection and filter shape. Beamformers work by means of steering an array of microphones towards a desired look direction through utilizing signal information rather than physically moving the array. In this research, Wiener beam former is examined the input signals are first split into frequency bands so that Wiener beam forming techniques can be used. There are many algorithms developed for estimating the number of sources and locating the DOA, such as Bayesian algorithm, kalman filtering, Generalized Cross Correlation (GCC) and Steered Response Power (SRP) algorithm. But SRP algorithm with its steered beam forming technique for speaker localization is more robust using microphone array. The Phase Alignment Transform (PHAT) has gained a lot of attention in the recent research for its quite robust response in low noise, but reverberant environment. So combining SRP-PHAT will become the robust localizer in reverberant environment. Experiments were done on recorded data of human talkers. The algorithm gives accurate DOA from the dominant speaker. In addition to these, listener opinion testing is performed.

Place, publisher, year, edition, pages
2012. , p. 64
Keywords [en]
RIR, Bemaforming, filterbank, srp-phat
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:bth-2190Local ID: oai:bth.se:arkivex08107A2082A82C53C1257A0C005EF2CDOAI: oai:DiVA.org:bth-2190DiVA, id: diva2:829457
Uppsok
Technology
Supervisors
Available from: 2015-04-22 Created: 2012-05-28 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(781 kB)2213 downloads
File information
File name FULLTEXT01.pdfFile size 781 kBChecksum SHA-512
ddd8dff2719af29751f0eebb85975362cb13e590ebf324d2d7f86bc79c3950a00a967c07fe9e09ced85e32e5a86e53172a5a8e242dd2c44d0f20df9b9e195991
Type fulltextMimetype application/pdf

By organisation
School of Engineering
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 2213 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 179 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf