Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank
Blekinge Tekniska Högskola, Sektionen för ingenjörsvetenskap.
2012 (engelsk)Independent thesis Advanced level (degree of Master (Two Years))OppgaveAlternativ tittel
Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank (svensk)
Abstract [en]

This thesis describes the design and implementation of a speech enhancement system that uses 4-channel microphone array beam forming and speech enhancement algorithms applied to a speech signal in a multiple source environment. To locate the accurate Direction of Arrival (DOA) from the source, it is necessary to design a suitable microphone array system with more efficient localization algorithm. The goal of the system is to improve the quality of the primary speech signal. A filter bank is a signal processing tool that can facilitate manipulation of signals in the frequency domain. The WOLA (Weighted Overlap and Add) filter is an efficient method used to implement a uniformly distributed multi-channel filter bank. The WOLA is generally used in applications that demand high quality filters in term of stop band rejection and filter shape. Beamformers work by means of steering an array of microphones towards a desired look direction through utilizing signal information rather than physically moving the array. In this research, Wiener beam former is examined the input signals are first split into frequency bands so that Wiener beam forming techniques can be used. There are many algorithms developed for estimating the number of sources and locating the DOA, such as Bayesian algorithm, kalman filtering, Generalized Cross Correlation (GCC) and Steered Response Power (SRP) algorithm. But SRP algorithm with its steered beam forming technique for speaker localization is more robust using microphone array. The Phase Alignment Transform (PHAT) has gained a lot of attention in the recent research for its quite robust response in low noise, but reverberant environment. So combining SRP-PHAT will become the robust localizer in reverberant environment. Experiments were done on recorded data of human talkers. The algorithm gives accurate DOA from the dominant speaker. In addition to these, listener opinion testing is performed.

sted, utgiver, år, opplag, sider
2012. , s. 64
Emneord [en]
RIR, Bemaforming, filterbank, srp-phat
HSV kategori
Identifikatorer
URN: urn:nbn:se:bth-2190Lokal ID: oai:bth.se:arkivex08107A2082A82C53C1257A0C005EF2CDOAI: oai:DiVA.org:bth-2190DiVA, id: diva2:829457
Uppsök
Technology
Veileder
Tilgjengelig fra: 2015-04-22 Laget: 2012-05-28 Sist oppdatert: 2025-09-30bibliografisk kontrollert

Open Access i DiVA

fulltekst(781 kB)2343 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 781 kBChecksum SHA-512
ddd8dff2719af29751f0eebb85975362cb13e590ebf324d2d7f86bc79c3950a00a967c07fe9e09ced85e32e5a86e53172a5a8e242dd2c44d0f20df9b9e195991
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 2343 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 266 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf