Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Direction of Arrival Estimation and Localization of Multiple Speech Sources in Enclosed Environments
Blekinge Tekniska Högskola, Sektionen för ingenjörsvetenskap, Avdelningen för elektroteknik.
Ansvarig organisation
2012 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Speech communication is gaining in popularity in many different contexts as technology evolves. With the introduction of mobile electronic devices such as cell phones and laptops, and fixed electronic devices such as video and teleconferencing systems, more people are communicating which leads to an increasing demand for new services and better speech quality. Methods to enhance speech recorded by microphones often operate blindly without prior knowledge of the signals. With the addition of multiple microphones to allow for spatial filtering, many blind speech enhancement methods have to operate blindly also in the spatial domain. When attempting to improve the quality of spoken communication it is often necessary to be able to reliably determine the location of the speakers. A dedicated source localization method on top of the speech enhancement methods can assist the speech enhancement method by providing the spatial information about the sources. This thesis addresses the problem of speech-source localization, with a focus on the problem of localization in the presence of multiple concurrent speech sources. The primary work consists of methods to estimate the direction of arrival of multiple concurrent speech sources from an array of sensors and a method to correct the ambiguities when estimating the spatial locations of multiple speech sources from multiple arrays of sensors. The thesis also improves the well-known SRP-based methods with higher-order statistics, and presents an analysis of how the SRP-PHAT performs when the sensor array geometry is not fully calibrated. The thesis is concluded by two envelope-domain-based methods for tonal pattern detection and tonal disturbance detection and cancelation which can be useful to further increase the usability of the proposed localization methods. The main contribution of the thesis is a complete methodology to spatially locate multiple speech sources in enclosed environments. New methods and improvements to the combined solution are presented for the direction-of-arrival estimation, the location estimation and the location ambiguity correction, as well as a sensor array calibration sensitivity analysis.

Ort, förlag, år, upplaga, sidor
Karlskrona: Blekinge Institute of Technology , 2012.
Serie
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 3
Nyckelord [en]
Beamforming, Detection and classification, Speech enhancement, Source localization
Nationell ämneskategori
Signalbehandling
Identifikatorer
URN: urn:nbn:se:bth-00520Lokalt ID: oai:bth.se:forskinfoACD1267A1007C477C125796400452ABBISBN: 978-91-7295-226-3 (tryckt)OAI: oai:DiVA.org:bth-00520DiVA, id: diva2:835040
Tillgänglig från: 2012-09-18 Skapad: 2011-12-12 Senast uppdaterad: 2016-09-06Bibliografiskt granskad

Open Access i DiVA

fulltext(73 kB)99 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 73 kBChecksumma SHA-512
8db4849343d43bd58995ac1414fa476a6a47f0481fc1b7fc74140fe8f90b908277323a3641a740c90617b036fcc82c07a4556a5650d6483c3c979db287d6b539
Typ fulltextMimetyp application/pdf
fulltext(2333 kB)710 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 2333 kBChecksumma SHA-512
690abb779ad1c0750203350d66ca17b676d925c7b564235feb829f2059321f702769c84950d297c4cd7fdf5de884a2a03fb538f52d8de94d22568c07a6740ab9
Typ fulltextMimetyp application/pdf

Personposter BETA

Swartling, Mikael

Sök vidare i DiVA

Av författaren/redaktören
Swartling, Mikael
Av organisationen
Avdelningen för elektroteknik
Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 809 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 503 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf