Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Direction of Arrival Estimation and Localization of Multiple Speech Sources in Enclosed Environments
Blekinge Institute of Technology, School of Engineering, Department of Electrical Engineering.
Responsible organisation
2012 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Speech communication is gaining in popularity in many different contexts as technology evolves. With the introduction of mobile electronic devices such as cell phones and laptops, and fixed electronic devices such as video and teleconferencing systems, more people are communicating which leads to an increasing demand for new services and better speech quality. Methods to enhance speech recorded by microphones often operate blindly without prior knowledge of the signals. With the addition of multiple microphones to allow for spatial filtering, many blind speech enhancement methods have to operate blindly also in the spatial domain. When attempting to improve the quality of spoken communication it is often necessary to be able to reliably determine the location of the speakers. A dedicated source localization method on top of the speech enhancement methods can assist the speech enhancement method by providing the spatial information about the sources. This thesis addresses the problem of speech-source localization, with a focus on the problem of localization in the presence of multiple concurrent speech sources. The primary work consists of methods to estimate the direction of arrival of multiple concurrent speech sources from an array of sensors and a method to correct the ambiguities when estimating the spatial locations of multiple speech sources from multiple arrays of sensors. The thesis also improves the well-known SRP-based methods with higher-order statistics, and presents an analysis of how the SRP-PHAT performs when the sensor array geometry is not fully calibrated. The thesis is concluded by two envelope-domain-based methods for tonal pattern detection and tonal disturbance detection and cancelation which can be useful to further increase the usability of the proposed localization methods. The main contribution of the thesis is a complete methodology to spatially locate multiple speech sources in enclosed environments. New methods and improvements to the combined solution are presented for the direction-of-arrival estimation, the location estimation and the location ambiguity correction, as well as a sensor array calibration sensitivity analysis.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Institute of Technology , 2012.
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 3
Keywords [en]
Beamforming, Detection and classification, Speech enhancement, Source localization
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:bth-00520Local ID: oai:bth.se:forskinfoACD1267A1007C477C125796400452ABBISBN: 978-91-7295-226-3 (print)OAI: oai:DiVA.org:bth-00520DiVA, id: diva2:835040
Available from: 2012-09-18 Created: 2011-12-12 Last updated: 2016-09-06Bibliographically approved

Open Access in DiVA

fulltext(73 kB)179 downloads
File information
File name FULLTEXT01.pdfFile size 73 kBChecksum SHA-512
8db4849343d43bd58995ac1414fa476a6a47f0481fc1b7fc74140fe8f90b908277323a3641a740c90617b036fcc82c07a4556a5650d6483c3c979db287d6b539
Type fulltextMimetype application/pdf
fulltext(2333 kB)2907 downloads
File information
File name FULLTEXT02.pdfFile size 2333 kBChecksum SHA-512
690abb779ad1c0750203350d66ca17b676d925c7b564235feb829f2059321f702769c84950d297c4cd7fdf5de884a2a03fb538f52d8de94d22568c07a6740ab9
Type fulltextMimetype application/pdf

Authority records

Swartling, Mikael

Search in DiVA

By author/editor
Swartling, Mikael
By organisation
Department of Electrical Engineering
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 3091 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 742 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf