Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Direction of Arrival Estimation for Speech Sources using Fourth Order Cross Cumulants
Responsible organisation
2008 (English)Conference paper, (Refereed) Published
Abstract [en]

In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known Steered Response Power with Phase Transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a Probability Density Function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources.

Place, publisher, year, edition, pages
Seattle: IEEE , 2008.
Keyword [en]
Localization, Delay estimation, Higher order statistics
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:bth-8499ISI: 000258532101155Local ID: oai:bth.se:forskinfo4F23DB790184F372C12574A40059C144OAI: oai:DiVA.org:bth-8499DiVA: diva2:836225
Conference
International Symposium on Circuits and Systems
Available from: 2012-09-18 Created: 2008-08-13 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(154 kB)102 downloads
File information
File name FULLTEXT01.pdfFile size 154 kBChecksum SHA-512
b142724b8b319baaca0772ce911b4684f1a9f0d5bb8331245392176b507aaec4fc7ffda84c6c719adb8887c8691ac18f8621902e8db695c95abc7b5f026a3caa
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Swartling, Mikael
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 102 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 22 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf