Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Complex-Valued Independent Component Analysis for Online Blind Speech Extraction
Responsible organisation
2008 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, E-ISSN 1558-7924, Vol. 16, no 8, p. 1624-1632Article in journal (Refereed) Published
Abstract [en]

This paper presents a theoretical analysis of a certain criterion for complex-valued independent component analysis (ICA) with a focus on blind speech extraction (BSE) of a spatio–temporally nonstationary speech source. In the paper, the proposed criteria denoted KSICA is related to the well-known FastICA method with the Kurtosis contrast function. The proposed method is shown to share the important fixed-point feature withthe FastICA method, although an improvement with the proposed method is that it does not exhibit the divergent behavior for a mixture of Gaussian-only sources that the FastICA method tends to do, and it shows better performance in online implementations. Compared to the FastICA, the KSICA method provides a 10 dB higher source extraction performance and a 10 dB lower standard deviation in a data batch approach when the data batch size is less than 100 samples. For larger batch sizes, the KSICA metod performs equally well. In an online application with spatially stationary sources the KSICA method provides around 10 dB higher interference suppression, and 1 MOS-unit lower speech distortion compared to the FastICA for 0.15 s time constant in the algorithm update parameter. Thus, the FastICA performance matches the KSICA performance for a time constant above 1 s. Finally, in an online application with a moving speech source, the KSICA method provides 10 dB higher interference suppression, compared to the FastICA for the same algorithm settings. All in all, the proposed KSICA method is shown to be a viable alternative for online BSE of complex-valued signal mixtures.

Place, publisher, year, edition, pages
IEEE , 2008. Vol. 16, no 8, p. 1624-1632
Keywords [en]
Array signal processing, higher order statistics, speech enhancement
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:bth-8385DOI: 10.1109/TASL.2008.2002058ISI: 000260463800022Local ID: oai:bth.se:forskinfo65AB698F1066AAF9C12574EC002AE0E4OAI: oai:DiVA.org:bth-8385DiVA, id: diva2:836100
Available from: 2012-09-18 Created: 2008-10-24 Last updated: 2017-12-04Bibliographically approved

Open Access in DiVA

fulltext(474 kB)273 downloads
File information
File name FULLTEXT01.pdfFile size 474 kBChecksum SHA-512
e3a737cea417515d2a1350a88c590676ed8d95553ec1018f11c46478a4028fd79e74f3091301b44e401771553a970aa538e6b3fd9f6b70e3f26ef4c4825d0856
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Claesson, Ingvar

Search in DiVA

By author/editor
Claesson, Ingvar
In the same journal
IEEE Transactions on Audio, Speech, and Language Processing
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 273 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 190 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf