Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Simple and Computationally Efficient Algorithm for Real-Time Blind Source Separation of Speech Mixtures
Responsible organisation
2006 (English)Conference paper, Published paper (Refereed) Published
Abstract [en]

In this paper we exploit the amplitude diversity provided by two sensors to achieve blind separation of two speech sources. We propose a simple and highly computationally efficient method for separating sources that are W-disjoint orthogonal (W-DO), that are sources whose time-frequency representations are disjoint sets. The Degenerate Unmixing and Estimation Technique (DUET), a powerful and efficient method that exploits the W-disjoint orthogonality property, requires extensive computations for maximum likehood parameter learning. Our proposed method avoids all the computations required for parameters estimation by assuming that the sources are "cross high-low diverse (CH-LD)", an assumption that is explained later and that can be satisfied exploiting the sensors settings/directions. With this assumption and the W-disjoint orthogonality property, two binary time-frequency masks that can extract the original sources from one of the two mixtures, can be constructed directly from the amplitude ratios of the time-frequency points of the two mixtures. The method works very well when tested with both artificial and real mixtures. Its performance is comparable to DUET, and it requires only 2% of the computations required by the DUET method. Moreover, it is free of convergence problems that lead to poor SIR ratios in the first parts of the signals. As with all binary masking approaches, the method suffers from artifacts that appear in the output signals.

Place, publisher, year, edition, pages
Setubal, PORTUGAL: INSTICC-INST SYST TECHNOLOGIES INFORMATION CONTROL & COMMUNICATION , 2006.
Keywords [en]
bSS, blind source separation, speech enhancement, speech analysis, speech synthesis
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:bth-8742ISI: Setubal, PORTUGALLocal ID: oai:bth.se:forskinfoB18A94B4264EDB92C12573C6007AA154ISBN: 972-8865-64-3 (print)OAI: oai:DiVA.org:bth-8742DiVA, id: diva2:836494
Conference
International Conference on Signal Processing and Multimedia Applications
Available from: 2012-09-18 Created: 2008-01-04 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

No full text in DiVA

Signal Processing

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 422 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf