Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Simple and Computationally Efficient Algorithm for Real-Time Blind Source Separation of Speech Mixtures
Ansvarig organisation
2006 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat) Published
Abstract [en]

In this paper we exploit the amplitude diversity provided by two sensors to achieve blind separation of two speech sources. We propose a simple and highly computationally efficient method for separating sources that are W-disjoint orthogonal (W-DO), that are sources whose time-frequency representations are disjoint sets. The Degenerate Unmixing and Estimation Technique (DUET), a powerful and efficient method that exploits the W-disjoint orthogonality property, requires extensive computations for maximum likehood parameter learning. Our proposed method avoids all the computations required for parameters estimation by assuming that the sources are "cross high-low diverse (CH-LD)", an assumption that is explained later and that can be satisfied exploiting the sensors settings/directions. With this assumption and the W-disjoint orthogonality property, two binary time-frequency masks that can extract the original sources from one of the two mixtures, can be constructed directly from the amplitude ratios of the time-frequency points of the two mixtures. The method works very well when tested with both artificial and real mixtures. Its performance is comparable to DUET, and it requires only 2% of the computations required by the DUET method. Moreover, it is free of convergence problems that lead to poor SIR ratios in the first parts of the signals. As with all binary masking approaches, the method suffers from artifacts that appear in the output signals.

Ort, förlag, år, upplaga, sidor
Setubal, PORTUGAL: INSTICC-INST SYST TECHNOLOGIES INFORMATION CONTROL & COMMUNICATION , 2006.
Nyckelord [en]
bSS, blind source separation, speech enhancement, speech analysis, speech synthesis
Nationell ämneskategori
Signalbehandling
Identifikatorer
URN: urn:nbn:se:bth-8742ISI: Setubal, PORTUGALLokalt ID: oai:bth.se:forskinfoB18A94B4264EDB92C12573C6007AA154ISBN: 972-8865-64-3 (tryckt)OAI: oai:DiVA.org:bth-8742DiVA, id: diva2:836494
Konferens
International Conference on Signal Processing and Multimedia Applications
Tillgänglig från: 2012-09-18 Skapad: 2008-01-04 Senast uppdaterad: 2015-06-30Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 193 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf