Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Speaker Separation Investigation
Blekinge Institute of Technology, School of Engineering, Department of Telecommunication Systems.
2007 (English)Independent thesis Advanced level (degree of Master (One Year))Student thesisAlternative title
Högtalareavskiljandeutredning (Swedish)
Abstract [en]

This report describes two important investigations which formed part of an overall project aimed at separating overlapping speech signals. The first investigation uses chirp signals to measure the acoustic transfer functions which would typically be found in the speaker separation project. It explains the behaviour of chirps in acoustic environments that can be further used to find the room reverberations as well, besides their relevance to measuring the transfer functions in conjunction with speaker separation. Chirps that have been used in this part are logarithmic and linear chirps. They have different lengths and are analysed in two different acoustic environments. Major findings are obtained in comparative analysis of different chirps in terms of their cross-correlations, specgrams and power spectrum magnitude. The second investigation deals with using automatic speech recognition (ASR) system to test the performance of the speaker separation algorithm with respect to word accuracy of different speakers. Speakers were speaking in two different scenarios and these were nonoverlapping and overlapping scenarios. In non-overlapping scenario speakers were speaking alone and in overlapping scenario two speakers were speaking simultaneously. To improve the performance of speaker separation in the overlapping scenario, I was working very close with my fellow colleague Mr. Holfeld who was improving the existing speech separation algorithm. After cross-examining our findings, we improved the existing speech separation algorithm. This further led to improvement in word accuracy of the speech recognition software in overlapping scenario.

Place, publisher, year, edition, pages
2007. , p. 121
Keywords [en]
Speaker Separation, Acoustics, Cross-correlation, Automatic Speech Recognition, Room Impulse Response, Maximal Length Sequences, Chirps
National Category
Signal Processing Telecommunications
Identifiers
URN: urn:nbn:se:bth-5794Local ID: oai:bth.se:arkivex553AFA8D8FE87E03C125733D00354E49OAI: oai:DiVA.org:bth-5794DiVA, id: diva2:833197
Uppsok
Technology
Supervisors
Note
Email Contact: foxandanchor@gmail.com Mobile: +46708290539Available from: 2015-04-22 Created: 2007-08-20 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(1001 kB)666 downloads
File information
File name FULLTEXT01.pdfFile size 1001 kBChecksum SHA-512
361aa654cb479aff90f63755b24807f49be0d8212359ab5f79836145ef0ab27d4bd27f748219ba42314bfef4b67f7cb0744b363854942f14bb58290bf9932590
Type fulltextMimetype application/pdf

By organisation
Department of Telecommunication Systems
Signal ProcessingTelecommunications

Search outside of DiVA

GoogleGoogle Scholar
Total: 668 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 361 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf