Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Intelligent Camera Tracking using SRP-based sound Source localization in frequency domain
Blekinge Institute of Technology, School of Engineering.
2012 (English)Independent thesis Advanced level (degree of Master (Two Years))Student thesis
Abstract [en]

The Steered Response Power Phase Transform (SRP-PHAT) is one of the most robust methods among sound source localization operating in noisy and reverberant environments. Direction of Arrival (DOA) Estimation has important applications in human computer interfaces such as video conferencing, speech enhancement and speech recognition. In this thesis work, SRP-PHAT method has been implemented for 16 element microphone array arranged into 4 rows and 4 columns in the presence of noise and reverberation. Computation of TDOA for each pair of microphones in a row setup or a column setup, generalized cross correlation estimates are calculated and thereby computing the source position and then by averaging the row wise obtained TDOA values and column wise obtained TDOA values, best accurate source position can be determined. Weighted Overlap and Add (WOLA) filter bank is used in SRP-PHAT method to find the TDOA in frequency domain. Original TDOA's and estimated TDOA's obtained from SRP-PHAT are compared to analyse the performance of the SRP-PHAT method. Mean estimation error and Standard deviation are calculated to find the accuracy of the estimated values of TDOA.

Place, publisher, year, edition, pages
2012. , p. 55
Keywords [en]
SRP-Phat, acoustics, camera tracking, GCC Phat
National Category
Signal Processing Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:bth-4046Local ID: oai:bth.se:arkivexA2081AA51BDEA180C1257A9800222099OAI: oai:DiVA.org:bth-4046DiVA, id: diva2:831365
Uppsok
Technology
Supervisors
Note
0046761925672Available from: 2015-04-22 Created: 2012-10-15 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(3108 kB)496 downloads
File information
File name FULLTEXT01.pdfFile size 3108 kBChecksum SHA-512
d10ebefd1dc5e764dd77ecadd9270a54ba3d72467235ce9a2241e61864f716ffec840a1416883203b3011c27cf0a657ba6d23b5ca02a123dd692fb00440ae8dc
Type fulltextMimetype application/pdf

By organisation
School of Engineering
Signal ProcessingElectrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 502 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 159 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf