Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
SPEECH RECOGNITION FOR WEB BASED TELEPHONY
Blekinge Institute of Technology, School of Engineering.
2010 (English)Independent thesis Advanced level (degree of Master (Two Years))Student thesis
Abstract [en]

Web based telephony purges the need of explicit downloading and installing a VoIP client software. Calls in web based telephony can be made directly from the browser. The combination of web technologies and traditional telephony makes it possible to introduce new exciting services. One such new service is introduced as a result of this thesis work. The voicemails received are automatically transcribed and converted into text; the text is then saved to an inbox. The performance of the introduced service is good and gives a better recognition rate in the current configuration. The speech recognition covers a continuous speech of English and a maximum vocabulary of 64 thousand words. Adobe Flash 10 has a proprietary protocol for the streaming of audio over internet. Red5 server is an open source server that has support for RTMP plug in. Red5Phone is an open source SIP phone containing a flash based client. The new service introduced is added to the existing Red5Phone solution. Speech recognition for web based telephony was investigated, developed, implemented, and tested. Sphinx-4 is an open source state-of-the art ASR system. It is capable of keeping up with the requirement of large vocabulary transcription. Sphinx-4 was configured and integrated with the developed service for the transcription of voicemails. The performance of Sphinx-4 was rigorously evaluated before its configuration.

Place, publisher, year, edition, pages
2010. , p. 63
Keywords [en]
Speech Recognition, VoIP, Web Telephony, RTMP, SIP, Red5Phone, Sphinx-4, Voicemail
National Category
Signal Processing Telecommunications
Identifiers
URN: urn:nbn:se:bth-3426Local ID: oai:bth.se:arkivex7083C58C4987F9CAC125770B0041C141OAI: oai:DiVA.org:bth-3426DiVA, id: diva2:830732
Uppsok
Technology
Supervisors
Available from: 2015-04-22 Created: 2010-04-20 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(914 kB)207 downloads
File information
File name FULLTEXT01.pdfFile size 914 kBChecksum SHA-512
ef6c1579db347f292fe2508b800c69a137a7e5f5168341f46d2f139fe9f4bf6bbcb4378a04b77e685126653a553f19cb50d6bb6e98b04f69abf21c862b4b7118
Type fulltextMimetype application/pdf

By organisation
School of Engineering
Signal ProcessingTelecommunications

Search outside of DiVA

GoogleGoogle Scholar
Total: 207 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 244 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf