Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Pattern Acquisition Methods for Information Extraction Systems
Blekinge Institute of Technology, School of Engineering, Department of Systems and Software Engineering.
2007 (English)Independent thesis Advanced level (degree of Master (One Year))Student thesis
Abstract [en]

This master thesis treats about Event Recognition in the reports of Polish stockholders. Event Recognition is one of the Information Extraction tasks. This thesis provides a comparison of two approaches to Event Recognition: manual and automatic. In the manual approach regular expressions are used. Regular expressions are used as a baseline for the automatic approach. In the automatic approach three Machine Learning methods were applied. In the initial experiment the Decision Trees, naive Bayes and Memory Based Learning methods are compared. A modification of the standard Memory Based Learning method is presented which goal is to create a classifier that uses only positives examples in the classification task. The performance of the modified Memory Based Learning method is presented and compared to the baseline and also to other Machine Learning methods. In the initial experiment one type of annotation is used and it is the meeting date annotation. The final experiment is conducted using three types of annotations: the meeting time, the meeting date and the meeting place annotation. The experiments show that the classification can be performed using only one class of instances with the same level of performance.

Place, publisher, year, edition, pages
2007. , p. 68
Keywords [en]
Natural Language Processing, Information Extraction, Patterns Acquisition, Linguistic Patterns, Memory Based Learning, Event Recognition
National Category
Computer Sciences Software Engineering
Identifiers
URN: urn:nbn:se:bth-4291Local ID: oai:bth.se:arkivex449AF033BA2EF5ACC125736E0045D807OAI: oai:DiVA.org:bth-4291DiVA, id: diva2:831623
Uppsok
Technology
Supervisors
Note
(+48)669808616Available from: 2015-04-22 Created: 2007-10-08 Last updated: 2018-01-11Bibliographically approved

Open Access in DiVA

fulltext(1087 kB)251 downloads
File information
File name FULLTEXT01.pdfFile size 1087 kBChecksum SHA-512
221e674b5ed49c97f5d8484a49e69cebbd21a7e7629419b09139c8a27e04d60c6a1acde5e1dcd4b01344072af32521241efeca77552686eb3543865a195fe30f
Type fulltextMimetype application/pdf

By organisation
Department of Systems and Software Engineering
Computer SciencesSoftware Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 251 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 803 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf