Deakin University
Browse

File(s) under permanent embargo

Evaluating parts-of-speech taggers for use in a text-to-scene conversion system

conference contribution
posted on 2005-01-01, 00:00 authored by K Glass, Shaun BangayShaun Bangay
This paper presents parts-of-speech tagging as a first step towards an autonomous text-to-scene conversion system. It categorizes some freely available taggers, according to the techniques used by each in order to automatically identify word-classes. In addition, the performance of each identified tagger is verified experimentally. The SUSANNE corpus is used for testing and reveals the complexity of working with different tagsets, resulting in substantially lower accuracies in our tests than in those reported by the developers of each tagger. The taggers are then grouped to form a voting system to attempt to raise accuracies, but in no cases do the combined results improve upon the individual accuracies. Additionally a new metric, agreement, is tentatively proposed as an indication of confidence in the output of a group of taggers where such output cannot be validated.

History

Event

South African institute of computer scientists and information technologists (2005: White River, South Africa)

Series

ACM International Conference Proceeding Series

Pagination

20 - 28

Publisher

South African Institute for Computer Scientists and Information Technologists

Location

White River, South Africa

Place of publication

Pretoria, South Africa

Start date

2005-09-20

End date

2005-09-22

ISBN-10

1595932585

Language

eng

Publication classification

E1.1 Full written paper - refereed

Copyright notice

2005, SAICSIT

Editor/Contributor(s)

J Bishop, D Kourie

Title of proceedings

SAICSIT '05 : Research for a Changing World – Proceedings of SAICSIT 2005

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC