NexTech 2021 Congress
October 03, 2021 to October 07, 2021 - Barcelona, Spain

  • UBICOMM 2021, The Fifteenth International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies
  • ADVCOMP 2021, The Fifteenth International Conference on Advanced Engineering Computing and Applications in Sciences
  • SEMAPRO 2021, The Fifteenth International Conference on Advances in Semantic Processing
  • AMBIENT 2021, The Eleventh International Conference on Ambient Computing, Applications, Services and Technologies
  • EMERGING 2021, The Thirteenth International Conference on Emerging Networks and Systems Intelligence
  • DATA ANALYTICS 2021, The Tenth International Conference on Data Analytics
  • GLOBAL HEALTH 2021, The Tenth International Conference on Global Health Challenges
  • CYBER 2021, The Sixth International Conference on Cyber-Technologies and Cyber-Systems

SoftNet 2021 Congress
October 03, 2021 to October 07, 2021 - Barcelona, Spain

  • ICSEA 2021, The Sixteenth International Conference on Software Engineering Advances
  • ICSNC 2021, The Sixteenth International Conference on Systems and Networks Communications
  • CENTRIC 2021, The Fourteenth International Conference on Advances in Human-oriented and Personalized Mechanisms, Technologies, and Services
  • VALID 2021, The Thirteenth International Conference on Advances in System Testing and Validation Lifecycle
  • SIMUL 2021, The Thirteenth International Conference on Advances in System Simulation
  • SOTICS 2021, The Eleventh International Conference on Social Media Technologies, Communication, and Informatics
  • INNOV 2021, The Tenth International Conference on Communications, Computation, Networks and Technologies
  • HEALTHINFO 2021, The Sixth International Conference on Informatics and Assistive Technologies for Health-Care, Medical Support and Wellbeing

NetWare 2021 Congress
November 14, 2021 to November 18, 2021 - Athens, Greece

  • SENSORCOMM 2021, The Fifteenth International Conference on Sensor Technologies and Applications
  • SENSORDEVICES 2021, The Twelfth International Conference on Sensor Device Technologies and Applications
  • SECURWARE 2021, The Fifteenth International Conference on Emerging Security Information, Systems and Technologies
  • AFIN 2021, The Thirteenth International Conference on Advances in Future Internet
  • CENICS 2021, The Fourteenth International Conference on Advances in Circuits, Electronics and Micro-electronics
  • ICQNM 2021, The Fifteenth International Conference on Quantum, Nano/Bio, and Micro Technologies
  • FASSI 2021, The Seventh International Conference on Fundamentals and Advances in Software Systems Integration
  • GREEN 2021, The Sixth International Conference on Green Communications, Computing and Technologies

TrendNews 2021 Congress
November 14, 2021 to November 18, 2021 - Athens, Greece

  • CORETA 2021, Advances on Core Technologies and Applications
  • DIGITAL 2021, Advances on Societal Digital Transformation

 


ThinkMind // INTELLI 2013, The Second International Conference on Intelligent Systems and Applications // View article intelli_2013_1_20_80015


Automated Annotation of Text Using the Classification-based Annotation Workbench (CLAW)

Authors:
R.oy George
Hema Nair
Khalil Shujaee
David Krooks
Chandler Armstrong

Keywords: Text Annotation, Multi-label Classification, Bayes Theorem, Annotation Workbench.

Abstract:
Text annotation is used to mark up text using highlights, comments, footnotes, tags, and links. Manual annotation is a human intensive process and is not feasible for a large corpus of text. Classification is a technique that may be used to automate the annotation process. This paper develops a Classification-based Text Annotation Workbench (CLAW), an annotation assistance tool that incorporates automated classification to reduce the difficulty of manual annotation. There are several technical challenges posed by the practical nature of the text corpus and the annotation methodology. The text corpus, is large and consists of numerous reports, lessons learnt and best practices. Complexity is introduced due to the size of the documents, the variety of formats and the range of subject matter. The annotation taxonomy is extensive and unstructured and may be applied to the text body without constraints. Consequently, the search space for the label(s) become prohibitively large and it becomes necessary to adopt strategies that reduce the complexity of the classification process. We introduce a simplification technique to reduce the large classification search space. We improve precision by supplementing these predictive algorithms with similarity based measures and evaluate CLAW for performance using both prediction-based metrics and ranking-based metrics. It is shown that CLAW performs better than a competing algorithm on all evaluation metrics.

Pages: 6 to 11

Copyright: Copyright (c) IARIA, 2013

Publication date: April 21, 2013

Published in: conference

ISSN: 2308-4065

ISBN: 978-1-61208-269-1

Location: Venice, Italy

Dates: from April 21, 2013 to April 26, 2013

SERVICES CONTACT
2010 - 2017 © ThinkMind. All rights reserved.
Read Terms of Service and Privacy Policy.