Arrow Research search
Back to AAAI

AAAI 1999

Regression Testing for Wrapper Maintenance

Conference Paper Technical Papers Artificial Intelligence

Abstract

Recent workon Internet information integration ~sumesa library of wrappers, specialized informationextraction procedures. Maintainingwrappersis difficult, becausethe formatting regularities on whichthey rely often change. Thewrapperverification problemis to determine whethera wrapperis correct. Standard regression testing approachesare inappropriate, because both the formatting regularities anda site’s underlying content may change. Weintroduce RAPTURE, a fully-implemented, domain-independent verification algorithm. RAPTURE uses well-motivated heuristics to computethe similarity betweena wrapper’s expected and observed output. Experimentswith 27 actual Internet sites showa substantial performanceimprovementover standard regression testing.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue
AAAI Conference on Artificial Intelligence
Archive span
1980-2026
Indexed papers
28718
Paper id
1030888095946653331