Learning (k,l)-contextual tree languages for information extraction from web pages

    loading  Checking for direct PDF access through Ovid

Abstract

This paper introduces a novel method for learning a wrapper for extraction of information from web pages, based upon (k,l)-contextual tree languages. It also introduces a method to learn good values of k and l based on a few positive and negative examples. Finally, it describes how the algorithm can be integrated in a tool for information extraction.

Related Topics

    loading  Loading Related Articles