UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
ELIXIR: A Library for Writing Wrappers in Java (2001)
Edward Wild
ELIXIR is a library for writing wrappers in Java. ELIXIR provides a way to combine text extraction and spidering in wrappers. Since wrappers using ELIXIR are Java programs, they are eays to integrate with other Java program. The user can also extend the functionality of ELIXIR by implement new ItemExtractors. In an experiment, a wrapper written using ELIXIR showed an 89% reduction in non-comment source statements from a wrapper written using a prototype of ELIXIR. In another experiemnt, a wrapper written using ELIXIR showed a 90% reduction in non-comment source statements from a wrapper written using SPHINX, a Java toolkit for writing spiders.
View:
PDF
,
PS
Citation:
Undergraduate Honor Thesis, Department of Computer Sciences, University of Texas at Austin.
Bibtex:
@misc{wild:undergrad2001, title={ELIXIR: A Library for Writing Wrappers in Java}, author={Edward Wild}, month={December}, note={Undergraduate Honor Thesis, Department of Computer Sciences, University of Texas at Austin}, url="http://www.cs.utexas.edu/users/ai-lab?wild:undergrad2001", year={2001} }
Areas of Interest
Information Extraction
Machine Learning
Labs
Machine Learning