The Scorpion Open Source project offers software that implements a system for automatically classifying Web-accessible text documents. Scorpion is intended for use by investigators who have a machine-readable subject classification scheme or thesaurus and wish to incorporate it into an automatic classification system.
The following pages have many links to articles that describe the development and evaluation of OCLC's Scorpion project.
- Automatic classification
This page contains links to current research papers and related projects.
- The Scorpion project
This site contains links to early research papers and a Web demo.
As of 2006 we are issuing software under the Apache License, Version 2.0.
If you would like to use this software under the Apache license, please contact us and we may be able to update the software to use the Apache license.
You may download the complete Scorpion code without using CVS for use or evaluation. This download is Release 1.1 of the software.
Scorpion is an application of Pears and Gwen. For a complete installation, both of them must be installed. In addition, the Dbutils support classes must be installed. The CVS repository for Scorpion contains the software and documentation required for designing Scorpion databases, custom-handling the results of a database search, and implementing a Web demo.