| |





















































|

|

Projects: SourceFinder
|
|
Bhasha developed a client/server software tool to gather "good" documents
from the web to a database. The user specifies "good" documents -
and their categories - by providing example-documents tagged with their
categories. The server then crawls the web looking for "similar" documents.
It analyzes the documents for a large number of features to check for
their similarity with the given example documents. It stores the "good"
documents in a database. The users can review the collected documents
and their analyses through the client software.



|
|


Last modified: January 2, 2001.

|