

Overview
WebGlimpse adds search capabilities to your WWW site
automatically and easily.
It attaches a small search box to the bottom of every
HTML page, and allows the search to cover the neighborhood
of that page or the whole site.
With WebGlimpse there is
no need to construct separate search pages, and no need
to interrupt the users from their browsings.
All pages remain unchanged except for the extra search capabilities.
It is even possible for the search to efficiently cover remote
pages linked from your pages.
(WebGlimpse will collect such remote pages to your disk
and index them.)
Installation, customization (e.g., deciding which pages to collect
and which ones to index), and maintenance are easy.
WebGlimpse uses the Glimpse
search engine, a fast and flexible searching tool.
WebGlimpse is now at version 1.5.
These pages refer to the newest version of WebGlimpse.
WebGlimpse borrows ideas from our previous
GlimpseHTTP tool
(which is not supported anymore -- see the
security alert
about it),
and from the Harvest
system.
(WebGlimpse can be thought of as a ``mini Harvest'':
Its gathering capabilities are more limited -- only HTML and
text pages are handled -- but its abilities
to define and search only neighborhoods may prove more
flexible. It is also generally easier to install and maintain.)



Main Features of WebGlimpse
-
Automatic addition of search features to all selected pages with one script.
The following form is added to the bottom of all pages:
(This is for illustration only, it's not a working form.)
-
Neighborhoods can be defined for each page (e.g., all pages within
2 links away), and the search can be restricted to the current
page's neighborhood.
This allows a convenient combination of
searching and browsing.
-
Automatic collection and indexing of remote pages that are linked
from your pages.
-
Very flexible search, using glimpse, supporting
a search of only new documents (e.g., last 8 days), allowing
misspellings, Boolean queries, regular expressions, and more.
Glimpse's space requirements for the index are minimal
(less than 10% of the source).
Clicking on ``Search Options'' in the form above brings the more
extended options form:
(This is for illustration only, it's not a working form.)
-
Each answer includes the title of (and a link to) the URL, the date
it was last modified, and
the list of all lines that matched
the query.
For example, a query for ``matching'' may return
Udi Manber --
Home Page, ( May 14 1996)
- pattern matching.
- agrep -- an improved grep that allows approximate matching,
- A fast algorithm for multi-pattern matching
-
Easy maintenance and installation.
-
At the current time, WebGlimpse works only on UNIX systems.
We hope to have an NT version soon.

IMPORTANT: Read the Copyright Notice
NOTE: WebGlimpse requires glimpse version 4.0b1
and above, and Perl version 5.0 and above.
See Glimpse Home Pages
for download information.
For commercial sites,
professional support for WebGlimpse is
now available.

Documentations
An Article
describing the design and implementation of WebGlimpse.
Installation of WebGlimpse
configuring an archive using the confarc command
Indexing an archive
Removing an archive using rmarc
Technical documentation (for the advanced user only)
Contacts and Credits for WebGlimpse

To be put on the glimpse mailing list, send email to
glimpse-request@cs.arizona.edu
with the word "add" or "subscribe" anywhere in the message.
(To be removed from the list, use the word "remove".)
This is not an automatic listserv.
Maintained by
Udi Manber.
glimpse@cs.arizona.edu
Last modified: January 6, 1998.