h_12.gif (1147 octets)

Evaluation of Selected Internet Search Tools

Search tools or search engines attempt to retrieve resources (usually Web pages) containing the terms that describe your topic. Although not all search tools permit the use of the Boolean operators AND, OR, and NOT, most do use relevancy ranking for retrieval and display. The documentation accompanying many of the prominent search tools is replete with claims about speed, database size, search features, accuracy, and precision. The truth is, even the smaller search tools will sometimes retrieve useful material that the bigger ones miss. There is also no unanimous agreement on how to best calculate the size of a search tool's database. Some tools only count the Web pages they have actually retrieved and indexed. Others also include the number of links mentioned in those retrieved pages. Try several to ensure a more complete search. Almost all the search tools listed below provide helpful search tips. Use this information to take full advantage of each search tool! You may also wish to examine the Search Engine Watch site for more valuable information and resources on search engines. There is also a guide to Getting the most out of Guides and Search Engines that may prove useful in your work or research.

AltaVista
http://altavista.digital.com/
AltaVista says it provides access to "31 million pages found on 476,000 servers, and four million articles from 14,000 Usenet news groups."
Databases: WWW pages, Usenet news groups
Search features: Full text; Boolean operators AND, OR, AND NOT; restriction of search to certain portions of the web page; phrase searching; case sensitive; truncation/wildcard; use of parentheses to group search expressions.
Results: Results can be viewed in standard, compact, and detailed form. Rankings criteria must be specified for "Advanced Query" mode.
Comments: AltaVista's ability to target those WWW pages that link to your site is one of its most popular features. Extensive search tips for features not mentioned above are available.
DejaNews
http://www.dejanews.com/
Databases: Archived Usenet news groups "ranging back to March, 1995."
Search features: Select "Power Search" to take advantage of the full range of search options including the Boolean operators AND, OR, AND NOT, and NEAR. Other features are phrase, proximity and field searching.
Results:
Comments: Updated daily. An extensive help index is available.
Excite
http://www.excite.com/
Excite's producers say that their NetSearch database is not padded by including the number of links embedded in indexed pages.
Databases: Two weeks of Usenet news group postings, Usenet classified advertisements, and NetReviews, a collection of over 60,000 brief Web site reviews.
Search features: Full text of WWW pages, excluding common stopwords; use of Boolean AND, OR, AND NOT; parentheses to group portions of Boolean queries together; "query by example" option finds pages similar to the search result you like best. "Concept-based search" (not available with above search features) attempts to retrieve documents with your exact query words and related words.
Results: The annotated results are returned in order of decreasing relevance.
Comments: Select "Advanced Query Language" for tips on how to fine-tune your search. The index is rebuilt approximately once a week.
Galaxy Search
http://galaxy.tradewave.com/search.html
Databases: WWW pages, Gopher titles, and Telnet resources from the Hytelnet hypertext telnet database.
Search features: Full, title, and text searches, retrieval of ANY or ALL search terms, automatic truncation.
Results: Output length (annotation of results) may be selected.
Comments: Use the name of the sub-directory included with each retrieved item to go seek other similar items in Galaxy. Provides some search tips.
HotBot
http://www.hotbot.com/
HotBot is powered by the Inktomi search engine.
Databases: WWW pages, Usenet news groups.
Search features: Retrieval of all the words, any of the words, phrase, person, and URL. For extra options, select the "modify" and "expert" buttons.
Results: The annotated results are returned in order of decreasing relevance.
Comments: Excludes common stopwords.
 
InfoSeek Guide
http://guide.infoseek.com/
The InfoSeek Guide database counts unique Web pages, not the URLs (or links) mentioned in those pages.
Databases: WWW pages, Usenet news groups, InfoSeek Select Sites (reviewed), directory of companies, e-mail addresses, one month of news stories, FAQs
Search features: Full-text, excluding common stopwords; case sensitive; symbol and numbers recognition; phrase searching; proximity; use of the + (plus) and - (minus) symbols to designate required and unwanted terms.
Results: The annotated results are returned in order of decreasing relevance. Some results include a list of related topics and news groups.
Comments: InfoSeek endeavors to be selective in the pages it indexes so as to reduce the number of duplicate and irrelevant pages retrieved. For best results, use the search tips.
Lycos
http://www.lycos.com/
Lycos merges the results of its continuous WWW sampling into its catalog on a weekly basis. Its database includes the URLs and links (some Gopher and FTP) just mentioned in the Web pages Lycos has actually indexed.
Databases: WWW pages, some FTP and Gopher links, A2Z (the most frequently visited sites arranged by subject) and WWW reviews by Point Communications
Search features: Ignores common stopwords; searches the title, headings, links, keywords, and first 20 lines of Web pages; default search assumes OR between search terms; search form available via the "Custom search" option allows you to choose match strength (loose to strong), a match of ALL or ANY terms, and three levels of display detail. Also allows narrowing the search to sounds or images.
Results: Results are listed in order of decreasing relevance.
Comments: Provides extensive search tips.
The Open Text Index
http://index.opentext.net/
The Open Text Index indexes the full text (even common stopwords) of Web pages.
Databases: WWW pages, Usenet news groups (Deja News)
Search features: Choice between Simple Search and Power Search; Boolean AND, OR, BUT NOT, NEAR, FOLLOWED BY; limit of search to certain parts of Web pages such as URLs, titles, headings.
Results: Special features allow users to view their search terms in context and find pages similar to the ones they like best.
Comments: Open Text does not truncate pluralized words. Be sure to use the search tips.
SavvySearch
http://www.cs.colostate.edu/~dreiling/smartform.html
SavvySearch is a multi-threaded tool, designed to take your query to several search tools at once, gather the information, and return the results.
Databases:WWW pages
Search features: Using a common interface for several search tools prevents the use of certain special search features available at some sites.
Results: Option to select length of format. Results are arranged by name of search tool. Presence of annotation varies depending on the original search tool
Comments: Due to its popularity and the problems inherent in contacting multiple search tools, SavvySearch can be difficult to reach during the day.
WebCrawler
http://webcrawler.com/
WebCrawler allows document title and content searches of its submission and robot-constructed database. The database consists of both explored and unexplored Web pages.
Databases: WWW pages
Search features: Words automatically stripped of their endings; use of Boolean AND, OR, NOT; phrase searching; adjacency; parentheses for grouping searches; proximity searches.
Results: Search results are returned in order of decreasing relevance in an unannotated, easy-to-browse list.
Comments: Useful search tips and examples available.
Yahoo! Search
http://search.yahoo.com/bin/search/options
Databases: WWW pages, e-mail addresses, Usenet news groups.
Search features: Matches can contain all or at least one of the search terms, select substrings or whole words, select the number of matches displayed per page. Yahoo! is not case sensitive.
Results: Yahoo! returns a list of categories that match your keywords, end-sites that match your keywords, and names of categories where those end-sites are listed.
Comments: Yahoo! searchers may use the name of the categories included in search results to go seek other similar items.

 

Source: Northwestern University Library
1935 Sheridan Road
Evanston, IL 60208