Documentation

 

Home
SLO Information
Instruction Page
Documentation
Practice Modules
Tips and Tricks
Discussion List
Search Links
Submit a Question

Glossary

Introduction

Finding information on the World Wide Web has been compared to being in the world's largest library and not having access to a card catalog or information desk.  Selecting the appropriate search tools can overcome these limitations.

There are three major classifications of Internet Search Tools:

1)    Directories

2)    Search Engines

3)    Meta-Search Engines

General Search Tips

Search Engine Syntax

When do I use a directory and when do I use a search engine?

Popular Search Engines

Subject and Other Guides

Search Tools for Children

How do I find graphic files and image files using a search engine?

Evaluating K-12 Websites

Directories

Directories are man-made databases that organize sites by subject areas - not unlike a subject-based library card catalog.  Sites are reviewed by the directory's staff and classified under some appropriate topic heading.  For example, a school district web page, might be found under Education/K-12/school/sites/.

The most famous directory is the Yahoo database.  Yahoo contains 14 subject areas and each area also contains several subtopics.  Yahoo also offers many other services besides its directory.  It has free E-mail, auctions, news, weather, sports, chat rooms, online shopping, etc.  A site like Yahoo that contains numerous features and activities is known as a Portal.

Yahoo is based on a hierarchy of subjects.  For example, to find web sites that contain Social Studies lesson plans, the user would navigate through a series of topics and subtopics:

Education -> K-12 ->  Social Studies -> Lesson Plans

Yahoo also contains a children's site - Yahooligans, which offers a watered down subject list and filtering of inappropriate materials.

The Argus Clearinghouse is a site that lists topical guides to select Internet-based information.  The guides are evaluated on a scale of five checkmarks.  This site has high standards and is very selective about its listings.  

Advantages

The advantages of using Yahoo and other directories are that they are easy to navigate and present a good starting point if you just want to browse what's available under a subject area.   A directory provides a wide scope of info listed under broad topic headings.  Searching is also possible; however, you will only be searching the directory's database.  In Yahoo, if an item isn't found in the Yahoo database, the site defaults to an Internet search engine.

Disadvantages

Directories such as Yahoo only cover a small percentage of the sites available on the web - less than 3%.  Most directories are not very good for finding very specific information.  If you're looking for very specific information try using a search engine.
                                                                                                                                TOP

Search Engines

Search engines are software programs that traverse web servers and index the files they collect.  This software is known as a spider, robot or crawler.  The spider visits a web page, reads it, and then follows links to other pages at that site.  The spider returns to the site periodically to update any changes.

The indexes that are created can be collected from the title, full-text, URL, etc.  An index is sometimes known as a catalog.  It may take a while for new pages or changes that the spider finds to be added to the index.  The user utilizes an interface that allows entry of keywords into a search box.  When you do a keyword search, the search engine matches your query words against all the records it has in its indexed databases to present a listing of possible documents meeting your request.  Boolean or logical operators (AND, OR, NOT) can be used on many search engines. 

The results of the keyword search or query are displayed in relevancy ranked order.  Every search engine has a different algorithm (formula) for ranking the display of relevant hits.  The algorithm usually looks at the keywords location and frequency on the page and will generally rank pages high with keywords found in the title and repeated many times.  

Not all search engines are created equal.  Database size varies and performance is linked to how often updates are made, the number of unique hits, and the number of inactive or dead links.

Advantages

Search engines contain a much larger and more current database than a directory search tool.  They also allow the use of Boolean operators or special syntax.

Disadvantages

Can present an overwhelming number of results that may not be relevant to the original search request.  Users must read the help files contained in the search engine documentation.  Unfortunately, not all search engines share the same syntax.

                                                                                                                           
TOP

Popular Search Engines

AltaVista is a portal site that presents numerous search options.  Its simple search interface supports pseudo-Boolean syntax and the advanced search tool can handle Boolean queries.  There are four tabs above the Search box that toggle users between an advanced search, Images, MP3/Audio, and video search screens.

AltaVista also claims to index more of the Internet than any other search engine and also has the capability to conduct searches in different languages.   Raging Search is a recent AltaVista product that is a search-only site.

Fast Search is a simple, uncluttered, easy-to-use, quick search engine.  It does not display advertisements and portal buttons to distract your attention.  It has both a simple and advanced search interface.  The simple search supports pseudo-Boolean syntax and the advanced search filters words through drop-down menus.  Domain filtering, filtering for offensive content and support for multiple languages are also available at this site.  Search interfaces for MP3, multimedia and a listening room are provided through Lycos.

Google is a clean interface search engine unencumbered by clusters of buttons, adds and links.  Its PageRank function indicates the number of Web pages that point to a particular document and determines which pages to display first.  It also includes an "I'm Feeling Lucky" button that displays the search engines top pick.  Another interesting feature is Google's cache function.  Each page found after a search is automatically saved so if a link you need no longer exists, clicking on the "Show matches (Cache)" link will display the last saved version of the page.

HotBot is powered by the Inktomi search engine.  Featured on this search site is an easy to set collection of fine-tuning controls that work without using Boolean operators.  These controls appear in drop-down menus in both the standard and advanced search interfaces.  Also featured are word filters, specific date ranges and domain choices.  The site also includes subject categories, links to commercial sites, E-mail, etc.  Boolean syntax, wild cards and case-sensitive searching are also supported.  The site utilizes the Direct Hit popularity engine, which includes a list of the ten most visited sites for your query.

The Northern Light search engine groups search results into "custom folders" based on subject, document type, source and language, that enables more efficient location of relevant information.  There is also support for Boolean syntax, nested queries, wildcards and truncation.  In the Power Search mode, searches can be limited by date, language, and type of Web page (.edu, .org, etc.).  Another unique feature of this site is a fee-based special collection of articles not available elsewhere.

                                                                                                                            TOP

Search Engine Syntax

Boolean or logical operators (AND, OR, NOT) are not universally accepted by all search engines.  It's important to read the help files of the search engine you intend to use.

Steps in constructing a keyword search:

a)    Identify keywords

When you do a search break down the topic into key concepts.  For example, to find information on the orbit of Neptune's moons, the keywords may be:

Neptune   moons   orbit

In some search engines the order of the words is critical.  The primary subject of your search should be listed first with subtopics or secondary info listed after.

b)    Use Boolean or logical operators, if allowed.  (AltaVista, Excite, Fast Search, HotBot, Lycos)

Neptune AND moons AND Orbit

In the above construct only pages where the words Neptune, moons, and orbit appear on the same page will be displayed in the list of search results.  AND is used to limit a search.

The keywords:

Neptune OR moons OR orbit

This construct will produce pages that contain any or all of the keywords in the list of results.  OR is used to expand a search.

The keywords:

Cowboys AND NOT Dallas 

The above construct will produce pages that contain information on cowboys but not web pages about the "Dallas Cowboys" football team.

AND NOT is used to filter out undesired websites.

c)    Use Pseudo-Boolean or implied operators. (+ -) (AltaVista Simple Search, Excite, Fast Search, Google, HotBot, InfoSeek, Lycos) 

Many search engines allow the + and - to be used instead of AND, and AND NOT.  There is no space between the plus or minus sign and the keyword.

To use OR in these engines do not use a symbol. 

+k-12 +"lesson plans"

The above syntax will find pages that have the term k-12 and the phrase "lesson plans" on the same page.

+"lesson plans" -math -science

The construct above will produce pages that have the phrase "lesson plans" and exclude pages that also contain the words math or science.

apples oranges

This construct will result in pages that have the words apples or oranges on them.

d)    Phrase Searching

To keep multiple words or phrases together use double quotes.  For example:

+"heart disease" +cause

The above example instructs the search engine to retrieve pages where the phrase "heart disease" appears along with the word cause somewhere else on the page.

e)    Capitals and Plurals

Most search engines interpret small case letters as either as either small or upper case.  If you desire both lower and upper case occurrences returned, use lower case letters.

Some search engines support truncation or wildcard features that allow variations in spelling or word forms.  An asterisk(*) tells the search engine to return alternate spellings for a word at the point that the asterisk appears.

For example swim* returns hits with swim, swimming, swimmer, swimsuit, etc.

f)    Title Search  (AltaVista, Fast Search, HotBot, InfoSeek, Lycos, Northern Light)

One of the most effective ways to narrow search results is to search by fields: title, domain, host, URL, and link.

+title:"Bill Clinton" +President +Hillary

The search above directs the search engine to return web pages where the phrase Bill Clinton appears in the title and the words President and Hillary appear somewhere on the page.

Please note: Check the help files of the specific search engine to see what features are supported.

                                                                                                     TOP

Meta-Search Engines

A meta-search engine utilizes a number of search engines at the same time.  It uses a keyword search method similar to a search engine.  The results are either listed by the search engine used or integrated into a single listing.

AskJeeves is a meta-search engine that supports natural language searching.  Users can enter a question in plain English and then choose a relevant website by selecting it from a pull-down menu and and clicking on the ASK! button.

Dogpile searches up to 25 other search engines.  It also permits the user to specify which search engines to use by choosing "Custom Search" under "Dogpile destinations."

MetaCrawler developed by the University of Washington in 1994 does not maintain an internal database.  Keyword queries are sent to several Web search engines, including AltaVista, Excite, InfoSeek, Lycos, etc.  The results are collated into a single list with duplicate entries deleted.  MetaCrawler also allows sorting by relevance and source.

Advantages

Can produce results when other search tools do not.  They are also tolerant of inaccurate or imprecise search syntax.

Disadvantages

Are not as effective for difficult searches as a search engine as they may present a large number of irrelevant results.

                                                                                                                            TOP

General Search Tips

  1. Decide whether you need to use a directory or search engine.  
  2. It is generally better to start with a specific search and then try more general terms if you are unsuccessful.
  3. Enter proper nouns with capital letters and enclose phrases in double quotation marks.
  4. Watch your spelling.
  5. On some search engines word order is important.  Enter your most important term first.
  6. If you don't find what you seek in the first 10-20 hits, change your query or use another search engine.
  7. Use a metasearch engine, such as Dogpile or Inference Find, if you don't get the desired results with a standard search engine.
  8. Use Boolean (logical) operators (AND, OR, NOT, and NEAR), where allowed.
  9. Be prepared to refine and redo your search if the results produced are not helpful.

                                                                                                                             TOP

Use a subject directory:

a)    when searching a broad topic or idea to research.
b)    to produce a list of recommended and reviewed sites.
c)    to produce a list of documents that are highly relevant to your topic, as opposed to  search engines which can produce less relevant documents.
d
)    when you want to produce a list of top-level page sites rather than many pages contained within sites.

Use a search engine:

a)    when a simple search using a directory is unsuccessful.
b)    to find information that is very precise or exact.  An example of this could be a search to find the maiden name of former President Jimmy Carter's mother.
c)    when you need to limit or expand your results by using Boolean operators.
d)    to find as much current information available on a particular topic.

                                                                                                                             TOP

Finding Graphic and Image Files

The search engine ditto.com specializes in finding pictures on the Web.  A user can either enter keywords in a search field or browse through 16 subject categories.  The site also supports an advanced search interface that allows queries by picture type, file size, color depth, picture width and height, and date added.  The site claims to filter offensive or inappropriate images from its image database.  Boolean and pseudo-Boolean operators are supported.  In addition, filename, URL, and title searches are supported.

AltaVista also allows for image searching.  The feature is contained in a separate tab on the main page.  Filter criteria allow the user to choose among photos, graphics, buttons/banners, color or black and white.  Support is also included for domain, host, media, title, and URL contexts.

The Amazing Picture Machine contains a list of sites on graphical resources on the Internet.

Pics4Learning

Please keep in mind that it is necessary to obtain permission from the image owner or source on copyrighted images before using the graphic on a web page.

                                                                                                                            TOP

Subject and Other Guides

Biography is the electronic version of Who's Who.  Also contains a quiz and anagrams and links to A&E and The History Channel.

Britannica Internet Guide is an interesting portal site that allows users to search, the Britannica Encyclopedia, reviewed web sites, magazine articles and related books.   The site provides good coverage of general interest topics.

INFOMINE contains a collection of scholarly Internet resources relevant to higher education.

MapQuest is a website totally devoted to maps.  The interface allows the user to input an address and an interactive map is returned.  Driving directions can also be obtained.  The site also contains a Yellow and White Pages directory and a travel and city guide section.

                                                                                                                            TOP

Search Tools for Children

Most search engines do not filter out sites that may contain pornography, violence, and hate-filled messages.  Search tools for children mainly use humans to review indexed sites, unlike adult search engines which rely on software called spiders or web-bots to search out files on web servers.  Consequently, the size of children's search engine databases are much smaller than the search tools using web-bots.

Listed below are search engines and directories designed for children.  The details on what is filtered out varies from site-to-site.  It's recommended that you read the help or about files to determine the extent of filtering.

Ask Jeeves for Kids presents an easy-to-use interface that's similar to AskJeeves.  The site tries to produce results that contain one or two relevant sites rather than presenting an overwhelming number of hits.  Responses are also filtered to only include "G-rated" pages and those specifically written for children.

CyberSleuth Kids

Education World

Fact Monster

Go Guardian

Internet Search Engines for Kids

Ithaki for Kids

KidsClick

Kids Search Tools

Lycos Kids

OneKey: The Kid Safe Search Engine

Yahooligans is the children's version of Yahoo.  This site seems to mainly focus on the entertainment and social needs of young users.  Topics are arranged by categories and subtopics as in Yahoo.  The site also also contains guides for parents and teachers.

                                                                                                             TOP

Evaluating K-12 Websites

Anyone can create and publish a web page.  Reliability, accuracy, and credibility can be a problem with over a billion web pages to choose from.  Educators should be aware of a few common sense guidelines and questions to ask before recommending websites for curricular integration.

  1. Who is the author or source of the page?
    Is the page an endorsement for a commercial product or cause?
    Who is the target audience of the web page?

    Look at the top level domain: .edu, .org, .com, .net, .gov.  Pages 
     that originate at higher education institutions, government
     facilities, trade and professional associations or not for profit
     organizations may have more credibility or less of a hidden
     agenda.

    Why is the information on the web?

  2. Is the information reliable? 
    Are there citations for sources that can be checked for
     authenticity?
    Do these citations or links reveal any biases?
    What topics are included on the site and are they explored in
     depth?
    Has the information been reviewed by peers, authorities in the
     field, other publications, libraries.
    Does the author or organization provide contact information, such
     as an E-mail link?
  1. How current is the page?
    When was the page originally produced?
    When was the page last updated?
    How current are the links?

  2. Is the site constructed in a clear and logical fashion?  
    Can you navigate the site easily?
    Are any special programs or plug-ins required to view the site?
    Can the information be displayed on a slow Internet connection  -
     site contains large graphics or multimedia files?
    Does the site contain correct spelling and grammar and can it be
     understood by the intended audience?
    Is the site copyrighted and are copyright laws respected?

                                                                                                               TOP

 

NERIC Home Page    NCC Home Page    Training Page    SLO Home Page    NYS Learning Standards

Copyright 2003 Capital Region BOCES NERIC    All Rights Reserved