4:00 P.M.
Assembly Room, A. K. Smiley Public
Library
Over the past 50 years the world's
ability to gather record and analyze information has changed in ways unimaginable to all
but the most extreme visionaries.
The evolution of computers allowed
storage and analysis of information. Suddenly
we had huge stores of information easily available and shareable among computers. A need to interconnect computers for more
computing power created the ability to exchange data between large computers at
universities and government installations.
Business adapted this data transfer
technology for large corporations. The same
businesses then developed public access technology and at the same time that personal
computers became available. This allowed the
general public to access information from external services.
The general public was now looking for
information on the Internet. Large
corporations, small business and non-profits have developed web sites to provide
information or sell their product or simply provide a service. The information available is extensive but can be
easily searched using Internet search engines.
Various types of search engines and
search techniques are discussed in this paper.
Hyperlinks are provided so that you can
visit the sites discussed in the paper.
Internet Search Engines: Help or
Overload?
Introduction
Everyone in this room grew up in a
world where words written on paper were the basis for gathering information to keep up
with the news, study, shop, prepare to travel, invest, and find out what is playing at the
movies.
These words were distributed through
newspapers, books, magazines, and periodicals specializing in numerous fields of interest. We had them delivered to our homes, bought them in
stores or found them in our libraries. The
information available was and still is overwhelming. Those who have the ability to access,
further organize and understand this wealth of information are respected leaders and
thinkers.
If you wanted to research the actions
of a public figure through the use of newspaper articles you could go to the Periodical
index in the library. Then you could refine
the search for the information you wanted.
This approach is rapidly changing. Today use of the electronic word is increasing and
to many it is preferable to the printed word on paper. With a computer the right software and an Internet connection we have rapid access
to an almost infinitely large information source.
This paper will review the background
of how the Internet was developed and provide some examples of how search engines help us
use it.
This paper viewed through our
Fortnightly Club web site on the Internet provides hyperlinks to the sites I will discuss. Just click the link and you will visit the
designated site.
The Internet
According to Encarta Encyclopedia the
Internet is a:
"
computer-based worldwide information network.
The Internet is composed of a large number of smaller interconnected networks called
Internets. These Internets may connect tens, hundreds, or thousands of computers, enabling them to share information with each
other and to share various resources, such as powerful supercomputers and databases of information. The Internet has made it
possible for people all over the world to effectively and inexpensively communicate with
each other. Unlike traditional broadcasting media, such as radio and television, the
Internet is a decentralized system. Each connected individual can communicate with anyone
else on the Internet, can publish ideas, and can sell products with a minimum overhead
cost. In the future, the Internet may have a dramatic impact on higher education and
business as more universities offer courses and more companies offer goods and services
online."[1]
The first cooperative network known as
ARPANET began to operate in 1975 as a defense project in support of US missile and nuclear
programs. The concept was almost instantly
accepted in the scientific and business communities. It eventually became known as the World Wide
Web.
Worldwide the use of the Internet is
growing at an incredible rate. As of August
2001 NUA Internet surveys estimates that over 180,000,000 million users exist in North
America. The same company estimates there are
513,000,000 worldwide users. This compares to
an estimated 30,000,000 in North America in 1996 with 45,000,000 worldwide.
To access the Internet you need a
computer, the appropriate software in the computer, and an Internet connection.
The Personal Computer:
The German scientist Wilhelm Schikard
created the first computer in the year 1623. It could add multiply and divide. Evolution was gradual over the next three
centuries. World War II accelerated computer
development and by 1945 an electronic digital machine know as ENIAC was built at the
University of Pennsylvania in Philadelphia. The
first personal computers appeared in 1975. They were simple and had limited use. Their appeal was limited. Some of you might remember the Altair 8800.
The
big boost for the personal computer came in the late 1970s.
American
computer designers Steven
Jobs
and Stephen
Wozniak,
working out of their garage, created the Apple II in 1977. The Apple II was one of the
first PCs to incorporate a color video display and a keyboard that made the computer easy
to use. Jobs and Wozniak later founded Apple Computer Corporation.
In
1981 International Business Machines Corporation (IBM) introduced the IBM PC. It was
designed with an open architecture that enabled other computer manufacturers to create
similar machines, or clones, that could also run software designed for the IBM PC. The
design of the IBM PC and its clones soon became the PC standard.
The
Apple Macintosh, developed in 1984, featured a graphical user interface (GUI)a
visually appealing way to represent computer commands and data on the screen. The
Macintosh GUI combined icons (pictures that represent files or programs) with windows
(boxes that each contain an open file or program). Information on the screen was
controlled by a pointing device known as a mouse. Inspired by earlier work of computer
scientists at Xerox Corporation, the Macintosh user interface made computers easy and fun
to use and eliminated the need to type in complex commands.[2]
Personal
Computers allowed the home and small business user to use software for multiple uses.
People used them primarily for word processing, spreadsheets, databases and gaming. Who can forget Pong and Pacman? The next question was how to access the world of
information for business and personal use.
Internet Service Providers:
Along with a
computer you need a connection to the Internet. This
connection usually involves a modem installed in your computer and a telephone connection
through your ordinary telephone line.
A Company known
as CompuServe initially created a business providing access to shared files for the
general public.
" Jeffrey Wilkins founded CompuServe in 1969
CompuServe sold time on the insurance companys mainframe computer, allowing
other companies to access and use the system via modem. This service was known as computer
time-sharing. Ten years later CompuServe began offering technical support, electronic
mail, and electronic discussion groups to personal-computer users via modem.
By
1989 CompuServe had more than 500,000 subscribers, making it the largest service of its
kind. CompuServe expanded to countries on five continents in the late 1980s and early
1990s, teaming up with Fujitsu Limited to launch the NiftyServe online service in Japan.
In
the early 1990s CompuServes discussion groups, or forums, gained popularity as a way
for computer enthusiasts to share information and discuss ideas. The forums then branched
out to include topics ranging from music and art to sports and politics. These discussion
forums helped popularize online services. [3]
However
the files shared by CompuServe were unique to CompuServe or were contained completely on
the CompuServe network. They did not provide access to other available files.
Based on it's potential the Internet
became a major growth industry. Major growth
businesses known as Internet Service Providers offered connections for computer users
wishing access to the Internet.
Service
Provider,
company that sells computer access to the Internet,
A user buys a subscription to a service provider, which gives the user an
identifying username and password and a phone number. With his or her computer and modem,
the user calls, connects to, and logs
on to
the service provider's computer. The user's
computer then lets the service provider's computer take over,
The user can then
utilize any of the tools the service provider's computer furnishes, which can give access
to the World
Wide Web
(WWW), electronic mail (e-mail), File
Transfer Protocol
(FTP), Telnet,
and other Internet services.[4]
Internet Browsers:
Computers still needed software to find
and read information posted on the Internet. This
software is called an Internet browser.
Browser,
a program that enables a computer to locate, download, and display documents
containing text, sound, video, graphics, animation, and photographs located on computer
networks. The act of viewing and moving about between documents on computer networks is
called browsing.
The largest open network is the Internet, a
worldwide computer network that provides access to sites on the World
Wide Web.
Browsers
allow users to access Web information by locating documents on remote computers that
function as Web servers. A
browser downloads information over phone lines to a users computer through the
users modem
and then displays the information on the computer. Most browsers can display a variety of
text and graphics that may be integrated into such a document, including animation, audio
and video.
Documents
on networks are called hypertext if
the media is text only, or hypermedia if
the media includes graphics as well as text. Every hypertext or hypermedia document on an
Internet has a unique address called a uniform resource locator (URL).
Hypertext documents usually contain references to other URLs that appear in bold,
underlined, or colored text. The user can connect to the site indicated by the URL by
clicking on it. This use of a URL within a Web site is known as a hyperlink. When the user clicks on a hyperlink, the
browser moves to this next server and downloads and displays the document targeted by the
link. Using this method, browsers can rapidly take users back and forth between different
sites.[5]
The National Center for Supercomputer
Applications developed the first user friendly browser in 1993. It was known as Mosiac. To stimulate interest the development team gave
it away. It emphasized the Graphic User
Interface and allowed the user to navigate primarily using a mouse.
Today three major browsers compete with
each other. These are Internet Explorer,
Netscape Navigator, and America Online. They
are available at no cost and can be downloaded from the Internet.
With access to the Internet, home
computers, and browsers millions now learned how to navigate the Internet and Internet
businesses boomed.
Web Sites
Information on the Internet is
primarily provided through web sites.
Web
Site
a
file of information located on a server connected to the World
Wide Web
(WWW). The WWW is a set of protocols and software that allows the global computer network
called the Internet to
display multimedia
documents. Web sites may include text, photographs, illustrations, video, music, or
computer programs.
Every
web site has a specific address on the WWW, called a Uniform Resource Locator (URL).
The
content presented on a web site usually contains hypertext and icons, pictures that also serve as links to other
sites. By clicking on the hypertext or icons with their mouse, users instruct their
browser program to connect to the web site specified by the URL contained in the hypertext
link. These links are embedded in the web site through the use of Hypertext
Markup Language (HTML), a
special language that encodes the links with the correct URL.[6]
Companies were developed for a variety
of never before heard of businesses. They
were nicknamed dot com companies.
Traditional companies and institutions
also joined the bandwagon. They developed
online businesses to sell virtually everything from automobiles to wedding gifts.
News organizations including magazines,
newspapers and network news teams have developed sites where you get up to the minute news
if you can't wait for the televised evening news.
Airlines post flight information and
rates so that you can pick your own flight and compare prices. It is interesting to note that you often get
different quotes from different sites for the same flight. You can also buy airline tickets at Internet auctions.
Search Engines
When user friendly browsers became
available you had to know an URL address for a site to find it. An annual Internet Yellow Pages document similar
to the telephone yellow pages was published. It
gave you descriptions and URL addresses for sites that you might want to visit. You had to enter each URL and hope that the site
was still at that address.
Search engines were then developed that
now provide the primary basis to find what you are looking for on the Internet.
Search
Engine,
computer software
that compiles lists of documents, most commonly those on the World
Wide Web
(WWW), and the contents of those documents. Search engines respond to a user entry, or query, by searching the lists and displaying a list
of documents (called Web sites when on the WWW) that match the search query.[7]
The
first search engine that made surfing the Internet popular is known as YAHOO!. Two University of Stanford students created it. It was developed initially using volunteers to
catalogue Web Sites and post them on the YAHOO! Web Site. It offered a simple way to search a large portion of the World Wide Web and was a
major reason for the rapid growth of the World Wide Web in the late 1990's. It is still considered the most popular search
engine.
According to one source over 3600
search engine are now available to the user. They
fall broadly into three categories. These
are:
· Crawler based search in which computers
visit Web Sites on a recurring basis and catalogue the information for retrieval when an
individual search is initiated.
· Human indexed search engines that
receive submittals from webmasters and include those Web Sites in a specific category for
the searcher to look through much like a table of contents.
· Hybrids that combine both indexed and
crawler based searches.
How do you search the Internet?
You need a computer, a web browser and
an Internet connection. Start your Internet
browser and connect to the Internet. Most
browsers have a built in home or start page until you change it. This is usually a portal page that provides news
and some indexed categories for you to search.
For purposes of this discussion I will
use the home page for the YAHOO! Search engine. Its
URL is http://www.yahoo.com/. If you are connected to the Internet and click on the link it will take you to that
site.
YAHOO! promotes this Web Site as an
Internet portal page. You can set your
Internet browser to start at this page. It is
also a hybrid search engine. You can enter
search words or you can use the categories
shown on the page to look for the information.
It can be customized so that your
particular interests are displayed. There are
over 20 different categories of information to choose from.
The blue underlined words are called
hyperlinks. If you click them with your mouse
the computer will take you to an expanded page of information and further links on that
specific subject.
To use the crawler based capability of
this site you can enter a specific search term in the dialog box in the top quarter of the
page. For example I have entered the search
term Redlands Fortnightly. (http://google.yahoo.com/bin/query?p=redlands+fortnightly&hc=0&hs=0)
I received a list of 173 hits. At the top of the list is the Fortnightly Web
Site. The number of hits often depends on how
you enter your terms. In this case my
entry allowed the search engine to select any page that contained both the words Redlands
and Fortnightly. If I were to enter the words
in a manner that the search only looked for the exact phrase I would only get 42 hits.
Again the hyperlinks will take you to
the other sites the search has found and let you read the content on that page. In the larger list you find an amazing variety of
references from the recent Los Angeles Times article on the Inland Empire and Redlands to
a Nova Scotia Pharmaceutical Web Site that refers to a January 1920 Redlands Fortnightly
paper by on premature senility by C. F. Curtis.
To use the directory or human catalogue
capability of this site you can select a particular category in the outline on the page.
It will lead through more specific directories.
Let us select News & Media:
Newspapers: Web Directories: U.S. Newspaper Links: California (n-z): Redlands: and we can
read some contents from today's Daily Facts. http://www.redlandsdailyfacts.com/
You could see as we browse through the
categories that the news choices are almost infinite.
Next we will visit the search engine
Google.
http://www.google.com
Google is a top choice for web
searchers. It offers the largest collection of web pages of any crawler-based search
engine. Google makes heavy use of link analysis as a primary way to rank these pages. This
can be especially helpful in finding good sites in response to general searches such as "cars" and "travel," because users across the web have in essence
voted for good sites by linking to them. The system works so well that Google has gained
widespread praise for its high relevancy. Google provides web page search results to a
variety of partners, including Yahoo and Netscape Search (see below). Google also provides
the ability to search for images, through Usenet discussions and its own version of the
Open Directory by SearchEngineWatch.com
Staff
Updated: Jan. 7, 2002
I entered a search for duck
hunting. http://www.google.com/search?q=Duck+hunting&btnG=Google+Search
I thought that at least one of
our members might like this one. In 0.15
seconds it found 219,000 applicable references. These
include commercial hunting as well as the non-profit Ducks Unlimited site. http://www.ducks.org/
A word about search terms is
appropriate. Our initial search for duck
hunting found sites that included either the word duck or the word hunting. Sites with both words got the highest priority. A second search can be made using Google's advanced search feature. Inn this case we asked for the exact phrase
"duck hunting". In 0.28 seconds it
"only" yields 49,300 results. Obviously
Search engines can be used to limit or expand the results of a search.
We have discussed two types of
search engines. By my observation the third
is maybe the most extensively used although I haven't seen any statistics on it. These are indexed specialty sites.
One of the most expansive I have
found is the Library of Congress Site that lists Official US
Executive Branch Web Sites. The
Library of Congress provides a portal to the entire federal government through: (http://lcweb.loc.gov/global/executive/fed.html)
Locally we have a community site called
RedlandsWeb.com. http://www.redlandsweb.com/
It provides information on local
government, entertainment, business, history and other community activities. Through this site you can even find what is
playing at the Krikorian and view the movie previews.
This is the site that hosts our
Redlands Fortnightly Club Web Site.
Select your search terms
This paper can only give a brief
overview of the Internet and its search engines. The
information is larger than any library in the world and can be searched in fractions of a
second. If you are not careful with the selection of a search engine and use of search
terms you can receive millions of results.
For example a search for Jerry Lewis
our congressman entered as Jerry Lewis yields 749,000 hits. Most of them are for the entertainer Jerry Lewis. If you add an additional term to your search for example Jerry Lewis congress you
reduce the number of hits to 41,200. While
some information on the entertainer remains most of the information concerns the
congressman. It includes official sites such
as his official government site and watchdog sites. You
can find his biography, legislative record, campaign fund status and personal financial
disclosures.
You can also use Boolean terms in your
search with some search engines. For example
Redlands and California not Australia.
Select your search engine.
I
have only used the two of the most popular search engines in this paper. There are many with different characteristics and
specialties. One useful Web Site to help you
select your search engine is at http://nuevaschool.org/~debbie/library/research/adviceengine.html.
A good source to find search
engines is http://www.searchenginewatch.com/links/Major_Search_Engines/Guides_To_Search_Engines/index.html
Read the instructions for
advanced search published with the search engine. These
instructions will help you group your search terms and eliminate useless or non-relevant
results. Each search engine uses different
symbols and aids to set up searches. This web
site called "Search
Engine Features for Searchers" gives an overview of how commands
vary from engine to engine. http://searchenginewatch.com/facts/ataglance.html
Further Thoughts
Run your search on several search
engines. Each search engine has different
searching techniques and has crawled or indexed different parts of the World Wide Web. If you are not satisfied with your results with
one search engine try another. You will be
surprised at the differences.
Check to see how often the web
site you are visiting is kept up to date. Some
don't appear to be updated for years. Others
are selling products and may not be giving you the entire story. New sites are good for researching recent news but
have not brought their old content online.
There are excellent medical web
sites. The Merck Medical Manual is available
on line. http://www.merckhomeedition.com/home.html
The national Institute of Health
provides extensive information. http://navigation.helper.realnames.com/framer/1000/default.asp?realname=National+Institute+of+Health&cc=US&lc=en%2DUS&frameid=1565&providerid=112&url=http%3A%2F%2Fwww%2Enih%2Egov
One of my favorites for basic
information is WEBMD. http://my.webmd.com/index
The best way to learn about the
Internet is to explore it. Enjoy!
Now what do you think that people
most often search for? Here are some Web
Sites that give us an answer.
http://50.lycos.com/
http://www.google.com/press/zeitgeist.html
http://buzz.yahoo.com/
About the Author
The author
retired from the County of San Bernardino after 21 years in various executive positions.
He received a Bachelor of Science Degree in Mechanical Engineering from Carnegie Institute
of Technology in 1961. After graduation he
joined the US Air Force and was assigned to The Air Force Rocket Propulsion Laboratory
where he worked on the development of advanced rocket propulsion systems. After 5 1/2 years service he moved to Redlands
where he worked for Lockheed Propulsion Company as a systems engineer.
He has worked with several community
non-profit organizations as a volunteer and officer of the organization including the San
Bernardino county Museum Association, the Boy Scouts of America and local American
Diabetes Association chapter
He and his wife Kathy own a Physician
Recruiting Company that depends heavily on the Internet for new clients and new
candidates. He maintains an active interest
in how new technology evolves and how it can be used.
[1]"Internet,"
Microsoft® Encarta® 98 Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[2]"Personal
Computer," Microsoft® Encarta® 98
Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[3]"CompuServe
Corporation," Microsoft® Encarta® 98
Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[4]"Service
Provider," Microsoft® Encarta® 98
Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[5]"Browser,"
Microsoft® Encarta® 98 Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[6]"Web
Site," Microsoft® Encarta® 98 Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.
[7]"Search
Engine," Microsoft® Encarta® 98
Encyclopedia. © 1993-1997 Microsoft Corporation. All rights reserved.