BEST BETS IN SEARCHING THE INTERNET

 

Best Bets in Searching the Internet

Once you get connected to the Internet, you need to know how to find sites and newsgroups that will be important and of interest to you. Some people enjoy wandering, or surfing from site to site, going wherever the links take you. But sometimes you just don't have the time and you need something specific and you need it now. When it's one of those times, you simply have to use a search engine.

 

Below, is a compilation of useful information on how to find what you need and where you'll most likely find it.

Search engines

Which search engine should I use?

Best Bets

Ten Essential Tips

Seven Stupid Mistakes

If nothing else works


Search engines

A search engine is a service that indexes, organizes, and often rates and reviews Web sites.  It can help you find what you're looking for among the millions of documents available on the Internet. Different search engines work in different ways:

  • Some rely on people to maintain a catalog of Web sites or pages.
  • Some use software to identify key information on sites across the Internet.
  • Some combine both types of service.
  • So when you search their "holdings," you're bound to get different results.

Which search engine should I use?

No search engine keeps track of all the content on the Internet. Even the major search engines—such as Yahoo, Alta Vista, Infoseek, Lycos, and Snap—won't give you everything. (Some studies indicate that even the top search engines find less than half of what's really out there!).  So try several search engines to see which produce the most useful results for the types of information you usually look for. Whenever a search fails to produce the results you want, try another search engine.

 

Here's a quick introduction to some of the major search engines:

 

Alta Vista - At AltaVista, you can search for keywords or type your query in the form of a question. Other features include Babelfish, which translates sites in several languages; Photo & Media Finder, which searches for images; and Family Filter, which helps you limit the types of results produced by the search.

Infoseek - Infoseek's search engine boasts an extensive catalog of sites. Infoseek has a directory, too, that categorizes sites by topic using software. Some sites have been reviewed and are recommended.

Lycos  - Like Excite and Infoseek, Lycos is a search engine with a directory, organized by subject.

Google - Google Inc. is one of the fastest growing search engines on the web.  The key benefits of Google's search engine include its ease of use and clean interface, and highly relevant results.

Snap - Snap is a directory of Web sites that the Snap directory team has chosen, divided into topic areas, and summarized. Within each topic area, sites are divided into more specific subjects to help you locate the type of information you want. Snap's top picks are marked with a Best icon.

Excite  - A hybrid search engine, Excite fulfills the traditional search engine definition (it is always visiting Web sites and cataloging them using a software program) but also has a directory—a subsection of its cataloged sites that have been reviewed by people.

Yahoo! - This is actually a directory rather than a search engine, meaning humans compile and categorize the sites it searches. So you may get fewer results. But Yahoo! makes it easy to search for sites by category and to continue your search using a search engine if you want to.

MetaCrawler - Rather than keeping its own database of Web sites, MetaCrawler conducts your search simultaneously on several search engines including Excite, Infoseek, Lycos, and Yahoo! This takes a little longer than using just one search engine, but it is likely to yield a more comprehensive set of results.

Major search sites generally provide more than a search engine for finding Web sites. They also allow you to look up information such as recent news stories, newsgroup postings, reference material (such as dictionary entries and maps), and e-mail addresses, street addresses, and telephone numbers of business and individuals.

Here are two guidelines for picking a search engine:

  • If you're looking for a broad, common topic, such as how to buy a personal computer, use a search engine that tends to give you fewer results of higher quality (usually the type that relies on people to catalog sites), such as Yahoo!
  • If you're looking for a rare topic, such as 19th-century Hungarian authors, use a site that may be less discriminating but yields more results (usually the type compiled by a software program), such as Excite.

 

Searching For...

Best Bets:

A broad overview of a subject.

Looksmart is a topical directory of the Web, with more than 24,000 categories to choose from.

A large easy to use search engine with LOTS of pages indexed.

AltaVista, HotBot, and Northern Light, all have more than 100 million pages indexed.

A subject using a short phrase of two or three words.

Infoseek recognizes phrases, and won't ignore the most common words that other search engines discard.

The answer to a plain English question.

Ask Jeeves has the answers to more than 7 million common questions, and understands plain English queries.

A topic using rare or uncommon words.

Because AltaVista is the largest Web index, it will find the most documents about rare or uncommon subjects.

The most popular Web sites on a subject.

Search using HotBot, then click on the "Get the Top 10 Most Visited Sites For [Your Query] link, at the top of your results list.

Web pages in a specific language.

Both AltaVista and HotBot let you restrict your search to specific languages.

Web pages that were published on a specific date.

AltaVista's Advanced Search and HotBot's Super Search find documents created within a specific date range.

The most comprehensive results for a topic.

Dogpile and Metacrawler are meta-search engines, meaning they simultaneously scan the indexes of multiple search engines and present you with all of the results.

The ability to search within the results of a search.

Infoseek lets you refine your search using only the results from a previous query.

"Smart results" for cities, companies, or sports teams.

Excite prioritizes results for these types of searches, presenting you with city information including maps and weather, company news and information, and sports schedules, stats, and scores.

Sites that have paid to get top listings in search results.

GoTo.com allows Web masters to bid for position on search results for specific terms.

A business or company

At Kompass.com you can search for companies worldwide by product or service, company or trade name, AND in several languages.

A browsable directory of quality links that have been evaluated by experts.

All links in every About.com Guide site have been carefully chosen by topic experts.

 

Ten Essential Search Tips

Here are ten essential searching tips that range from basic, common-sense searching strategies, to advice on when to use specialized search tools.

 

Choose The Best Search Tool

Every search is different, and you'll get the best results if you choose the best search tool for each unique situation. The most fundamental choice is whether to use a search engine vs. a directory.  Search engines are actually full-text indexes of Web pages. They're similar to telephone white pages, which contain nothing more than lots of names with phone numbers.  Directories, on the other hand, are collections of links to Web sites compiled by people, not software robots.  Directories are similar to telephone yellow pages, because they are organized by category or topic, and often contain more information than bare-bones white pages listings.

As a general rule, if you're searching for a specific fact or particular document, use a search engine like AltaVista or Google. On the other hand, if you want to browse for information to get a general overview of a topic, use a directory like Yahoo or the Open Directory Project.

Consider using specialized search tools for certain types of information. For example, if you're looking for a person or a place, use specialized people finders or maps and directions search sites.  There are specialized search tools for virtually every topic or subject and each search engine has its own tools.  It’s worthwhile to check them out before you start a search.

 

Use Multiple Search Terms

If a stranger came up to you and said "anchovy paste!" or "sibberidge!" how would you respond? Most people would either laugh, or ask questions to find out what the person was talking about. Search engines don't have that option—they simply make a guess and provide the best results they can with limited information.

Good queries are rich in keywords that can narrow the focus of a search. Better queries than those above would be "anchovy paste in licorice tahini pizzas" to get the mouth-watering recipe you're looking for, or "sibberidge marriage banns" for more information on a wedding tradition common in the Middle Ages.

Use Natural Language Search Strings

Most search engines handle natural language queries quite well. In fact, sentence structure can provide valuable clues to the search engine that might be lost if you only enter a few search keywords.

Rather than entering a non-grammatical string of keywords, try a plain-language question. Instead of "snow blowers Minnesota" try "Where can I find a snow blower dealer in Grand Marais, Minnesota?"

Capitalize Proper Nouns

Most search engines are case-sensitive. This means you'll get much better results if you capitalize names, places, and other proper nouns. Results for "John Bull" will likely be about Britain's mascot; results for "john bull" may also contain off-topic (though strictly speaking quite legitimate) information about restrooms in bullfighting arenas.

Don't Make Dumb Mistakes

Well, duh! Obviously you shouldn't make dumb mistakes. But do you really know the range and variety of mistakes you can make while searching? Take a look at Seven Stupid Searching Mistakes to see some of the real doozies made by even smart, well-intentioned people.

These are fairly basic tips. For more advanced tips, from when to use Boolean logic to what to do when search engines fail, read on.

 

Be Careful With Boolean

Most search engines allow you to use Boolean operators (and, or, not) to focus your search. Unless you have lots of experience using Boolean logic, you're probably better off not using it.

There are two reasons for this. First, Boolean searching works slightly differently in each search engine. Unless you know exactly how Boolean operators work for a specific engine, you may end up misusing them and actually hampering your results.

Second, when you use Boolean operators you override the numerous other factors that determine how the search engine calculates relevance. In essence, you're saying to the search engine "I'm smarter than you so let me tell you how to do your job." This is rarely a winning strategy, online or off.

Think Before You Click

There are two parts to a successful search equation: A well-formed query, and an accurate, reliable result. You can save a lot of time by quickly analyzing your search results list before you click through to any specific pages.

For one thing, the best result may not be #1 or #2. Get a feel for all of the results on a page and make your own judgement. An important part of this analysis is to compare the title and the URL. Do they make sense together in the context of your search?

For example, if two documents are titled "One Hundred Faces of Faith," the page from www.vatican.va/faith.html will likely be quite different from www.babepix.com/faith.htm. Of course, the one that meets your needs depends on what you were searching for.

Learn From Expert Searchers

The best way to learn to be a great searcher is to learn from people who are acknowledged masters. Two outstanding magazines that regularly publish tips and techniques from experts are

“Online” at http://www.onlineinc.com/onlinemag, and
”Searcher” at http://www.infotoday.com/searcher/default.htm.

Develop Effective Searching Habits

Like any worthwhile skill, searching is something that needs to be practiced. It's important that you practice techniques that give you effective results. Keep track of the types of queries that give you results and use those tactics again.

 

Skilled searchers don't just pound away on search engines until they get results. They develop habits of thinking that allow them to quickly and effectively find what they're looking for.

Don't Give Up!

Let's face it, sometimes nothing you do seems to produce good results. Often, when your searches come up empty and you feel like you've slammed into a dead-end, it's tempting to just throw up your hands and walk away.

No need to despair. Check out these strategies and coping mechanisms for those times When Search Engines Fail. There's almost always something you can do to salvage a seemingly hopeless search.

 

Seven Stupid Searching Mistakes

In the lighthearted spirit of the popular books for "idiots" and "dummies," here's a look at seven common blunders that are virtually guaranteed to deliver useless, nonsensical, or completely worthless search results. Some of these gaffes might surprise you. But once you recognize them, it's easy to banish these little gremlins forever from your Web search tool kit.

Sputtering on "Stop Words"

Some search engines simply ignore certain words. They are never used to find a matching document, despite what amounts to a direct command when you type them into a search form.

These are called "stop words" because the search engine doesn't when they're found in its index. Why not? Because stop words are either too common to generate meaningful results, or are parts of speech like adverbs, conjunctions, prepositions, or forms of "be" that mean nothing unless they're part of a phrase with more "important" nouns and verbs.

If you use a stop word in a query you may get wildly irrelevant results. For example, the phrase "searching the Web" contains two stop words: "the" and "Web." Though it's not a particularly common word, Web is used so frequently on the Internet that it's virtually worthless as a finding aid.

Stripping out the stop words, "searching the Web" becomes "searching," which will naturally lead to results describing everything from criminal manhunts to quests for enlightenment—and if you're lucky, maybe even something about searching the Web.

Bungling with Boolean

Boolean operators, like "and," "or," and "not," can help narrow search results—when used properly. The problem is that Boolean operators, because of their apparent simplicity, appear to be easy to use. Maybe, maybe not.

According to Ran Hock, author of The Extreme Searcher's Guide to Web Search Engines, search engines implement Boolean features in different ways. For example, while some accept a simple "not," others require "and not" for the same effect. Additionally, some engines require that Boolean operators be capitalized, while others do not (or and do not?). Fortunately, search engines usually have a note or two on tipes for using "their" engine and it's worthwhile checking them out beforehand.

Being Ever So Vulgar

Vulgar comes from the Latin vulgus, meaning common. Like some educated sophisticates, search engines have a problem with common words. It's not that they're being snotty or pretentious. It's that some words are so common that they appear in literally millions of documents, making them virtually useless as a finding aid.

Take weather, for example. There are thousands of sites providing weather information, from local forecasts to elaborate treatises on meteorology. Tighten your query by using focusing words to narrow the scope of your search. Rather than merely searching for "weather," construct a query like "Cicely Alaska annual snowfall," or something equally specific.

Looking for a Rose, By Any Other Name

Many words have multiple meanings. For example, when using the word "bond" we have to ask ourselves, are we looking for information about financial bonds, chemical bonds, or even James Bond? Think about the possible other meanings and phrase your query accordingly.

Search engines are also easily confused by heteronyms, words that are spelled identically but have different meanings when pronounced differently. For example, "lead," pronounced LEED, means to guide. Pronounced as LED, though, the word refers to the metal element.

Committing Capital Offenses

Yet another problem for the searcher is whether to use capital letters in a query. Some engines are case sensitive, while others are not. As a rule of thumb, it's a good idea to always use lower case letters when you search. This will typically return results that contain both upper and lower case letters.

 

If you use uppercase letters in a query to a case sensitive engine, results will only include documents that also use upper case letters. This is usually a good thing for proper nouns like names or places, which use initial upper case letters anyway. But it might cause you to miss other documents where case-sensitivity is less important.

Close, But No Cigar

Most search engines do a good job at matching simple phrases, like "Kosovo refugees," or "space shuttle missions." You run into problems, though, with a phrase like this section's title. Searching for "close but no cigar" on one major engine (which shall remain mercifully unnamed) provided this link as its number two pick: The Common Cold: Relief But No Cure. Definitely no cigar!

The distance between one word and another in a document is referred to as proximity. Some search engines will give a positive result if your query words appear anywhere on a page, whether or not they are near each other, or are used together in a phrase.

If you're searching for something where your keywords must be near each other to get good results, your best bets are AltaVista, which allows you to use the NEAR operator, and Lycos Advanced Search, which offers several ways to specify proximity.

And now for the number one most common searching mistake:

Searching for Hits in all the Wrong Places

If you're determined to find what you're looking for on the Web, be sure you're using the right tools for the job. Search engines vary widely in scope, function, and quality. You'll waste a lot of time if you don't choose the best search engine for each specific searching task.

Should you use Excite or HotBot? AltaVista or InfoSpace? Choose the Best Search Engine shows you how to select the best search tool for any topic, by analyzing your needs and comparing them with the strengths and weaknesses of each search service.

And if you get stuck, don't be hard on yourself. One of the most ridiculous misconceptions I've ever heard is that "you can find anything on the Internet." This is about as true as saying that there are diamonds in every coal mine.

Though it may sound strange reading this in a Web search Guide, sometimes your best bet for finding information is to log off and take a trip to your local library. Libraries have tons of resources that aren't available on the Web. And librarians are trained experts who are usually more than willing to help you find what you're looking for. When you're getting nowhere on the Web, take advantage of these "human search engines."

Begone, Mistakes!

As you gain experience searching the Web, avoiding these seven searching mistakes will become almost second nature. Whenever you get weird or unexpected results, take a close look at your query and try to figure out what happened. You might discover yet another mistake to avoid.

If Nothing Else Works...

We love em, we hate em. Sometimes no matter how hard you try, you can't get the results you need. Then what?

Search engine failures range from swamping you with irrelevant results to surfacing no results at all, with degrees of failure at all levels between these extremes. Here are five common situations when search engines fail, with specific tips on how to turn failure into success.

Forty-four Gazillion Documents Found

The most common failure isn't really a failure, but it sure is annoying. The search engine isn't finding the needle in the haystack—it's returning the entire haystack.

This type of failure is almost always the result of one of two things, both of which are easily fixed. First, your query was probably short, just one or two words. And second, your query words were so common that they were found in countless documents on the Web.

If you use common words in a query, they should always be part of a multi-word phrase, at least three or more words that form a sentence fragment. Whenever you can, use synonyms for common words or better yet, think of rare or uncommon words that describe your topic. The trick here is not to outsmart yourself by being too clever. You don't want to use words that aren't likely to be used by most Web authors.

If you must use a common word or phrase, use inclusion and exclusion operators (the plus and minus signs) to tighten the search focus. For example, "+George Washington -state -D.C. -university" finds documents about George Washington, greatly reducing documents related to Washington state, Washington D.C., and George Washington University.

Your Query Found No Documents

Although not common, it is possible for a search engine to come up with zilch for your query. It's possible that the info your searching for simply isn't on the Web. This is more likely if you're looking for a person, a small or obscure company, or an esoteric fact.

However, more likely, you're searching in the wrong place. Go back to the section on Which Search Engine Should I Use. But basically, for people use a specialized people finder, for companies, start with a business service like Transium or Kompass and for facts, try Ask Jeeves or one of the specialized reference services.

Remember that the major search services vary hugely both in size and scope. Yahoo (less than a million documents) and Webcrawler (2 million) are puny compared with AltaVista, HotBot, and Northern Light, all of which have more than 100 million pages indexed. If you must use a general-purpose engine, try your query with a meta search engine like Dogpile, Mamma, or Savvy Search.

The 404-Not Found Blues

One of the stupidest, yet aggravatingly common mistakes made by Web masters make is to "tidy up" their Web site in the same way they might tidy up their personal hard disk. By bringing order to the chaotic mass of stuff on a Web site by moving everything into well-organized folders with meaningful filenames they also inadvertently break every bookmark, ruin every search engine link, and void every printed URL that formerly allowed people to find the page.

Luckily, this "404," or "document not found" message is being sent to you from the same computer where the document used to be located. With luck, it's still there, and you just need to do a bit of snooping to find its new location. To do so, start with the "URL back-up" strategy. Put your cursor after the "htm" or "html" in the address window, and delete the filename, stopping at the first forward slash. Hit enter. Continue this process until you've either found a menu with a link to your document, or the root URL, which should have either a direct link or a search form you can use to help locate your document.

In most cases, if the document is still on its original server, you'll be able to find it with a little searching.

Moved; Forwarding Address Unknown

Web documents, like people, sometimes move.  Students graduate; businesses move from modest sites on free communities to larger sites under their own domain name; people change ISPs. When this happens, your best bet is to search for either the title of the document or the person who authored it. You can do a title search by simply enclosing the title in double quotes and searching for the phrase. But a better way is to do a "field search" specifically on the title.

Field searching is an advanced function offered by most search engines. It allows you to limit the search to the HTML title that appears on the title bar on your browser (it's also used as a bookmark title). You can do field searching for titles, images, and other specific parts of Web pages.  If you search for the author of the document, try HotBot's search for "the person" option.  Remember, when searching for people, you should always capitalize the name and enclose the name in quotes.

Nothing Works: Now What?

You've tried everything. A search query that worked just fine last week now gives you nonsense results, with your target document nowhere to be found. What is going on? A dirty little secret about search engines is that pages are dropped from their indexes all the time. Sometimes thousands of pages. Sometimes these abandoned pages reappear in the index, sometimes not. Official explanations range from "pruning" the search engine database, in an attempt to keep listings fresh and relevant, to "maintenance" where the index is rebuilt from scratch. All and well, unless the specific document you're looking for never reappears in the index.

Fortunately, there are two archival sources where you may be able to find your document. Google keeps a copy of all documents in its index in "cache." Even if the direct link to the document is broken, clicking the link to the cached version will show you a copy of the page from the last time Google indexed it.

 

Alexa, maker of the popular toolbar that shows links to related pages, maintains an archive of over a million Web pages. Using Alexa, if your browser displays a "not found" message, you can often click on the "archive" link in the Alexa window to see a copy of the page.

Google's cached documents and Alexa's archived documents may be out of date, but in most cases, something is better than nothing.

 

Sources:

http://www.microsoft.com/insider/internet/articles/

http://websearch.about.com/internet/websearch/library/weekly/topicsub_searching.htm

http://websearch.about.com/internet/websearch/library/weekly/aa051299.htm

http://hoohoo.ncsa.uiuc.edu/ftp/faq.html

http://websearch.about.com/internet/websearch/library/weekly/bl_981204a.htm

http://pigseye.kennesaw.edu/~msanche1/