DeepWeb Ad

Saturday, April 25, 2015

Digging Below the Surface

Fortunately, you can uncover this wealth of information by using specialized tools designed to mine databases. For instance, let's say you want to buy a used copy of "Alice in Wonderland." How would you find it? Searching on eBay or Amazon.com--essentially querying their databases--will be more fruitful than using Yahoo! or Google. The same goes for job hunting. Since job postings are stored in a database, most search engines can't find them; searching sites like Craigslist or Monster is a better way to go.
The secret to successful searching is to understand what you want to know, and then using the right Web resource to find it. Ask yourself these questions:
  • Is the information time-sensitive, such as stock quotes or newspaper articles?
  • Are you looking for a photo or a video clip?
  • Do you want to find an MP3 music file or listen to a podcast?
  • Are you searching for specific types of content, such as blogs?
If the answer is "yes," then try using the tools listed in the chart below. And when you find ones that you like, be sure to bookmark them for future use.
Tools for Mining the Deep Web
To find... Try using...
Audio and Music Files
Blogs
Databases
News
Newsgroup & Groups
Photos and Graphic Images
Podcasts
RSS feeds
Sound Effects
Video

Mining the Deep Web

Although search engines like Yahoo!, Bing and Google index billions of web pages and other electronic documents, this represents only a tiny part of the total information available on the World Wide Web. To unearth the buried treasure, you have to understand how to mine the data.
Two Layers of Data Think of the Web as having two layers: a shallow surface and an almost bottomless, deep level. In the top layer, the Surface Web, you will find all the web pages like the one that you're now reading. This page and others like it have fixed web addresses or URLs (in this case, http://www.learnthenet.com/how-to/search-the-deep-web). Also, the information contained in the page doesn't change very often.
The Deep Web contains pages with dynamic content--data that changes frequently and can't be indexed easily by search engines. Most of this information is stored in databases and is assembled "on the fly" when you query the database. For instance, when you search for an item on eBay, information is pulled from eBay's database and instantly assembled on a web page for you. That page did not exist until you performed your search, which is what makes it dynamic; it was customized in response to your query. Because of this fact, search engines can't readily index this information.
Other types of "deep" information include:
  • Multimedia (audio, music and video)
  • Photos and graphics
  • Job listings
  • Financial data (stock and bond prices, currency rates)
  • News
  • Travel-related data (airline and train schedules)
  • Information on sites that require passwords