Site Owners Forums - Webmaster Forums

Site Owners Forums - Webmaster Forums (http://siteownersforums.com/index.php)
-   Social Networks (http://siteownersforums.com/forumdisplay.php?f=43)
-   -   Crawling and cashing (http://siteownersforums.com/showthread.php?t=58291)

ryangetty 07-07-2012 03:16 AM

Crawling and cashing
 
What is the difference between crawling and cashing.

JACKADEN 07-09-2012 04:27 AM

Both information are much to increase my knowledge about Crawling and Cashing..

jessica733 07-10-2012 09:50 AM

Quote:

Originally Posted by ptfe (Post 224860)
crawling,
Is when a search engines robot 'bot' comes to your site and starts looking about at the information and browses for stuff to index.

indexing,
Indexing is when a page on your site gets put into the search index ( so its searchable ) Indexing is the part done after the crawling and is the bit when the search engine looks into what the crawler found and then finds what your page is about so it can be indexed correctly.

and cashing,
Cashing ( Spelt Caching ) is essentially a snapshot of what the crawler found on your site.
Google has a "Cache" of a webpage on your site.. Ie: they have saved a copy of your page to their system for reference.

Great information you have share with us in very easy manners and i am completely agree with your points and explanation, these things are very important to understand for every newbie in SEO field.

satnam1980 07-10-2012 01:52 PM

Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner or in an orderly fashion. Other terms for Web crawlers are ants, automatic indexers, bots, Web spiders, Web robots,or especially in the FOAF community Web scutter..

Caching can happen at many places, including proxies (i.e. the user�s ISP) and the user�s local machine. The objective is to make efficient use of resources and speed the delivery of content to the end user.While caching can have a positive impact on the user�s experience, it can have a negative impact for site publishers, resulting in undercounts of page views and ad impressions. In response to this problem, sites have implemented various cache-busting techniques to better ensure that all performance statistics are accurately measured.

JanPaul 07-10-2012 09:32 PM

Thanks for the post.its very informative...

Elsha 07-10-2012 10:05 PM

Cashing is a snapshot of what the crawler found on your site.

jeny 07-13-2012 10:51 PM

Crawling is the process in which spiders move on every page of your site and fetch data from your site.
Indexing is where search engine has crawled the web and ranks the URLs found using various criteria and places them in the database, or index.

kaithyedward 07-15-2012 09:24 PM

The main difference in crawling and cashing is its processor in search engine like every search engine 1st of all crawling to any site the cashing it.

JACKADEN 07-15-2012 09:27 PM

Quote:

Originally Posted by kaithyedward (Post 226205)
The main difference in crawling and cashing is its processor in search engine like every search engine 1st of all crawling to any site the cashing it.

Thanks brother the such great answer..

wyanesmith1987 07-16-2012 12:42 AM

Crawling

There is a hidden, relentless force that permeates the web and its billions of web pages and files, unbeknownst to the majority of us sentient beings. I'm talking about search engine crawlers and robots here. Every day hundreds of them go out and scour the web, whether it's Google trying to index the entire web, or a spam bot collecting any email address it could find for less than honorable intentions. As site owners, what little control we have over what robots are allowed to do when they visit our sites exist in a magical little file called "robots.txt."

"Robots.txt" is a regular text file that through its name, has special meaning to the majority of "honorable" robots on the web. By defining a few rules in this text file, you can instruct robots to not crawl and index certain files, directories within your site, or at all. For example, you may not want Google to crawl the /images directory of your site, as it's both meaningless to you and a waste of your site's bandwidth. "Robots.txt" lets you tell Google just that.

Caching

Google's cache has been around in the search results for a long time. In fact, Google's cache is often ignored in SEO strategy and analysis. But using it can provide you with a lot of information that can increase leads, sales, user satisfaction and even offer clues to existing problems with your website. This article will examine Google's cache page in detail and recommend possible ways to use the information provided in your search engine optimization strategy.

Basic components of Google's cache

Before we dive deeper, a short introduction to Google's cache is helpful. You can see the Google cache of your website/web pages in three different ways.

Method 1: When you visit the URL that you need to view in the Google cache, click the Google toolbar (e.g using Firefox browser); in the drop down, click "Google snapshot of page."

Method 2: In the Google search result showing the URL that you need to view, click the "Cache" link.

Method 3: In the Google search box, type:

cache:www.thisisyourdomain.com/thisisthepage.htm

Or if you are checking for your home page:

cache:www.thisisyourdomain.com/
Read more at http://www.seochat.com/c/a/Google-Op...fLj0gmlwE2p.99

vishenda 07-16-2012 07:38 AM

thanks for the clarification guys!

jonemike 07-17-2012 02:41 AM

Quote:

Originally Posted by ptfe (Post 224860)
crawling,
Is when a search engines robot 'bot' comes to your site and starts looking about at the information and browses for stuff to index.

indexing,
Indexing is when a page on your site gets put into the search index ( so its searchable ) Indexing is the part done after the crawling and is the bit when the search engine looks into what the crawler found and then finds what your page is about so it can be indexed correctly.

and cashing,
Cashing ( Spelt Caching ) is essentially a snapshot of what the crawler found on your site.
Google has a "Cache" of a webpage on your site.. Ie: they have saved a copy of your page to their system for reference.


Great answer i like it.

jiyaalbert 07-21-2012 12:40 AM

Quote:

Originally Posted by wyanesmith1987 (Post 226259)
Crawling

There is a hidden, relentless force that permeates the web and its billions of web pages and files, unbeknownst to the majority of us sentient beings. I'm talking about search engine crawlers and robots here. Every day hundreds of them go out and scour the web, whether it's Google trying to index the entire web, or a spam bot collecting any email address it could find for less than honorable intentions. As site owners, what little control we have over what robots are allowed to do when they visit our sites exist in a magical little file called "robots.txt."

"Robots.txt" is a regular text file that through its name, has special meaning to the majority of "honorable" robots on the web. By defining a few rules in this text file, you can instruct robots to not crawl and index certain files, directories within your site, or at all. For example, you may not want Google to crawl the /images directory of your site, as it's both meaningless to you and a waste of your site's bandwidth. "Robots.txt" lets you tell Google just that.

Caching

Google's cache has been around in the search results for a long time. In fact, Google's cache is often ignored in SEO strategy and analysis. But using it can provide you with a lot of information that can increase leads, sales, user satisfaction and even offer clues to existing problems with your website. This article will examine Google's cache page in detail and recommend possible ways to use the information provided in your search engine optimization strategy.

Basic components of Google's cache

Before we dive deeper, a short introduction to Google's cache is helpful. You can see the Google cache of your website/web pages in three different ways.

Method 1: When you visit the URL that you need to view in the Google cache, click the Google toolbar (e.g using Firefox browser); in the drop down, click "Google snapshot of page."

Method 2: In the Google search result showing the URL that you need to view, click the "Cache" link.

Method 3: In the Google search box, type:

cache:www.thisisyourdomain.com/thisisthepage.htm

Or if you are checking for your home page:

cache:www.thisisyourdomain.com/
Read more at http://www.seochat.com/c/a/Google-Op...fLj0gmlwE2p.99

Thanks for the great information.

zeilenga569 07-24-2012 06:51 AM

Very informative post thanks for sharing it

jeff.cool25 07-26-2012 05:58 AM

Thanks guys to provide such a great information


All times are GMT -7. The time now is 04:12 PM.


Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.