Go Back   Site Owners Forums - Webmaster Forums > Search Engine Optimization > Search Engine Optimization

Notices


Reply
 
Thread Tools Rate Thread Display Modes
Old 08-27-2012, 04:43 AM   #16
john mathew
Registered User
 
Join Date: Feb 2012
Posts: 225
hi,I am reading this article and thanks for sharing this information for about forum posting,
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
john mathew is offline   Reply With Quote
Old 08-27-2012, 05:11 AM   #17
webdesignindia
Registered User
 
Join Date: Oct 2011
Location: Ahmedabad
Posts: 93
Quote:
Originally Posted by C.Rebecca View Post
Robots.txt is a text (not html) file you put on your root directory to tell search robots which files to ignore (or alternatively) which files to crawl. It also helps Search Engines to locate the Sitemap of the website and hence crawl the entire website in depth... helping in your rankings and traffic.
I completely agree with you. Robots.txt allows you to tell search engine to not to crawl any sensitive data or information. Which is its mail benefit.
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
webdesignindia is offline   Reply With Quote
Old 08-31-2012, 04:17 AM   #18
zeilenga569
Registered User
 
Join Date: Jul 2012
Posts: 55
Thanks , Great post information!
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
zeilenga569 is offline   Reply With Quote
Old 08-31-2012, 07:14 AM   #19
lylevasser12
Registered User
 
Join Date: Aug 2012
Posts: 69
Hello ,
It's a text file which instructs search engine spiders or crawlers on what to do. It tells specific web spiders on which specific web pages to index.
lylevasser12 is offline   Reply With Quote
Old 09-05-2012, 05:38 AM   #20
blueapple
Registered User
 
Join Date: Feb 2012
Posts: 92
A robots.txt file is a simple txt file. robots file on a website wills utility as a appeal that specified robots discount specified files or directories when crawling a site.
blueapple is offline   Reply With Quote
Old 09-05-2012, 11:54 PM   #21
wowmadam
Registered User
 
Join Date: Aug 2012
Posts: 7
Robots.txt is a very useful text file to be uploaded on root directory of your site so as to disallow crawling our mentioned url's in robots.txt as not to be displayed to users out there.

Thanks
wowmadam is offline   Reply With Quote
Old 09-27-2012, 12:09 AM   #22
lawrencehayden
Registered User
 
Join Date: Sep 2012
Posts: 10
The Software Exemption Conventional, also known as the Spiders Exemption Method or robots.txt protocol, is a meeting to avoid participating web spiders and other web robots from opening all or part of a web page which is otherwise openly readable. Spiders are often used by google to classify and store web websites, or by web page owners to check resource value.
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
lawrencehayden is offline   Reply With Quote
Old 09-27-2012, 01:04 AM   #23
anshulniet
Registered User
 
Join Date: Aug 2012
Posts: 117
Robots.txt is a text file that you can put on your site to tell search robots which page you like them not to visit. Robots.txt is by no means mandatory for search engines but search engines obey what they are asked not to do. The location of robots.txt is very important as it must to be in main directory.
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
-
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
-
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
anshulniet is offline   Reply With Quote
Old 09-27-2012, 02:38 AM   #24
alluremedspa123
Registered User
 
Join Date: Sep 2012
Location: Mumbai, India
Posts: 10
A robots.txt is a permissions file that can be used to control which webpages of a website a search engine indexes. The file must be located in the root directory of the website for a search engine website-indexing program (spider) to reference
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
alluremedspa123 is offline   Reply With Quote
Old 09-28-2012, 12:33 AM   #25
3idatascraping
Registered User
 
Join Date: Aug 2012
Posts: 13
Robot.txt means to tell search engine of which pages you want to crawl or Not.
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
3idatascraping is offline   Reply With Quote
Old 09-28-2012, 02:27 AM   #26
peterraimi
Registered User
 
Join Date: Sep 2012
Posts: 13
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

Structure of a Robots.txt File :

The structure of a robots.txt is pretty simple (and barely flexible) – it is an endless list of user agents and disallowed files and directories. Basically, the syntax is as follows:

User-agent:

Disallow:

“User-agent” are search engines' crawlers and disallow: lists the files and directories to be excluded from indexing. In addition to “user-agent:” and “disallow:” entries, you can include comment lines – just put the # sign at the beginning of the line:

# All user agents are disallowed to see the /temp directory.

User-agent: *

Disallow: /temp/
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
peterraimi is offline   Reply With Quote
Old 09-28-2012, 04:18 AM   #27
john mathew
Registered User
 
Join Date: Feb 2012
Posts: 225
Robot.txt tells to Google that which page should be crawl in the website.
__________________

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
|
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
john mathew is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
issue in robots.txt file davikerkrish Search Engine Optimization 0 07-20-2012 12:40 AM
Unable to delete robots.txt file notfake Yahoo 0 04-29-2012 02:19 AM
Don’t exceed maximum file size for robots.txt file ClaudiaSchayffe Search Engine Optimization 3 04-02-2012 04:28 AM
What Is Robots.txt? samlko Search Engine Optimization 13 03-09-2012 02:37 PM
sitemap.xml and robots.txt? jamesranatte Search Engine Optimization 5 01-31-2012 11:16 PM


All times are GMT -7. The time now is 09:28 AM.


Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.