Site Owners Forums - Webmaster Forums

Site Owners Forums - Webmaster Forums (http://siteownersforums.com/index.php)
-   Search Engine Optimization (http://siteownersforums.com/forumdisplay.php?f=16)
-   -   What is robots.txt? (http://siteownersforums.com/showthread.php?t=178541)

abinaya 09-30-2016 05:19 AM

What is robots.txt?
 
Robots.txt is a text file. It is through this file, it gives instruction to search engine crawlers about indexing and caching of a webpage, file of a website or directory, domain.

Williams Reus 09-30-2016 04:08 PM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter. That is why we say that if you have really sen sitive data, it is too naïve to rely on robots.txt to protect it from being indexed and displayed in search results.

cybexindia15 09-30-2016 11:34 PM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website

AvantikaThakur 09-30-2016 11:45 PM

tittle
 
It is used for the seo and it is used in the ranking for the seo.

dongtay1001 10-01-2016 12:31 AM

Quote:

Originally Posted by abinaya (Post 594864)
Robots.txt is a text file. It is through this file, it gives instruction to search engine crawlers about indexing and caching of a webpage, file of a website or directory, domain.

ok! that was awesome! thanks for posting.

.................................................. .................................................. .............................

(bai viet rat huu ich !)

.................................................. .................................................. ..............................

dây điện cadivi dây cáp điện cadivi

sashwatmegh 10-01-2016 01:29 AM

Robot.txt allow search bots to read or not read the website files, folder or content.

Tabassum 10-01-2016 03:24 AM

Robots.txt enables you to instruct search engine machine to crawl and index pages on your website.

ChrisRogers123 10-01-2016 09:32 AM

It is a type of text file which tells bot what to crawl or what to not ?

middo 10-01-2016 01:38 PM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

othername0104 10-03-2016 08:29 PM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

adlersmith 10-03-2016 10:39 PM

Well robots.txt is actually a notepad file in which we create instructions/commands for web crawler to not check out and index the link. It is used when the web page is new and there is no content in the web page, therefore it is important for SEO.

SanGoku 10-04-2016 12:21 AM

A robots.txt file is a file at the root of your site that indicates those parts of your site you don't want accessed by search engine crawlers.

mariajerek 10-04-2016 02:46 AM

Robot.txt is a text file which is used to not to index a particular page. It is put in root file.

alex.thomson 10-04-2016 03:07 AM

* A robots.txt file is a file at the root of your site that indicates those parts of your site you don’t want to be accessed by search engine crawlers. The file uses the Robots Exclusion Standard, which is a protocol with a small set of commands that can be used to indicate access to your site by section and by specific kinds of web crawlers (such as mobile crawlers vs desktop crawlers).
* Robots.txt is the common name of a text file that is uploaded to a Web site's root directory and linked in the HTML code of the Web site. The robots.txt file is used to provide instructions to the Web site to Web robots and spiders. Web authors can use robots.txt to keep cooperating Web robots from accessing all or parts of a Web site that you want to keep private.

apextgi 10-04-2016 04:26 AM

A robots.txt file is used to notify search engine to index or not to index webpages of the website.

rolinsebra 10-04-2016 04:29 AM

Web site owners use the robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

Ramkarma 10-04-2016 05:50 AM

Robot.txt :- Robot.txt is also known as the robots exclusion protocol(REP),is a text file webmaster create to instruct robots( typically search engine robots) how to crawl and index pages on their website. It is used to the new website when there is no content.

johnathan410 10-04-2016 05:57 AM

Robots.txt is common name of a text file that is uploaded to a Web site's root directory and linked in the html code of the Web site. The robots.txt file is used to provide instructions about the Web site to Web robots and spiders.

ethanmichael201 10-04-2016 11:16 PM

Robots.txt is a text file. it gives instruction to bots to crawlers about indexing and caching of a website or webpage.

nancy07 10-05-2016 01:31 AM

The robots.txt file as instructions on where they are allowed to crawl (visit) and index (save) on the search engine results.*Robots.txt*files are useful: If you want search engines to ignore any duplicate pages on your website.

FerinKings 10-05-2016 03:12 AM

robots.txt is a text file, it indicates the crawler to which to crawl and which one don't want to crawl.

Jayasree 10-05-2016 03:22 AM

Bots will use robots.txt to crawl our website and webpages used for crawlers about indexing .

AshokDixit89 10-10-2016 12:40 AM

The basic use of Robots.txt - The most common usage of Robots.txt is to ban crawlers from visiting private folders or content that gives them no additional information.

Robots.txt Allowing Access to Specific Crawlers.
Allow everything apart from certain patterns of URLs.

pattroderick 10-11-2016 01:48 AM

robots.txt is a file, that guides the crawler which one to crawl and which to not crawl..

autocarcovers 10-11-2016 01:56 AM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

SaraSanjay 10-12-2016 10:49 PM

The robots avoidance convention (REP), or robots.txt is a content record website admins make to educate robots (normally web search tool robots) how to creep and file pages on their site.

NukeBlaster 10-13-2016 09:57 PM

Quote:

Originally Posted by abinaya (Post 594864)
Robots.txt is a text file. It is through this file, it gives instruction to search engine crawlers about indexing and caching of a webpage, file of a website or directory, domain.

I first thought you wanted to ask until I realized you answered your own question.

jordanangel 10-15-2016 02:22 AM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

saravjeet 10-17-2016 02:24 AM

The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned.

jamesvincent21 10-17-2016 02:31 AM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

Shiksha 10-17-2016 03:25 AM

It is great when search engines frequently visit your site and index your content but often there are cases when indexing parts of your online content is not what you want. For instance, if you have two versions of a page (one for viewing in the browser and one for printing), you'd rather have the printing version excluded from crawling, otherwise you risk being imposed a duplicate content penalty. Also, if you happen to have sensitive data on your site that you do not want the world to see, you will also prefer that search engines do not index these pages (although in this case the only sure way for not indexing sensitive data is to keep it offline on a separate machine). Additionally, if you want to save some bandwidth by excluding images, stylesheets and javascript from indexing, you also need a way to tell spiders to keep away from these items

Shiksha 10-17-2016 03:29 AM

The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned. Robots are often used by search engines to categorize web sites. Not all robots cooperate with the standard; email harvesters, spambots, malware, and robots that scan for security vulnerabilities may even start with the portions of the website where they have been told to stay out. The standard is different from, but can be used in conjunction with, Sitemaps, a robot inclusion standard for websites.

Jayasree 02-13-2017 04:09 AM

Robot.txt is a text file.It is used when the web page is new and there is no content in the web page, therefore it is important for SEO.

quocanh123 02-13-2017 07:45 AM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

jamesandy 05-02-2017 04:54 AM

Robots.txt is a file associated with your website used to ask different web crawlers to crawl or not crawl portions of your website.

AliceArifova 05-02-2017 10:36 PM

It is a type of text file which tells bot what to crawl or what to not ?

Hostingsafety 05-02-2017 10:41 PM

Robots.txt is common name of a text file that is uploaded to a Web site's root directory and linked in the html code of the Web site.

Jayasree 05-02-2017 11:06 PM

Robots.txt is a text file which will instruct search engine machine to crawl and index pages on your website.

komal14 05-03-2017 03:26 AM

This file is used by search engine for crawling website's page and for given them index.

jumad 05-03-2017 04:59 AM

A robot.txt file instructs search bots about the pages that has to be indexed.

Seo in dubai


All times are GMT -7. The time now is 04:22 AM.


Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.