META robots tag is to enable or disable the robots who come to the site to index the page. In addition, the tag is designed to offer the robot to walk on all site pages and index them. In addition, this tag can benefit those who do not have access to the root server and change the file robots.txt, but want to prevent indexing of the files and directories.
Format meta tag Robots
META tag robots placed in tag html-document (inside the tag <HEAD>). The format is simple enough (capitalization value is not playing)
<META NAME="ROBOTS" CONTENT="value"> Information about the meta-tag can be assigned to four option values. CONTENT attribute can contain the following values: Index (index), noindex (not indexed), Follow (follow the links) and nofollow (not to follow the link), for example, not to index the document:
To prevent all search engines from showing a link to your site, add a section <HEAD> following tag:
Preventing scan or delete parts
Fragment - the text that is displayed below the page title in the search results and describes the content of the page.
To prevent Google from displaying snippets from your page, add a section <HEAD> following tag:
How to make sure that search engines do not use DMOZ data in search results for your site
one of the sources that are used to create pieces of descriptions is the Open Directory Project . That no search engines (meta tags support) did not use this information to describe the page, add the following tag:
How to tell the search engine that a certain piece of code is not necessary undesired or to a link that does not have to follow?
Each search engine then invents his bike: Google append a link attribute rel = "nofollow" , Yahoo add class class = "robots-nocontent", Yandex encouraged to entering the text in such a <noindex>.
Tag NOINDEX
Webmaster can "tell" the spider or bookmark file by the contents of a file will be changed. In this case, the robot will not save the URL and the user's browser will or will not make a file in bookmark. While this information is described only in the file / robots.txt, user will not be aware of is that this page will change.
Meta-tag DOCUMENT-STATE may be useful for this. By default, this meta-tag is received from CONTENT = STATIC.
How to exclude indexing search engine generated pages or duplicate documents, if there is a mirror server?
Generated pages - pages generated by the action of CGI-scripts. They certainly should not be indexed because if you try to fail at them from a search engine, you get an error. As for the mirror, the more useless when issued two separate references to different servers, but with the same content, to avoid this, use a meta tag URL specifying the absolute URL of the document (in the case of mirrors - to the relevant page of the main server).
following tag:
Format meta tag Robots
META tag robots placed in tag html-document (inside the tag <HEAD>). The format is simple enough (capitalization value is not playing)
<META NAME="ROBOTS" CONTENT="value"> Information about the meta-tag can be assigned to four option values. CONTENT attribute can contain the following values: Index (index), noindex (not indexed), Follow (follow the links) and nofollow (not to follow the link), for example, not to index the document:
<META NAME="ROBOTS" CONTENT="NOINDEX>Search engine robots do not have to go on the links from this page.
<META NAME="ROBOTS" CONTENT="NOFOLLOW">For complete ban index page and bypass links with its use
<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
If the values of a few, they are separated by commas.
To prevent indexing a page on your site only Google's robots from indexing it and allow other robots, use the following tag:
To prevent indexing a page on your site only Google's robots from indexing it and allow other robots, use the following tag:
<META NAME="GOOGLEBOT" CONTENT="NOINDEX, NOFOLLOW">
To allow robots to index the page, but prevent indexing of pictures on a page, use the following tag:
<META NAME="ROBOTS" CONTENT="NOIMAGEINDEX">Google automatically creates and stores a "snapshot" of each scanned page. Version stored in the cache, the page allows you to show your end users, even if the original page is unavailable (due to a temporary technical problem on the web server). Cached page appears to users in the form in which it was last scanned robot Google, while at the top we show a message saying that this version of the cache. Users can access the cached version by clicking on the "Cached" button on the search results page.
To prevent all search engines from showing a link to your site, add a section <HEAD> following tag:
<META NAME="ROBOTS" CONTENT="NOARCHIVE">
To disable link "Cached" only on Google, and the rest resolved, use the following tag:
<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">
Note. This tag only removes the link "Cached" on that page. Google will continue to index the page and display a snippet.
Preventing scan or delete parts
Fragment - the text that is displayed below the page title in the search results and describes the content of the page.
To prevent Google from displaying snippets from your page, add a section <HEAD> following tag:
<META NAME="GOOGLEBOT" CONTENT="NOSNIPPET">
Note. When removing the fragments are also removed and stored in the cache page.
How to make sure that search engines do not use DMOZ data in search results for your site
one of the sources that are used to create pieces of descriptions is the Open Directory Project . That no search engines (meta tags support) did not use this information to describe the page, add the following tag:
<META NAME="ROBOTS" CONTENT="NOODP">In order for Google to ban use of this information in the description page, add the tag:
<META NAME="GOOGLEBOT" CONTENT="NOODP">If the meta tag "robots" is used for the other teams, they can be combined. Example:
<META NAME="GOOGLEBOT" CONTENT="NOODP, NOFOLLOW">An example of using the description:
<META NAME = "Description" CONTENT = "Information for WEB-developers programmers and students of higher educational institutions, WEB-technology.">Search bans
How to tell the search engine that a certain piece of code is not necessary undesired or to a link that does not have to follow?
Each search engine then invents his bike: Google append a link attribute rel = "nofollow" , Yahoo add class class = "robots-nocontent", Yandex encouraged to entering the text in such a <noindex>.
Tag NOINDEX
Prevent indexing is possible not only for files but also for their parts. To prevent indexing of certain parts of files (text), you need to mark their special tags <NOINDEX> </ NOINDEX>.
Moreover, NOINDEX tag should not violate other nested tags. Only understands PS Yandex.
rel = "nofollow"
To the search engines not to follow the link, the tag can be set An attribute rel = "nofollow":
Moreover, NOINDEX tag should not violate other nested tags. Only understands PS Yandex.
rel = "nofollow"
To the search engines not to follow the link, the tag can be set An attribute rel = "nofollow":
<A rel = "nofollow" href = "htmlweb.pk> WEB-technology </ a>
This attribute does not affect the indexing links. Most search engines (except for Google) transition is still on it. The sole purpose of this attribute - to inform the search engine, the ranking of the page on which the link is posted, should not be transferred page to which the link leads.
If you need to reduce the number of external links on the page, wrap them with tags <NOINDEX> </ NOINDEX>
Use of this site search: Google
class = "robots-nocontent"
If you need to reduce the number of external links on the page, wrap them with tags <NOINDEX> </ NOINDEX>
Use of this site search: Google
class = "robots-nocontent"
The use of "class = robots-nocontent" attribute:
The following are some examples of how to use this attribute for different purposes and different syntaxes:
The following are some examples of how to use this attribute for different purposes and different syntaxes:
<div class="robots-nocontent"> This navigation menu of the site and is common to all pages. It contains a lot of terms and words that are not associated with the site </ Div>
<span class="robots-nocontent"> This is the site header, which is present on all pages of the site and not associated with any particular page </ Span>
<p class="robots-nocontent"> This stereotypical Legal required on each page of the site </ P>How often index the document to a spider?
Webmaster can "tell" the spider or bookmark file by the contents of a file will be changed. In this case, the robot will not save the URL and the user's browser will or will not make a file in bookmark. While this information is described only in the file / robots.txt, user will not be aware of is that this page will change.
Meta-tag DOCUMENT-STATE may be useful for this. By default, this meta-tag is received from CONTENT = STATIC.
<META NAME=\"DOCUMENT-STATE\" CONTENT=\"STATIC\">
<META NAME=\"DOCUMENT-STATE\" CONTENT=\"DYNAMIC\">Mirrors
How to exclude indexing search engine generated pages or duplicate documents, if there is a mirror server?
Generated pages - pages generated by the action of CGI-scripts. They certainly should not be indexed because if you try to fail at them from a search engine, you get an error. As for the mirror, the more useless when issued two separate references to different servers, but with the same content, to avoid this, use a meta tag URL specifying the absolute URL of the document (in the case of mirrors - to the relevant page of the main server).
<META NAME=\"URL\" CONTENT=\"absolute_url\">
following tag:
0 comments:
Post a Comment