The Indexing

Definition

It is based on META tags. During the submission of a Web page to a search engine, this one will index the page by treating in priority the contents of META tag.

Referencing is not an exact science, according to search engines, META tag can be more or less effective. Indeed, the algorithms of indexing differ according to engines.

Where are the META tags ?

Structure of an html document :

<HTML>
<HEAD>
<TITLE>Titre de la Page</TITLE>
<META NAME=""CONTENT="">
</HEAD>
<BODY>
HTML code
</BODY>
</HTML>

META tags are located between the end of title </TITLE >tag and the </HEAD> tag of the page.


The different types:

<META name="keywords" content="">
<META name="description" content="">
<META name="language" content="">
<META name="robots" content="">
<META name="revisit" content="">


The KEYWORDS attribute
:

- It can contain until 100 words, however, 20 words per page are enough.
- A key word should not be repeated several times.

Example:

<META name="keywords"content="Sport,Automobile,Race">


The DESCRIPTION attribute:

- It should not contain more than 150 characters (spaces included).
- Try to re-use the keywords used in the attribute keyword.

Example
 :

<META name="description"content="Welcome, the site is about F1, you will see my favorite pilots and their victories in championship.">


The LANGUAGE Attribute:

- For a French page you will use Fr
- For an English page you will use En
- For a German page you will use De

In fact this tag contains the first 2 letters of the country in the official language of the country. For example for a German page, we will use De (for Deutsch) and not Ge for Germany.

Example:

<META name="language"content="fr">

The ROBOT attribute:

The robot is in fact the program which consults your page in order to index it. The purpose of this tag is to indicate the behavior that it must have.

You can :
- prohibit or not the robot from indexing the page (index or noindex)
- force or not the robot to consult the links of the page (follow or nofollow)
- index the images or not of the page (imageindex or noimageindex)
- index the links associated with the images (imageclik or noimageclik)
By defect, the following parameters are used: index, follow, imageclick, imageindex.

Example:

<meta name="robot"content="index,nofollow,imageindex,noimageclik">
The REVISIT attribute:
The purpose of this tag is to make visit the site all X days by the robot.
Example:
<meta name="revisit"content="6">
The robot will revisit the site in 6 days.
The AUTHOR attribute:
It is used to insert the name of the autor in an HTML page
Example:
<meta name="author"content="name of the author">


SEARCH Engines

Address
Pages (millions)
Time (week)
http://www.alltheweb.com
340
3
http://www.altavista.fr
1000
6
http://www.nomade.fr
3
3
http://www.google.com
500
6
http://www.hotbot.com
500
8
http://www.lycos.fr
340
3
http://www.voila.fr
160
2
http://www.yahoo.fr
?
?
http://www.dmoz.org
?
?