« Sitemap tagsSitemap index files »

Entity escaping

  04/08/07 00:42, by , Categories: Specs

Your Sitemap file must be UTF-8 encoded (you can generally do this when you save the file). As with all XML files, any data values (including URLs) must use entity escape codes for the characters listed in the table below.

Character   Escape Code
Ampersand & &
Single Quote ' '
Double Quote " "
Greater Than > >
Less Than < &lt;

In addition, all URLs (including the URL of your Sitemap) must be URL-escaped and encoded for readability by the web server on which they are located. However, if you are using any sort of script, tool, or log file to generate your URLs (anything except typing them in by hand), this is usually already done for you. Please check to make sure that your URLs follow the RFC-3986 standard for URIs, the RFC-3987 standard for IRIs, and the XML standard.

Below is an example of a URL that uses a non-ASCII character (├╝), as well as a character that requires entity escaping (&):



Below is that same URL, ISO-8859-1 encoded (for hosting on a server that uses that encoding) and URL escaped:



Below is that same URL, UTF-8 encoded (for hosting on a server that uses that encoding) and URL escaped:



Below is that same URL, but also entity escaped:



Sample XML Sitemap

The following example shows a Sitemap in XML format. The Sitemap in the example contains a small number of URLs, each using a different set of optional parameters.


<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
This entry was posted by and is filed under Specs.

1 comment

Comment from: aaa [Visitor]

Thanks a lot

10/28/09 @ 15:12

Form is loading...


©2017 by sitemaps

Contact | Help | b2evo skin by Asevo | VPS hosting | François