2017 2018 4G add-ons Aiarty Image Enhancer Android apbn App Apple Application arabic Archaeology Argentina Arrest warrant Arrow keys Artificial Intelligence Artikel Arus Kas aset kripto Asia Assange Associated Press Aston Martin Aston Martin One-77 Astronomical unit Asyar Atlantis Audi Audi A5 Audi Q7 Australia Austria auto like Automobile Autos Avaya Device Manager Azwar Abubakar B. J. Habibie bacaan tahlil Bachelor's degree Backup badan pengawasan obat bius Badan Pusat Statistik Badanan Bagi Bali Ballmer Bandung Bandwidth Tests Bangkalan Bank Indonesia Banner Baptists Barack Obama Barbie Baru BASIC bbc Beauty pageant Bedework Bedework beginner Belum Beri Berpura BI Biar Biasanya Bilbo Baggins bima sakti Binance Bisnis Bitcoin Biz Stone BJ Habibie BKN BKPM BlackBerry BlackBerry PlayBook Blackpool Blog Blogger Blogger help Blur (band) BNI BNN Bob Hasan Boeing Bogor Bom BonJovi Bonus 1GB Bonus Data Bonus Smartfren Borobudur BPK Brain Bridget Firtle Britney Spears Broadband Broadband Internet access Browser Brunei Bruno Mars Brussels BSE Buah Ape Bubarkan PKS Buddhism Budidaya Buenos Aires Bugatti Veyron BugattiVeyron Bugzilla Bukittinggi bumi Bureau of Labor Statistics Burma Busdiness Business Business and Economy Business school BY COUNTRY Cable television Calendar California capres 2014 Car Cara Cara Mencari Kerja Cara Menjual Diri Cara Menulis Cara Menulis Resume cars Cascading Style Sheets Cash conversion cycle CBS News CCC CD ripper CDDB Central Authentication Service Central Java Chairul Saleh Chania Chat room Chester Chief Digital Offers Chief marketing officer Chile China Cho Kyuhyun Chris Johns Christine Quinn Chuck Hagel Cisco PIX City Classified information Clients Climate change Clothing CMO CMOs cms CNET.com CNN Coba CoffeeScript Cognition Collecting Colleges and Universities Colotomy Combinatorics Community Compact Disc Companies Company Comparison of antivirus software Computer crime Computer file Computers and Internet Connect Content management system Contoh Proposal Control key Corruption Eradication Commission Counties CPN CPNS CPNS 2013 Crack epidemic (United States) Create Crime Cryptocurrency CSS Current account Customer Customer service CV DA2 Daerah Dalam Dalat Dalcroze Eurhythmics Dallas Dan Dan (rank) Dangdut Academy2 Dangdut Akademi2 Daniel Tal Daniele De Rossi Danny Quirk Dari Data center Data Communications Database administrator Dating Daun kelor Dead Space 2 Death Debu Decision making Default (finance) Definisi Deforestation in Indonesia Dell Dell inspiron N4010 Drivers For Windows 7 (32bit) Democracy Democratic Design & Printing Dessau Device driver Device Drivers devisit Dick Costolo Digital Clock Digital distribution Digital marketing Digital photography Digital Subscriber Line Digital System Dire Straits Disk Management Distilleries DNS DNS hosting service DNS Server Dolls Domain name Domain Name System Dompet Donald Benek DoubleClick DoubleClick Insights Dow Jones Industrial Average download Download Driver LapTop Download manager DownThemAll DPR driver canon Drivers Drupal Drupal Planet Dubai Duduk Duhur Dunayevskaya Raya DVD Dynamic Views E-book East Java East Nusa Tenggara Economy ecstasy Education Edward Snowden Efficient Cars Eid al-Adha Eid al-Fitr Eid ul-Fitr Ekonomi Embassies and Consulates Embraer Regional Jet Émile Jaques-Dalcroze Employee Empowerment Employment Enable Flas Player Energy Alternatif Energy minister Entertainment EPUB Eropa Erwin Arnada European Central Bank European Commission European Parliament European Union Evan Williams Evi Juara DA2 Evi Masamba Evil Bong EXO exo planet expensive expensive cars expensive cars 2013 expensive cars speed Experience point F. W. de Klerk Facebook Facebook Graph Search Fashion Federal Reserve System Ferrari Ferrari 458 Italia Ferrari Enzo FIA Fianna Fáil File Management File Transfer Protocol Filipina Financial Aid Firefox Firefox 3.5 Firefox 3.6 FLAC Flas Player Flax Flipcard Floor trader FM Transmitter font Food and Drink Foods Ford Fusion Forest FPI Fred Phelps Free Free content free hosting Free Proxy Free Proxy Lists FreeDB FreeSat TV Freeware Frekuensi friend FriendFeed Fujian Gallery Game design Games Ganjar Garuda Garuda Indonesia Gaussian blur Gautama Buddha Gaya Hidup Gaza Gerindra Germany Daun Ghalib Github Global navigation satellite system Global Newsbeat Global Redirect Glossarium Glutinous rice Gmail GNU General Public License Golden Globe Award Goldman Sachs Golkar Google Google Analytics Google Docs Google Friend Connect Google Search Google Translate Google+ Government Government Issues Grafena Graffiti Grand Final DA2 Grandma Grandparent Graph Search Graphic Design Graphics Graphics processing unit Gratis Greasemonkey Greenhouse gas GROW Guy Rosen GYM Habibie Hacker Hagel Hagen Haiti Hal Halley Halley's Comet Hallo Halloween Hambalang Hamster Handhelds Hank Johnson Hard disk drive Hardware Hari Hars Haryono Suyono HCL Technologies Helmi Johannes Hendarman Supandji Hennessey Venom GT heroin Hewlett-Packard Hibrid High-definition television Hindia Belanda History History of Java Hitung Mundur Pilkada Hobbit Holly Qur'an Hollywood Home Honda Honda Accord Hongkong Hosted Proxy Services Hosting hosting gratis Hosts HOTBISNIS HotBot HOTINFO Hotmail How Search Works How To HTML HTML element HTTP cookie HTTP Secure HTTrack Huffington Post Hulu Human resources Humboldt huruf Hypertext Transfer Protocol Hyundai Sonata Ian Richardson Ibukota Baru ICQ Idaho Falls Idul Adha Idul Fitri Idul Fitri 1435H Ijin Iklan iMacros Image Imsak Income India Indonesia Indonesian Aerospace Indonesian Democratic Party – Struggle Indonesian National Armed Forces Indonesian rupiah Indonesian Ulema Council Indosiar Indro Infiniti Information retrieval Information Technology inggris INI file Ink cartridge Inkatha Freedom Party inovator Insomnia Inspiron Instant messaging Insulators Intel Core Intel Core i7 Intel Corporation intermediate Internal internasional International Monetary Fund International Whaling Commission InternationalSpaceStation Internet Internet access Internet Connect Internet Download Manager Internet Explorer Internet Gratis Internet Phone Internet Protocol Internet safety Internet service provider Internet television Investigasi Investing Investor Investor Awal IOS IP address ip2700 IPad IPhone iPhone terbaik IPsec Iptek Iran Iron Man 3 Islam Islamic Defender Front Islamic Defenders Front Israel Isya Jack Dorsey Jacques-Dalcroze JADWAL IMSAK 2016 JADWAL SELEKSI CPNS Jakarta Jakarta Globe James Brooke Jamur Tiram Japan Jateng Java Java Development Kit Java version history Java virtual machine JavaScript Jawa Timur Jendela jenis proxy Jika Jika Anda jobs Joe Biden Johannesburg Join the Conversation Joko Widodo Jokowi Joseph McBrennan JPEG JRE Juara DA2 Juara Dangdut Akademi 2 Juga Juli Julian Assange Juru Justin Timberlake Jusuf Kalla Kalau Kali Kam Kami Kapal Selam Karena Kata Keamanan Kedua kehidupan Kekuatan sinyal Kelas 3 Kemanusiaan Yang Adil dan Beradab Kenapa Kepemimpinan Kesehatan Kesra Ketiga Ketika Ketuhanan Yang Maha Esa Ketupat Kíla Kilas VOA Kinetic Energy Recovery System Kini Kita Knowledge Graph Koenigsegg Koenigsegg Agera Koenigsegg CCX Koenigsegg CCXR Koenigsegg Trevita kokain Komet Koneksi nirkabel Konflik Konser Kemenangan DA2 Konsultasi Korea koruptor Kota Rajasthan KPU kriminal Kripto Krishna Kristiani Herawati Krystal Jung KSPI Kung Fu Kupang l Sony Lagu Evi Masamba Lamborghini Lamborghini Aventador Lamborghini Reventón land grabs Languages Laporan Arus Kas Laporan Keuangan Layer 2 Tunneling Protocol LCGC LEBARAN Lemsaneg Leopard Let Me Try Libre knowledge like Likuiditas link Link Building link friend linkedin Linux List of Buddhist temples List of countries by oil production List of Empire ships (Sa–Sh) List of PDF software List of petroleum companies LittleBigPlanet LittleBigPlanet 2 Lodging Login Lokal Lord of the Rings Los Angeles Lost Civilizations Mac OS X Mac OS X Leopard Macintosh Macklemore MacOS Magelang Maghrib Maharaja Mailing list Majapahit makanan Makes and Models Malang Malaysia Malicious Software Maná Management Manajemen Manajemen Perubahan Mandela Marc Brackett Marine biology Marine Ecology Progress Series Mario Mark Zuckerberg Market News Market sentiment Marketing Marketing and Advertising Marketing Team Martial Arts Masehi Massachusetts Mata Uang Kripto mata-mata matahari Matematika Math Mbah Google McLaren McLaren F1 McLaren P1 media query MediaTek Megabyte Megawati Megawati Sukarnoputri Membaca Neraca Memory Fox Menjebol Neraca Menulis CV Merdeka Meta Meta element Meta Tags Meteor shower Meteoroid Michael Bloomberg Microsoft Microsoft Internet Explorer Microsoft PowerPoint Microsoft Windows Middle East Mikheil Saakashvili MikroTik Miley Cyrus Military Militer Minta Maaf mirror web Miss World Miss World 2013 Missions mlm Mobil Murah Mobile application development Mobile Computing Mobile device Mobipocket Mobutu Sese Seko Modal Kerja Model Paradigma Budaya Mohammad Hatta MOHON MAAF LAHIR DAN BATHIN Monorail Moon Moscow most expensive most expensive cars Motion Design Motivasi Motivation Movies Mozilla Mozilla Firefox Mozilla Project MP3 Mpu Prapanca MS-CHAP MSN MSN Messenger MTV Video Music Award Muara Hati Multifunction printer Multimedia Murah Mural Music Music and Audio Music roll Muslim my blog My Telkomsel Myanmar N4010 Nagarakretagama Nagasaki Name server Namun NASA Nasional National Football League National Geographic National Monument National Security Agency Nature Negara Nelson Mandela Neraca Netscape Network Network Connection New South Wales New York New York City New York City Police Department New York Stock Exchange New York Times News Nigel Dessau Niger Nigeria Nissan Altima Nobel Peace Prize Norwegian Media Authority Notepad Notepad++ November NSA NTLM NU Nudi Nürburgring Nusa Dua Nusantara NYSE NYSE Euronext NYX obat bius sintetik offline browser Ogg Onavo Online online business Online Communities Online dating service Online Education Online shopping Ookla OPEC Open Europe Open source Opera Mobile Operating system Operating Systems opium Oppa Optical fiber Organization Orion Orionid Orton Orton Effect OS X OS-2 Otomotif Owneys Rum Pada Pada (foot) Pagani Zonda PageRank PageSpeed Pakistan Pala Empire Palestina Palestinian people palm oil Pancasila Panduan Panel Para Partai Demokrat Pathauto Pay per click PayPal pbb PC Plus Peat Pelajaran Bsnis pemasaran Pembinaan Pemenang DA2 Pemilu Pen Qur'an Pendampingan Pendorong Pengetahuan Peningkatan Gambar AI Peningkatan Kualitas Gambar Peninsula Valdes Penipuan Penyakit Berbahaya penyelidikan Penyiaran People People's Justice Party (Malaysia) Perangkat Lunak Pengolah Foto Perilaku Persatuan Indonesia Personal computer Personal Learning Network Personally identifiable information Personals Pertama Pertamina Pertukaran pertukaran kripto Peru Perusahaan Listrik Negara Pesawat Tak Berawak Peter Godwin Peternakan Petroleum Petroleum industry Philippines Phnom Penh Phone Photograph PHP Piano Picasa Picasa Web Albums Picnik PILGUB PILGUB 2018 PILGUB JATENG PILKADA PILKADA SERENTAK 2018 Pilpres Pilpres 2014 PKS Korupsi PKS Sarang Koruptor planet Platforms Playboy Playboy Indonesia PlayStation PlayStation 3 PLN Plone Plug-in (computing) Pluit Point-to-Point Tunneling Protocol Pokki Policy Political Philosophers Political Philosophy Political Science Ponsel ponsel terbaik Porsche 918 Portable Document Format Portlet PostgreSQL Postmedia News Pownce PPTP PQ 12 PQ 15 Prabowo Subianto Prajogo Pangestu Prambanan Pratt Institute Presiden Susilo Bambang Yudhoyono Primera Capital print Print Design printer printer 3d Product placement Profesional Programming Promo Promote Promotion Proposal Usaha Prose Protocols Provinces Proxify Proxy server Proxying and Filtering Przedsiębiorstwo Komunikacji Samochodowej Public DNS Publik Publishing Publishing and Printing Pulang2 Ganteng Pulau Jawa Purnomo Yusgiantoro Python Qualcomm Quantum Leap Qur'an Digital Quran Rabu 18 Desember 2013 Radio Rakernas Ramadan Randall Miller ransomware Rape Rasio Likuiditas Recreation Reducing Emissions from Deforestation and Forest Degradation Registry cleaner Regulasi Rekap Pilpres 2014 Relationship Religion and Spirituality Repost Resolusi Gambar Tinggi Resume Resumes and Portfolios Retirement Reuters RF Power Amplifier Rheinmetall Riau Ridwan Robert Kiyosaki Robin Thicke Roboform Roi Tiger Roundtable on Sustainable Palm Oil RTC RTT News rum Rumah Rusia Sabu Safari Salah Sales Sam Hall Samsung Sandy Bridge Sangat (term) Sani Abacha Sapta Pesona satelit Satu Saving Money SBY Puas Scam Scholarship Scintilla Scripts SCTV (Indonesia) SD Sea Lines of Communication Search Search engine optimization Search Engines Searching Secure Socket Tunneling Protocol Security Sejuk Sekolah Olahraga Nasional SELAMAT HARI RAYA IDUL FITRI 1434H Selamat Idul Fitri Selanjutnya SELEKSI CPNS Semarang seo seo service online Seoul September 2013 Serbia Sergei Krikalev Server Sesungguhnya Shareware Shopping Shopping mall Short Shwedagon Pagoda Siaran Silvio Berlusconi Sime Darby Singapore Single Evi Masamba Single sign-on Site map Site Submission Sitemap skema investasi SlideShare Slideshow SM Entertainment Smartphone Smartphones Smarty Smaug SMV SMV FreeSat TV social bookmark Social media Social Network Social Networking Social Sciences Society Soeharto Soekarno software Software Peningkatan Gambar Solar sail Sosial South Africa Southeast Asia Southern right whale Soviet Union Soweto Space Space Shuttle Discovery Speedtest.net SPN Sports Sports car Sriwijaya Air SSL Stanford University States and Divisions Statistics Indonesia Stephan Winkelmann Stereo FM Transmitter Steve Ballmer Steve McQueen Steven Sinofsky Stock Exchange of Singapore Stock market Streaming media Student Subscriber Identity Module Subuh Suharto Sukanto Tanoto Sukhoi Sumatra Sumatra PDF Super Bowl Superior mesenteric artery Surabaya Surat Susan Boyle Susilo Bambang Yudhoyono Swasta Sweden Switzerland system backup image System Restore T-shirt Tablet Taeyeon tahlil tahlilan Taipan Daily Taiwan Tanah Abang Tapi tata surya Tatto Taylor Swift TeachMeet TechCrunch technology Teknologi Teknologi Kecerdasan Buatan teks tahlil Tel Aviv Telepon Gratis Televisi Television Telkomsel Telstra Template Designer Templates Temporary Internet Files Teori Keuangan Terkenal Ternak Ternak Belut Territorial Disputes Test Testing and Tools Tf–idf Thailand The Noble Experiment Things You Didn't Know About... ThinkPad Thomas Tiffany Tiga Timeline Timor Timor Leste Timur Timur Tengah Tips Tips Kesehatan Title Tommy Suharto Tony Abbott Toolbar Tools Toyota Toyota Camry Toys Toys and Games Traffic collision Translation Transmitter Transport Layer Security Travel Travel and Tourism trik blog Trowulan Tujuan Wisata tutorial tvone Twentieth Century Twitter U.S. Economy Uang Baru Ulang Tahun Ultraviolet Undang Undang Undang Desa Uniform resource locator United Arab Emirates United States United States Secretary of Defense Universities University of Utah unlimited hosting free Unmanned Untuk update Uploading and downloading UPortal Usaha USB Flash Drive Ustad Ustaz utility UU Desa Valuasi Saham Veggies verification Verizon Fios Veto Video Video game Video Games Virginia Virtual private network virus Vista Vitamin VOA VOA News Voice of America VOIP Volkswagen Group Volkswagen Passat Voltage Regulator VPN VPN Connection Waktu Sholat Washington Washington DC WAV Web browser Web cache Web content management system Web Design Web Design and Development Web directory web hosting Web proxy web robot Web search engine Web server Web traffic webmaster tools Website Website Reviews Weekly welcome West Sumatra WETA Digital Whitney Houston Wi-Fi Wibiya Wifi wiki WikiLeak Wikileaks Wikimania Wikimedia Commons Wikimedia Foundation Wilayah Winamp Window 8 Windows Windows 10 Windows 11 Windows 2000 Windows 7 Windows 8 windows 8 backup Windows 8.1 Windows 95 Windows 98 Windows 9x Windows API Windows Media Audio Windows Mobile Windows NT Windows Phone Windows Phone 7 Windows Registry Windows startup process Windows Update Windows Vista Windows XP Wiranto Wireless Data Wisata WordPress World bank Writer Writing x factor Xiaomi Mi5 XML Yahoo Yahoo Messenger Yangon Yogyakarta Yossi Vardi YouTube Yuri Gagarin Zaskia Zemanta ZIP (file format) Zoner Photo Studio ZTE

Sitemaps XML format - sitemaps.org - Protocol

This document describes the XML schema for the Sitemap protocol.
The Sitemap protocol format consists of XML tags. All data values in a Sitemap must be entity-escaped. The file itself must be UTF-8 encoded.
The Sitemap must:
  • Begin with an opening <urlset> tag and end with a closing </urlset> tag.
  • Specify the namespace (protocol standard) within the <urlset> tag.
  • Include a <url> entry for each URL, as a parent XML tag.
  • Include a <loc> child entry for each <url> parent tag.
All other tags are optional. Support for these optional tags may vary among search engines. Refer to each search engine's documentation for details.
Also, all URLs in a Sitemap must be from a single host, such as www.example.com or store.example.com. For further details, refer the Sitemap file location

Sample XML Sitemap

The following example shows a Sitemap that contains just one URL and uses all optional tags. The optional tags are in italics.
<?xml version="1.0" encoding="UTF-8"?>

<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">

   <url>

      <loc>http://www.example.com/</loc>

      <lastmod>2005-01-01</lastmod>

      <changefreq>monthly</changefreq>

      <priority>0.8</priority>

   </url>

</urlset> 

Also see our example with multiple URLs.

XML tag definitions

The available XML tags are described below.
Attribute Description
<urlset> required Encapsulates the file and references the current protocol standard.
<url> required Parent tag for each URL entry. The remaining tags are children of this tag.
<loc> required URL of the page. This URL must begin with the protocol (such as http) and end with a trailing slash, if your web server requires it. This value must be less than 2,048 characters.
<lastmod> optional The date of last modification of the file. This date should be in W3C Datetime format. This format allows you to omit the time portion, if desired, and use YYYY-MM-DD.
Note that this tag is separate from the If-Modified-Since (304) header the server can return, and search engines may use the information from both sources differently.
<changefreq> optional How frequently the page is likely to change. This value provides general information to search engines and may not correlate exactly to how often they crawl the page. Valid values are:
  • always
  • hourly
  • daily
  • weekly
  • monthly
  • yearly
  • never
The value "always" should be used to describe documents that change each time they are accessed. The value "never" should be used to describe archived URLs.
Please note that the value of this tag is considered a hint and not a command. Even though search engine crawlers may consider this information when making decisions, they may crawl pages marked "hourly" less frequently than that, and they may crawl pages marked "yearly" more frequently than that. Crawlers may periodically crawl pages marked "never" so that they can handle unexpected changes to those pages.
<priority> optional The priority of this URL relative to other URLs on your site. Valid values range from 0.0 to 1.0. This value does not affect how your pages are compared to pages on other sites—it only lets the search engines know which pages you deem most important for the crawlers.
The default priority of a page is 0.5.
Please note that the priority you assign to a page is not likely to influence the position of your URLs in a search engine's result pages. Search engines may use this information when selecting between URLs on the same site, so you can use this tag to increase the likelihood that your most important pages are present in a search index.
Also, please note that assigning a high priority to all of the URLs on your site is not likely to help you. Since the priority is relative, it is only used to select between URLs on your site.

Entity escaping

Your Sitemap file must be UTF-8 encoded (you can generally do this when you save the file). As with all XML files, any data values (including URLs) must use entity escape codes for the characters listed in the table below.
Character Escape Code
Ampersand & &amp;
Single Quote ' &apos;
Double Quote " &quot;
Greater Than > &gt;
Less Than < &lt;
In addition, all URLs (including the URL of your Sitemap) must be URL-escaped and encoded for readability by the web server on which they are located. However, if you are using any sort of script, tool, or log file to generate your URLs (anything except typing them in by hand), this is usually already done for you. Please check to make sure that your URLs follow the RFC-3986 standard for URIs, the RFC-3987 standard for IRIs, and the XML standard.
Below is an example of a URL that uses a non-ASCII character (ü), as well as a character that requires entity escaping (&):
http://www.example.com/ümlat.php&q=name
Below is that same URL, ISO-8859-1 encoded (for hosting on a server that uses that encoding) and URL escaped:
http://www.example.com/%FCmlat.php&q=name
Below is that same URL, UTF-8 encoded (for hosting on a server that uses that encoding) and URL escaped:
http://www.example.com/%C3%BCmlat.php&q=name
Below is that same URL, but also entity escaped:
http://www.example.com/%C3%BCmlat.php&amp;q=name

Sample XML Sitemap

The following example shows a Sitemap in XML format. The Sitemap in the example contains a small number of URLs, each using a different set of optional parameters.
<?xml version="1.0" encoding="UTF-8"?>

<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">

   <url>

      <loc>http://www.example.com/</loc>

      <lastmod>2005-01-01</lastmod>

      <changefreq>monthly</changefreq>

      <priority>0.8</priority>

   </url>

   <url>

      <loc>http://www.example.com/catalog?item=12&amp;desc=vacation_hawaii</loc>

      <changefreq>weekly</changefreq>

   </url>

   <url>

      <loc>http://www.example.com/catalog?item=73&amp;desc=vacation_new_zealand</loc>

      <lastmod>2004-12-23</lastmod>

      <changefreq>weekly</changefreq>

   </url>

   <url>

      <loc>http://www.example.com/catalog?item=74&amp;desc=vacation_newfoundland</loc>

      <lastmod>2004-12-23T18:00:15+00:00</lastmod>

      <priority>0.3</priority>

   </url>

   <url>

      <loc>http://www.example.com/catalog?item=83&amp;desc=vacation_usa</loc>

      <lastmod>2004-11-23</lastmod>

   </url>

</urlset>

Using Sitemap index files (to group multiple sitemap files)

You can provide multiple Sitemap files, but each Sitemap file that you provide must have no more than 50,000 URLs and must be no larger than 10MB (10,485,760 bytes). If you would like, you may compress your Sitemap files using gzip to reduce your bandwidth requirement; however the sitemap file once uncompressed must be no larger than 10MB. If you want to list more than 50,000 URLs, you must create multiple Sitemap files.
If you do provide multiple Sitemaps, you should then list each Sitemap file in a Sitemap index file. Sitemap index files may not list more than 50,000 Sitemaps and must be no larger than 10MB (10,485,760 bytes) and can be compressed. You can have more than one Sitemap index file. The XML format of a Sitemap index file is very similar to the XML format of a Sitemap file.
The Sitemap index file must:
  • Begin with an opening <sitemapindex> tag and end with a closing </sitemapindex> tag.
  • Include a <sitemap> entry for each Sitemap as a parent XML tag.
  • Include a <loc> child entry for each <sitemap> parent tag.
The optional <lastmod> tag is also available for Sitemap index files.
Note: A Sitemap index file can only specify Sitemaps that are found on the same site as the Sitemap index file. For example, http://www.yoursite.com/sitemap_index.xml can include Sitemaps on http://www.yoursite.com but not on http://www.example.com or http://yourhost.yoursite.com. As with Sitemaps, your Sitemap index file must be UTF-8 encoded.

Sample XML Sitemap Index

The following example shows a Sitemap index that lists two Sitemaps:
<?xml version="1.0" encoding="UTF-8"?>

<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">

   <sitemap>

      <loc>http://www.example.com/sitemap1.xml.gz</loc>

      <lastmod>2004-10-01T18:23:17+00:00</lastmod>

   </sitemap>

   <sitemap>

      <loc>http://www.example.com/sitemap2.xml.gz</loc>

      <lastmod>2005-01-01</lastmod>

   </sitemap>

</sitemapindex>

Note: Sitemap URLs, like all values in your XML files, must be entity escaped.

Sitemap Index XML Tag Definitions

Attribute Description
<sitemapindex> required Encapsulates information about all of the Sitemaps in the file.
<sitemap> required Encapsulates information about an individual Sitemap.
<loc> required Identifies the location of the Sitemap.
This location can be a Sitemap, an Atom file, RSS file or a simple text file.
<lastmod> optional Identifies the time that the corresponding Sitemap file was modified. It does not correspond to the time that any of the pages listed in that Sitemap were changed. The value for the lastmod tag should be in W3C Datetime format.
By providing the last modification timestamp, you enable search engine crawlers to retrieve only a subset of the Sitemaps in the index i.e. a crawler may only retrieve Sitemaps that were modified since a certain date. This incremental Sitemap fetching mechanism allows for the rapid discovery of new URLs on very large sites.

Other Sitemap formats

The Sitemap protocol enables you to provide details about your pages to search engines, and we encourage its use since you can provide additional information about site pages beyond just the URLs. However, in addition to the XML protocol, we support RSS feeds and text files, which provide more limited information.

Syndication feed

You can provide an RSS (Real Simple Syndication) 2.0 or Atom 0.3 or 1.0 feed. Generally, you would use this format only if your site already has a syndication feed. Note that this method may not let search engines know about all the URLs in your site, since the feed may only provide information on recent URLs, although search engines can still use that information to find out about other pages on your site during their normal crawling processes by following links inside pages in the feed. Make sure that the feed is located in the highest-level directory you want search engines to crawl. Search engines extract the information from the feed as follows:
  • <link> field - indicates the URL
  • modified date field (the <pubDate> field for RSS feeds and the <updated> date for Atom feeds) - indicates when each URL was last modified. Use of the modified date field is optional.

Text file

You can provide a simple text file that contains one URL per line. The text file must follow these guidelines:
  • The text file must have one URL per line. The URLs cannot contain embedded new lines.
  • You must fully specify URLs, including the http.
  • Each text file can contain a maximum of 50,000 URLs and must be no larger than 10MB (10,485,760 bytes). If you site includes more than 50,000 URLs, you can separate the list into multiple text files and add each one separately.
  • The text file must use UTF-8 encoding. You can specify this when you save the file (for instance, in Notepad, this is listed in the Encoding menu of the Save As dialog box).
  • The text file should contain no information other than the list of URLs.
  • The text file should contain no header or footer information.
  • If you would like, you may compress your Sitemap text file using gzip to reduce your bandwidth requirement.
  • You can name the text file anything you wish. Please check to make sure that your URLs follow the RFC-3986 standard for URIs, the RFC-3987 standard for IRIs
  • You should upload the text file to the highest-level directory you want search engines to crawl and make sure that you don't list URLs in the text file that are located in a higher-level directory.
Sample text file entries are shown below.
http://www.example.com/catalog?item=1


http://www.example.com/catalog?item=11

Sitemap file location

The location of a Sitemap file determines the set of URLs that can be included in that Sitemap. A Sitemap file located at http://example.com/catalog/sitemap.xml can include any URLs starting with http://example.com/catalog/ but can not include URLs starting with http://example.com/images/.
If you have the permission to change http://example.org/path/sitemap.xml, it is assumed that you also have permission to provide information for URLs with the prefix http://example.org/path/. Examples of URLs considered valid in http://example.com/catalog/sitemap.xml include:
http://example.com/catalog/show?item=23

http://example.com/catalog/show?item=233&user=3453

URLs not considered valid in http://example.com/catalog/sitemap.xml include:
http://example.com/image/show?item=23

http://example.com/image/show?item=233&user=3453

https://example.com/catalog/page1.php

Note that this means that all URLs listed in the Sitemap must use the same protocol (http, in this example) and reside on the same host as the Sitemap. For instance, if the Sitemap is located at http://www.example.com/sitemap.xml, it can't include URLs from http://subdomain.example.com.
URLs that are not considered valid are dropped from further consideration. It is strongly recommended that you place your Sitemap at the root directory of your web server. For example, if your web server is at example.com, then your Sitemap index file would be at http://example.com/sitemap.xml. In certain cases, you may need to produce different Sitemaps for different paths (e.g., if security permissions in your organization compartmentalize write access to different directories).
If you submit a Sitemap using a path with a port number, you must include that port number as part of the path in each URL listed in the Sitemap file. For instance, if your Sitemap is located at http://www.example.com:100/sitemap.xml, then each URL listed in the Sitemap must begin with http://www.example.com:100.

Sitemaps & Cross Submits

To submit Sitemaps for multiple hosts from a single host, you need to "prove" ownership of the host(s) for which URLs are being submitted in a Sitemap. Here's an example. Let's say that you want to submit Sitemaps for 3 hosts:
www.host1.com with Sitemap file sitemap-host1.xml

www.host2.com with Sitemap file sitemap-host2.xml

www.host3.com with Sitemap file sitemap-host3.xml

Moreover, you want to place all three Sitemaps on a single host: www.sitemaphost.com. So the Sitemap URLs will be:
http://www.sitemaphost.com/sitemap-host1.xml

http://www.sitemaphost.com/sitemap-host2.xml

http://www.sitemaphost.com/sitemap-host3.xml

By default, this will result in a "cross submission" error since you are trying to submit URLs for www.host1.com through a Sitemap that is hosted on www.sitemaphost.com (and same for the other two hosts). One way to avoid the error is to prove that you own (i.e. have the authority to modify files) www.host1.com. You can do this by modifying the robots.txt file on www.host1.com to point to the Sitemap on www.sitemaphost.com.
In this example, the robots.txt file at http://www.host1.com/robots.txt would contain the line "Sitemap: http://www.sitemaphost.com/sitemap-host1.xml". By modifying the robots.txt file on www.host1.com and having it point to the Sitemap on www.sitemaphost.com, you have implicitly proven that you own www.host1.com. In other words, whoever controls the robots.txt file on www.host1.com trusts the Sitemap at http://www.sitemaphost.com/sitemap-host1.xml to contain URLs for www.host1.com. The same process can be repeated for the other two hosts.
Now you can submit the Sitemaps on www.sitemaphost.com.
When a particular host's robots.txt, say http://www.host1.com/robots.txt, points to a Sitemap or a Sitemap index on another host; it is expected that for each of the target Sitemaps, such as http://www.sitemaphost.com/sitemap-host1.xml, all the URLs belong to the host pointing to it. This is because, as noted earlier, a Sitemap is expected to have URLs from a single host only.

Validating your Sitemap

The following XML schemas define the elements and attributes that can appear in your Sitemap file. You can download this schema from the links below:
For Sitemaps: http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd
For Sitemap index files: http://www.sitemaps.org/schemas/sitemap/0.9/siteindex.xsd
There are a number of tools available to help you validate the structure of your Sitemap based on this schema. You can find a list of XML-related tools at each of the following locations:

http://www.w3.org/XML/Schema#Tools
http://www.xml.com/pub/a/2000/12/13/schematools.html
In order to validate your Sitemap or Sitemap index file against a schema, the XML file will need additional headers as shown below.
Sitemap:
<?xml version='1.0' encoding='UTF-8'?>

<urlset xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

         xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd"

         xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">

   <url>

      ...

   </url>

</urlset>

Sitemap index file:
<?xml version='1.0' encoding='UTF-8'?>

<sitemapindex xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

         xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/siteindex.xsd"

         xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">

   <sitemap>

      ...

   </sitemap>

</sitemapindex>

Extending the Sitemaps protocol

You can extend the Sitemaps protocol using your own namespace. Simply specify this namespace in the root element. For example:
<?xml version='1.0' encoding='UTF-8'?>

<urlset xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

         xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd"

         xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"

         xmlns:example="http://www.example.com/schemas/example_schema"> <!-- namespace extension -->

   <url>

      <example:example_tag>

         ...

      </example:example_tag>

      ...

   </url>

</urlset>

Informing search engine crawlers

Once you have created the Sitemap file and placed it on your webserver, you need to inform the search engines that support this protocol of its location. You can do this by:
The search engines can then retrieve your Sitemap and make the URLs available to their crawlers.

Submitting your Sitemap via the search engine's submission interface

To submit your Sitemap directly to a search engine, which will enable you to receive status information and any processing errors, refer to each search engine's documentation.

Specifying the Sitemap location in your robots.txt file

You can specify the location of the Sitemap using a robots.txt file. To do this, simply add the following line including the full URL to the sitemap:
Sitemap: http://www.example.com/sitemap.xml
This directive is independent of the user-agent line, so it doesn't matter where you place it in your file. If you have a Sitemap index file, you can include the location of just that file. You don't need to list each individual Sitemap listed in the index file.
You can specify more than one Sitemap file per robots.txt file.
Sitemap: http://www.example.com/sitemap-host1.xml

Sitemap: http://www.example.com/sitemap-host2.xml

Submitting your Sitemap via an HTTP request

To submit your Sitemap using an HTTP request (replace <searchengine_URL> with the URL provided by the search engine), issue your request to the following URL:
<searchengine_URL>/ping?sitemap=sitemap_url
For example, if your Sitemap is located at http://www.example.com/sitemap.gz, your URL will become:
<searchengine_URL>/ping?sitemap=http://www.example.com/sitemap.gz
URL encode everything after the /ping?sitemap=:
<searchengine_URL>/ping?sitemap=http%3A%2F%2Fwww.yoursite.com%2Fsitemap.gz
You can issue the HTTP request using wget, curl, or another mechanism of your choosing. A successful request will return an HTTP 200 response code; if you receive a different response, you should resubmit your request. The HTTP 200 response code only indicates that the search engine has received your Sitemap, not that the Sitemap itself or the URLs contained in it were valid. An easy way to do this is to set up an automated job to generate and submit Sitemaps on a regular basis.
Note: If you are providing a Sitemap index file, you only need to issue one HTTP request that includes the location of the Sitemap index file; you do not need to issue individual requests for each Sitemap listed in the index.

Excluding content

The Sitemaps protocol enables you to let search engines know what content you would like indexed. To tell search engines the content you don't want indexed, use a robots.txt file or robots meta tag. See robotstxt.org for more information on how to exclude content from search engines.

Last Updated: 27 February 2008

sitemaps.org - Protocol

Post a Comment

Write You comment here! Please...

[blogger]

Author Name

{picture#YOUR_PROFILE_PICTURE_URL} YOUR_PROFILE_DESCRIPTION {facebook#YOUR_SOCIAL_PROFILE_URL} {twitter#YOUR_SOCIAL_PROFILE_URL} {google#YOUR_SOCIAL_PROFILE_URL} {pinterest#YOUR_SOCIAL_PROFILE_URL} {youtube#YOUR_SOCIAL_PROFILE_URL} {instagram#YOUR_SOCIAL_PROFILE_URL}

Contact Form

Name

Email *

Message *

Powered by Blogger.