What is a sitemap.xml and why do I need one? +
A sitemap.xml is an XML file that lists all the important URLs on your website along with metadata like last modification date and change frequency. It helps search engines discover and crawl your pages more efficiently. While search engines can find pages through links, a sitemap ensures no important pages are missed — especially new pages or pages with few incoming links.
How do I fix a broken sitemap? +
Common sitemap issues include invalid XML syntax, non-accessible URLs (returning 404 or 500 errors), and missing required fields. Validate your XML format, ensure all listed URLs return 200 status codes, and include the required <urlset> and <url><loc> elements. Most frameworks (Next.js, WordPress, etc.) can auto-generate sitemaps for you.
How many URLs should be in my sitemap? +
A single sitemap can contain up to 50,000 URLs and must not exceed 50MB uncompressed. For larger sites, use a sitemap index file that references multiple sitemaps. Only include canonical, indexable URLs — skip pages with noindex tags, redirects, or error pages. Quality matters more than quantity.
Should my sitemap include every page? +
No. Only include pages you want search engines to index. Exclude pages behind authentication, admin panels, duplicate content, paginated archives, and utility pages like search results. Focus on your most valuable content pages, product pages, and any pages you want to rank in search results.