Tech Stack
The Easiest Way to Generate Perfect XML Sitemaps for Any Website
XML Sitemap Generator is a modern web application that makes creating SEO-optimized sitemaps effortless. Simply enter your website's URL, watch the real-time crawl progress, and download a fully compliant XML sitemap ready for Google, Bing, and other search engines.
The Problem It Solves
SEO is crucial for any website's success, and XML sitemaps are fundamental to getting indexed by search engines. But creating sitemaps manually is tedious, error-prone, and time-consuming especially for large websites. Existing tools are often:
- Expensive with monthly subscriptions
- Limited to small numbers of pages
- Lacking support for modern JavaScript-rendered sites
- Missing real-time feedback during crawling
XML Sitemap Generator solves all of these problems with a free, open-source solution that handles both traditional and modern web architectures.
Key Features
Intelligent Crawling
The crawler is smart enough to handle real-world websites:
- Crawls up to 1000 pages per site (configurable from 10-1000)
- Seamlessly handles Server-Side Rendered (SSR) pages
- Supports Client-Side Rendered (CSR) pages using Puppeteer headless browser
- Automatically extracts internal links while avoiding non-HTML resources
- Concurrent processing with configurable batch sizes for optimal performance
Ethical & Compliant
Being a good web citizen is built into the core:
- robots.txt Compliance: Automatically fetches and respects disallow rules
- Priority-based Sitemap: Assigns intelligent priority values based on page depth (1.0 for homepage, decreasing by 0.1 per level)
- Standards Compliant: Generates XML sitemaps following the official Sitemaps.org protocol
- lastmod Support: Includes last-modified dates from HTTP headers when available
Real-time Progress Tracking
Watch your sitemap being built live:
- Server-Sent Events (SSE) streaming for instant feedback
- Live display of current URL being crawled
- Running total of discovered pages
- Progress indicator showing crawl status
- No page refreshes needed everything updates in real-time
Hybrid Rendering Support
This is what sets this tool apart, it handles modern web applications:
Traditional server-side rendered pages are crawled quickly with Cheerio for HTML parsing. JavaScript-heavy single-page applications (SPAs) are rendered with Puppeteer to extract dynamically loaded content. The crawler automatically detects which approach to use for each page.
Technical Implementation
Backend Architecture
Built with Next.js API routes for serverless deployment:
- Puppeteer integration for headless browser automation
- Cheerio for fast HTML parsing
- Server-Sent Events for real-time communication
- Concurrent batch processing for performance
- Smart caching to avoid redundant requests
Frontend Excellence
Modern React application with TypeScript:
- Real-time UI updates via SSE
- Smooth animations and loading states
- Form validation with clear error messages
- Responsive design with Tailwind CSS
- Optimized for performance and accessibility
Sitemap Generation
Produces fully compliant XML output:
xml<?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc>https://example.com/</loc> <lastmod>2025-12-27T12:00:00.000Z</lastmod> <priority>1.0</priority> </url> <!-- Additional URLs... --> </urlset>
How It Works
Step 1: Enter URL
Simply paste your website's URL (e.g., https://example.com)
Step 2: Configure Select the maximum number of pages to crawl using the slider (10-1000 pages)
Step 3: Generate Click "Generate Sitemap" and watch the real-time progress as pages are discovered
Step 4: Download
Download the generated sitemap.xml file with one click
Step 5: Deploy
Upload the sitemap to your website's root directory and reference it in robots.txt:
Sitemap: https://example.com/sitemap.xml
Real-World Impact
This tool has helped:
- Developers generate sitemaps for client projects quickly
- SEO professionals audit site structure and discover indexing issues
- Small businesses improve their search engine visibility without expensive tools
- Content creators ensure all their pages are discoverable by search engines
Use Cases
New Website Launches Generate a comprehensive sitemap before going live to ensure search engines can index your content immediately.
SEO Audits Discover how many pages are actually crawlable on your site and identify potential navigation issues.
Site Migrations Ensure all pages from your old site structure are accounted for in the new sitemap.
Regular Updates Re-generate sitemaps periodically to reflect new content and removed pages.
Learning & Education Understand how web crawlers work and what search engines see on your website.
Technical Highlights
Performance Optimization
- Concurrent crawling with batch processing
- Smart resource detection to avoid crawling images, CSS, JS files
- Puppeteer pooling for efficient browser instance management
- Efficient memory usage even with large site crawls
Error Handling
- Graceful handling of unreachable pages
- robots.txt parsing errors don't block crawling
- Timeout management for slow-loading pages
- Clear error messages for user debugging
Deployment Ready
- Serverless-friendly architecture
- Environment variable configuration
- Easy deployment to Vercel, Netlify, or any Node.js host
- Docker support for self-hosting
Open Source & Free Forever
XML Sitemap Generator is completely free and open source. No subscriptions, no page limits, no upsells. The code is available on GitHub for anyone to use, modify, or learn from.
This is SEO tooling done right: powerful, accessible, and community-driven.
Notes
- Fully functional web application deployed and ready to use
- Free and open source (no subscriptions or limits)
- Supports both SSR and CSR websites via Puppeteer
- Real-time progress tracking with Server-Sent Events
- Respects robots.txt and web standards
- Configurable crawl limits (10-1000 pages)
- Mobile responsive design
- Easy self-hosting with Next.js
- Generates standards-compliant XML sitemaps
- Educational value for learning web crawling concepts
