Generate XML Sitemaps
An intelligent site crawler that respects robots.txt, parses image tags, optimizes fetches with conditional HTTP caching, and writes compressed sitemaps.
Incremental Caching & Gzip
Utilizes ETags and Last-Modified times to perform conditional fetches (304 Fast Path), and delivers XML and GZ files.
Image & shadow DOM parsing
Traverses shadow roots and extracts image elements to build rich Google Image schema sitemaps.
Robots.txt & Redirections
Ethically handles redirects, consolidates protocols, and parses wildcards and Allow rules using RFC 9309 criteria.
Asynchronous Redis Queue & Stability
Crawls run via BullMQ workers with automatic Chromium recycling and SIGKILL cleanups to mitigate leaks and CFG crashes.