# ArchiveSpider This spider starts crawling the given URL, extract all the links to articles in the page, follows "Next" link to the next archive index page, and repeat the process until no link to "Next" page is found. The spider is designed for a typical "Archive" pages commonly found on popular CMS. ## Usage The spider requires two arguments: `archive_next_xpath` and `archive_article_xpath`. `archive_next_xpath` is an XPath ([Wikipedia article](https://en.wikipedia.org/wiki/XPath)) to `href` attribute of the link to the next page. `archive_article_xpath` is an XPath to `href` attribute of the links to archive pages. Suppose, the archive index page has many links to archive pages under a `
January 10, 2025