:py:mod:`generic.spiders.archive` ================================= .. py:module:: generic.spiders.archive .. autodoc2-docstring:: generic.spiders.archive :allowtitles: Module Contents --------------- Classes ~~~~~~~ .. list-table:: :class: autosummary longtable :align: left * - :py:obj:`ArchiveSpiderConfig ` - .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpiderConfig :summary: * - :py:obj:`ArchiveSpider ` - .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider :summary: API ~~~ .. py:class:: ArchiveSpiderConfig(/, **data: typing.Any) :canonical: generic.spiders.archive.ArchiveSpiderConfig Bases: :py:obj:`generic.spiders.read_more.ReadMoreSpiderConfig` .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpiderConfig .. rubric:: Initialization .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpiderConfig.__init__ .. py:attribute:: archive_article_xpath :canonical: generic.spiders.archive.ArchiveSpiderConfig.archive_article_xpath :type: typing.Optional[str] :value: "//main//li[@class!=' pr']//h2[@class='title']//a/@href" .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpiderConfig.archive_article_xpath .. py:attribute:: archive_next_xpath :canonical: generic.spiders.archive.ArchiveSpiderConfig.archive_next_xpath :type: typing.Optional[str] :value: "//div[contains(@class, 'pagination')]//a[contains(text(), '次へ')]/@href" .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpiderConfig.archive_next_xpath .. py:class:: ArchiveSpider(*args, **kwargs) :canonical: generic.spiders.archive.ArchiveSpider Bases: :py:obj:`generic.spiders.base.GenericSpider`\ [\ :py:obj:`generic.spiders.archive.ArchiveSpiderConfig`\ ], :py:obj:`generic.mixins.read_more.ReadMoreMixin` .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider .. rubric:: Initialization .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.__init__ .. py:attribute:: name :canonical: generic.spiders.archive.ArchiveSpider.name :value: 'archive_spider' .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.name .. py:attribute:: allowed_domains :canonical: generic.spiders.archive.ArchiveSpider.allowed_domains :value: ['bunshun.jp'] .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.allowed_domains .. py:attribute:: start_urls :canonical: generic.spiders.archive.ArchiveSpider.start_urls :value: ['https://bunshun.jp/category/latest?page=300'] .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.start_urls .. py:method:: get_config_class() -> typing.Type[generic.spiders.archive.ArchiveSpiderConfig] :canonical: generic.spiders.archive.ArchiveSpider.get_config_class :classmethod: .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.get_config_class .. py:method:: start() :canonical: generic.spiders.archive.ArchiveSpider.start :async: .. py:method:: parse_archive_index(response) :canonical: generic.spiders.archive.ArchiveSpider.parse_archive_index .. autodoc2-docstring:: generic.spiders.archive.ArchiveSpider.parse_archive_index