:py:mod:`generic.pipelines` =========================== .. py:module:: generic.pipelines .. autodoc2-docstring:: generic.pipelines :allowtitles: Module Contents --------------- Classes ~~~~~~~ .. list-table:: :class: autosummary longtable :align: left * - :py:obj:`GenericPipeline ` - .. autodoc2-docstring:: generic.pipelines.GenericPipeline :summary: * - :py:obj:`DropMissingTextPipeline ` - .. autodoc2-docstring:: generic.pipelines.DropMissingTextPipeline :summary: * - :py:obj:`FeedStoragePipeline ` - .. autodoc2-docstring:: generic.pipelines.FeedStoragePipeline :summary: * - :py:obj:`FileItemPipeline ` - .. autodoc2-docstring:: generic.pipelines.FileItemPipeline :summary: * - :py:obj:`FileItemStoragePipeline ` - .. autodoc2-docstring:: generic.pipelines.FileItemStoragePipeline :summary: * - :py:obj:`SpacyTokenizePipeline ` - .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline :summary: * - :py:obj:`CleanSentencesPipeline ` - .. autodoc2-docstring:: generic.pipelines.CleanSentencesPipeline :summary: API ~~~ .. py:class:: GenericPipeline :canonical: generic.pipelines.GenericPipeline .. autodoc2-docstring:: generic.pipelines.GenericPipeline .. py:method:: process_item(item, spider) :canonical: generic.pipelines.GenericPipeline.process_item .. autodoc2-docstring:: generic.pipelines.GenericPipeline.process_item .. py:class:: DropMissingTextPipeline :canonical: generic.pipelines.DropMissingTextPipeline .. autodoc2-docstring:: generic.pipelines.DropMissingTextPipeline .. py:method:: process_item(item) :canonical: generic.pipelines.DropMissingTextPipeline.process_item .. autodoc2-docstring:: generic.pipelines.DropMissingTextPipeline.process_item .. py:class:: FeedStoragePipeline :canonical: generic.pipelines.FeedStoragePipeline .. autodoc2-docstring:: generic.pipelines.FeedStoragePipeline .. py:method:: process_item(item) :canonical: generic.pipelines.FeedStoragePipeline.process_item .. autodoc2-docstring:: generic.pipelines.FeedStoragePipeline.process_item .. py:class:: FileItemPipeline :canonical: generic.pipelines.FileItemPipeline .. autodoc2-docstring:: generic.pipelines.FileItemPipeline .. py:method:: process_item(item: generic.items.FileItem, spider: scrapy.Spider) -> generic.items.FileItem :canonical: generic.pipelines.FileItemPipeline.process_item .. autodoc2-docstring:: generic.pipelines.FileItemPipeline.process_item .. py:method:: process_pdf_item(item: generic.items.FileItem, spider: scrapy.Spider) -> generic.items.FileItem :canonical: generic.pipelines.FileItemPipeline.process_pdf_item .. autodoc2-docstring:: generic.pipelines.FileItemPipeline.process_pdf_item .. py:class:: FileItemStoragePipeline :canonical: generic.pipelines.FileItemStoragePipeline .. autodoc2-docstring:: generic.pipelines.FileItemStoragePipeline .. py:method:: process_item(item, spider) :canonical: generic.pipelines.FileItemStoragePipeline.process_item .. autodoc2-docstring:: generic.pipelines.FileItemStoragePipeline.process_item .. py:class:: SpacyTokenizePipeline(spacy_url) :canonical: generic.pipelines.SpacyTokenizePipeline .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline .. rubric:: Initialization .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline.__init__ .. py:method:: from_crawler(crawler) :canonical: generic.pipelines.SpacyTokenizePipeline.from_crawler :classmethod: .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline.from_crawler .. py:method:: process_item(item, spider) :canonical: generic.pipelines.SpacyTokenizePipeline.process_item :async: .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline.process_item .. py:method:: close_spider(spider) :canonical: generic.pipelines.SpacyTokenizePipeline.close_spider :async: .. autodoc2-docstring:: generic.pipelines.SpacyTokenizePipeline.close_spider .. py:class:: CleanSentencesPipeline :canonical: generic.pipelines.CleanSentencesPipeline .. autodoc2-docstring:: generic.pipelines.CleanSentencesPipeline .. py:method:: process_item(item, spider) :canonical: generic.pipelines.CleanSentencesPipeline.process_item :async: .. autodoc2-docstring:: generic.pipelines.CleanSentencesPipeline.process_item