Class PagePdfDocumentReader
java.lang.Object
org.springframework.ai.reader.pdf.PagePdfDocumentReader
- All Implemented Interfaces:
Supplier<List<Document>>
,DocumentReader
Groups the parsed PDF pages into
Document
s. You can group one or more pages
into a single output document. Use PdfDocumentReaderConfig
for customization
options. The default configuration is: - pagesPerDocument = 1 - pageTopMargin = 0 -
pageBottomMargin = 0- Author:
- Christian Tzolov
-
Field Summary
-
Constructor Summary
ConstructorDescriptionPagePdfDocumentReader
(String resourceUrl) PagePdfDocumentReader
(String resourceUrl, PdfDocumentReaderConfig config) PagePdfDocumentReader
(org.springframework.core.io.Resource pdfResource) PagePdfDocumentReader
(org.springframework.core.io.Resource pdfResource, PdfDocumentReaderConfig config) -
Method Summary
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.springframework.ai.document.DocumentReader
read
-
Field Details
-
METADATA_START_PAGE_NUMBER
- See Also:
-
METADATA_END_PAGE_NUMBER
- See Also:
-
METADATA_FILE_NAME
- See Also:
-
document
protected final org.apache.pdfbox.pdmodel.PDDocument document -
resourceFileName
-
-
Constructor Details
-
PagePdfDocumentReader
-
PagePdfDocumentReader
public PagePdfDocumentReader(org.springframework.core.io.Resource pdfResource) -
PagePdfDocumentReader
-
PagePdfDocumentReader
public PagePdfDocumentReader(org.springframework.core.io.Resource pdfResource, PdfDocumentReaderConfig config)
-
-
Method Details