Class JsoupDocumentReaderConfig

java.lang.Object
org.springframework.ai.reader.jsoup.config.JsoupDocumentReaderConfig

public final class JsoupDocumentReaderConfig extends Object
Common configuration for the JsoupDocumentReader. Provides options for specifying the character encoding, CSS selector, text separator, and whether to extract all text from the body or specific elements, and handling link extraction.
Author:
Alexandros Pappas
  • Field Details

    • charset

      public final String charset
    • selector

      public final String selector
    • separator

      public final String separator
    • allElements

      public final boolean allElements
    • groupByElement

      public final boolean groupByElement
    • includeLinkUrls

      public final boolean includeLinkUrls
    • metadataTags

      public final List<String> metadataTags
    • additionalMetadata

      public final Map<String,Object> additionalMetadata
  • Method Details