Link
Skip to main content

Experimental Structured Output

Note: This is an experimental setting, we welcome you to try this new mode and provide us with any feedback you may have. The setting/output will be iterated & improved upon.

This setting adds tagged content from the PDF to the HTML to improve document accessibility. This tagged content can be used to better support accessibility features, such as screen readers.

With this setting enabled, each page will output an element with the structTree class, this contains the structure of that page which is generated using the defined structured content inside the PDF (if present). This output is connected to the relevant text elements with aria-owns attributes, which tools such as screen readers use to parse the content on the page in the intended order defined in the PDF.

This setting improves accessibility software’s ability to handle several unique document structures, like lists and tables, by mapping them to their respective HTML equivalent inside the structTree output.

It also allows for handling some text properties like /ActualText and /Lang values, to be mapped to their respective HTML equivalent.

This setting has no behaviour when converting documents to SVG.

Expected values:

  • true
  • false

Default: false

Usage Examples

Command Line

-Dorg.jpedal.pdf2html.experimentalStructuredOutput=true

Cloud

{ "org.jpedal.pdf2html.experimentalStructuredOutput": true }

What's included in your BuildVu trial?

  • Access to download the SDK and run it locally.
  • Access to the cloud trial to convert documents in the IDR cloud.
  • Access to the Docker image to set up your own trial server in the cloud.
  • Communicate with IDR developers to ask questions & get expert advice.
  • Plenty of time to experiment and build a proof of concept.
  • Over 100 articles to help you get started and learn about BuildVu.
  • An exceptional PDF to HTML converter that took over 20 years to build!

Learn more about BuildVu

Start Your Free Trial

Trial License Type