Experimental Structured Output
Note: This is an experimental setting, we welcome you to try this new mode and provide us with any feedback you may have. The setting/output will be iterated & improved upon.
This setting adds tagged content from the PDF to the HTML to improve document accessibility. This tagged content can be used to better support accessibility features, such as screen readers.
With this setting enabled, each page will output an element with the structTree class, this contains the structure of that page which is generated using the defined structured content inside the PDF (if present). This output is connected to the relevant text elements with aria-owns attributes, which tools such as screen readers use to parse the content on the page in the intended order defined in the PDF.
This setting improves accessibility software’s ability to handle several unique document structures, like lists and tables, by mapping them to their respective HTML equivalent inside the structTree output.
It also allows for handling some text properties like /ActualText and /Lang values, to be mapped to their respective HTML equivalent.
This setting has no behaviour when converting documents to SVG.
Expected values:
truefalse
Default: false
Usage Examples
Command Line
-Dorg.jpedal.pdf2html.experimentalStructuredOutput=true
Cloud
{ "org.jpedal.pdf2html.experimentalStructuredOutput": true }