1. Automatic paragraph detection for accessible PDF documents
- Abstract
This paper describes a new algorithm for the automatic detection and tagging of paragraphs in PDF documents. This is an important feature of the PDF Accessibility Validation Engine (PAVE), which is an open-source web application for the analysis and semi-automatic correction of accessibility issues in PDF documents. The tool is currently used by a large number of users, and their feedback is collected and evaluated. The evaluation so far revealed some major usability issues mainly due to the missing paragraph detection functionality. After an introduction in PDF accessibility this paper discusses the current usability issues with PAVE and describes the newly proposed algorithm to alleviate them. A first evaluation and conclusion of the results will be provided in the final paper.
- Published
- 2023