Remediating Inaccessible Static PDF Documents

Automated Accessibility Checking

Many pieces of software contain built-in accessibility checkers that check for various issues, including images not having image descriptions, a lack of tags, color contrast issues, and other problems. Before diving into anything else, run the checker to see what issues it finds. Keep in mind, though, that these automated tools may not catch all the problems, so manual testing throughout the process of remediation is still required. You might also want to periodically run the checker again throughout the remediation process. If the checker finds issues with color contrast, that is a change that needs to be made in the source document.

Tagging PDF Documents

Before looking at standard tags you can apply to documents, it’s important to understand that while the way you apply these tags is software dependent, the functionality and semantics are the same. The PDF standard has a standard set of tags you can apply to content to make your document accessible.

Now that we’ve got your expectations aligned, what exactly is tagging? Tagging is the process of specifying the structure and semantic intent of the document’s content, so it can be properly interpreted and communicated to assistive technology.

The Root of the Document

When tagging a document, we need to have a root for what is called the structure tree. The root of this tree must always be the “document” tag. This tag will contain all other content tags for the document, including paragraphs, headings and images.

Headings

Headings play a crucial role in logically structuring documents. When reviewing documents, screen reader users often prefer to quickly go through documents and find relevant sections by headings. To allow this in a PDF document, you need to use heading tags. There are 6 different heading tags that each represent a different section level in the document’s hierarchical structure. Not all documents use all six. The important thing is to order them logically and ideally not skip a level. Without properly tagged headings, screen readers will see an unstructured block of text.

Most government documents consist of images, lists and generic text content. The tags you use for this are the following:

Tables

Use tables when your document presents tabular data. When tagging tables in your document, there are a few tags that you’ll be using:

Reading Order

Now that we understand tagging and the various tags we can apply to a PDF document, the next crucial concept to understand is the reading order of the document. Even if the content appears visually correct, that may not be the case after applying tags to your document. After tagging, you must ensure that the actual reading order matches the intended reading order and remains logically consistent.  Software used to apply tags to PDF documents often provides the ability to view and modify the reading order. However, adjusting the reading order in software isn’t enough. It also must be tested with assistive technology, such as screen readers.