Terminology

A

Accessible name

An accessible name, often referred to as an 'accessibility identifier' or 'accessibility label,' is a text attribute associated with a UI (User Interface) element in a mobile app. This label provides information about the purpose or nature of the UI element. It is crucial to make the app accessible to users who rely on assistive technologies.

Examples of UI components that require an accessible name include but are not limited to buttons, links, form elements, and UI controls.

The importance of accessibility labels lies in their role in making mobile apps usable for individuals with visual or cognitive impairments. Users who rely on screen readers, voice commands, or other assistive technologies depend on these labels to understand and interact with the app's interface.

The content of accessibility labels should be clear, concise, and descriptive. It should convey the essential information about the UI element's purpose without being overly verbose.

Accessibility Navigation Menus (on Screen Readers)

"Screen Readers' Accessibility Navigation Menus" encompasses interface components or controls typically available in screen readers, such as VoiceOver Rotor in iOS or TalkBack Reading Control in Android. These menus offer users customizable tools and options to navigate interfaces effectively. They provide shortcuts, settings adjustments, and navigation controls tailored for accessibility needs, enhancing user interaction and enabling seamless access to digital content on various devices.

The VoiceOver Rotor set on Speaking Rate
The TalkBack Reading control set on Headings

Audio descriptions

Audio descriptions, also known as descriptive audio or video descriptions, are spoken narrations that provide additional information about visual content. These descriptions are specifically designed to assist individuals who are blind or visually impaired in comprehending visual elements that are not otherwise conveyed through dialogue or sound.

Audio descriptions verbally convey essential visual elements, such as actions, settings, gestures, scene changes, and expressions, which are not evident from the audio track alone. They are typically inserted during natural pauses in dialogue or sound elements to avoid interrupting the original audio content. They integrate seamlessly with the audio or video, ensuring a cohesive viewing or listening experience.

Audio descriptions are commonly found in TV shows, movies, educational videos, and cultural events. They can be provided as a separate audio track or through specialized devices and services.

C

Captions

Captions are text-based representations of the audio content in a video, audio, or multimedia presentation. They display the spoken dialogue, sound effects, and other auditory information as text on the screen, synchronized with the corresponding visuals.

There are two types of captions. The first type is the Closed Captions (CC). This caption type is not embedded into the media file and can be toggled on and off. Open Captions are permanently embedded into the video and cannot be turned off. They are always visible when the video is played.

Captions play a crucial role in making multimedia content accessible, providing an alternative means of understanding the auditory information presented in videos and ensuring a more inclusive experience for all viewers.

E

Essential content

"Essential content" refers to critical information necessary to understand or operate the UI. Without it, users may have difficulty or be unable to perceive the content's context and purpose or operating controls.

F

Full justify (text alignment)

"Full justify" alignment is a type of text alignment where the text block's left and right sides are aligned evenly. In full justify alignment, spaces between words and characters are adjusted dynamically to ensure that each line of text reaches both the left and right margins of the text block, creating a neat and straight edge on both sides.

This alignment style can enhance the visual appeal of text, presenting a clean and formal appearance. However, it can occasionally create uneven spacing between words, which might affect readability, especially in narrow columns or when dealing with irregular word lengths. In terms of accessibility, full justify alignment might pose challenges for users with dyslexia or visual impairments, as uneven spacing can make it more difficult to track lines while reading.

Focus indication

Focus indication is a visual cue or indicator that highlights the currently focused element on the screen when using the app. It is critical to make apps accessible to users who navigate through interfaces using assistive technologies like voice control or keyboard navigation.

The focus indication helps users understand where they are within the app and which element will respond to their input next as they navigate through interactive elements.

Effective focus indication provides a clear and intuitive user experience for those who rely on keyboard navigation or other assistive technologies, ensuring equal access to app functionality for all users.

Focus sequence

Focus sequence refers to the order in which interactive elements or controls receive focus when using navigation methods like keyboard navigation or other input methods. It denotes the logical flow or sequence in which these elements are selected or activated as users navigate the app's interface. A well-organized focus sequence ensures that the focus order aligns with the app's structure and content flow, allowing users to navigate and interact with app elements coherently and predictably. This is particularly crucial for users relying on keyboard navigation or assistive technologies to access and interact with mobile app interfaces.

I

Images of text

Images of text refer to readable content essential for comprehending the content and its context presented within and as a part of an image media.

Image types

Complex

Complex images refer to visuals that contain intricate or detailed content, such as diagrams, charts, graphs, or illustrations with multifaceted information. Text alternatives for complex images aim to provide a concise yet comprehensive description that conveys the crucial information within the image. It should offer a textual representation of the image's content, structure, and relationships between various elements within the visual, ensuring that users relying on screen readers or other assistive technologies can grasp the essential details presented by the complex image. This text alternative intends to offer an equivalent understanding of the image's information without overwhelming the user with excessive details.

Decorative

Decorative images refer to images that serve a solely aesthetic or decorative purpose, providing no significant content or information relevant to understanding the app's functionality or content. Text alternatives for decorative images are typically left empty or use a null value, indicating to assistive technologies that the image is purely decorative and does not convey any meaningful information necessary for understanding the app's content or functionality. This approach helps screen readers focus on essential content without disrupting the user experience with irrelevant descriptions for purely ornamental visuals.

Functional

Functional images refer to images that represent controls, links, or buttons essential for users to interact with the app's functionality. Text alternatives for functional images should convey the function or purpose of the control, aiding users who rely on screen readers or assistive technologies to understand the interactive nature of the image. This alt text should succinctly describe the action or outcome triggered by interacting with the image, ensuring users comprehend its role in the app's functionality. Providing accurate text alternatives for functional images is crucial for users who rely on assistive technologies to navigate and interact effectively within the mobile application.

Informative

Informative images are images that convey essential information, context, or conveyance of function within the app's interface. Text alternatives for informative images are descriptive text alternatives that succinctly describe the content or message conveyed by the image, aiding users who rely on screen readers or other assistive technologies to understand the significance of the visual element. The text alternatives for informative images should be informative and concise. They should clearly represent the image's purpose or content to ensure users receive the relevant information conveyed by the image.

M

Media

In the context of mobile applications, the term "media" refers to various types of content that can be embedded or integrated into the app to enhance its visual or auditory experience. Media elements can be, for example, a static image, a video audio-only or audio-only tracks, or synchronized tracks that include both video and audio.

Since media elements, by nature, are oriented to the user's sensory receptors (primarily visual or auditory), their content must have an alternative that can be perceived by all users and read programmatically by assistive technologies.

Because of the diversity of media types and their nature, the alternative type may differ between different media types. In addition, because media alternatives must provide a solution for people with a variety of impairments, sometimes more than one alternative is required to ensure that all users can accurately perceive the entire media content.

See also: Static media, Time-based media, and Synchronized media

Multipoint gestures

Multipoint gestures involve interactions on a touchscreen using multiple contact points simultaneously, typically using more than one finger.
These gestures typically involve using two or more fingers or touch points on the screen simultaneously to perform actions or trigger specific functionalities within the app.
When offering advanced interaction methods, developers should ensure these gestures are intuitive, discoverable for users, and provide alternatives for individuals with motor limitations or other disabilities.

P

Path-based gestures

Path-based gestures refer to actions in the application that require users to draw or trace specific paths, shapes, or gestures on the screen following predefined patterns recognized by the app.
Path-based gestures often enable custom or specialized actions beyond standard tapping or swiping, allowing users to interact with the app uniquely. However, when offering non-standard interaction methods, developers must ensure these gestures are intuitive and accessible to users with diverse abilities, providing alternative methods for those who might find tracing gestures challenging.

Points (measurement unit)

"Points" refers to a measurement unit used typically to determine text size. Points are a standard unit of measurement for font size where 1 point (pt) is approximately equal to 1/72 of an inch.

On iOS, font size is usually measured with points. For example, UIFont in iOS uses point sizes for text.

Android, however, uses "scaled pixels" (sp). To calculate points from scaled pixels, use this formula:

points = scaledPixels / displayDensity

Where scaledPixels is the text size in "sp", and displayDensity is the device's screen density.

Programmatically read/determined

This term refers to content that user agents, like assistive technologies, can extract, interpret, and present to users in different modalities.

R

Reading sequence

Reading sequence refers to the programmatic order of elements in the UI, as assistive technologies such as screen readers interpret and read them.

Real-time updates

Real-time updates refer to information that dynamically and continuously refreshes and displays changing data or statuses to users. These updates provide current, up-to-the-moment details on a particular event, process, or status relevant to the user's interaction with the application.

Real-time updates provide users with information that directly impacts their current interaction or experience with the application. For instance, in a delivery app, real-time updates might display the estimated time of arrival (ETA) or the current location of the delivery person.

A few examples of application types that commonly use real-time updates are public transportation apps, food delivery services, stock market updates, and more.

In essence, real-time updates in applications serve to keep users informed and engaged by continuously delivering timely and relevant information about ongoing processes or events, enriching the overall user experience.

Role

The term "Role" refers to the type of component or widget an element represents on a webpage or native app, like a button, link, heading, or menu.

Web and native platforms build roles into native components. Using these components will provide "out of the box" support of the essential attributes required by the component type, such as support of states, returning a value, and the implied behavior of the component type.

Custom components use dedicated attributes to define the component’s role. Platforms have specific attributes for these roles so that assistive technologies interpret the component correctly.

The component’s role is separate from its accessible name. A component’s role is not added to the accessible name because assistive technologies may not correctly identify it for users.

Web and native app platforms usually have a dedicated attribute when defining custom roles or types. Custom roles typically use familiar and unambiguous terms that reflect the nature of the component and how it is used. Custom roles are uncommon and used sparingly on custom components as most roles can be defined through dedicated attributes.

S

Single-pointer gestures

Single-pointer gestures are a single point of contact, typically using one-finger interactions like a tap or double-tap that perform actions or trigger specific functionalities within the app's interface. These gestures are often the primary input method for users with a variety of disabilities. Developers must ensure they are responsive and compatible with assistive technologies.

Static media

"Static media" refers to non-text content that doesn't change or progress over time (like audio or video tracks would). Examples of static media include images, infographics, diagrams, or other non-text elements that do not rely on a time-based sequence for comprehension.

See also: Media, Time-based media, and Synchronized media

Synchronized media

"Synchronized media" refers to a video track with a synchronized corresponding audio track that auditory reflects the actions on the video.

See also: Media, Static media, and Time-based media

T

Text blocks

Text blocks refer to sections or segments of textual content. They typically contain at least three full rows of text in the app's body text font size.

Text alternatives

Text alternatives refer to descriptive text or content provided alongside visual elements such as images, graphics, or media (audio or video) to ensure that users, particularly those using screen readers or other assistive technologies, can access and comprehend the content. These text alternatives convey the essential information presented by the visual content, enabling users who cannot perceive the visual elements to understand their purpose, function, or information conveyed, thus fostering a more inclusive and accessible mobile app experience.

Since images and graphic elements may have different roles and purposes (such as an icon on a button, adding information to the written text, or graphs and charts). The phrasing of the text alternative should optimally serve the purpose and content that the image represents. An excellent place to learn about the optimal ways to formulate text alternatives is the W3C's Images Tutorial, which also includes a decision tree to help users determine what text alternative fits specific use cases best.

Time-based media

"Time-based media" refers to non-text content that unfolds or changes over time, such as audio or video files. These media types inherently have a duration and evolve as the content progresses, presenting the information within a specific timeframe.

See also: Media, Static media, and Synchronized media

Time-sensitive updates

"Time-sensitive" updates refer to information or notifications that require immediate attention or action from the user within a specific timeframe. These updates are designed to convey important or critical information with urgency or a limited window of time.

Time-sensitive updates convey information that requires prompt user action or attention. The information of time-sensitive updates could be related to security alerts, critical reminders, or real-time changes in a situation (e.g., location-based alerts). Time-sensitive notifications are often crucial for the user to maintain an app's service's functionality, security, or timeliness. For example, notifications about account security breaches or transaction verifications are time-sensitive due to their critical nature.

Toasts

"Toasts" are small, non-modal messages that provide user feedback, notifications, or alerts. Toast messages are typically displayed for a limited time before they automatically disappear. The fact that Toast messages are non-modal means that they don't require user interaction to dismiss them, and assistive technologies should announce their content without shifting focus to them.

Toast messages serve as unobtrusive, transient notifications or feedback mechanisms that inform users about specific events, updates, or actions within the app interface without disrupting the user experience.

Transcript

A transcript is a written or typed record of spoken words, dialogue, or text from a specific audio or video file. It represents a textual version of the content spoken or presented within the audio or video recording.

Unlike captions, transcripts do not provide descriptions of other relevant sounds. Transcripts can be time-indexed but are not synchronized with their associated audio or video tracks.