WAI-Adapt: Symbols Module Explainer

Authors

Matthew Tylee Atkinson (@matatk), Samsung Electronics

Participate

Issues: https://github.com/w3c/adapt/issues
Discussions: https://github.com/w3c/adapt/discussions

Introduction

Some people find graphical symbols easier to interpret than written text. They may find that when symbols are presented alongside text content, that content is easier to understand. But there are many different symbol sets in use, and people don't tend to learn more than one.

We propose the adapt-symbol attribute, which allows content authors to mark up the concepts relevant to their content, so that the appropriate symbol(s) for that concept can be rendered for the user, using their chosen symbol set.

We use the set of concepts maintained by Blissymbolics Communication International (BCI). These concepts underpin the Blissymbolics (or "Bliss") language—though our use of the concepts is strictly for mapping from a concept to the appropriate aymbol(s) for the user, and is not grammatical in nature.

We're working closely with BCI on this specification, and the W3C AAC Symbol Registry (more details below).

Demo

A proof-of-concept authoring tool demo can be found at: http://matatk.agrip.org.uk/adaptable-content-authoring-tool/editor/

Please note the following limitations:

It only supports Bliss symbols.
The given example does not represent the typical use case of sparse use of symbols, mainly to annotate content such as media chapters.

Goals

Allow content authors to add element-level metadata that link parts of the content to well-known concepts. This, in turn, supports the user need of having content annotated with symbols that the user can understand.

Out of scope

Mapping from the concepts to the appropriate symbols for the user, and rendering those symbols (this is left to the UA, or an extension).
Providing translation—the concepts are specified by the author to match the language of the page's content; if the page content were to be translated, the concepts would need to be translated too.
Providing an exhaustive list of concepts (the W3C AAC Symbol Registry, described below, aims to do this).

Important notes on symbols and rendering

Though rendering is out of scope, it's important to be aware of the nature of symbols, and different symbol sets:

Symbols are graphical objects (i.e. vector or bitmap images).
A concept (e.g. "tea") may map to zero or more symbols in any given symbol set (zero is a possibility because the symbol set may not have a symbol for a particular concept).
A concept will not necessarily map to the same number of symbols across symbol sets.

User research

Note

This work has been developed over several years, with input from the Cognitive Accessibility TF, and experts from BCI. We will add references to some key elements of that background work here.

The `adapt-symbol` attribute

The intent of the adapt-symbol attribute is to link a concept (for which the UA will render an appropriate symbol for the user) to some content (usually text) on the page.

The value of the adapt-symbol attribute is a representation of a concept, which will be rendered as one or more symbols by the UA. There are several ways that the concepts may be encoded, which is the subject of current discussion.

Mapping concepts to symbols (in general)

There are a number of ways we may identify, or "key", concepts—some alternatives are discussed below.

Regardless, the overall appraoch for using the attribute would be the same:

The goal of the adapt-symbol attribute is to match some content (e.g. a word, or a video chapter title) to the appropriate symbol(s) for the user.
This is done by having the author specifcy the concept that the content relates to.
The concept may map, in any given symbol set, to one or more symbols (or zero symbols, if the set has no symbol for that concept).
The rendering of symbols would be down to the UA—we have made a demo, and are working on a visual layout test suite.

Therefore, setting aside the means of identifying concepts, the basic markup would be as follows.

<p>Would you care for some <span adapt-symbol="CONCEPT_ID">tea</span>?</p>

The following sections compare ways to identify concepts. For now, let's assume they will be integers.

Here are three examples of how the adapt-symbol attribute could be used.

Symbols for individual words.

<span adapt-symbol="13621">Cup</span> of <span adapt-symbol="17511">Tea</span>

Symbols used with an image (alt text represented as a symbol).
```
<img src="cup.png" adapt-symbol="13621" alt="Cup"/>
```
Symbols with conjugation. In this example a symbol is used for "her name" for the conjugated Hebrew word, שמה. The comma is used to join the conjugated values, "her" (14707) and "name" (15691). If the gender is not important, you can just use the value for name (15691).
```
<img src="her-name.png" alt="שמה" adapt-symbol="15691, 14707"/>
```

Concept IDs: keying schemes

There is ongoing discussion on how the concepts should be expressed in the HTML markup in issue 240. This section makes three suggestions.

BCI concept IDs as attribute values

This keying scheme maps one integer (the BCI concept ID) to a concept.

BCI maintains a dictionry of concepts, with corresponding Bliss symbols, and written-language definitions.

Advantages

Simple—provides a 1:1 mapping between concept and key that identifies the concept.
Does not expose implementation details of Bliss symbols to content authors.
Relatively minimal lag time between a concept addition/update and availability via authoring tools, or the W3C AAC Symbol Registry.

Disadvantages

Allows us to only specify concepts available in the Bliss language. (But we can still map to any symbol set, based on those concepts.)

Bliss characters' Unicode representations as attribute values

This maps one or more representations of Bliss characters (symbols) to a concept.

Instead of BCI concept IDs (integers), we could use:

Unicode code points for Bliss symbols (directly), or
Hex (or other) representations of Unicode code points that correspond to Bliss symbols.

Important

Basing the key into the concept dictionary on Bliss characters means that:

Regardless of the symbol set being used for output, the concept is expressed in terms of Bliss symbols.
The values used relate to the Unicode code points for these Bliss characters. Because there are approximately 6,500 Bliss concepts, but only around 1,400 Bliss characters being added to Unicode, this means that some concept identifiers will need to contain multiple Bliss character representations.

Note

Further details can be found in our comment on issue 240 (this comment only suggest the user of code points directly, though): #240 (comment).

Advantages

Based on an existing standard (Unicode).

Disadvantages

Exposes the implementation details of Bliss to someone writing this markup.
- As part of this, it's more complex than using atomic keys (such as BCI concept IDs): some concepts that would be represented by one BCI ID would need more than one Bliss character representation to identify them.

Unknown factors

Unclear as to what the process for adding additional Bliss characters to Unicode would be.
The time between new concepts being added to Bliss, and them being available via Unicode would likely be significant, due to the release cadence of Unicode.

Multiple concepts per attribute value

Though it is not expected to be used extensively, we have considered how multiple concepts may be referenced within one attribute value.

As separate Bliss characters (or their representations) are space-separated, it is proposed that if multiple concepts were to be included in a single adapt-symbol attribute value, they would be comma-separated. For example:

<span adapt-symbol="0x4242, 8857, 0x4444 0x2222, 3856"

In this example, there are 4 concepts identified, via...

hex representation of a single Bliss Unicode code point;
Bliss Concept ID;
hex representation of a concept that is identified by two Bliss Unicode code points; or
another Bliss Concept ID.

Note

We want to maintain consitency with how other parts of the platform handle this—we're very-much open to using other delimiters if needed.

Looking up concepts

The content author needs to be able to find known concepts, and their associated identifier. This is addressed in the next section.

The W3C AAC Symbol Registry

Note

We are planning to split this explainer off into a separate file, to avoid explainers that are too long.

The registry brings BCI's dictionary of concepts into W3C space. Each record in the registry contains:

A uniquely-identifying key.
A description of the concept in a written language (e.g. English).
The Bliss symbol(s) that embody this concept in the Bliss language.

The registry can be found at: https://www.w3.org/TR/aac-registry/

Note

The registry's key for identifying concepts is presently the concepts' BCI concept ID (an integer). However, as discussed above, we are in discussions with potential implementers on whether the corresponding Bliss Unicode code point(s) for a given concept could be used instead.

Privacy considerations

Note

This section is to be expanded.

Because the rendering of symbols is expected to be done by injecting them into the HTML, the site could determine that the user is using symbols, and which symbol set is in use.

Considered alternatives

Note

This section is to be added.

Stakeholder feedback/opposition

Our work is currently focused on working with BCI within W3C to solidify our recommendations for the syntax of the adapt-symbol attribute. We have run several breakouts on the work, and will more actively seek feedback once this is decided.

Through prior TPAC meetings, and issue 240, we have been discussing authoring consdierations with WHATWG.

We plan to seek input from implementers following the resolution of the concept keying issue.

We have engaged with experts in the COGA TF regarding the appropriateness of building upon the concepts identified by BCI—this work actually began within the COGA TF, with the input of renowned experts on AAC and symbols. Bliss is used because it is comprehensive, and has a mature process for the addition and updating of concepts.

References

Note

This section to be added.

Acknowledgments

Lisa Seeman, COGA TF
Russell Galvin, BCI
WAI-Adapt TF participants

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

symbols.md

symbols.md

WAI-Adapt: Symbols Module Explainer

Authors

Participate

Contents

Introduction

Demo

Goals

Out of scope

Important notes on symbols and rendering

User research

The `adapt-symbol` attribute

Mapping concepts to symbols (in general)

Concept IDs: keying schemes

BCI concept IDs as attribute values

Advantages

Disadvantages

Bliss characters' Unicode representations as attribute values

Advantages

Disadvantages

Unknown factors

Multiple concepts per attribute value

Looking up concepts

The W3C AAC Symbol Registry

Privacy considerations

Considered alternatives

Stakeholder feedback/opposition

References

Acknowledgments

Files

symbols.md

Latest commit

History

symbols.md

File metadata and controls

WAI-Adapt: Symbols Module Explainer

Authors

Participate

Contents

Introduction

Demo

Goals

Out of scope

Important notes on symbols and rendering

User research

The adapt-symbol attribute

Mapping concepts to symbols (in general)

Concept IDs: keying schemes

BCI concept IDs as attribute values

Advantages

Disadvantages

Bliss characters' Unicode representations as attribute values

Advantages

Disadvantages

Unknown factors

Multiple concepts per attribute value

Looking up concepts

The W3C AAC Symbol Registry

Privacy considerations

Considered alternatives

Stakeholder feedback/opposition

References

Acknowledgments

The `adapt-symbol` attribute