Position
Paper for UPA 2001 Workshop #6:
Exploring
Measurement and Evaluation Methods for Accessibility
Stephanie
Rosenbaum,
Tec-Ed,
Inc.
As
founding principal of a usability research firm, I have spent a large portion
of my professional activities over the past ten years developing and adapting
methodologies for usability evaluation.
I participate in many of the initial planning meetings with client
organizations needing usability research, and I work with these clients to
identify the research methods appropriate to achieve their goals. Often that process involves modifying
existing methods and evolving new ones (see also my UPA 2000 paper, "Not
Just a Hammer: When and How to Apply Multiple Methods in Usability Programs").
Although
most of Tec-Ed's usability research projects are not focused primarily on
accessibility, we are increasingly able to include an accessibility component,
most often by including one or two disabled participants in usability tests. Occasionally an entire study addresses a
particular accessibility issue, for example, a study of a new laptop screen
where we recruited only participants who wore eyeglasses. Tec-Ed’s staff members are familiar with Section
508 and are comfortable working with clients who want to test for
accessibility.
Need for Ethnographic and
Longitudinal Data in Accessibility Research
The
challenge for the usability community, as described by the workshop organizers,
is to develop techniques for designing and testing products for
accessibility--in particular, low-cost test processes to measure users'
performance with the user interface (UI).
One important approach for developing these techniques and processes is
to consult ethnographic and longitudinal data.
Just
as in usability research with non-disabled users, in accessibility research
we often concentrate on measuring user efficiency, effectiveness, and
satisfaction under controlled circumstances such as usability testing. A shortcoming of this approach is that, when
we design the tasks and their context, we can't know what circumstances of
actual use we are not observing. For
this reason, a successful usability research program should include a
combination of techniques: usability testing, inspection methods as appropriate,
and ethnographic studies to inform the other techniques about the real-world
context of use.
For
us to be successful in creating techniques that both measure Section 508
compliance and improve product design, we need to include ethnographic studies
to inform the development of standard tasks and performance methods. Ethnographic data, especially longitudinal
data, will help us produce better evaluation methods both for user testing and
for user-free evaluations.
Example of Ethnographic Accessibility
Observations
Although
Tec-Ed has not yet been commissioned by clients to perform longitudinal studies
of accessibility, we work regularly with a technical consultant (a network
software specialist) who has retinopathy and consequently extremely low
vision. For the past two years, we've
observed this person's computer use for several hours about twice a month, and
we learned about several accessibility issues we probably would not have seen
during usability testing.
Our
subject sets all software to display text in a large type size and a readable
type face, which he then reads one word at a time with a hand-held
magnifier. He also occasionally uses
Jaws screen-reading software, but the issues described here involve attempts to
read text on-screen (an accessibility concern of increasing interest to the
aging U.S. population).
His
major concern during software use is not the primary functionality of an
application, which usually displays in the large type size as instructed
(although it does not always allow him to change displays to a more distinct
and readable type face). Rather,
whenever any problems occur, the error messages display in a default type size
and type face that the user cannot adjust, even with the accessibility
settings. This behavior occurs both for
error messages from most applications and those from the Windows operating
system.
Thus
at unpredictable times, an otherwise competent computer user must ask for help
from a physically present person with better vision. (Problems also occur when Jaws attempts to read error messages; I
can describe these during the workshop if they're of interest.) Can the Help system provide assistance? Rarely, because most applications also have
their Help messages programmed to display a fixed text size.
Usability
problems occur with software applications that would otherwise be excellent
accessibility tools, if a low-vision person could gain access to the tool. Our subject attempted to use IBM's dictation
product, because dictation would be much faster than finding a letter at a time
on the keyboard. To "train"
the software to recognize the user's voice, one must read sample text out
loud--and the sample text is displayed in a type face our subject can't
read. (The product apparently offers the
ability to change type sizes and type faces, but only after the start-up
activities are completed.)
Similarly,
a year or two ago I saw a demonstration of a new Adobe product that would
enable users to change the type size of a pdf file, and the text would
"rewrap" to create a readable page in the new type size. Unfortunately, the menus and icons to
specify the larger type were themselves too small for a low-vision person to
read--and I was told these could not be changed.
Some
quite basic software bugs raise questions about QA of accessibility options.
Windows 98 had an accessibility feature that magnified the whole screen by a
specified percentage. About two years
ago, our subject tried setting this feature on his new laptop to 200%, at which
point only a quarter of the screen was visible (the top-left quarter). When he wanted to turn the feature off, or
try a different magnification, all the controls he needed were on undisplayed
parts of the screen. But the screen
displayed no scroll bars (perhaps because this was a "setting," not a
central part of the operating system).
After many unsuccessful calls to the Microsoft Help desk, our subject
returned the laptop to his vendor, who also couldn't solve the problem. Their solution was to replace the hard disk
of the laptop with a new one!
Windows
also offers a "hand magnifier," which you can move around to magnify
selected portions of a screen. When our
subject clicks on this tool, it works for about 15 minutes and then fails (with
another unreadable error message).
Probably most users of the magnifier don't try to use it for hours at a
time. Again, the Help desk hasn't been able to help. Our subject pleads strongly for major software companies having
accessibility specialists in tech support, so that Help desk questions about
accessibility options can easily be escalated to someone knowledgeable about
these features.
Issues Resulting from
Ethnographic and Longitudinal Observations
The
example above involves just one subject and only one disability condition
(although we expect problems with low-vision users to become increasingly
prevalent), yet it provided many insights.
Our observations led to ideas, implications, and issues that we can
usefully address as we think about developing tasks and performance measures
for accessibility:
·
Even
inspection methods can improve accessibility far more than they currently do,
simply by extending accessibility standards to the entire product—including the
installation or start-up sequences (or wizards), the error messages, and the
help system. Perhaps I’m wrong in
saying “simply”; the technical challenge is straightforward, but the
organizational challenges in requiring many development teams to cooperate may
be greater. Still, user-free methods
can be developed to ensure that every part of a user interface can be displayed
in large, distinct type.
·
The
process of developing “standard” tasks for accessibility usability testing
should be an iterative one, rather than performed once and never
revisited. An ongoing ethnographic,
longitudinal research program should be carried out with representatives of the
major groups of people with disabilities.
The information gleaned from this longitudinal research should be used
to update any standardized tasks, at least yearly (probably more often).
·
Although
an ideal user interface would not require tech support, we won’t achieve such
perfection in product design in our lifetimes (considering the diversity of
user audiences and the trend toward more complex products). Given the requirement to provide user
support for products, we can improve accessibility dramatically by providing
specialized user support for accessibility features.
Note
that, as Gregg Vanderheiden from TRACE R&D Center said during his CHI 2001
talk, most of these approaches to improving accessibility also result in more
usable products for their wider audiences, providing an additional
justification for investing in the required resources. For example, our low-vision subject
commented that he could use Jaws more effectively if its help index included
more “ordinary-language” entries rather than only the terminology used in the
product; and usability studies of indexes show that adding “see also” entries
supplied by users is an effective—and low-cost—way to improve product
usability.