Open Access
Article  |   September 2020
Everyday visual demands of people with low vision: A mixed methods real-life recording study
Author Affiliations
  • Sandra D. Starke
    Aston Business School, Aston University, Birmingham, UK
    Previously School of Engineering (Honorary Research Fellow), University of Birmingham, Birmingham, UK
    Previously GiveVision, iCentrum, Birmingham, UK
    s.starke@aston.ac.uk
  • Eugenie Golubova
    Aston Business School, Aston University, Birmingham, UK
    Previously GiveVision, iCentrum, Birmingham, UK
    e.golubova@aston.ac.uk
  • Michael D. Crossland
    NIHR Biomedical Research Centre for Ophthalmology at Moorfields Eye Hospital and UCL Institute of Ophthalmology, London, UK
    mcrossland@nhs.net
  • James S. Wolffsohn
    Optometry and Vision Science, Aston University, Birmingham, UK
    j.s.w.wolffsohn@aston.ac.uk
Journal of Vision September 2020, Vol.20, 3. doi:https://doi.org/10.1167/jov.20.9.3
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Sandra D. Starke, Eugenie Golubova, Michael D. Crossland, James S. Wolffsohn; Everyday visual demands of people with low vision: A mixed methods real-life recording study. Journal of Vision 2020;20(9):3. https://doi.org/10.1167/jov.20.9.3.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Research has demonstrated that low vison aids (LVAs) can have a positive impact on the functional sight of those living with sight loss. Step changes in technology are now enabling new wearable LVAs with greater potential than those available previously. For these novel devices to receive increased acceptance and therefore adoption by those with sight loss, visual task demands have to be understood more clearly in order to enable better alignment between device design and user requirements. The aim of this study was to quantify these requirements. Thirty-two participants aged 18 to 87 wore a spectacle-mounted video camera to capture and narrate all everyday situations in which they would use a “perfect” sight aid during 1 week. Captured scenes were analyzed through categorization and computational image analysis. Results showed large variation in activities and lifestyles. Participants reported no available sight aid or coping strategy for 57% of the recorded activities. Reading made up 49% of all recorded tasks, the other half comprising non-textual information. Overall, 75% of captured activities were performed ad hoc (duration of 0–5 minutes), 78% occurred indoors, 58% occurred at home, 48% were lit by natural light, 68% included the object of interest within reach, and 69% required a single focus plane only. Around half of captured objects of interest had a size of 2 degrees visual angle (2.08 logarithm of the minimum angle of resolution [logMAR]) or smaller. This study highlights the need for a sight aid that can make both textual and non-textual scenes accessible while offering flexibility to accommodate individual lifestyles.

Introduction
Globally, approximately 216 million people live with moderate to severe visual impairment (Bourne, Flaxman, Braithwaite, Cicinelli, Das, Jonas, et al., 2017), with these numbers predicted to rise over coming decades due to an aging population (Access Economics Pty Ltd, 2009; Pezzullo, Streatfeild, Simkiss, & Shickle, 2018). Although for many people sight loss cannot be reversed or cured, low vision aids (LVAs) can have a positive impact on residual functional sight (Crossland, Starke, Imielski, Wolffsohn, & Webster, 2019; Binns, Bunce, Dickinson, Harper, Tudor-Edwards, Woodhouse, et al., 2012; Hooper, Jutai, Strong, & Russell-Minda, 2008; Peterson, Wolffsohn, Rubinstein, & Lowe, 2003; Virgili, Acosta, Bentley, Giacomelli, Allcock, & Evans, 2018), with a prominent research focus on restoring the ability to read (Virgili et al., 2018). Optical low vision aids tend to be task specific, with constraints on magnification, field of view, working distance, and ease of use: stronger optical magnifiers typically have a smaller field of view, are harder to align, and are more affected by hand movement than weaker devices. However, with the recent advent of high-resolution digital screens and miniaturization of sensing and computing devices, new wearable sight enhancement devices have become available. These devices are suitable for a broader range of activities beyond reading and overcome several challenges associated with high levels of optical magnification (Deemer et al., 2018; Ehrlich et al., 2017; Peterson, Wolffsohn, Rubinstein, & Lowe, 2003). 
The opportunity of new technological developments requires that the actual visual task demands of people living with low vision have to be understood better in order to design for their specific needs through patient centric design. This would prevent falling short on user acceptance as exhibited by many current devices (Gori, Cappagli, Tonelli, Baud-Bovy, & Finocchietti, 2016). The lack of comprehensive evidence for the LVA needs and requirements of different patient groups has been highlighted through several reviews of the current low vision rehabilitation landscape (Binns et al., 2009; Binns et al., 2012; Dickinson et al., 2011; Markowitz, 2006; Markowitz, 2016). Among the few studies that do exist, the Royal National Institute for the Blind (RNIB) conducted one of the most comprehensive surveys on the self-reported impact of sight loss through the 2015 “MyVoice” study with > 1200 participants (Slade & Edwards, 2015). Similarly, the Massof inventory provides an extensive list of tasks relevant for people with low vision in the early 2000s (Massof, Hsu, Baker, Barnett, Park, Deremeik, 2005a; Massof, Hsu, Baker, Barnett, Park, Deremeik, 2005b; Massof et al., 2007). Other studies reported tasks that patients with macular degeneration struggled with most (Taylor, Hobby, Binns, & Crabb, 2016), potential use cases for LVAs (Palmer, 2005), or psychological factors for device uptake among the elderly (Horowitz, Brennan, Reinhardt, & MacMillan, 2006). Further, several studies that offer indirect clues on device requirements have examined quality of life in patients with low vision (Berdeaux, Nordmann, Colin, & Arnould, 2005; Frick et al., 2012; Lamoureux et al., 2008; Murphy et al., 2007; Tadic, Cooper, Cumberland, Lewando-Hundt, & Rahi, 2016), requirements for LVAs (Sandnes, 2016), or self-support strategies (Smallfield, Berger, Hillman, Saltzgaber, Giger, & Kaldenberg, 2017). To date, no study has systematically captured daily needs through live recordings rather than introspection. 
Recording of everyday activities is a common technique applied across disciplines, such as Human Computer Interaction (HCI), Human Factors/Ergonomics and Psychology. It allows to assess behavior, interventions, and concept designs. Diary studies (Brown, Costley, Friend, & Varey, 2010; Carter & Mankoff, 2005; Karisalmi & Nieminen, 2017) utilizing electronic recording devices are popular to capture data with minimum disruption. In photo elicitation studies, participants capture imagery of activities and/or environments, commenting on them in a subsequent interview (Clark-Ibáñez, 2004; Harper, 2002; Richard & Lahman, 2015; Van House, 2006). “Ecological assessment” (Shiffman, Stone, & Hufford, 2008) or “experience sampling” (Barrett & Barrett, 2001; Christensen, Barrett, Bliss-Moreau, Lebo, & Kaschub, 2003; Intille, Kukla, & Ma, 2002) similarly capture behaviors and experiences through image recordings. Since the emergence of small wearable cameras, automated recordings of daily routines and experiences has been facilitated through devices, such as SenseCam, Autographer, Narrative, or Google Glass (Mavoa, Oliver, Kerr, Doherty, & Witten, 2013; Shipp, Skatova, Blum, & Brown, 2014; Wilson, Jones, Schofield, & Martin, 2016). These devices have been used in studies including social behavior and behavior change (Doherty et al., 2013; Fleck, 2012; Fleck & Fitzpatrick, 2009; Gemming, Doherty, Utter, Shields, & Ni Mhurchu, 2015; Mavoa et al., 2013; Wilson, Jones, Schofield, & Martin, 2016) with an associated well-established ethical framework for such recordings (Doherty et al., 2013; Kelly et al., 2013; Kienzler, 1997; Kwok, Skatova, Shipp, & Crabtree, 2015; Mok, Cornish, & Tarr, 2015; Shipp, Skatova, Blum, & Brown, 2014; Skatova et al., 2015). 
The objective of this study was to develop a comprehensive list of design input requirements for future LVAs that can address real-life challenges of people with visual impairment. The aim of this study was to understand daily visual demands with regard to image content, desired activities in need of support, task characteristics, and personal priorities. The study was conducted using a mixed methods design combining self-recording of tasks through a spectacle-mounted video camera, including live narrative analyzed both qualitatively and quantitatively including computational image analysis. 
Materials and methods
Participants
A total of 32 participants aged 18 to 87 were recruited from the Aston Low Vision Clinic and GiveVision's volunteer network. Adult (age 18 years and over) participants with a visual impairment were included in the study. All participants met the definition of visual impairment by Leat et al. (1999): best monocular or binocular visual acuity of worse than 6/7.5, horizontal visual field of < 146 degrees to Goldmann III4e targets, or contrast sensitivity worse than 1.5 log units (Leat, Legge, & Bullimore, 1999). 
Participation was voluntary and participants were free to withdraw at any point. This research was conducted in compliance with the Declaration of Helsinki and Aston University's Review Board and it was granted a favorable ethical opinion and governance approval by the Aston University Ethics Committee (no. #1280). Written informed consent was obtained from participants after explanation of the nature and possible consequences of the study. At the conception stage, the study was discussed and piloted with two visually impaired volunteers and adjustments based on feedback and observations were made prior to commencing the study. 
Task
The study task was to record, for approximately 1 week, all scenarios during which the participant would use “a perfect sight aid” (see Figure 1 for an example of such recordings). Recordings were facilitated through a wearable, glasses-mounted miniature video camera. Participants were given the instructions to capture the object of interest, narrate the intended activity, and comment on the related difficulty. They were asked to record all scenarios that they struggled with, even if they had a coping strategy, such as the use of a magnifier. 
Figure 1.
 
Example of scene recordings taken with the spectacle-mounted camera.
Figure 1.
 
Example of scene recordings taken with the spectacle-mounted camera.
Participants were equipped with recording spectacles (SunnyCam Sport; image resolution 1920 × 1080 pix, 30 fps, 120 degrees field-of-view horizontal, stereo audio recording, operation through a single button, and vibration feedback). For improved accessibility, a bumpon sticker (3M, Two Harbors, MN, USA) was affixed with superglue (Loctite, Westlake, OH, USA) to both the recording button and USB charger connectors. The device could not be used in conjunction with spectacles due to the close fit. However, none of the participants frequently wore spectacles. Participants who used yellow tinted lenses for management of light sensitivity had the device's default glasses replaced by custom yellow inserts available via the device manufacturer. 
After fitting the device, a detailed practical induction on how to operate and charge it was given, verified by the participant executing set tasks, and reviewing these with the instructor. To extend battery life, participants were equipped with and trained to use a portable USB charger (Anker 2-port USB wall charger, China). Explanatory handouts for the public were provided in line with recommendations made for recording studies (Kelly et al., 2013) to move the explanatory burden from the participant to the study team. Participants were further instructed to secure verbal consent from any persons recorded in private spaces. 
On return of the recording spectacles, a debrief was conducted during which participants could review recordings and answered a structured questionnaire. Results of this questionnaire will be reported separately. 
Visual assessment
During the briefing session, the participant's best corrected distance binocular visual acuity (early treatment diabetic retinopathy study [ETDRS] logarithm of the minimum angle of resolution [logMAR] chart) and contrast sensitivity (Pelli-Robson chart) were measured. Sight loss condition, onset, and demographic details were recorded. 
Data analysis
All recordings were downloaded from the recording glasses in mp4 format and stored with an anonymous identifier. Recordings were watched in real-time by the research team, extracting representative image frames for all captured scenes/subtasks, and conducting a classification of these as described below. 
Activity classification
Activities were written down as tasks as narrated by the participant. For activity, where the narrative only described a top-level task (e.g. “I am making cake”), the study team broke this task down into subtasks and used simple language. For task difficulty, the study team converted the narrative to a rating on the six-point scale of very easy, easy, average, difficult, very difficult, and unable. Repeatability analysis showed that this classification had poor repeatability between observers, and it was hence simplified to “able to complete alone” and “unable to complete alone.” 
Recorded subtasks were then mapped to the 48-item Veterans Affairs Low Vision Visual Functioning Questionnaire (VA LV VFQ-48). This tool was chosen from seven validated questionnaires as it demonstrated the best fit (80%) for the captured tasks. Twelve of the VA LV VFQ-48 tasks were adapted to reflect the captured activities. An additional, bespoke task list (identified as the category ‘Bespoke Tasks’) was created specifically to capture those activities that could not be mapped to the VA LV VFQ-48. 
Scene classification
Scene classification focused on 14 characteristics related to environment, object location, and task requirements (Table 1) coded by the study team. Repeatability (see Table 1) was calculated across two authors for each of these classifiers, who independently coded the same 27 snapshots. 
Table 1.
 
Scene classification.
Table 1.
 
Scene classification.
Computational image analysis
Exported video frames were undistorted, white-balanced, and de-noised in Lightroom 5 (Adobe Systems, Mountain View, CA, USA) and saved as 16-bit tiff images. Computerized image analysis was then conducted using Matlab 2018b (The MathWorks) with the Image Processing Toolbox through the steps described below (see Figure 2 for an illustration of the processing steps). 
Figure 2.
 
Digital image processing for the analysis of image characteristics. Top row: Left - original recording, middle - undistorted, white balanced, and de-noised image, and right - linearized greyscale image. Bottom row: Left - local entropy (texture), middle - high frequencies, and right - edges.
Figure 2.
 
Digital image processing for the analysis of image characteristics. Top row: Left - original recording, middle - undistorted, white balanced, and de-noised image, and right - linearized greyscale image. Bottom row: Left - local entropy (texture), middle - high frequencies, and right - edges.
Preprocessing
Recordings were preprocessed for further analysis by linearizing the sRGB color images using function “rgb2lin” and then converting the linear red, green, and blue (RGB) image to greyscale (luminance) using function “rgb2gray.” Subsequent computational image analysis was performed on this single-channel greyscale image. 
Contrast
Image contrast, as the percentage of total tonal value range captured, was calculated from the average of the 0.5% brightest (Imax) and 0.5% darkest (Imin) pixels for the 16-bit tiff images as  
\begin{equation}{\rm{Contrast}} = \left( {\left( {{{\rm{I}}_{{\rm{max}}}} - {{\rm{I}}_{{\rm{min}}}}} \right){\rm{/}}{{\rm{2}}^{{\rm{16}}}}} \right){\rm{*100}}{\rm{.}}\end{equation}
(1)
 
Global and local entropy
Global and local entropy were calculated to characterize the texture of the image, with entropy also considered its randomness. Global entropy was computed using Matlab's “entropy” function, normalized to a 0 to 1 output space (with the boundaries of 0 – no randomness and 1 – maximum randomness). Local entropy was computed using Matlab's “entropyfilt” function (9-by-9 neighborhood) to characterize what area of the scene (Atextured) may attract attention due to textual content. This area was calculated based on the derived entropy matrix by counting the number of pixels within the entropy matrix with a value larger than three (pixtexture, cut-off set experimentally as the value adequately separating texture from noise) as a proportion of all pixels (pixall) as  
\begin{equation}{{\rm{A}}_{{\rm{textured}}}} = {\rm{100\,*}}\,\left( {{\rm{pi}}{{\rm{x}}_{{\rm{texture}}}}{\rm{/pi}}{{\rm{x}}_{{\rm{all}}}}} \right){\rm{.}}\end{equation}
(2)
 
High frequency content and edges
High spatial frequency content was approximated using Matlab's “rangefilt” function, which computes the tonal range (maximum – minimum luminance) for a 3-by-3 neighborhood around any given pixel. The proportion of the image containing high spatial frequency content was calculated based on the derived matrix as the area (AhighSpatialFrequency) containing pixel values larger than 10% of the total possible range (pixhighFrequency, threshold = 0.1*216 = 6554) as a proportion of all pixels (pixall) as  
\begin{equation}\!\!\!\begin{array}{l}{{\rm{A}}_{{\rm{highSpatialFrequency}}}} = \displaystyle{\rm{100\,*}}\left( {{\rm{pi}}{{\rm{x}}_{{\rm{highFrequency}}}}{\rm{/pi}}{{\rm{x}}_{{\rm{all}}}}} \right). \end{array}\end{equation}
(3)
Edge detection was performed through Matlab's “edge” function and the area containing edges (Aedges) calculated as the proportion of pixels containing edges (pixedge) of all pixels (pixall) as  
\begin{eqnarray}{{\rm{A}}_{{\rm{edges}}}} = {\rm{100\,*}}\left( {{\rm{pi}}{{\rm{x}}_{{\rm{edge}}}}{\rm{/pi}}{{\rm{x}}_{{\rm{all}}}}} \right){\rm{.}}\end{eqnarray}
(4)
 
Object size estimation
Object size (in degrees of visual angle) within the field of view was calculated for the horizontal and vertical distance, which the overall object of interest spanned within the recorded snapshot. For written content, the size of a single letter was estimated. 
Results
Participation
Of 32 participants, 15 were women and 17 were men; 26 participants still had vision in both eyes and 6 had vision in one eye only; 16 participants had adult onset of visual impairment, 8 participants childhood onset, and 8 participants had been visually impaired from birth. Mean (SD) age of participants was 47.5 (21.9) years (range: 18 to 87 years), binocular distance acuity was 1.2 (0.3) logMAR (range: 0.36 to hand movements [HMs]), binocular contrast sensitivity was 0.8 (0.4) log units (range: 0.1 to 1.5 log units), and time since diagnosis was 22 years (16 years; range: 2 to 64 years). Initially a further four participants had joined the study. However, one participant dropped out during the study and three participants had missing recording data due to errors made by the dispensing team; these participants are not included in the N = 32 count participant cohort presented in the paper. 
Across all recordings and participants, a total of 612 subtasks were extracted. Individual participants captured a mean (SD) of 18.9 (18.7) of these subtasks (median: 12, range: 3 to 68 per participant). Recordings varied in length from around 10 seconds to over 25 minutes; long recordings typically contained multiple tasks and subtasks as well as long un-narrated periods. 
Activity classification
Of all recorded subtasks, the most frequently used words recorded were “read” (299 of 605 words, 49%), “find” (46 words, 8%), “see” (36 words, 6%), “identify” (31 words, 5%), “bus” (29 words, 5%), “operate”(26 words, 4%), “labels” (21 words, 3%), and “TV” (20 words, 3%). 
Across all captured 612 scenes, the most commonly categorized subtasks fell into the “bespoke” category (27%, Figure 3). The remainder of subtasks was distributed between the four top-level categories of the VA LV VFQ-48 (see Figure 3) ranging from 14% (issues of mobility) to 23% (reading/near distance tasks). Within each of these categories, there was a small number of tasks that were substantially more frequent than the majority of others (see Figure 3). 
Figure 3.
 
Breakdown of subtasks as mapped to the Veterans Affairs Low Vision Visual Functioning Questionnaire and bespoke tasks. Percentages are expressed relative to all tasks in a given category. For absolute values, please refer to Table 2.
Figure 3.
 
Breakdown of subtasks as mapped to the Veterans Affairs Low Vision Visual Functioning Questionnaire and bespoke tasks. Percentages are expressed relative to all tasks in a given category. For absolute values, please refer to Table 2.
The three most commonly reported tasks across participants were finding something on a crowded shelf, reading package labels, and using appliance dials, buttons, and remotes - 59% of participants reported each of these. Table 2 shows the ranking across all snapshots and across participants. 
Table 2.
 
Task rankings.
Table 2.
 
Task rankings.
Scene classification
Participants reported use of a sight aid or other coping strategy (such as help from others or touch) for 43% of recorded scenes, and no coping strategy for the remaining 57%. Participants were able to complete 71% of recorded scenes alone, the remaining 29% could not be completed at all or only with the help from others (examples were retrieving money from a cash point, crossing the road, choosing the correct bus, or selecting items while shopping). 
Classification of recorded scenes (Figure 4) showed that 75% of activities were performed ad hoc (duration of 0–5 minutes), 78% occurred indoors, and 58% occurred at home. Of all scenes, 48% were lit by natural light, 68% included the object of interest within reach, and 69% required a single focus plane only. To perform activities, in 60% of scenes, one or two hands were required, and in 17% walking was required. The majority of activities occurred during the morning (6:00–12:00, 42%) and afternoon (12:00–18:00, 37%). 
Figure 4.
 
Scene characteristics of the captured recordings with regards to task context, lighting, visual, and focus requirements as well as practical requirements.
Figure 4.
 
Scene characteristics of the captured recordings with regards to task context, lighting, visual, and focus requirements as well as practical requirements.
Computational image analysis
Across the 612 snapshots, computational image analysis (Figure 5) showed that the mean (SD) contrast range was 93.1% (12.3%) with a range from 28.5% to 100%, entropy was 0.8 (0.1) out of 1.0 with a range from 0 to 1, local entropy reflecting notable texture covered 44.0% (16.6%) of the image with a range from 1% to 92.4%, high frequency content covered 13.2% (8.5%) of the image with a range from 0.2% to 58.9%, and edges covered 2.5% (1.0%) of the image with a range from 0.2% to 6.1%. 
Figure 5.
 
Image characteristics of all captured scenes as quantified through computational image analysis. The probability (0–1 scale) refers to the proportion of images with a given characteristic.
Figure 5.
 
Image characteristics of all captured scenes as quantified through computational image analysis. The probability (0–1 scale) refers to the proportion of images with a given characteristic.
Object size estimation
Objects of interest had a mean (SD) size of 5.6 (7.7) degree visual angle horizontally with a range from 0.1 to 45.4 degrees (Figure 6). The logMAR equivalent of this is 2.53 (2.66) logMAR, ranging from 0.78 logMAR to 3.44 logMAR. Outcomes for vertical angles were similar at 4.7 degrees (5.8 degrees) visual angle vertically with a range from 0.2 to 37.8 degrees. The logMAR equivalent of this is 2.45 logMAR (2.54 logMAR), ranging from 1.08 logMAR to 3.36 logMAR. All reported measures do not consider texture within the object of interest, solely the dimensions of its general size. The distribution was highly skewed toward small angles. Across all snapshots, 50.4% of objects of interest had a size of < 2.0 degrees visual angle (2.08 logMAR) horizontally and 49.8% had < 2.0 degrees vertically. 
Figure 6.
 
Object of interest sizes (in field of view fov) across all participants and captured objects. Note that the size relates to the object as a whole, not the textures contained within (e.g. jar of jam). For textual information, the size of a single letter was approximated.
Figure 6.
 
Object of interest sizes (in field of view fov) across all participants and captured objects. Note that the size relates to the object as a whole, not the textures contained within (e.g. jar of jam). For textual information, the size of a single letter was approximated.
Discussion
General summary
This study quantified the breadth of LVA requirements from both the behavioral and technical perspective. The activities recorded here compared well with previous work (Palmer, 2005; Royal National Institute for the Blind, 2015; Taylor, Hobby, Binns, & Crabb, 2016; see also Table 3), albeit with some variation in the detail of descriptions by authors and priority ranking of reported activities by participants. Most importantly, we found that activities reported in this study were not fully captured by any of the existing inventories. This highlights the need to update inventories to include daily activities which are relevant now, because behaviors, hobbies, and technology change constantly (i.e. the use of smartphones, tablets, laptops, and/or PCs is now common across age groups). One of the largest activity inventories in the field identified 337 tasks and ordered them by difficulty based on 600 participants (Massof et al., 2005), with good overlap with the tasks reported in the present study. Similar to our large number of specific activities reported by few individuals each, a smaller 2005 study (Palmer, 2005) with 25 participants (most aged 75+ years) found many specific activities reported by relatively few people, too. 
Table 3.
 
 
Table 3.
 
 
Overall, our results illustrate that LVAs need to consider individual lifestyles and hobbies in order to be relevant for people living with low vision. A paradigm shift away from working toward satisfying “average” users and/or focusing on single tasks and instead flexibly designing toward the individual needs of diverse groups is here both the opportunity and challenge for future device developers. We have identified many diverse tasks that people with visual impairment need help with and accept that it may not be feasible for one device to help with all tasks. When moving ahead, the factors of task frequency, importance, and difficulty should be carefully explored further: a frequent task may not be important or difficult; an important task may not be frequent; and a difficult task may not be important. Our hope is that this study is seen as a starting point for much needed further research into the needs of potential subgroups. 
What everyday activities require support through LVAs?
In this study, reading text in a vast variety of contexts comprised almost half of all recorded scenes. The other half, however, contained non-textual information, highlighting the need to make accessible scenes other than text through LVAs. Comparing our findings to the independent study outcomes above, it is apparent that people with low vision require an LVA for a very varied task inventory, including indoor and outdoor tasks, recognizing many different objects, reading many different types of information presented in numerous ways at different distances, as well as recreational and professional tasks, and tasks in both familiar and unfamiliar/public environments. In developing LVAs, designers have to embrace that different people lead different lives in which different activities can make a crucial difference. Rather than developing an LVA that only supports few very common tasks, it would hence be beneficial to develop for as broad a task range as possible to maximize uptake. It remains uncertain which factors are most important when predicting device uptake, where there is, for example, disagreement between studies on the predictive power of use frequency for the perceived benefit (Lorenzini & Wittich, 2019). Ensuring that LVA design maps onto people's individual lifestyles may be a key factor leading to uptake. 
This study showed that almost a third (29%) of desired activities could not be completed by participants alone. Similarly, participants did not report having a coping strategy for more than half (57%) of recorded sub-tasks. This illustrates the loss of independence that sight loss currently causes. Similarly, the 2015 RNIB “My Voice” study (Royal National Institute for the Blind, 2015) found that the majority of their over 1200 participants required help in the home, whilst two-thirds of working age people and one-third of retirees had recently experienced collisions with obstacles on the pavement (one in three incidents leading to injury). It is important to note that the inability to complete individual subtasks, such as operating dials on a cooker, can prevent people with low vision completing whole activities, such as meal preparation. Other such examples include inability to access information about busses preventing independent use of public transport or inability to cross a road preventing leaving the house. Developing LVAs that support these subtasks may open up vastly improved independence. Future work should establish such critical subtasks presenting barriers to general activities of daily living and focus on making these accessible. 
What do the characteristics of recorded scenes tell us about LVA design requirements?
Our computational image analysis showed that modern LVAs have to accommodate a broad range of scenes with sometimes highly variable characteristics. Analysis of image contrast showed that the majority of scenes use the full contrast range, hence requiring technology with a large dynamic range to capture all tonal values in mostly high contrast scenes of everyday life. A move to high dynamic range (HDR) processing might prove beneficial here. Brightness varied largely between outdoor activities and indoor activities; whereas we classified the majority of scenes as having “medium” brightness (e.g. normally lit home environment and cloudy day outside), any modern LVA will have to accommodate the extremes in order to offer support throughout the day. Based on recordings, we estimate that luminance values ranging from 20 cd/m2 (scene at sunrise or sunset) to 5000 cd/m2 (scene in full sunlight) should be reproduced without noticeable image artifacts, such as clipping (bright regions) or noise (dark regions), with optimal performance at around 700 cd/m2 (scene on an overcast day). 
The vast majority of scenes (75%) fell into the “ad hoc” category, with an expected duration of up to 5 minutes. This requires LVAs with a quick start-up time and almost immediate access and responsiveness, as otherwise the effort and wait of engaging the LVA may prevent people from utilizing it. It is important to note that some activities, such as cooking, may require a succession of such ad hoc tasks (e.g. reading a recipe, finding ingredients, assembling a kitchen tool, and setting oven dials). The user may wish to wear the sight aid continuously for such tasks or use it only where needed, and this should be facilitated through the design. Equally, tasks such as watching TV or reading, require an LVA that is comfortable and safe to wear for hours. 
Based on our scene classification results, modern LVAs should accommodate quickly changing focus planes and object distances, although three-quarters of tasks required a single focus plane for an object within reach. LVAs would benefit from allowing peripheral vision, especially for the bottom/lower periphery, and should allow for visual scanning, utilization of hands while using the LVA, and ideally facilitate walking. Given that just over half (58%) of activities were performed at home, design acceptability in public spaces, work environments, and school is of importance. Finally, we found that the size of the attended object of interest was typically less than around 2 degrees visual angle horizontally and vertically. This requires a substantial magnification factor and related requirements, such as image quality and stability, which will further scale with the user's acuity. 
Study limitations
In this study, a number of approximations had to be made given technical constraints. Calculation of contrast was relative to the system's dynamic range and exposure adjustments so they do not represent an absolute contrast estimate. Calculations of texture and areas with high spatial frequencies depended on set thresholds and should be regarded as approximations. Estimates of object size are based on the camera's field of view and measured object size in pixels, which are subject to measurement error especially for small objects. 
Comprehensive briefing sessions were conducted to ensure participant compliance (practical exercises, review of recorded examples, and verification of the ability to control the recording spectacles independently). In addition, large-print instructions were provided for use and a technical helpline offered. Although this led to all participants capturing recordings, the duration and style of recordings still varied noticeably between participants. Future studies should plan for such variation and review ways how to reduce it further. To an extent, we believe that the difference in the number of recordings between participants reflects individual lifestyles. However, some participants may have recorded less due to feeling uncomfortable recording in public, etc. This should be taken into consideration when interpreting our results. 
As part of this study, we did not specifically enquire about compliance. One possible explanation for 58% of recordings taking place at home could be forgetting to wear the device when going out or not wanting to be seen wearing the device in public. Another could be that 58% of activities that participants believe they would use a “perfect sight aid” for genuinely do take place in the home, giving a level of uncertainty regarding the location of device usage when interpreting results. An interesting question is whether, even if outdoor use was under-represented due to forgetting to take the device along or not wanting to be seen with it, the same bias would also apply to a real device. Designing for increased uptake outside the home would then become an additional behavior change and design challenge. 
Acknowledgments
The authors thank Clare Hayes and Elizabeth Bartlam from the Aston University Low Vision clinic as well as Elodie Draperi for their support in recruitment and administration of the study, Chris Banks for help with administrative tasks, and two reviewers for their thoughtful comments. This work would not have been possible without our participants who volunteered their time; we are very grateful for their support. 
This study was part-funded through Innovate UK grant number 103865. 
Commercial relationships: none. 
Corresponding author: Sandra D. Starke. 
Email: s.starke@aston.ac.uk. 
Address: Aston Business School, Aston University, Aston Triangle, Birmingham, B4 7ER, UK. 
References
Access Economics Pty Ltd. (2009). Future sight loss UK (1): The economic impact of partial sight and blindness in the UK adult population. Full report. Royal National Institute for the Blind. Retrieved from https://www.rnib.org.uk/sites/default/files/FSUK_Report.pdf.
Barrett, L. F., & Barrett, D. J. (2001). An introduction to computerized experience sampling in psychology. Social Science Computer Review, 19, 175–185. [CrossRef]
Berdeaux, G. H., Nordmann, J.-P., Colin, E., & Arnould, B. (2005). Vision-related quality of life in patients suffering from age-related macular degeneration. American Journal of Ophthalmology, 139, 271–279. [CrossRef]
Binns, A. M., Bunce, C., Dickinson, C., Harper, R., Tudor-Edwards, R., Woodhouse, M., et alet al (2012). How effective is low vision service provision? A systematic review. Survey of Ophthalmology, 57, 34–65. [CrossRef]
Binns, A., Bunce, C., Dickinson, C., Harper, R., Tudor-Edwards, R., Woodhouse, M.,et al (2009). Low vision service outcomes: A systematic review. Low Vision Service Model Evaluation (LOVSME) Project. Retrieved from https://www.rnib.org.uk/sites/default/files/LOVESME_summary_lit_review.pdf.
Bourne, R. R. A., Flaxman, S. R., Braithwaite, T., Cicinelli, M. V., Das, A., Jonas, J. B., et al. (2017). Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis. The Lancet. Global Health, 5, e888–e897. [CrossRef]
Brown, C., Costley, C., Friend, L., & Varey, R. (2010). Capturing their dream: Video diaries and minority consumers. Consumption Markets and Culture, 13, 419–436. [CrossRef]
Carter, S., & Mankoff, J. (2005). When participants do the capturing: the role of media in diary studies. In: Proceedings of the SIGCHI conference on Human factors in computing systems. New York, NY: ACM. pp. 899–908.
Christensen, T. C., Barrett, L. F., Bliss-Moreau, E., Lebo, K., & Kaschub, C. (2003). A practical guide to experience-sampling procedures. Journal of Happiness Studies, 4, 53–78. [CrossRef]
Clark-Ibáñez, M. (2004). Framing the social world with photo-elicitation interviews. The American Behavioral Scientist, 47, 1507–1527. [CrossRef]
Crossland, M., Starke, S. D., Imielski, P., Wolffsohn, J., & Webster, A. (2019). Benefit of an electronic head-mounted Low Vision Aid. Ophthalmic and Physiological Optics, 39, 422–431. [CrossRef]
Deemer, A. D., Bradley, C. K., Ross, N. C., Natale, D. M., Itthipanichpong, R., Werblin, F. S., et al. (2018). Low vision enhancement with head-mounted video display systems: Are we there yet? Optometry and Visual Science: Official Publication of the American Academy of Optometry, 95, 694–703. [CrossRef]
Dickinson, C., Linck, P., Tudor-Edwards, R., Binns, A., Bunce, C., Harper, R., et al (2011). A profile of low vision services in England: The Low Vision Service Model Evaluation (LOVSME) project. Eye, 25, 829–831. [CrossRef]
Doherty, A., Williamson, W., Hillsdon, M., Hodges, S., Foster, C., & Kelly, P. (2013). Influencing health-related behaviour with wearable cameras: strategies & ethical considerations. In: Proceedings of the 4th international SenseCam & pervasive imaging conference. New York, NY: ACM. pp. 60–67.
Ehrlich, J. R., Ojeda, L. V., Wicker, D., Day, S., Howson, A., Lakshminarayanan, V., et al. (2017). Head-mounted display technology for low-vision rehabilitation and vision enhancement. American Journal of Ophthalmology, 176, 26–32. [CrossRef]
Fleck, R. (2012). Rating reflection on experience: A case study of teachers’ and tutors’ reflection around images. Interacting with Computers, 24, 439–449. [CrossRef]
Fleck, R., & Fitzpatrick, G. (2009). Teachers’ and tutors’ social reflection around SenseCam images. International Journal of Human-Computer Studies, 67, 1024–1036. [CrossRef]
Frick, K. D., Drye, L. T., Kempen, J. H., Dunn, J. P., Holland, G. N., Latkany, P., Rao, N. A., et al (2012). Associations among visual acuity and vision- and health-related quality of life among patients in the Multicenter Uveitis Steroid Treatment Trial. Investigative Opthalmology and Visual Science, 53, 1169. [CrossRef]
Gemming, L., Doherty, A., Utter, J., Shields, E., & Ni Mhurchu, C. (2015). The use of a wearable camera to capture and categorise the environmental and social context of self-identified eating episodes. Appetite, 92, 118–125. [CrossRef]
Gori, M., Cappagli, G., Tonelli, A., Baud-Bovy, G., & Finocchietti, S. (2016). Devices for visually impaired people: High technological devices with low user acceptance and no adaptability for children. Neuroscience & Biobehavioral. Reviews, 69, 79–88. [CrossRef]
Harper, D. (2002). Talking about pictures: A case for photo elicitation. Visual Studies, 17, 13–26. [CrossRef]
Hooper, P., Jutai, J.W., Strong, G., & Russell-Minda, E. (2008). Age-related macular degeneration and low-vision rehabilitation: a systematic review. Canadian Journal of Ophthalmology, 43, 180–187. [CrossRef]
Horowitz, A., Brennan, M., Reinhardt, J. P., & MacMillan, T. (2006). The impact of assistive device use on disability and depression among older adults with age-related vision impairments. The Journals of Gerontology: Series B, Psychological Sciences and Social Sciences, 61, S274–S280. [CrossRef]
Van House, N. A. (2006). Interview viz: visualization-assisted photo elicitation. In: CHI’06 Extended Abstracts on Human Factors in Computing Systems. New York, NY: ACM. pp. 1463–1468.
Intille, S., Kukla, C., & Ma, X. (2002). Eliciting user preferences using image-based experience sampling and reflection. In: CHI’02 Extended Abstracts on Human Factors in Computing Systems. New York, NY: ACM. pp. 738–739.
Karisalmi, N., & Nieminen, M. P. (2017). Selfies of sickness: The use of video diaries with chronically ill children. In: Building Capacity for Health Informatics in the Fugure. Studies in Health Technology and Inforatics, pp. 172–177.
Kelly, P., Marshall, S. J., Badland, H., Kerr, J., Oliver, M., Doherty, A. R., et al. (2013). An ethical framework for automated, wearable cameras in health behavior research. American Journal of Preventive Medicine, 44, 314–319. [CrossRef]
Kienzler, D. S. (1997). Visual ethics. Journal of Business Communication, 34, 171–187. [CrossRef]
Kwok, S. Y., Skatova, A., Shipp, V., and Crabtree, A. (2015). The ethical challenges of experience sampling using wearable cameras. In: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct. New York, NY: ACM. pp. 1054–1057.
Lamoureux, E. L., Pesudovs, K., Pallant, J. F., Rees, G., Hassell, J. B., Caudle, L. E., et al. (2008). An evaluation of the 10-item vision core measure 1 (VCM1) scale (the Core Module of the Vision-Related Quality of Life scale) using Rasch analysis. Ophthalmic Epidemiology, 15, 224–233. [CrossRef]
Leat, S. J., Legge, G. E., & Bullimore, M. A. (1999). What is low vision. Optometry and Visual Science: Official Publication of the American Academy of Optometry, 76, 198–211. [CrossRef]
Lorenzini, M.-C., & Wittich, W. (2019). Factors related to the use of magnifying low vision aids: a scoping review. Disability and Rehabilitation, https://doi.org/10.1080/09638288.2019.1593519.
Markowitz, S. N. (2006). Principles of modern low vision rehabilitation. Canadian Journal of Ophthalmology, 41, 289–312. [CrossRef]
Markowitz, S. N. (2016). State-of-the-art: Low vision rehabilitation. Canadian Journal of Ophthalmology, 51, 59–66. [CrossRef]
Massof, R. W., Ahmadian, L., Grover, L. L., Deremeik, J. T., Goldstein, J. E., Rainey, C., et al (2007). The Activity Inventory: An adaptive visual function questionnaire. Optometry and Visual Science: Official Publication of the American Academy of Optometry, 84, 763. [CrossRef]
Massof, R. W., Hsu, C. T., Baker, F. H., Barnett, G. D., Park, W. L., Deremeik, J. T., et al (2005). Visual disability variables. II: The difficulty of tasks for a sample of low-vision patients. Archives of Physical Medicine and Rehabilitation, 86, 954–967. [CrossRef]
Massof, R. W., Hsu, C. T., Baker, F. H., Barnett, G. D., Park, W. L., Deremeik, J. T., et al (2005). Visual disability variables. I: The importance and difficulty of activity goals for a sample of low-vision patients. Archives of Physical Medicine and Rehabilitation, 86, 946–953. [CrossRef]
Mavoa, S., Oliver, M., Kerr, J., Doherty, A., and Witten, K. (2013). Using SenseCam images to assess the environment. In: Proceedings of the 4th International SenseCam & Pervasive Imaging Conference. New York, NY: ACM. pp. 84–85.
Mok, T. M., Cornish, F., & Tarr, J. (2015). Too much information: Visual research ethics in the age of wearable cameras. Integrative Psychological and Behavioral Science, 49, 309–322. [CrossRef]
Murphy, C. C., Greiner, K., Plskova, J., Frost, N. A., Forrester, J. V., & Dick, A. D. (2007). Validity of using vision-related quality of life as a treatment end point in intermediate and posterior uveitis. The British Journal of Ophthalmology, 91, 154–156. [CrossRef]
Palmer, S. (2005). Factors which influence the use of low vision aids. Visibility.
Peterson, R. C., Wolffsohn, J. S., Rubinstein, M., & Lowe, J. (2003). Benefits of electronic vision enhancement systems (EVES) for the visually impaired. American Journal of Ophthalmology, 136, 1129–1135. [CrossRef]
Pezzullo, L., Streatfeild, J., Simkiss, P., & Shickle, D. (2018). The economic impact of sight loss and blindness in the UK adult population. BMC Health Services Research, 18, 63. [CrossRef]
Richard, V. M., & Lahman, M. K. E. (2015). Photo-elicitation: Reflexivity on method, analysis, and graphic portraits. International Journal of Research & Method in Education, 38, 3–22. [CrossRef]
Royal National Institute for the Blind (RNIB). (2015). My Voice 2015 Summary people in the UK Key messages. Retrieved from https://www.rnib.org.uk/knowledge-and-research-hub-research-reports-general-research/my-voice.
Sandnes, F. E. (2016). What do low-vision users really want from smart glasses? Faces, text and perhaps no glasses at all. In: Computers Helping People with Special Needs. ICCHP 2016. Lecture Notes in Computer Science, vol. 9758. Cham: Springer. pp. 187–194.
Shiffman, S., Stone, A. A., & Hufford, M. R. (2008). Ecological momentary assessment. Annual Review of Clinical Psychology, 4, 1–32. [CrossRef]
Shipp, V., Skatova, A., Blum, J., & Brown, M. (2014). The ethics of wearable cameras in the wild. In: Ethics in Science, Technology and Engineering, 2014 IEEE International Symposium on Ethics in Engineering, Science, and Technology, IEEE. May 23–24, 2014 Chicago Marriott O'Hare, Chicago, IL, USA. pp. 1–5.
Skatova, A., Shipp, V. E., Spacagna, L., Bedwell, B., Beltagui, A., & Rodden, T. (2015). Datawear: Self-reflection on the go or how to ethically use wearable cameras for research. In: Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems. New York, NY: ACM. pp. 323–326.
Slade, J., & Edwards, R. (2015). My Voice 2015: The views and experiences of blind and partially sighted people in the UK. Royal National Institute for the Blind. Retrieved from https://www.rnib.org.uk/sites/default/files/My%20Voice%202015%20-%20Full%20report%20-%20Accessible%20PDF_0.pdf.
Smallfield, S., Berger, S., Hillman, B., Saltzgaber, P., Giger, J., & Kaldenberg, J. (2017). Living with low vision: Strategies supporting daily activity. Occupational Therapy in Health Care, 31, 312–328. [CrossRef]
Tadic, V., Cooper, A., Cumberland, P., Lewando-Hundt, G., & Rahi, J. S. (2016). Measuring the quality of life of visually impaired children: First stage psychometric evaluation of the novel VQoL_CYP instrument. PLoS One, 11, e0146225. [CrossRef]
Taylor, D. J., Hobby, A. E., Binns, A. M., & Crabb, D. P. (2016). How does age-related macular degeneration affect real-world visual ability and quality of life? A systematic review. BMJ Open, 6, e011504. [CrossRef]
Virgili, G., Acosta, R., Bentley, S. A., Giacomelli, G., Allcock, C., & Evans, J. R. (2018). Reading aids for adults with low vision. The Cochrane Database of Systematic Reviews, 4, CD003303.
Wilson, G., Jones, D., Schofield, P., & Martin, D. J. (2016). Experiences of using a wearable camera to record activity, participation and health-related behaviours: Qualitative reflections of using the Sensecam. Digital Health, 2, 2055207616682628. [CrossRef]
Figure 1.
 
Example of scene recordings taken with the spectacle-mounted camera.
Figure 1.
 
Example of scene recordings taken with the spectacle-mounted camera.
Figure 2.
 
Digital image processing for the analysis of image characteristics. Top row: Left - original recording, middle - undistorted, white balanced, and de-noised image, and right - linearized greyscale image. Bottom row: Left - local entropy (texture), middle - high frequencies, and right - edges.
Figure 2.
 
Digital image processing for the analysis of image characteristics. Top row: Left - original recording, middle - undistorted, white balanced, and de-noised image, and right - linearized greyscale image. Bottom row: Left - local entropy (texture), middle - high frequencies, and right - edges.
Figure 3.
 
Breakdown of subtasks as mapped to the Veterans Affairs Low Vision Visual Functioning Questionnaire and bespoke tasks. Percentages are expressed relative to all tasks in a given category. For absolute values, please refer to Table 2.
Figure 3.
 
Breakdown of subtasks as mapped to the Veterans Affairs Low Vision Visual Functioning Questionnaire and bespoke tasks. Percentages are expressed relative to all tasks in a given category. For absolute values, please refer to Table 2.
Figure 4.
 
Scene characteristics of the captured recordings with regards to task context, lighting, visual, and focus requirements as well as practical requirements.
Figure 4.
 
Scene characteristics of the captured recordings with regards to task context, lighting, visual, and focus requirements as well as practical requirements.
Figure 5.
 
Image characteristics of all captured scenes as quantified through computational image analysis. The probability (0–1 scale) refers to the proportion of images with a given characteristic.
Figure 5.
 
Image characteristics of all captured scenes as quantified through computational image analysis. The probability (0–1 scale) refers to the proportion of images with a given characteristic.
Figure 6.
 
Object of interest sizes (in field of view fov) across all participants and captured objects. Note that the size relates to the object as a whole, not the textures contained within (e.g. jar of jam). For textual information, the size of a single letter was approximated.
Figure 6.
 
Object of interest sizes (in field of view fov) across all participants and captured objects. Note that the size relates to the object as a whole, not the textures contained within (e.g. jar of jam). For textual information, the size of a single letter was approximated.
Table 1.
 
Scene classification.
Table 1.
 
Scene classification.
Table 2.
 
Task rankings.
Table 2.
 
Task rankings.
Table 3.
 
 
Table 3.
 
 
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×