The impact of annotation on concrete and abstract visual representations in science education: testing the expertise reversal effect

Zheng, Robert; Cordner, Holly; Spears, Jeffrey

doi:10.1186/s41039-022-00194-y

Research
Open access
Published: 08 May 2022

The impact of annotation on concrete and abstract visual representations in science education: testing the expertise reversal effect

Research and Practice in Technology Enhanced Learning volume 17, Article number: 18 (2022) Cite this article

2790 Accesses
1 Citations
5 Altmetric
Metrics details

Abstract

This study investigates the effects of annotation on abstract and concrete visual representations in science education. Two studies were conducted: Study 1 investigated the interaction between annotation and visual representations. The results of this study demonstrated that in science learning, annotation with abstract visual representations was superior to annotation with concrete visual representations. Study 2 tested the expertise reversal effect in a three-factor design where the interaction among annotation, visual representation, and prior knowledge was measured. The findings showed that high-prior-knowledge learners performed better in the annotation-abstract visual condition than in the annotation-concrete visual condition where low-prior-knowledge learners showed the opposite outcome—confirming that the expertise reversal effect is at play in how science learners utilize visual information. The study has clarified the roles of prior knowledge, visual representation, and instructional strategy on learner cognitive processing in science education. This knowledge should prove useful for educators as they engage in the design and development of computer-based science learning.

Introduction

Research has demonstrated that visual resources, when optimally designed, can significantly improve learners’ understanding (Moreno & Mayer, 1999; Sweller & Chandler, 1994). Evidence from empirical research further suggests that different forms of visual representation may differently influence learners’ cognitive information processes (Mason et al., 2013). Studies also show that concrete visual representations can influence the way learners utilize their prior knowledge when processing science content (Moreno et al., 2011). Abstract visual representations, on the contrary, are likely to facilitate deep-level processing such as knowledge transfer in learning (Easterday et al., 2009; Kaminski et al., 2008). In addition, visual cues like annotation have been found to significantly affect learners’ information processing in visual learning by redirecting their attention, reducing the search path, and alleviating the cognitive load (de Koning et al., 2010; Madsen et al., 2013). Despite these advances in the knowledge of the relationship between visual representations and visual cues, little research has been conducted to understand how visual cues like annotation may influence learners’ information processing in abstract and concrete visual representations. Thus, the first goal of the current study was to examine the relationship between annotation and abstract/concrete visual representations in learning.

Studies have shown that the way visual representation is represented can have different effects on learners, depending on their level of expertise in knowledge domain (Gegenfurtner et al., 2011). In their study on diagrams and expertise, Kalyuga et al. (2012) found that novices performed well in diagrams with text, whereas experienced learners worked better with diagrams only, resulting in an expertise reversal effect. Given this empirical evidence of the functional role of expertise in visual learning, it is evident that research on visual representation and annotation should also consider the mediating effect of expertise. Therefore, the second goal of this study was to investigate the impact of expertise on visual learning by studying a 3-way interaction between visual representations (abstract vs. concrete), annotation (annotation vs. non-annotation), and prior knowledge (high vs. low) in science learning.

To understand the complexity of visual representations and annotation and their impact on learners’ cognitive processes, the current study is informed by several theories and framework that include cognitive load theory, research in abstract and concrete visual representations, role of annotation, and expertise reversal effect in visual learning.

Cognitive load theory

Cognitive load theory (CLT), which was conceived in the 1980s by Sweller and colleagues, has become one of the most important theories in learning and psychology (Greenberg et al., 2021; Kalyuga & Plass, 2018; Plass & Kalyuga, 2019). According to CLT, three types of cognitive load can significantly influence learners’ cognitive processes in learning (Sweller et al., 1998). They are intrinsic, extraneous, and germane cognitive load. The intrinsic load refers to the difficulty of the learning material as defined by its element interactivity, that is, “the level of interconnectedness between the information elements that need to be processed … to make sense of the learning tasks or materials” (Plass & Kalyuga, 2019, p. 342). For example, memorizing a list of random words requires a lot of effort but does not necessarily pose high intrinsic load. Reading a dense paragraph of text, on the other hand, may induce high intrinsic load, since the learner needs to figure out the semantic structure among the words, the grammatical relationship between parts of speech, and so forth. All of this indicates a high level of element interaction. Thus, the higher the element interactivity is, the more difficult learning the material becomes, and the higher the intrinsic cognitive load will be (Sweller & Chandler, 1994). The extraneous cognitive load is caused by inappropriate design in instruction, such as redundancy or split attention in learning. For instance, teachers may unwittingly increase learners’ extraneous load by presenting materials that “require students to mentally integrate mutually referring, disparate sources of information” (Sweller & Chandler, 1991, p. 353). Therefore, the extraneous cognitive load is considered a hindrance to learning and should be minimized. Finally, the germane cognitive load is induced by learners’ efforts to process and comprehend material in order to construct new knowledge. This type of cognitive load is relevant to learning and reflects learners’ cognitive engagement.

There has recently been a growing interest in the affective aspects in cognitive load research (Moreno & Mayer, 2007; Plass & Kalyuga, 2019; Plass & Kaplan, 2016). In an early study, Moreno (2010) stressed the importance of motivation in determining the cognitive resources allocated to learning tasks (the resources that are associated with germane load). Plass and Kalyuga (2019) argued that cognitive load, and germane load in particular, may be related to, or involve, affective processes. They contended that activating motivation would lead to effortful cognitive engagement during the learning process. In a study on cognitive load, motivation and prior knowledge in mathematics problem solving, Gupta and Zheng (2020) found germane load was significantly positively correlated with the learner’s interest (r = 0.221, p < 0.01), confirming Plass and Kalyuga’s argument about the relationship between motivation and germane cognitive load in learning.

Abstract and concrete visual representations

Visual resources play an important role in the formation and development of mental representations (Crisp & Sweiry, 2006). According to Crisp and Sweiry, there are a number of possible reasons for the apparent superiority of visual over non-visual representations in mental representation formation (Also see Greenberg et al., 2021, Greenberg & Zheng, 2022). First, processing visual material may require less cognitive effort. The learner can grasp the general meaning of an image in as little as milliseconds. Second, visual and non-visual materials may be processed in different cognitive systems. The dual-coding theory (Paivio, 1986) explains why memory for images may be better than memory for non-image information—it is a result of them being encoded both as images and verbal labels, while text is only encoded verbally.

Due to the above reasons, visual representations have been widely applied to learning across domains, especially in STEM education (Frey et al., 2016; Fuchsova & Korenova, 2019; Rau, 2017; Soong et al., 2020). Fuchsova and Korenova (2019) investigated the effects of visual representation on elementary school teachers’ human biology training. They noticed that by using augmented reality, the learners demonstrated deeper understanding, greater motivation, and more creativity in learning.

In addition, visual representations can be classified into abstract and concrete visual representations. Abstract visual representations refer to visuals that use conventional symbols (e.g., lines, nodes, boxes, etc.) to represent the relevant elements of a problem where concrete visual representations are those that depict the real-life objects corresponding to a problem (Moreno et al., 2011). Figures 1 and 2 are examples of abstract and concrete visual representations, respectively. In the abstract visual representations, lines and symbols are used to represent the electric circuitry. In contrast, in the concrete visual representations, actual images (e.g., bulb, battery, wire, etc.) are used to represent the electric circuitry.

Mason et al. (2013) differentiate the cognitive functioning between abstract and concrete visual representations by showing that concrete visuals “may make information more available in long-term memory,” whereas abstract visuals “may lead to more efficient processing of learning material” (p. 377). In a separate study, Moreno et al. (2011) looked at the differences between abstract and concrete visual representations in electric circuitry. They found that concrete visual representations promoted better comprehension and problem solving by depicting a close correspondence between the representations and the concrete objects that they intended to represent, which made concrete visual representations rely less on knowledge conventions for their interpretations. They claimed that concrete visual representations help build the connection between learners’ prior knowledge and the information to be learned. Abstract visual representations were found to promote problem solving by focusing learners’ attention on structural rather than superficial problem information. In other words, abstract visual representations generalize the common underlying structure of the problems that are superficially dissimilar. This knowledge of the general underlying structure of the problems is important in supporting transfer in learning. Moreno et al. therefore concluded that abstract visuals facilitate knowledge transfer while concrete visuals connect with schemata. However, de Bock et al. (2011) compared abstract and concrete visual representations in mathematics learning and came to the conclusion that there were no differences between abstract and concrete visual representations. They found both abstract and concrete visual representation groups performed equally well in knowledge transfer. They speculated that the non-significance in different visuals may be accounted for by learners’ abilities to derive the underlying structure in the mathematics problems in both groups regardless of the forms of visual representation. Given the equivocal findings from abstract and concrete visual representation research, further investigation on abstract and concrete visual representations is warranted.

Role of annotation in visual learning

Research in neuroscience has demonstrated that visual stimuli like visual cues can significantly influence brain functions in the domains of attention and executive functioning (Heinrich et al., 2007). Madsen et al. (2013) examined the effects of visually cued and uncued diagrams on learners’ cognitive processes in aerodynamics learning. They found that compared to the visually uncued condition, learners in the visually cued condition spent less time looking at “novicelike” areas of the diagram and more time at the “expertlike” areas of the diagram in transfer problem solving.

As a visual cueing strategy, annotation has recently received significant attention among researchers and educators who view annotation as a way to augment the learning process—particularly in science education. Studies have shown that annotation provokes individuals to consider and weigh new perspectives. It supports critical thinking and augments deep processing in learning (Samuel et al., 2011; Wallen et al., 2005). Wallen et al. (2005) note that annotation aids the process of selecting relevant information, organizing the information in memory, and integrating new information with prior knowledge. Moreover, it redirects learners’ attention and reduces their visual search, resulting in a reduced extraneous cognitive load in learning.

There are many ways to instantiate annotation in learning. The current study chose to provide an annotation tool with which learners could, for instance, review information about electric circuit design and the formula for calculating electric resistance. They could then make annotations as he/she solves the problem. Figures 1 and 2 illustrate the annotations made by the learners as they calculate the electric resistance in a parallel electric circuit in both abstract and concrete visual representations.

Despite what is known about annotation in learning, little research has been conducted to understand how it supports learners’ cognitive processes when different visual representations (e.g., abstract vs. concrete) are used. The current study aimed to understand the differing roles of annotation in the context of abstract and concrete visual representations in science learning.

Prior knowledge and expertise reversal effect

The expertise reversal effect pertinent to visuals in learning has gained attraction among the researchers over the past decade (Zheng & Greenberg, 2018, Zheng & Gardner, 2020; Gupta & Zheng, 2020; Kalyuga et al., 2012), largely because of the fascinating observations about how expertise may influence the outcome of visual learning (Gegenfurtner et al., 2011). Kalyuga (2009) pointed out that learning procedures and techniques that are beneficial for low-prior-knowledge learners may become relatively inefficient for high-prior-knowledge learners. In a longitudinal study, Kalyuga et al. (1998) investigated the redundancy effect between high- and low-prior-knowledge learners. They found that when diagrams were embedded in the text, novices learned the content well. After the learners underwent intensive training, however, a reversal effect was observed: Diagram-alone materials generated a higher level of performance on the subsequent tests. Kalyuga et al. explained that at the beginning, novices may not have constructed schemata to understand the complex content. Therefore, the text with the diagram helped the novices comprehend the content. However, as the learners gained more knowledge, their learning actually became hindered when additional text was added, since it was then unnecessary and redundant in learning. Kalyuga et al. (2012) argued that when this redundant information cannot be ignored, interference with learning occurs resulting in high extraneous cognitive load as well as a misalignment between learners’ effort and task difficulty. Lee et al. (2006) conducted a study on task complexity (high vs. low), visual representation (symbol vs. icon), and prior knowledge (high vs. low). An expertise reversal effect was observed. Low-prior-knowledge learners performed better with symbolic and iconic visual representations, whereas high-prior-knowledge learners performed better with symbolic visual representations only. Lee et al. explained that the multiple visuals were necessary scaffolds for low-prior-knowledge learners who lacked adequate schemata. However, these same scaffolds became redundant to high-prior-knowledge learners and consequently hindered their learning. The current study therefore investigated the role of expertise in visual learning.

Three predictions were made to guide the present study.

Prediction 1: Learners with annotation will outperform these without annotation as measured by learners’ comprehension, problem solving, and three types of cognitive load.

Prediction 2: There will be an interaction between annotation and abstract/concrete visual representations as measured by learners’ comprehension, problem solving, and three types of cognitive load.

Prediction 3: Learners’ performance in annotation and abstract/concrete visual representations will be affected by their expertise in the domain area. Specifically, high-prior-knowledge learners will perform better in the abstract visual representation with annotation (AA) condition, since abstract visuals facilitate the understanding of the underlying structure of the problems. In contrast, low-prior-knowledge learners will perform better in the concrete visual representation with annotation (CA) condition, because the extra visual support from both concrete visuals and annotation will facilitate schema development for the novices.

Two studies were conducted to test the above predictions. Study 1 explored a two-way interaction between annotation and visual representation with a focus on the difference between annotation and non-annotation in abstract and concrete visual representations. Study 2 tested a three-way interaction between visual representation (abstract vs. concrete), annotation (annotation vs. non-annotation), and prior knowledge (high vs. low). The purpose was to understand how visual representation and annotation may be influenced by learners’ prior knowledge.

Study 1

To test predictions 1 and 2, Study 1 investigated (a) the differences between annotation and non-annotation and (b) the relationship between annotation (annotation vs. non-annotation) and visual representations (abstract vs. concrete) in science learning.

Methodology

Subjects and design

Participants (N = 108) were recruited from a Research I university in the western USA. The average age of the subjects was 21.5 (SD = 1.60). Of 108 subjects, 49% (n = 53) were males and 51% (n = 55) were females. About 57.4% (n = 62) were white, 4.6% (n = 5) were African American, 20.4% (n = 22) were Hispanic, 14.8% (n = 16) were Asian, and 2.8% (n = 3) were other. A 2 × 2 between-subjects factorial design was employed with visual representation (abstract vs. concrete) and annotation (annotation vs. non-annotation) as the independent variables, and comprehension, problem solving, and cognitive load scores (intrinsic, extraneous, and germane) as dependent variables. The pretest scores on learners’ prior knowledge of electrical circuitry were entered as a covariate. Four conditions were created. They included Abstract visual + Annotation (AA), Abstract visual + Non-annotation (ANA), Concrete visual + Annotation (CA), and Concrete visual + Non-annotation (CNA). Subjects were randomly assigned to one of the four conditions. A family-wise alpha level of 0.05 was adopted for all analyses.

Learning materials

The learning materials were created with Adobe DX to support interactive computer-based learning in electrical circuitry. The content was adapted from a textbook by Herman (2016). The learning materials covered the concepts of electric circuit (e.g., parallel and multiple resistance in an electric circuit) with electric circuit problems requiring the learners to solve them with Ohm’s law. The learning materials had built-in annotation support where the participants were able to click the annotation button to get the resources (e.g., formula for calculating the resistance) and enter their own notes if needed.

Measurement

The measurement for Study 1 included prior knowledge test (PKT), the posttest, and cognitive load questionnaire (CLQ), the details of which are described as follows.

Prior knowledge test (PKT) The PKT consisted of 10 items aiming to test learners’ prior knowledge in electric circuitry. The test included basic concepts from elements of electric circuit (e.g., current, voltage) to the types of electric circuits (e.g., series, parallel). The maximum score one could obtain was 10 points. The PKT was adapted from a screen test in Herman’s (2016) textbook. The items were reviewed by a panel of experts whose feedback was incorporated in the finalization of the instrument. The internal consistency for the current study for the PKT was α = 0.806, suggesting good reliability for measuring the prior knowledge on the subject.

The posttest The posttests consisted of a comprehension test and problem-solving test. The comprehension test had ten questions assessing learners’ understanding of the concepts and principles related to electrical circuit such as explaining the difference between parallel and multiple resistance in electric circuit. The maximum score one could obtain on the comprehension test was 10 points. The problem-solving test included five near transfer problems with a maximum of 10 points possible for the entire test. In the problem-solving test, learners were asked to solve a problem based on a given condition. The learner would calculate, for example, the level of resistance in the electric current using Ohm’s law and then find a solution for the proper functioning of the electric circuit. The inter-item reliabilities for comprehension and problem solving were α = 0.812 and α = 0.721, respectively, showing medium to high reliabilities. Figure 3 provides an example of a problem-solving test item.

Cognitive load questionnaire (CLQ) A self-report questionnaire that evaluates the cognitive load in learning was used. The 11-point Likert-scale questionnaire (N = 10) was adapted from Leppink et al. (2013) to measure three types of cognitive load: intrinsic load (IL) items 1–3, extraneous load (EL) items 4–6, and germane load (GL) items 7–10. Examples of the questions include "The topic covered in the electric circuit material was very complex" (IL), "The instruction and explanation during the learning were very ineffective" (EL), and "The annotation with visuals really enhanced my understanding of the content covered" (GL) (see Appendix for the whole questionnaire).

The instrument reports medium to high reliabilities with intrinsic load α = 0.81, extraneous load α = 0.75, and germane load α = 0.82. The current study reported similar reliabilities with intrinsic load α = 0.81, extraneous load α = 0.76, and germane load α = 0.84, all indicating good reliability.

Procedure

Participants were informed of the nature of the study and completed the consenting process before participating in the study. They were then randomly assigned to one of the four learning conditions: AA, ANA, CA, and CNA. The participants completed a demographic survey and a prior knowledge test. They were then told to log onto the computer to start the learning session that included the electrical circuitry materials followed by a practice session. At the end of practice session, the participants were asked to complete a posttest that consisted of comprehension and problem-solving subtests. Finally, the CLQ was administered. The entire study took about one and a half hours. The data were aggregated for final analyses.

Results

Statistical assumptions were evaluated and met. The MANCOVA was performed using SPSS v. 26 with annotation (annotation vs. non-annotation) and visualization (abstract vs. concrete) as independent variables and comprehension, problem solving, and three cognitive load scores as dependent variables. As the raw scores for the three cognitive load measures varied due to the differing number of questions in each load category (IL = 3, EL = 3, GL = 4), Z-scores were calculated to allow for meaningful comparisons among the outcomes. Prior knowledge scores were entered as a covariant in the final analyses. Table 1 presents the descriptive statistics with means and standard deviations.

Table 1 Descriptive statistics with means and standard deviations for study 1 (N = 108)

Full size table

The results of multivariate tests show prior knowledge as a covariant was significant λ = 0.59, p < 0.001, ƞ² = 0.40. Main effects were observed for annotation λ = 0.69, p < 0.001, ƞ² = 0.30 and visual representation λ = 0.70, p < 0.001, ƞ² = 0.29. The follow-up between-subjects tests revealed that there was a significant difference in annotation measured by problem solving F(1, 107) = 7.38, p < 0.01, ƞ² = 0.06, but not by comprehension F(1, 107) = 3.33, p = 0.07.

It was found that learners who studied in the AA condition generally outperformed these in the CA condition (Fig. 4). There was a significant interaction between visual representation and annotation λ = 0.76, p < 0.001, ƞ² = 0.23 as measured by comprehension F(1, 107) = 11.73, p < 0.001, ƞ² = 0.10, problem solving F(1, 107) = 6.41, p < 0.05, ƞ² = 0.05 and germane cognitive load F(1, 107) = 8.89, p < 0.01, ƞ² = 0.07, but not by intrinsic cognitive load F(1, 107) = 1.56, p = 0.213 and extraneous cognitive load F(1, 107) = 3.83, p = 0.053, suggesting a connection between germane cognitive load and performance.

As expected, learners experienced higher extraneous cognitive load without annotation than with annotation F(1, 107) = 39.11, p < 0.001, ƞ² = 0.27. A significant difference in intrinsic cognitive load was observed for visual representation F(1, 107) = 5.48, p < 0.05, ƞ² = 0.05 where learners experienced higher intrinsic load in concrete visual representation than in abstract visual representation conditions. Finally, germane cognitive load was significant for annotation F(1, 107) = 8.17, p < 0.01, ƞ² = 0.07. The interaction between annotation and visual representation was significant F(1, 107) = 8.89, p < 0.01, ƞ² = 0.07.

As expected, learners experienced higher extraneous cognitive load without annotation than with annotation F(1, 107) = 39.11, p < 0.001, ƞ² = 0.27. A significant difference in intrinsic cognitive load was observed for visual representation F(1, 107) = 5.48, p < 0.05, ƞ² = 0.05, indicating learners experienced higher intrinsic load in concrete visual representation than in abstract visual representation conditions. Finally, germane cognitive load was significant for annotation F(1, 107) = 8.17, p < 0.01, ƞ² = 0.07 revealing the relationship between germane load and annotation. There was a significant interaction between annotation and visual representation F(1, 107) = 8.89, p < 0.01, ƞ² = 0.07 suggesting that the types of visual representation were related to the presence of annotation in science learning.

Regardless of the significant interaction between visual representation and annotation, the results, however, remained inconclusive. As Kalyuga (2007) noted, the effects of instructional strategies may differ relative to learners’ prior knowledge. Given the significance of prior knowledge as a covariant in Study 1, a follow-up study that examined the impact of prior knowledge on visual representation and annotation was called for.

Study 2

Two hundred and twenty-seven participants were recruited from the same university. Of 227 participants, 59% (n = 135) were females and 41% (n = 92) were males. The average age of the subjects was 22.5 (SD = 1.72). About 63% (n = 143) were white, 7% (n = 16) were African American, 9.7% (n = 22) were Hispanic, 15% (n = 34) were Asian, and 5.3% (n = 12) were other.