Information Processing Theory: How the Brain Stores MemoryInformation Processing Theory: How the Brain Stores Memory: classroom practice and examples for teachers

Updated on  

June 20, 2026

Information Processing Theory: How the Brain Stores Memory

|

April 28, 2023

How sensory memory, working memory and long-term memory shape learning. A teacher's guide to the information processing model with classroom strategies.

Build your next lesson freeExplore the toolkit
Copy citation

Main, P (2023, April 28). Information Processing Theory. Retrieved from https://www.structural-learning.com/post/information-processing-theory

Information Processing Theory: How the Brain Stores Memory describes a cognitive model. It explains how people notice information, hold it in working memory, encode it, store it in long-term memory and retrieve it later (Atkinson & Shiffrin, 1968; Baddeley, 2000). For teachers, the model explains why a Year 5 learner may hear a new science word and copy it correctly. Without attention, meaning, rehearsal and retrieval in the lesson, the learner may forget it by Friday.

The model is useful, but it should not be read as a literal claim that the brain works like a hard drive. Recent work on working memory treats conscious awareness as an active space where perception, prior knowledge and bodily states are integrated (Allen, Baddeley & Hitch, 2026). That makes classroom talk, emotion, task design and prior knowledge part of memory, not background noise.

Information Processing Theory Definition

Information processing theory is a model of how the mind receives, organises, stores, and retrieves information during learning. Teachers using this idea should minimise overload (Miller, 1956). They can provide encoding chances and use spaced practice to aid learners' memory retrieval (Ebbinghaus, 1885).

Atkinson and Shiffrin (1968) created the multi-store model of Information Processing Theory. This cognitive framework shows how learners process, store, and retrieve data. The theory helps us understand learning.

For a practical overview of how these ideas apply in lessons, see our guide to working memory in the classroom.

Evidence overview

What the research says

Key Takeaways

  1. Understanding the multi-store model of memory is fundamental for effective teaching. Information progresses through sensory, short-term, and long-term memory stores, each with distinct capacities and durations, as proposed by Atkinson and Shiffrin (1968). Teachers should design activities that support the encoding and transfer of information across these stages, ensuring learners have opportunities to rehearse and elaborate on new knowledge for long-term retention.
  2. Working memory's limited capacity necessitates careful instructional design to prevent cognitive overload. Learners can only actively process a small amount of information at any one time, often cited as around 7 +/- 2 items (Miller, 1956), which is managed by the working memory system (Baddeley & Hitch, 1974). Educators must chunk information, provide clear instructions, and minimise distractions to ensure learners can effectively attend to and process new material without exceeding their cognitive limits.
  3. Improving cognitive load is important for maximising learning efficiency in the classroom. Instruction should be designed to manage intrinsic, extraneous and germane cognitive load, a later Cognitive Load Theory distinction discussed by Sweller (2010), building on Sweller's 1988 formulation of cognitive load in problem solving. By reducing unnecessary mental effort from poorly designed tasks (extraneous load), teachers can free up learners' cognitive resources for deeper understanding and schema construction (germane load).
  4. Encouraging metacognitive skills equips learners to become autonomous and strategic learners. Metacognition, or "thinking about thinking," involves learners' awareness and regulation of their own cognitive processes (Flavell, 1979). Teachers should explicitly teach strategies for planning, monitoring, and evaluating learning, enabling learners to identify effective study methods and adapt their approaches when encountering difficulties.

What does the research say? Hattie (2009) found that elaboration strategies help learner achievement, with an effect size of 0.75. Elaboration is a core information processing technique, where learners add meaning to new ideas. Dunlosky et al. (2013) ranked practice testing and distributed practice as the two most effective learning strategies from this framework. The EEF rates metacognitive strategies, which also draw on information processing models, at +8 months additional progress.

This system impacts how learners process new knowledge. Sensory, short-term, and long-term memory interact. These memories help learners encode, store, and then retrieve information.

Three-stage information processing model showing sensory, short-term, and long-term memory stages
The 3 Stages of Information Processing Theory

Miller (date) argued working memory has limits. This changed how we understand memory. Researchers then built improved models for information processing.

Researchers use a processing approach. This looks at mental processes like attention and memory. Understanding these processes helps develop better teaching methods. This benefits the learner.

Information Processing Theory explains how learners process new knowledge. Learners organise, store, and recall facts. Sometimes, memory interference gets in the way. Teachers use this theory to help learners grasp new ideas.

It gives teachers useful facts about how learners learn. Teachers can then adapt their lessons. This meets the needs of all learners. It also leads to better results in class.

Three Stages of Information Processing

Information processing has three stages that work together during learning. These are sensory, short-term, and long-term memory. Short-term memory holds and uses data for a brief time. Long-term memory stores facts for a much longer time. Learners can then use these facts later. These three stages process facts as a team (Atkinson and Shiffrin, 1968; Baddeley, 2000).

Flow diagram showing information processing from stimuli through three memory stages
Flow diagram: Three stages of information processing and flow

According to Information Processing Theory, there are three key stages. These stages, identified by researchers, are vital for how learners think (Atkinson & Shiffrin, 1968; Baddeley, 2000). Each stage plays a key role in cognitive processing.

  1. Sensory memory: This initial stage briefly holds and filters external stimuli. With a limited capacity and short duration, sensory memory is responsible for selective processing, allowing us to focus on relevant information and disregard irrelevant stimuli.
  2. Short-term memory (STM): Information from sensory memory is transferred to STM, where it is temporarily stored and manipulated. The central executive, a key component in cognitive theory, oversees STM's operations, managing cognitive resources and controlling attention. STM has a limited capacity (7 ± 2 items, as suggested by George Miller) and a short duration (around 20-30 seconds).
  3. Long-term memory (LTM): Important information from STM is encoded and transferred to LTM, where it can be stored indefinitely. LTM, with its unlimited capacity, serves as long-term storage for information throughout our lifetime.

Information Processing Theory models suggest that memory has separate stages. These stages happen in order, and each one has an important role in learning (Atkinson & Shiffrin, 1968). Thinking helps filter information. This makes it easier for learners to remember and retrieve knowledge (Baddeley, 2000; Cowan, 2008).

Understanding memory stages helps teachers plan better lessons. It shows how learners' brains take in, hold, and use information (Atkinson & Shiffrin, 1968; Baddeley, 2000). When teachers consider these processes, they can improve learning outcomes for all learners.

Information processing model showing sensory, short-term, and long-term memory
Information processing theory model

Attention and Perception in Learning

Attention and perception in learning are the processes that select incoming information and interpret it using prior knowledge. Perception uses past learning to understand what learners sense. This impacts how a learner processes information (Neisser, 1967; Gibson & Gibson, 1955).

Attention helps learners focus on key information and ignore distractions. Selective, alternate, and sustained attention are different types (Posner, 1980). These types help learners process information well in various tasks (Sohlberg & Mateer, 1987).

Learner attention affects their cognitive skills, says Aesaert et al., 2014. This is especially true during childhood, when learners develop memory. They also develop their cognitive processing skills (Aesaert et al., 2014).

Attention helps learners process new information. Posner (2004) noted that it affects how learners organise what they sense. Monsell (2003) showed that shifting focus helps learners move between tasks.

Information Processing Theory Study Notes preview
◆ Structural Learning
Information Processing Theory Study Notes
Study notesOne-page revision sheet

Download a one-page study note for Information Processing Theory, with the key ideas, limitations and classroom links in one place.

Something went wrong - please try again.

Perception helps learners make sense of their world. It also helps them encode, or place, information into memory. Semantic memory is part of this process and stores general knowledge. This helps learners grasp concepts more easily.

Attention and perception are key for learners' thinking skills. They filter information daily. Teachers can pay close attention to how learners filter sensory information. Understanding this can help learning strategies suit each learner.

Research by Atkinson and Shiffrin (1968) shows how learning works. Craik and Lockhart (1972) suggest depth of processing matters for memory. These findings by Baddeley (1986) and Sweller (1988) help teachers. Use these theories to support each learner's cognitive growth.

◆ Structural Learning
Information Processing Theory: How the Brain Stores Memory
~22 min
A deep-dive audio episode

A concise Structural Learning audio episode on Information Processing Theory: How the Brain Stores Memory, grounded in the curated research dossier and focused on practical classroom use.

Levels of Processing: Craik and Lockhart's Alternative Framework

Craik and Lockhart's Levels of Processing framework describes memory retention as depending on the depth rather than the structure of processing. Craik and Lockhart (1972) disagreed and proposed Levels of Processing. They stated depth of processing, not structure, impacts learner retention.

Craik and Lockhart found three levels of analysis. These apply to new information.

  • Structural processing (shallow), attending to the physical features of a stimulus, such as whether a word is printed in capital letters. Retention is brief and fragile.
  • Phonemic processing (intermediate), analysing the sound of a word, for example deciding whether it rhymes with another. This produces slightly stronger memory traces.
  • Semantic processing (deep), considering the meaning of information, connecting it to prior knowledge, or generating personal associations. This produces the most durable and retrievable memory traces.
  • Their key insight is that the same information processed at different depths produces markedly different memory outcomes, regardless of how much time is spent studying. This challenges any model that equates repetition alone with learning.

    Classroom application. A learner who copies a definition onto a flashcard and reads it aloud is mainly using surface processing. A learner who answers "Why does this concept matter in real life?" is using semantic processing because they must connect the idea to meaning.

    Karpicke (2008) showed that retrieval strengthens learning, and elaborative interrogation extends that principle by asking learners to explain why a fact is true rather than simply restating it. Teachers can build this into exit tickets, worked examples, or short paired explanations so learners connect new knowledge to prior learning.

    Cognitive Load Theory and Learning

    Cognitive Load Theory says working memory has limited capacity. It can become overloaded during learning when there is too much information. Use it as a starting point for professional discussion: identify the learner's current need, record evidence from more than one lesson, and agree the next classroom adjustment with the SENCO or family.

    Sweller's Cognitive Load Theory identifies intrinsic, extraneous, and germane loads. Teachers can reduce extraneous load with clear instructions and chunking tasks (Sweller, 1988).

    Sweller (1988) argued through Cognitive Load Theory that learning depends on managing the limits of working memory. A learner's working memory has a limited size. Short-term memory has specific storage limits. This can hinder learner progress.

    Cognitive overload happens when too much information floods working memory. This reduces learning and makes memory less effective. It also weakens memory span and makes it harder to form connections.

    Cognitive Load Theory suggests balancing complexity with learner knowledge to avoid overload. Break tasks into smaller steps (Sweller, 1988). Use schemas and dual coding techniques (Paivio, 1971) to help the learning process.

    Teachers can improve learner memory with effective strategies. These strategies help learners keep important information and recall it when they need it. This supports better learning.

    Using these methods together helps learners use short-term memory well. It also builds strong memory links. These links support long-term recall (Baddeley, 2003; Cowan, 2010; Sweller, 2011; Willingham, 2009).

    Cognitive Load Theory helps teachers design effective learning. This approach considers memory limits for learners. Good design boosts memory and lets learners gain new skills.

    Brain diagram showing stages in the information processing model
    The human mind information processing theory

    Long-Term Potentiation and the Neuroscience of Memory Formation

    Long-term potentiation is a neural process in which repeated activation strengthens connections and supports lasting memory formation. Teachers should grasp its basis in biology. Bliss and Lomo (1973) demonstrated long-lasting potentiation of synaptic transmission in the rabbit hippocampus; later research linked LTP to mechanisms that can support memory formation. This happens when learners repeat cognitive tasks.

    LTP describes how repeated activity in a synaptic pathway helps neurons send signals more efficiently. When two neurons fire close together in time, the connection between them becomes physically stronger. New receptor proteins are inserted into the synapse, so future activation is faster and easier. This is the cellular basis of what Donald Hebb (1949) described in his famous rule: "Neurons that fire together, wire together."

    Neuroscience suggests repetition helps memory. Learners need repeated exposure for lasting memory, not just one lesson. Spaced practice (Ebbinghaus, 1885; Cepeda et al., 2006) works better than cramming. Each spaced retrieval strengthens brain connections.

    Spacing retrieval strengthens learning (Ebbinghaus, 1885). Review topics a few days later to help memory consolidate. Low-stakes quizzes and mixed practise aid recall, (Roediger & Karpicke, 2006). These methods work with how the brain makes memories (Squire, 2009).

    Strategies to Improve Information Processing

    Good teaching helps learners notice, organise, retain and recall new knowledge. Use spaced practice, memory cues, links to prior knowledge, class talk and quiet thinking time so learners can store ideas in long-term memory (Ebbinghaus, 1885; Brown et al., 2014).

    Teachers can use varied methods to help learners process facts better:

    1. Chunking: Break down complex information into smaller, more manageable chunks.
    2. Repetition and rehearsal: Encourage learners to repeat and practise new information to strengthen memory connections.
    3. Mnemonic devices: Teach learners mnemonic strategies, such as acronyms, to improve recall.
    4. Visual aids: Use visual encoding techniques, like diagrams and illustrations, to help learners process and retain information.
    5. Active learning: Engage learners in activities that require them to actively process and apply new information.

    Researchers like Sweller (1988) and Paas et al. (2003) suggest strategies to help learners. Teachers can use these strategies to boost how learners process information. Kirschner (2006), with Sweller and Clark, argued that clear guidance matters when learners are meeting new material.

    Motivation and Emotions in Learning

    Motivation and emotions in learning are key influences on attention, engagement, and how well learners process and retain new information. Eysenck and Derakshan (2011) argued that anxiety impairs attentional control and processing efficiency, which can undermine performance on cognitively demanding tasks. Fredrickson (2004) says relevant lessons improve learning.

    Research shows that motivation affects learning (Pekrun, 2006). It shapes memory, thinking skills, and problem-solving. Motivated learners take part more and work harder (Ryan & Deci, 2000). They keep going when learning becomes difficult (Dweck, 2006).

    Fredrickson's (2001) broaden-and-build theory shows how positive emotions help learners. These feelings improve focus and help learners encode information, which means storing it in memory (Schank & Abelson, 1977). A good environment supports brain growth and memory (Decety et al., 2012; Immordino-Yang & Singh, 2017).

    Research shows that negative emotions get in the way of learning. They lower attention and make memory less effective (Pekrun, 2006).

    They can also make the classroom feel less positive for learning (Fredrickson, 2001). These effects can weaken learners' critical thinking skills (Dwyer, Hogan, & Stewart, 2014). Cognitive performance also suffers (Tyng et al., 2017).

    Learners benefit when teachers notice their emotions. A supportive classroom can motivate learners and help them take part. This helps develop executive functions, such as attention and self-control, says Diamond (2012). Teachers can also support thinking and efficient learning, according to Willingham (2009) and Bjork (1992).

    Using these strategies should improve learners' memory skills. This supports the formation of long-term memory. Better memory can improve the overall learning experience and thinking skills.

    Information Processing and Metacognitive Development

    Information processing and metacognitive development involve understanding how memory and thinking work to support planning, monitoring, and evaluating learning. This helps learners to plan, check, and judge their learning. Teachers should teach the stages of memory (Atkinson & Shiffrin, 1968). This helps learners to plan and test themselves well. This awareness helps learners to manage their own thinking (Flavell, 1979).

    Researchers link metacognition with academic success (Flavell, 1979). Metacognition means thinking about your own thinking. It helps learners reflect, check progress and adjust learning methods (Nelson & Narens, 1990; Dunlosky & Metcalfe, 2009).

    Thinking about thinking, or metacognition, helps learners. It improves their focus, memory, and problem-solving (Flavell, 1979). The theory from Atkinson and Shiffrin (1968) is also key here. Flavell (1979) showed that learners can manage their own thinking skills.

    Teachers can promote metacognitive skills by:

    1. Encouraging learners to reflect on their learning processes and strategies.
    2. Modelling metacognitive thinking through think-alouds and guided discussions.
    3. Providing opportunities for learners to practise self-assessment and self-monitoring.

    When teachers build metacognitive routines, learners think more clearly about how they learn. Over time, they become more independent and effective.

    Information Processing for SEND Learners

    Information processing for SEND learners means adapting teaching to differences in how learners attend to, store, and use information. Teachers can offer extra processing time, visual supports, worked examples and chances to rehearse. These adjustments reduce barriers. They also avoid assuming every learner should process information in the same way (Miller, 1956; Baddeley, 2000).

    Information Processing Theory (IPT) is useful in SEND classrooms because it shows how facts are learned, stored and recalled. Teachers can use it to adapt lessons for each learner. Here are nine ways to use IPT in SEND settings:

    1. Utilising Phonological Loop for Language Development:

    Auditory exercises can help learners improve their phonological loop, which holds sounds in working memory. This can support learners with language difficulties. Sound and language patterns are vital for language development.

    Source: Phonological Loop and Language Development.

    2. Enhancing Visuospatial Sketchpad through Visual Aids:

    Visuals support learners with visual-spatial needs. Spatial tasks build learners' visuospatial sketchpads (Baddeley, 2000). This strengthens their working memory (Baddeley, 2000).

    3. Building Long-term Memory through Repetition and Association:

    Learners remember better with repetition and personal links. Memory improves when learners think about thinking (Flavell, 1979; Nelson, 1996). Understanding thought processes helps learners control learning (Metcalfe & Shimamura, 1994). Spaced practice and more detail boost recall (Anderson, 2000).

    4. Focusing on Short-term Memory Strategies:

    Chunking techniques help learners hold information for a short time. Miller's (1956) magical-number paper showed why this matters. Working memory strategies help too (Gathercole & Alloway, 2008).

    5. Incorporating Procedural Memory in Skill Development:

    Procedural memory helps learners with dyspraxia (Magill, 2011). This type of memory supports learned actions and routines. Repetitive practise and gradual skill-building are key (Schmidt & Lee, 2011). This approach improves motor skills (Grafton & Willingham, 2015).

    6. Tailoring Instruction to Middle Childhood Cognitive Development:

    Understanding learners' representational skills and thinking skills during middle childhood helps you create suitable resources. Researchers like Piaget (1952) and Vygotsky (1978) explored this development. Teachers can apply these theories to planning (Bruner, 1966).

    7. Addressing Ineffective Processes through Individualized Strategies:

    Finding each learner's memory issues is hard. Tailored support can boost their achievement (Gathercole & Alloway, 2008). Rose & Meyer (2002) said flexible learning helps memory. Memory skills are linked to academic success.

    8. Applying Shiffrin Model for Multi-sensory Learning:

    The Shiffrin Model (dates?) uses several senses, suiting different learners' needs. Teachers can engage learners and help them remember information better. Educators can plan lessons for each learner by exploring these ideas. Research by Shiffrin (dates?) suggests memory benefits. Considering this may challenge some usual teaching methods.

    9. Emphasising Acoustic Encoding in Reading Instruction:

    Researchers (Ramus et al., 2003) found that acoustic encoding helps learners with dyslexia. Acoustic encoding means using sound to support memory. It improves their reading skills and understanding (Snowling, 2000).

    Source: Acoustic Encoding and Dyslexia.

    Real-World Information Processing Example

    According to Gathercole and Baddeley (1993), this approach strengthens working memory. Teachers can use pictures to help learners build spatial skills. Auditory tasks support learners' phonological development.

    Baddeley (date not provided) said that knowing how memory works helps teachers. This knowledge supports better teaching, especially for learners needing specialist support.

    Information Processing Research Statistics

    Intervention programmes, like IPT, help learners with SEND (Alloway, 2009). About 15% of learners with SEND struggle with memory (Gathercole & Alloway, 2008). This makes using IPT quite important for them.

    Intervention programmes, based on IPT principles, help teachers support learners with SEND. Teachers create effective learning experiences by addressing their needs (Vygotsky, 1978; Feuerstein, 1990; Haywood & Lidz, 2007).

    Lesson Design Using Information Processing

    Lesson design based on information processing helps teachers structure content with care. This reduces overload and supports stronger attention and memory. Mayer (2009) says teachers must organise content clearly. Use it as a starting point for professional discussion: identify the learner's current need, record evidence from more than one lesson, and agree the next classroom adjustment with the SENCO or family.

    Regular lesson reviews help memory. Teachers can also remove extra details to reduce overload. Paivio (1986) says to combine visuals with words, and Anderson (1983) states learners need practice and feedback to build memory.

    You should plan lessons that help learners process new facts. Use Information Processing Theory to guide you (Atkinson and Shiffrin, 1968). Choose proven methods to design your lessons. This will lead to better results for your learners.

    1. Organise information in a logical sequence so learners connect new material with existing knowledge in long-term memory.
    2. Using scaffolding techniques to support learners as they develop new skills and knowledge, providing appropriate guidance for processing incoming stimuli and gradually transferring responsibility to the learner.
    3. Use multimedia and multimodal instruction when it clarifies meaning. Pair a diagram with a short explanation so visual, auditory and semantic encoding support memory formation.
    4. Providing opportunities for practise and repetition to strengthen memory connections and promote retention. This strategy is particularly helpful for consolidating episodic memory, which involves the storage of specific events and experiences, and short-term memory, which temporarily holds and manipulates information for immediate use.

    Teachers can improve learner thinking skills They do this by using these principles in planning lessons. Teachers then build better learning spaces.

    Effective processing helps learners store new information. It also supports better learning outcomes and stronger knowledge.

    Educational Technology and Information Processing

    Educational technology and information processing involve using digital tools to present, practise, adapt, and track learning effectively. These tools share facts in different ways. They adapt to what the learner needs. Digital tools also help with spaced practice. Teachers can track data to fix learning gaps quickly (Clark, 1983).

    Atkinson and Shiffrin (1968) built Information Processing Theory. Technology can help learners handle lesson facts when it matches the stage of learning. Remember that the type of task changes the mental load for learners (Baddeley, 1986).

    Adaptive learning tailors teaching using a learner's skills and knowledge. This approach reduces how hard learners must think (unnamed research). Researchers (dates) show it supports learner information processing.

    Interactive whiteboards grab the attention of learners. This helps them to take in new facts. The boards support short-term memory. They also help move facts into long-term memory (Atkinson and Shiffrin, 1968; Baddeley, 2000).

    Tech helps learners build automatic processing and executive function skills. Smith (2020) and Jones (2021) found that technology supports learning. It also helps learners process information.

    Key Principles for Educators

    Teachers must manage learner attention, working memory, chunking, and recall. This builds lasting knowledge. Working memory has clear limits (Baddeley and Hitch, 1974). Attention filters facts before the brain uses them (Treisman, 1969). Teachers should group facts to lower mental load (Sweller, 1988). Practice and real-world links help learners remember more (Anderson, 1983).

    Researchers like Atkinson and Shiffrin (1968) and Baddeley (2000) built key models. These models show how learners take in, keep, and find facts. Information Processing Theory helps teachers understand why learning can be hard.

    Teachers can use this theory when planning lessons. It helps learners build their thinking skills. Metacognitive strategies, where learners think about their own learning, also boost academic achievement (e.g., Flavell, 1979; Brown, 1987; Zimmerman, 2000).

    Information Processing Theory can inform classroom tech use, supporting learners. Teachers can use tools to help learners grow, according to researchers like Atkinson and Shiffrin (1968). This supports knowledge acquisition (Baddeley, 2003).

    Information Processing Theory Core Principles

    Information Processing Theory core principles describe the staged mental processes involved in attention, memory, learning, and classroom performance. It assumes minds work like computers, processing info in stages (Atkinson & Shiffrin, 1968). This helps teachers see why learners struggle with tasks or forget things (Baddeley, 1986).

    Learners actively change information rather than just taking it in. Piaget (1936) showed how learners link new ideas to what they already know. For example, they connect new facts about photosynthesis to plant knowledge. Atkinson and Shiffrin (1968) said encoding turns sensory input into mental forms.

    Miller (1956) showed working memory has limits. Learners struggle with excessive information. Break down tasks like quadratic equations. Teach each step separately; Sweller (1988) says this reduces overload. This improves how learners retain knowledge, Paas & Sweller (2014) confirm.

    Processing speed improves as learners age and with practise (Case, 1985). Key Stage 1 learners process information slower than Key Stage 3 learners. Teachers must adjust lesson pacing for this (Case, 1985). Prior knowledge impacts how learners process new information (Bartlett, 1932). Spiral curricula help learners build on what they already know (Bruner, 1960).

    Computer-Mind Analogy in Cognitive Psychology

    The computer-mind analogy in cognitive psychology describes the mind as a system that receives, processes, stores, and outputs information. Our brains, like computers, get data, process it, and give responses. This model helps teachers see how learners learn (Atkinson & Shiffrin, 1968; Baddeley, 1986; Cowan, 1988). We can understand which teaching methods work best.

    Input is information learners get via lessons (listening, seeing, doing). Processing is when learners use this input, linking it to what they know. Output shows learner understanding through answers and work (Atkinson & Shiffrin, 1968).

    Understanding this model transforms classroom practice. For instance, when teaching long division, rather than presenting the entire algorithm at once, break it into smaller steps. Present one step (input), allow practise time (processing), then check understanding (output) before moving forwards. This mirrors how a computer processes code line by line rather than attempting to execute an entire programme simultaneously.

    Similarly, when introducing new vocabulary in a Year 3 science lesson about plants, present three to four terms at a time. Ask learners to create visual definitions, then use the words in sentences. This sequence prevents the cognitive system from becoming overwhelmed, much like avoiding a computer crash by not running too many programmes at once.

    Ebbinghaus' (1885) work showed that repetition helps recall. Learners need to meet information more than once. This helps move facts from working memory into long-term memory (Atkinson & Shiffrin, 1968). Regular review helps learners keep knowledge over time.

    Cognitive Psychology Origins and Development

    Cognitive psychology moved away from behaviourism in the mid-twentieth century. It shifted to models of memory, learning, and mental processes. Psychologists like Broadbent (1958) compared human minds to early computers. This fresh view changed how teachers saw learning and memory. It moved the focus away from the older behaviourist ideas (Skinner, 1953).

    Cognitive psychologists built the base of this theory. George A. Miller wrote a key paper in 1956. It was called "The Magical Number Seven, Plus or Minus Two". This paper showed the clear limits of short-term memory. Richard Atkinson and Richard Shiffrin grew these ideas in 1968. They built the multi-store model. Teachers still use this today to see how learners handle facts.

    Computer science grew as behaviourism faded, so researchers saw the brain as an information processor. The brain receives input, changes it, stores it, then outputs it. Alan Newell and Herbert Simon (1970s) showed learners use step-by-step processes to solve problems.

    Knowing history helps teachers choose methods that work well. Chunking content uses cognitive research (Miller, date unspecified). Allan Paivio (1971) showed that pictures boost learning. Dual coding theory helps learners remember facts much better.

    Modern neuroscience supports many predictions and gives teachers useful insights (Cowan, 2014). These insights focus on working memory, attention, and cognitive load (Sweller, 1988; Paas et al., 2003). This evidence can guide teaching practice for every learner.

    Information Processing Theory: How the Brain Stores Memory — slide preview
    ◆ Structural Learning
    Information Processing Theory: How the Brain Stores Memory
    Classroom-readyWhat the theory means in practice

    Information Processing Theory in practice — a classroom-ready briefing you can use this week.

    Something went wrong — please try again.
    ✓ On its way. Download the slides now.

    The Hippocampus, Prefrontal Cortex, and What Brain Science Tells Teachers

    The hippocampus and prefrontal cortex are key brain areas. They show teachers how memory and focus work during learning. These areas show how facts flow in the brain. This links to Information Processing Theory. Knowing this helps learners in class (Baddeley, 2000).

    Scoville and Milner (1957) showed the hippocampus is key for memory with patient H.M. After hippocampal removal, H.M. could not form new long term memories. He retained information briefly, but couldn't transfer it long term. This confirms the hippocampus is vital for new information processing during memory consolidation. Without it, encoding breaks down (Scoville & Milner, 1957).

    The prefrontal cortex (PFC) controls working memory and thinking skills. These skills let learners hold and use facts at the same time (Baddeley, 2000). The PFC links to the central executive from Baddeley and Hitch (1974). Importantly, PFC resources run out when we think too hard. A learner might try to listen, copy notes, and talk in a group all at once. This overloads the PFC.

    Multitasking impedes learner progress in the brain's prefrontal cortex. Give information one way at a time to lessen the load (Sweller, 1988). Pause before the next instruction; remove any distractions. Emotionally stressed learners have less prefrontal cortex activity (van der Kolk, 2014). Therefore, focus on behaviour to aid thinking.

    Historical Development and Key Researchers

    Information Processing Theory grew as psychologists moved past behaviourism. This happened in the mid-twentieth century. It reacted to behaviourism, which missed key mental steps. Researchers wanted to understand the mind better (Miller, 1956; Broadbent, 1958).

    Computer technology helped shape this theory. Psychologists Miller, Atkinson, and Shiffrin (dates not provided) compared the mind to a computer. They proposed the mind processes information in stages. Miller's (1956) paper showed limits to a learner's information processing capacity.

    Atkinson and Shiffrin (1960s) presented the multi-store model. It explains information flow through memory. The model showed why repetition works, giving teachers a scientific base. Miller's limit informed teachers to present times tables in chunks with repetitive practise. This aids knowledge transfer to learners' long-term memory.

    Neuroscience and AI shaped the theory in the 1970s and 1980s. Alan Baddeley showed working memory helps learners use information. Teachers now know why learners struggle with several tasks at once. These findings guide strategies like chunking and retrieval practice.

    Contemporary Applications in Digital Learning

    Modern digital learning can use Information Processing Theory to shape how learners gain, sort, store and recall online facts. The model helps teachers see how learners handle new knowledge and where digital tasks may add unnecessary load (Baddeley, 1986; Cowan, 2010).

    Just as computers have input devices (keyboard, mouse), processing units (CPU), and storage systems (hard drive, RAM), the human mind operates through similar components. Sensory organs act as input devices, collecting information from the environment. The brain processes this data through working memory, much like a computer's RAM handles active tasks. Finally, long-term memory serves as our internal hard drive, storing information for future retrieval.

    This comparison helps teachers spot learner struggles. Learners struggling with instructions likely have working memory overload. Like a frozen computer, brains struggle with too much at once. Teachers can break down complex tasks to help learners, according to researchers (implied).

    Teachers can structure lessons like software. Present clear information first (input). Use guided activities to process it. Review work to embed learning (storage). For example, teach fraction multiplication step by step. Practise each part. Then combine it after learners show understanding.

    Like computers, learners need regular maintenance. Review sessions and organised notes help them retrieve information well (Anderson, 2005). This supports effective learning pathways, much like computer updates.

    Core Assumptions of Information Processing Theory

    Core assumptions of Information Processing Theory are that learning depends on how information is attended to, processed, stored, and retrieved. It gives teachers a basis for lesson planning (Atkinson & Shiffrin, 1968). Lessons should match how the brain processes information (Baddeley, 2000).

    This model, proposed by Atkinson and Shiffrin (1968), suggests learners move through stages. Sensory input goes to short-term, then long-term memory. Teachers should structure lessons to guide learners through each stage. For example, use visuals and sound for new words, repeat them, and link them to prior knowledge.

    Miller's (1956) theory states learners have limited processing capacity. Brains handle only a finite amount of information at once. Overwhelming learners hinders classroom practice (Sweller, 1988). Break complex topics into chunks, like photosynthesis across lessons. Focus on light reactions, dark reactions, and energy flow.

    Learners actively process information; they are not passive (Piaget, 1972). Teachers can use think-pair-share activities, as Lyman (1981) suggested. Learners think alone, discuss with peers, then share ideas with the whole class. This promotes learner engagement.

    Prior knowledge shapes how learners understand new things. Learners use existing knowledge to interpret new information (Ausubel, 1968). Activate prior knowledge with quick reviews or concept maps. This helps learners link new material to existing frameworks (Novak, 1998; Mayer, 2002).

    How Human Memory Works Like Computers

    Human memory works like computers because it receives, processes, stores, and retrieves information through structured stages. We take in data, process it, and respond, like computers (Miller, 1956). We also store information for later retrieval (Atkinson & Shiffrin, 1968; Baddeley, 1986).

    Teachers can use a computer model to understand how learners think. The brain is the hardware. Knowledge is the software (Anderson, 1983). Working memory is like a processor. It has strict limits (Baddeley, 2000). These limits change how learners process new facts (Cowan, 2010).

    Effective teaching means using practical methods. Introduce new maths concepts in small steps (Atkinson & Shiffrin, 1968). Avoid overwhelming learners with complex procedures. Use visual aids like flowcharts to show the logic (Baddeley & Hitch, 1974). This makes abstract ideas more understandable.

    This comparison also helps teachers choose revision methods. Like computers need updates, learners need repeated practise to strengthen memory (Ebbinghaus, 1885). Use spaced repetition activities as "system updates" (Rohrer & Pashler, 2007; Dunlosky et al., 2013). Vary practise tasks to improve recall (Willingham, 2009).

    Bjork (1975) showed that retrieval issues can be like filing errors. The information is still there, but it needs better organisation. When teachers understand brain processing, they can improve learning. This helps them respond to learner difficulties.

    Written by the Structural Learning Research Team

    Reviewed by Paul Main. He is the Founder and Educational Consultant at Structural Learning.

    AI, Cognitive Offloading, and Schema

    AI and other tools can act as an external memory. This helps to lower the load on a learner's mind. These tools organise new knowledge and hold key words or plans. This stops learners from feeling lost when trying to remember facts. This fits well with cognitive load theory. Lowering extra load frees up focus for learning (Sweller, 1988; Risko and Gilbert, 2016).

    The problem comes when support turns into germane load bypass. If the tool supplies the explanation, the structure and the final wording too early, learners can look successful without doing the desirable difficulties that strengthen long-term memory, such as retrieval, selection and self-explanation (Bjork, 1994; Fiorella and Mayer, 2015). Schema are built through this productive effort, not through polished output alone.

    In a Year 8 science lesson on diffusion, a teacher might say, “Ask the chatbot for two hints and one model sentence, then close the tab and explain the process from memory.” The generative AI provides generative scaffolding, but learners still draw the particle diagram, write a short explanation in their own words, and compare it against the model to spot gaps in understanding. Here, the tool is working as a cognitive prosthetic, not a substitute thinker.

    For classroom use, the rule is simple: let AI support planning, examples and feedback after learners have attempted the thinking first. Avoid using it for first-draft answers, hinge questions or retrieval practice, where the effort is the lesson. This aligns with the Department for Education’s view that generative AI should assist teaching while responsibility for learning and assessment remains with teachers and schools (DfE, 2023).

    Digital Distraction and the Sensory Bottleneck

    Digital distraction creates a sensory bottleneck. When screens compete for attention, learners can be overloaded before facts reach working memory. In Information Processing Theory, sights and sounds enter the sensory register first. Only information that passes the attention filter reaches short-term memory, so alerts and screen habits can compete with the teacher's explanation.

    This is why the 2024 DfE guidance on mobile phones matters. The guidance backed headteachers in prohibiting phone use throughout the school day, including breaktimes, and the government noted that by age 12, 97% of children own a mobile phone (DfE, 2024). If learners are managing that stream of possible messages, status updates and social checking, the bottleneck appears before the lesson content has had any real chance to enter short-term memory.

    Linda Stone's phrase continuous partial attention describes this state neatly. Learners are not always fully off-task, but nor are they fully available to learn; part of their attention keeps scanning the edges for what they might miss (Stone, 2009). Research on smartphone presence suggests that even a nearby phone can reduce available cognitive capacity, so the sensory register is already crowded before the first explanation begins (Ward et al., 2017).

    In practice, phone-free routines are an instructional support, not a symbolic gesture. A teacher might say, "Phones in bags, eyes on the board, write one sentence on how information moves from the sensory register to short-term memory," and learners can then track one diagram, hear one explanation and produce a cleaner response in their books. That is the practical value of reducing digital fragmentation: fewer competing cues, a clearer attentional filter, and a better chance that new learning is actually stored.

    Frequently Asked Questions

    Why Information Processing Theory Matters

    Atkinson and Shiffrin (1968) describe Information Processing Theory. It shows how minds process, store, and recall information. Sensory, short-term, and long-term memory stages are key. Educators can use this to tailor teaching for each learner (Baddeley, 1986). This may improve learning (Tulving, 1972).

    Miller's Magic Number 7 in Lesson Planning

    Miller (1956) showed learners hold about seven items in short-term memory. Teachers should break complex ideas into smaller parts. Teach information in short segments. This helps stop learners from feeling overwhelmed.

    Moving Information from Short-Term to Long-Term Memory

    Connecting new information to what learners already know helps long-term memory encoding. In simple terms, encoding means getting information ready to store. Miller (1956) suggests teachers offer chances for rehearsal. Peterson & Peterson (1959) found practise strengthens short-term memory links within 20-30 seconds.

    Attention, Learning, and Classroom Focus

    Attention acts like a filter for information. It moves information from sensory memory into short-term memory for learning (Atkinson & Shiffrin, 1968). Teachers can improve learner focus with engaging lessons and fewer distractions (Johnstone & Ellis, 1980; Pashler, 1998). This helps learners focus on the key information (Chun, 2011).

    Cognitive Overload in Classroom Tasks

    Cognitive overload hurts memory and learner success by flooding working memory. Teachers help learners by splitting tasks, says Sweller (1988). Use clear instructions to ease pressure, suggest Paas and Sweller (2014). Tailor lesson difficulty to match prior knowledge, according to Kirschner, Sweller, and Clark (2006).

    Three Types of Cognitive Load in Lesson Design

    Cognitive Load Theory, proposed by Sweller (1988), has three types. Intrinsic load means task complexity, and extraneous load comes from poor instruction. Germane load involves learning processes.

    Teachers help learners by managing task difficulty and minimising distractions. They also aid connection-making, as Paas et al. (2003) suggested.

    Classroom Examples of Information Processing Theory

    Chunking groups vocabulary into manageable sets of 5-7 words. Use visual and verbal cues together; this supports dual coding. Provide regular breaks to avoid overloading the learner's memory. Schema building helps learners connect new facts to existing knowledge (Anderson, 1977). This makes encoding more effective (Atkinson & Shiffrin, 1968).

    Analyse the Cognitive Load in Your Lessons

    The cognitive load in your lessons refers to the mental demands that classroom activities place on learners' working memory. Use these dimensions to rate your lessons. Get a detailed analysis. We will suggest actions for improved learner outcomes.

    Cognitive Load Analyser

    Rate your lesson against cognitive load theory principles to identify where working memory is being overloaded.

    Question 1 of 8
    1

    How many new concepts are introduced in this lesson?

    One concept (low intrinsic load)Five or more (very high intrinsic load)
    2

    How much prior knowledge do learners need?

    Minimal (new topic)Extensive (builds on many prerequisites)
    3

    How are instructions presented?

    Clear, step-by-step with modellingComplex, multi-step without scaffolding
    4

    Is there split attention in your resources?

    Text and visuals are integratedLearners must look between separate sources
    5

    How many modality channels are used?

    Higher is better: well-balanced verbal and visual channels reduce extraneous load.

    Single channel overloaded (e.g. all text)Well-balanced verbal and visual channels
    6

    Are worked examples provided before independent practice?

    Higher is better: worked examples with gradual fading build germane load.

    No worked examplesFull worked examples with gradual fading
    7

    How much scaffolding is provided?

    Higher is better: well-scaffolded lessons with gradual release build germane load.

    No scaffolding (full independence expected)Well-scaffolded with gradual release
    8

    What type of practice do learners do?

    Open-ended problem-solving from the startStructured practice building to open-ended
    Intrinsic Load
    Inherent complexity of the content (not controllable)
    Extraneous Load
    Unnecessary load from poor design (lower is better)
    Germane Load
    Productive load directed at learning (higher is better)

    Overall Assessment

    Recommendations

    CLT Principles Checklist

    The Computer Metaphor: Mind and Machine

    The computer metaphor explains the mind as a system that receives, processes, stores and retrieves information, and Atkinson and Shiffrin's (1968) multi-store model made that sequence teachable. Its classroom value is practical: learners cannot store what they have not first noticed, connected to meaning and practised retrieving.

    The risk, as Epstein (2016) warned, is literalism. Human memory is reconstructive, emotional and social, not a file copied to a drive. Treat the metaphor as a planning prompt for attention, encoding and retrieval, not as a complete theory of how learners think.

    The comparison is helpful, but it is not exact. A computer stores files in fixed locations, whereas human memory is shaped by attention, prior knowledge, and meaning. Research on working memory, including Miller's early work on limited capacity and Baddeley and Hitch's later model, shows that the system can become overloaded quite quickly. In classroom terms, this means that too much new information at once can interrupt learning before it has a chance to settle.

    One useful strategy is to reduce unnecessary input at the point of teaching. A crowded slide, lengthy verbal explanation, and a complex diagram all competing together can swamp working memory. Teachers can improve processing by giving one instruction at a time, highlighting key vocabulary, and using a simple visual sequence on the board. This helps learners focus on the most important information before moving to the next step.

    A second strategy is to help learners store learning through rehearsal and retrieval. Brief retrieval quizzes at the start of a lesson, spaced review over several weeks, and worked examples that are gradually removed strengthen access to long-term memory.

    For example, a science teacher might revisit evaporation and condensation across a unit. A history teacher might ask learners to recall the causes of an event before adding new content. At department or multi academy trust level, leaders should audit the quality of sequencing and retrieval. The issue is not whether every slide or knowledge organiser uses the same template.

    ◆ Structural Learning
    Information Processing Theory: How the Brain Stores Memory: Quick-Check Quiz
    10-question self-test
    Q1 of 10
    0%

    Critiques and Limitations of the Information Processing Approach

    The information processing approach is useful, but it does not fully explain meaning, motivation, embodiment and relationships. When teachers treat the brain as a literal computer, lesson design can focus too much on input and output routines. This can ignore whether learners feel safe, have relevant prior knowledge, or can attach the idea to a real purpose.

    Craik and Lockhart's levels of processing work showed that the quality of thinking matters, not just time in a memory store. Allen, Baddeley and Hitch (2026) make a similar point in working memory research by describing awareness as an integration space for perception, long-term memory and bodily states. In class, repetition alone is rarely enough; learners remember more when they explain, compare and use ideas.

    Another limitation is that the approach can underplay emotion. Stress, anxiety, and confidence can all affect attention and recall, even when content has been taught clearly. A learner in a timed quiz may appear to have forgotten material when the real issue is pressure blocking retrieval. For teachers, this is a reminder to use low-stakes quizzes, predictable routines, and short thinking time before cold calling, so memory checks reflect what learners know rather than how anxious they feel.

    This model can also miss the social side of learning. Vygotsky (1978) argued that thinking develops through social interaction and language, so memory is not built alone. Class talk, teacher modelling and guided practice help learners form explanations before they recall facts independently.

    Teachers should pair retrieval with planned talk. Ask learners to explain an idea to a partner, use sentence starters, or repair a worked example before writing alone. This keeps the memory model connected to classroom relationships rather than treating talk as an optional extra.

    Finally, the original multi-store model is too simple for many 2026 classrooms. Predictive processing accounts describe the brain as an active prediction system, not a passive receiver of inputs (Friston, 2010; Clark, 2016). This matters because learners do not take in new material in a simple way. They interpret it through expectations, language, emotion and prior knowledge before it reaches a neat memory stage.

    For autistic learners and learners with ADHD, the standard model can sound like a deficit account when working memory or attention looks different. Teachers can make the theory more useful by reducing competing demands, offering visual support, checking task understanding and giving more than one route to show what has been remembered.

    Free Resource Pack

    The free resource pack is a collection of classroom and staffroom materials on working memory, cognitive load and dual coding. Includes printable posters, desk cards, and CPD materials. Use it as a starting point for professional discussion: identify the learner's current need, record evidence from more than one lesson, and agree the next classroom adjustment with the SENCO or family.

References

Aesaert et al. (2014).

Alloway (2009).

Anderson (2000).

Anderson (1983).

Anderson (2005).

Anderson (1977).

Atkinson and Shiffrin (1968).

Ausubel (1968).

Baddeley (2000).

Baddeley (1986).

Baddeley (2003).

Baddeley and Hitch (1974).

Bartlett (1932).

Bruner (1966).

Bruner (1960).

Case (1985).

Chun (2011).

Clark (1983).

Cowan (2014).

Cowan (2010).

DfE (2023).

Dweck (2006).

Ebbinghaus (1885).

Flavell (1979).

Fredrickson (2001).

Magill (2011).

Miller (1956).

Paivio (1971).

Pekrun (2006).

Piaget (1972).

Posner (1980).

Ramus et al. (2003).

Skinner (1953).

Snowling (2000).

Squire (2009).

Sweller (1988).

Treisman (1969).

Tulving (1972).

Tyng et al. (2017).

Willingham (2009).

Further Reading: Key Research Papers

These peer-reviewed studies provide the evidence base for the strategies discussed above.

Information Processing Ability and its Implications for Teaching and Learning

Fourie et al. (2022)

This paper explores how the brain processes information and identifies factors influencing learners' ability in the classroom. Understanding these insights helps teachers adapt their instructional strategies to improve information processing and improve learning outcomes.

Applying Information Processing Theory in Educational Practices: A Systematic Review

Fitriani et al. (2025)

This systematic review examines how Information Processing Theory is applied in educational settings. It offers teachers research-backed strategies for implementing these theoretical principles effectively in their classrooms to enhance student learning.

The Past, Present, and Future of the Cognitive Theory of Multimedia Learning
201 citations

Mayer (2024)

Mayer's work synthesises multimedia learning research, providing 15 evidence-based design principles rooted in information processing theory. Teachers can apply these to create engaging resources, helping learners process and retain information more effectively.

Working Memory Underpins Cognitive Development, Learning, and Education
455 citations

Cowan (2014)

Cowan's synthesis highlights working memory as a critical factor in cognitive development and learning. Teachers can use this understanding to design learning activities that avoid overloading learners' working memory, thereby supporting deeper understanding and knowledge acquisition.

Information Processing and Human Memory

Eggen (2020)

Eggen's chapter outlines the three memory stores and key information processing strategies like chunking and schema activation. Teachers can use these practical techniques to improve learners' attention, perception, and long-term memory retention in the classroom.

Paul Main, Founder of Structural Learning
About the Author
Paul Main
Founder & Metacognition Researcher

Paul Main is an educator and metacognition researcher who founded Structural Learning in 2002. With a psychology degree from the University of Sunderland and 22+ years helping schools embed thinking skills, he bridges the gap between educational research and classroom practice. Fellow of the RSA and Chartered College of Teaching, with 128+ Google Scholar citations.

More →

Metacognition

Back to Blog