ICER 2026
Tue 11 - Fri 14 August 2026 Uppsala, Sweden

This program is tentative and subject to change.

Wed 12 Aug 2026 11:20 - 11:45 at Main conference room - Measures

Background and Context: Code complexity measures have been used to guide the design of various activities within computing education, such as instructional sequencing and assessment. However, empirical evidence for the link of these measures to actual cognitive difficulties remains mixed, with studies suffering from small sample sizes and non-controlled experimental design.

Objectives: We sought to investigate how code complexity measures predict the cognitive load of university students when tracing code and whether their predictive power is moderated by computer science achievement. We also compared how these measures stacked up against the comparative judgment from an 18-member expert panel.

Methods: We conducted a preregistered laboratory study to investigate the strength of code complexity measures identified in a recent neuroimaging study as predictors of cognitive load. In this controlled study, N=551 university students traced a random selection of 24 expert-curated code snippets in Java, Python and C++, and then reported their cognitive load using two validated measures of cognitive load. We assessed preregistered hierarchical regression models with respect to the predictive strength of the code complexity measures and possible moderation.

Findings: Contrary to the findings from the previous neuroimaging study, we could not confirm data-flow complexity to be the strongest predictor of measured cognitive load; instead, the simple source lines of code measure dominated all other static measure. A recent, more sophisticated measure also fared poorly, while experts ratings were strongly predictive of cognitive load.

Implications: In the educational context studied, measuring source lines of code is a simple and effective heuristic for ordering tracing tasks by difficulty and outperforms more sophisticated efforts involving data and control flow. The unexpected finding that easy-to-obtain rankings based on pairwise-comparison sessions involving experts have a much stronger predictive power than static metrics opens up avenues for follow-up research.

This program is tentative and subject to change.

Wed 12 Aug

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

11:20 - 12:35
11:20
25m
Talk
How (and How Not) Do Code Complexity Measures Predict Cognitive Load?
Research Papers
Sverrir Thorgeirsson ETH Zurich, Jan Vahrenhold University of Münster
11:45
25m
Talk
Measuring Student Self-Efficacy for Programming with Generative AI
Research Papers
James Prather Abilene Christian University, Lauren Margulieux Georgia State University, Yekaterina Kharitonova University of California, Santa Barbara, Yonggao Yang Texas A&M University, Prairie View, Brent Reeves Abilene Christian University, Paul Denny The University of Auckland, Jamie Gorson Benario Google, Ernest D.V. Holmes Google, Erin Spaulding Google, Gweneth Barbre Abilene Christian University, Musa Blake Abilene Christian University, Juho Leinonen Aalto University
12:10
25m
Talk
The Use of Computational Thinking Skills, Difficulties, and Strategies of Introductory Programming Students Solving Bebras Tasks
Research Papers
Enrico Benedetti Utrecht University, Isaac Alpizar-Chacon Utrecht University, Johan Jeuring Utrecht University