Assessment

PISA, TIMSS and PIRLS: What actually are they and what do they tell us?

According to the latest PISA results, England’s science scores are still on a downward trajectory that started a decade ago. Yet TIMSS, another respected study, has science performances rising. Which of them is right? Is one more valid than the other? In this blog AQi examines three International Large-Scale Assessments and finds that, although they may look the same from a distance, get up close and you’ll find they are very different beasts.

Are all measures equal?

PISA – The Programme for International Student Assessment

PIRLS - The Progress in International Reading Literacy Study

TIMSS – The Trends in International Mathematics and Science Study

What are the differences between PISA, PIRLS and TIMSS?

How comparable are the results from PISA, PIRLS and TIMSS?

What can we conclude about PISA, TIMSS from all of this?

Read More:

Download PDF

Date20/02/24

AuthorAQI team

Are all measures equal?

The PISA (Programme for International Student Assessment) rankings for maths, reading and science are some of the most eagerly anticipated data in the education world. Their publication prompts headlines, applause, criticism and sometimes soul searching.

The same goes for two other well-respected International Large-Scale Assessments - TIMSS (Trends in International Mathematics and Science Study) and PIRLS (Progress in International Reading Literacy Study).

You might think that these three assessments would pick up similar trends in each country where they are used, but that is not always the case.

For example, PISA’s last report showed that English students’ maths had declined, and that a downward trend in science performances, which started in 2012, was continuing.

But analyse TIMSS’ data and you’ll see that maths performances have been on an upward trend for both Year 5 and Year 9 students since 2011.

Not only that, when it comes to science, TIMSS recorded eight years of continual improvement by Year 5 students.

PISA’s findings on reading skills also seemingly conflict with those of PIRLS.

In its 2023 report, PISA recorded a 10point drop for England on its previous study (roughly equivalent to six month’s schooling).

Compare that with PIRLS’ 2023 report which showed England’s literacy score had fallen a statistically insignificant one point, even though the country had been through a challenging lockdown.

So, why do separate studies of the same subjects come up with different findings? Can they be compared?

To answer these questions let’s examine PISA, TIMSS and PIRLS, see how they work and what they measure.

PISA – The Programme for International Student Assessment

PISA has been conducted by the OECD since 2000 and is the most high-profile International Large-Scale Assessment.
It relies on a minimum of 150 schools with 4,500 to 10,000 students in each country/economy. The last survey involved 81 countries and almost 700,000 students.
It evaluates 15-year-olds’ ability to use their reading, maths and science skills and knowledge to solve ‘real life’ problems and is produced every three years.
Students sit computer-based assessments over two hours with a mix of multiple-choice and open-ended questions. They also complete questionnaires about themselves and their school, including behaviour and discipline.
Teachers, headteachers and parents also complete questionnaires about lessons and the wider school/home environment.

PISA's Selling point

Pisa is the biggest global measure of education performance levels.
The breadth of its reach and depth of its testing goes beyond the other international studies.
It measures attainment in a wider range of subjects than PIRLS and TIMSS.
The statistics it produces allow policymakers, educationalists and others to draw comparisons between education systems, examine elements within those systems and track changes over time.

PIRLS - The Progress in International Reading Literacy Study

PIRLS was first carried out in 2001 and is run by the International Association for the Evaluation of Educational Achievement – a co-operative of national research institutions, government agencies, educational researchers and analysts in more than 60 countries.
The IEA (CORR) conducts its study every five years on pupils aged 9-10.
PIRLS 2021 results were based on data from around 400,000 students, 20,000 teachers, 13,000 schools and 57 education systems.
The study, which students can sit either digitally or using pen and paper, uses multiple-choice and open-ended questions to assess the reading attainment of pupils focusing on reading for literary experience and reading to acquire and use information.
PIRLS assesses pupils in two 40-minute sessions. They also complete a questionnaire about their attitudes to learning, while teachers, headteachers and parents submit one about lessons and the wider school/home environment.

PIRLS' Selling point

The assessment is designed in collaboration between PIRLS and national bodies to fit each country’s curriculum. Each assessment is linked to the preceding one which means policymakers can compare their own results over time, make global comparisons and set benchmarks.
Texts used are constantly being reviewed to ensure they reproduce the student’ typical reading experience, are clear and coherent, and culturally relevant to each country.
By assessing literacy skills in younger students, PIRLS identifies potential problems early which allows for targeted interventions and support.
Policymakers can use PIRLS findings to share best practice, shape policy and allocate resources.

TIMSS – The Trends in International Mathematics and Science Study

TIMSS has been going since 1995 and is also run by the International Association for the Evaluation of Educational Achievement.
It is typically conducted every four years and taken by two groups of students - one aged around 10 and another around 14.
The 2019 study involved 580,000 students in 64 countries and 8 benchmarking systems.
The 90-minute assessment is available in digital or paper formats. It is split into two sessions and uses multiple-choice and open-response questions to evaluate maths and science knowledge as well as reasoning and application skills. There are also wider context questionnaires for students, teachers, headteachers and parents.

TIMSS' Selling point

TIMSS designs the assessment in collaboration with national bodies to fit each country’s curriculum. It refines or adapts assessments for each cycle to account for changes to the curriculum or other areas. This means policymakers can compare their own results over time, make global comparisons and set benchmarks.
By assessing two age groups four years apart and in four-year cycles, TIMSS shows how students are progressing. It also measures the effectiveness of interventions and identifies potential problems early in students’ education journey which allows for targeted support or curriculum improvement.

What are the differences between PISA, PIRLS and TIMSS?

They test different things

PIRLS/TIMSS are designed to evaluate students’ command of the curriculum and ability to apply their learning and skills.
PISA on the other hand aims to evaluate the country’s education system and emphasises socio-economic and equity factors. For this reason it is important that a representative selection of students sit the assessment which is not always the case.

These Large-Scale Assessments assess the abilities of different aged pupils

Reading
PIRLS looks at reading for enjoyment and information retrieval and use in Primary pupils.
PISA measures a wider range of literacy skills in secondary students, from basic decoding of text to “knowledge about the world.”

Maths/Science
TIMSS assesses students’ understanding of concepts in maths and science and their application mainly using multiple-choice and open-response questions.
PISA uses interactive computer-based assessment and performance-based tasks as well as traditional multiple-choice and open-response questions. The framework is based on the Real Mathematics Education (RME) philosophy which means that education systems using standards that match RME are more likely to score highly on PISA but not TIMSS.

How comparable are the results from PISA, PIRLS and TIMSS?

Research has shown that TIMSS and PISA provide similar pictures of student achievement at the country level. A recent assessment found close alignment between country mean scores from both studies. That study of the 2015 TIMSS and PISA results found the higher achieving East Asian countries did better in TIMSS maths than their PISA results predicted, while some Nordic and English-speaking countries did a little better in PISA.
We should note here that the TIMSS and PISA scales are different even though both are centred around 500 with a standard deviation of 100. More information on assessment scales can be found on the TIMSS and PISA websites.

What can we conclude about PISA, TIMSS from all of this?

What this shows is that PISA, TIMSS and PIRLS, taken together, can be good indicators of country achievement - on the proviso that the data is robust, and the trend is clear.
Most of the time, however, country-level data is messy and fine details get lost.
These surveys are not granular enough to be of use for policy setting.
Sure, sometimes their findings point in the same direction, but other times they don’t. So how are we supposed to know whether the similarities or differences are real or due to each survey’s own idiosyncrasies?
International assessments set the policy discourse and often pave the way in terms of carrying out surveys at scale, but lack in nuance when there are specific policy issues requiring an answer.
The best way to get those answers is through national assessments tailored to the country and the questions that need to be tackled.
All that said, the vast range of data PISA, TIMSS, PIRLS and others generate can be a boon to researchers in the field.

Bacc again: A policy briefing on Baccalaureate curriculum models
Levers of change: How policymakers can shape the education system
Post exam analysis: How to improve student experience

Computing

Could girls be the secret to boosting the UK’s growth as a technology superpower?

What if women played a more central role in responding to the rapid technological changes in our world? Girls in England outperform boys at every grade level but disproportionately don’t take Computer Science at GCSE.

Assessment

What can this year’s GCSE entries tell us to look for in tomorrow’s results?

With GCSE Results published on 22nd August, Dr Chinwe Njoku looks into the underlying data on what subjects this year's cohort took and how this has changed from previous years.

Assessment

A-level maths students hit six figures – what’s behind its popularity?

On Thursday, more than 100,000 A-level maths students in England will find out their results – 11.4% more than last year. Why the upturn? Dr Chinwe Njoku, AQA Education Insights Lead and former maths teacher, was heartened by the news and keen to look at the story behind the data.

Education Policy

What comes after ‘urgent’ for the new Education Secretary?

After the burning issues are addressed, what should come next for the new Education Secretary?

Education Policy

Labour’s oracy plans: They need clear goals

Sir Keir Starmer has said he wants to boost students’ confidence by raising the importance of speaking skills – oracy. In this previously published blog, Reza Schwitzer, AQA’s director of external affairs, applauds the ambition but warns there needs to be clear goals

Education Policy

Through the looking glass: How polling the public can help policymakers learn about themselves

Public attitude data is key to effective policymaking. Proper polling can reveal what people think about existing policies and what they want for the future. But, if looked at from a different angle, it can also help policymakers question themselves and their assumptions about the public. In this blog, AQA’s Policy and Evidence Manager Adam Steedman-Thake, reveals the lessons he learned about himself while reading a recent public attitude survey.

Assessment

Assessing oracy: Is Comparative Judgement the answer?

Oracy skills are vital to success in school and life. And yet, for many children, opportunities to develop them are missed. Educationalists are engaging in a growing debate about where oracy fits into the school system. Labour has put it at the heart of its plans to improve social mobility and an independent commission is looking at how it is taught in the classroom. This renewed focus on oracy means it is more important than ever that teachers have a way to reliably assess and understand their students’ attainment and progression. Amanda Moorghen of oracy education charity Voice 21 explains how Comparative Judgement can help with that and why it may be a game changer.

Education

TV subtitles as an aid to literacy: What does the research say?

Jack Black is probably best known in educational circles for playing a renegade substitute teacher in School of Rock. But the Hollywood star has made a more conventional foray into education by backing the use of TV subtitles to improve child literacy. Stephen Fry and the World Literacy Foundation also want parents to use their TV remotes to get children reading. So, could this simple click of a button be a solution to boost pupils’ reading skills? AQA’s resident expert on language teaching, Dr Katy Finch, casts her eye over the research to see whether it stacks up.

Data Analysis

What is left behind now education’s Data Wave has receded?

Is data the solution to all education’s issues? About a decade ago the prevailing wisdom said it was. Advocates of this Data Wave argued that harvesting internal statistics would help schools solve issues such as teacher accountability and attainment gaps. As with all waves, after crashing onto the beach they recede, leaving space for another to roll in. In this blog, teacher, author and data analyst Richard Selfridge looks at the legacy of the Data Wave to see what schools can take from it.

International Approaches

Finland & PISA – A fall from grace but still a high performer?

Finland was once recognised as one of the most successful educational systems in the world. At the turn of the millennium, it topped the PISA rankings in reading, maths and science. But by 2012, decline set in. The last set of results showed performances in maths, reading and science were at an all-time low. In this blog Dr Jonathan Doherty of Leeds Trinity University outlines some reasons that may account for the slide.

Download a PDF version.

Download a copy of this content to your device as a PDF file. We generate PDF versions for the convenience of offline reading, but we recommend sharing this link if you'd like to send it to someone else.

PISA, TIMSS and PIRLS: What actually are they and what do they tell us?

Are all measures equal?

PISA – The Programme for International Student Assessment

PISA's Selling point

PIRLS - The Progress in International Reading Literacy Study

PIRLS' Selling point

TIMSS – The Trends in International Mathematics and Science Study

TIMSS' Selling point

What are the differences between PISA, PIRLS and TIMSS?

They test different things

These Large-Scale Assessments assess the abilities of different aged pupils

How comparable are the results from PISA, PIRLS and TIMSS?

What can we conclude about PISA, TIMSS from all of this?

Read More:

Could girls be the secret to boosting the UK’s growth as a technology superpower?

What can this year’s GCSE entries tell us to look for in tomorrow’s results?

A-level maths students hit six figures – what’s behind its popularity?

What comes after ‘urgent’ for the new Education Secretary?

Labour’s oracy plans: They need clear goals

Through the looking glass: How polling the public can help policymakers learn about themselves

Assessing oracy: Is Comparative Judgement the answer?

TV subtitles as an aid to literacy: What does the research say?

What is left behind now education’s Data Wave has receded?

Finland & PISA – A fall from grace but still a high performer?

Join the conversation on Twitter

Download a PDF version.

Sign up to our newsletter