Data Catalogue

Variable NPD HESA LEO UCAS
Attainment
Key Stage 1 Yes No Yes No
Key Stage 2 Yes No Yes No
Key Stage 3 Yes No Yes No
Key Stage 4 Yes No Yes No
Key Stage 5 Yes No Yes Yes1
Measures of disadvantage
Free School Meal (FSM) eligibility Yes2 No3 Yes Yes
POLAR Quintile No No Yes Yes
IMD Decile No No Yes Yes
Lower layer Super Output Areas (LSOAs) Yes No Yes No
Income Deprivation Affecting Children Index (IDAC) Yes No Yes No
Student characteristics
Sex Yes4 Yes No Yes
Gender Yes Yes Yes Yes
Disability/SEN status Yes Yes Yes Yes
Age/Date of Birth Yes Yes Yes Yes
Ethnicity Yes Yes Yes Yes
Domicile status No No5 No Yes
Educational establishment type Yes No6 No Yes7
Region No No8 Yes Yes
Home postcode Yes Yes No9 Yes
Parental Education No Yes Yes Yes
Care-experienced Yes Yes Yes Yes
Religion No Yes No Yes
Higher Education Outcomes
Course(s)/Subject studied No Yes Yes Yes
Higher Education Provider attended No Yes Yes10 Yes
Higher Education Provider Tariff Group No No Yes Yes
Mode of study (Full time/Part-time) No Yes Yes Yes
Enrolment to Higher Education No No11 Yes Yes
Continuation (retention) No No12 No No
Completion No No13 Yes No
Type of qualification (pre-entry) No Yes Yes Yes
Type of qualification awarded on exit No Yes Yes No
Application and Offer Behaviour
Predicted Grade No No Yes Yes
Number of UCAS applications No No Yes Yes
Offers Received No No Yes Yes
Acceptance Route (e.g. firm choice, insurance choice, direct clearing, adjustment etc) No Yes Yes Yes
Post Higher Education Outcomes
Employment status No Yes Yes No
Earnings (salary) No Yes Yes No
Industry/Occupation No Yes Yes No
Sustained Employment/Further Study No Yes Yes No
1 UCAS records KS5 attainment only from qualifications that applicants self-declare in their application (e.g., A levels, BTECs, other Level 3 awards). It does not contain verified KS5 results or any official attainment data held in the NPD.
2 The NPD contains numerous FSM indicators. The EverFSM variable provides a broader measure of disadvantage, indicating whether a pupil has ever been FSM-eligible within the last 3, 6, or all-year windows.
3 HESA collects socio-economic classification (SEC), which refers to the socio-economic background of students aged 21 and over at the start of their course, or, for students under 21, the socio-economic background of their parent, step-parent, or guardian.
4 UCAS collected sex (Male/Female) prior to 2015 and gender from 2015 onward. For time-series consistency, values are treated as sex before 2015 and gender thereafter. From 2024, additional options (‘I use another term’, ‘I prefer not to say’) were introduced.
5 Permanent home address (PERMADDPOSTCODE) and Country of permanent address (PERMADDCOUNTRY) can be treated as relevant domicile proxy for domicile status.
6 HESA ccollects 'Previous provider' (PREVIOUSPROVIDER), which distinguishes the broad type of last provider attended (e.g., UK state school, independent school, FE college), and may be used to describe post-16 entry routes. However, it cannot be interpreted as a full indicator of “school type,” as it does not capture pre-16 schooling or detailed school-sector characteristics.
7 UCAS records the type of school or centre most recently attended or used to submit the application. This field is only populated for applicants aged 19 and under; anyone older is recorded as ‘Not Applicable'.
8 HESA collects the country of permanent address, which refers to the country the student lived in for non-educational purposes before starting HE.
9 LEO hosts other geographic identifiers to show where learners live, including local authority (anonymised), census output area, region, country, and parliamentary constituency. Each is linked to the learner and the relevant tax year.
10 LEO does not store provider name, only anonymised identifiers and location (pseudonymised HESA provider ID).
11 HESA contains no single variable that explicitly represents “enrolment” or “entry to higher education.” Instead, entry must be derived using Engagement-level data, particularly the Engagement start date (ENGSTARTDATE), the appearance of a new Engagement number (NUMHUS) for a student (SID), and the Programme year (YEARPRG), which typically equals 1 in the first year of study.
12 HESA contains no single variable representing “continuation”. Instead, continuation can be inferred from using a combination of Engagement number (NUMHUS), Student identifier (SID), Programme year (YEARPRG) and the presence/absence of Engagement end date (ENGENDDATE). [LEO] No single variable can dervice continuation, but continuation can be deriverd from HE_DATELEFT / HE_ENDDATE – end/withdrawal date, HE_RSNEND – reason for ending an instance, HE_NOTACT – indicates suspension, HE_FUNDCOMP – completion status of that academic year.
13 HESA contains no single variable representing 'completion.' Instead, continuation can be inferred using a combination of Qualification awarded result (QUALAWARDRESULT), Reason for engagement ending (RSNENGEND) and Engagement end date (ENGENDDATE).
Back to top