Back to Top

 Skip navigation

CSO Data Protocol for how the CSO manages the combining of CSO and non-CSO data came into effect in May 2005. The Protocol covers any work undertaken within the CSO to match the individual records contained in two or more data holdings, at least one of which originates outside the Office.

It also covers any assistance the CSO may give to other public authorities to enable them to link data holdings under their control for statistical purposes.

The tables below detail CSO Divisions engaged in data matching, datasets matched and outputs obtained.

Queries may be e-mailed to Dataoffice@cso.ie.

Register of Data Matching Activities

CSO Division: Administrative Data Governance and Analysis

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

CSO: Mortality Data;Census 2016 data DEASP:CRS DATA To produce an updated version of the Mortality Differentials in Ireland release using the 2016 Mortality and Census data Every 5 years in line with Census Tabular
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables Pseudonymised Income Tax Form 11, Business Details Data (Revenue) To examine the possibility of compiling a dataset from which the average weekly wage for full-time equivalent farm employees can be calcualted for the Agricultural Accounts. Ongoing The output will be in tabular format.
Pseudonymised Person Income Register Data (PIR), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35) Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Grant Application and Payment Data (SUSI) This project investigates the possibility of utilising data obtained from administrative sources in order to identify links between individuals and create an experimental household register. Annual Outputs will be varied ranging from an experimental household composition register to a quality report which compares the distribution of households from the register with the 2016 Census results.
Address Matching Tool Sets using GeoDirectory (GeoDirAMToolSets)  Directory of Irish Property Addresses, including Eircodes (GeoDir), Registered Deaths Data (GRO) The purpose of the project is to understand if there are variations in mortality in the Mid-West Region. One-Off The researchers are analysing the mortality data to see if there are any variations in mortality in the Mid-West - They intend producing a report.
Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis)  Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) To check the accuracy of the Administrative Data Centre’s (ADC) geocoding process and to identify issues that may need improvement. This to be done by matching persons in both datasets to see if their place of residence appears in the same geographical areas i.e. small areas (SA), electoral divisions (ED) etc. One-Off Report or paper in aggregated tabular form.
Census of Population 2016 Data Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE) To produce a socio-economic analysis of the COVID-19 pandemic and COVID-19 mortality differentials using the Census of Population 2016, the anonymised Central Record System datasets and data sources of the Health Service Executive (HSE) which have been supplied to the CSO to support analysis of COVID-19 related issues. One-Off Tabular  output.
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) This is an update of a previous project to include the inclusion of PMODAnalysis as SPP35 is no longer being updated. To build a register from already pseudonymised data sources in order to allow the creation of single source income register. Ongoing Creates the dataset called PIR which is used as an input into various statistical outputs.
Business Register Data, Earnings Hours and Employment Costs Survey Data, Earnings Analysis using Administrative Data Sources Data,  Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare),  Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue)~ Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) The PMOD Analysis Group aims to utilise the Revenue's PAYE Modernisation data along with other administrative and survey datasets to develop a standardised approach to the analysis of PMOD linked data and to produce a range of new, timely and informative outputs for the CSO. The output envisaged will be a continuation of the following - https://www.cso.ie/en/releasesandpublications/fp/fp-c19issse/impactofcovid-19incomesupportsonemployees Ongoing Population Pyramid
Cohort analysis: An employment and earnings analysis of joiners, leavers, stayer.
Monthly trends in earnings and employment
Aggregated statistics presented in tabular form by various economic and demographic characteristics including Economic Sector, Size class, Income Distribution, Gender, Age group, Region.
https://www.cso.ie/en/releasesandpublications/fp/fp-c19issse/impactofcovid-19incomesupportsonemployeesq42020-insightsfromrealtimeadministrativesourcesseries2/
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables,  Pseudonymised Person Income Register Data   Building Energy Rating details for domestic premises (SEAI), Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Stamp Duty on Property Transactions Data (Revenue), Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue),  Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) This is an update of previous project to include an additional data source and update project purpose. The project purpose is to solely create a publication which examines the types of cohort who purchase properties in Ireland. The inclusion of the CCRAnalysis will be used to investigate the potential of including buyers with and without a mortgage. One-off Frontier Publication, PXStat files and also potentially a value added dataflow on the ADC portal or a new analysis tier data flow which has been created by pseudonymising eStamping.
None Pseudonymised DSP Extract About Ukrainian Refugees (Welfare), Retail PostOffice List (An Post) The Analysis version of the Payments data set of the DSP extract concerning Ukrainian Refugees (UKR_DSP_Extract) will be enriched with Post Office geo-location by linking the data set with the list of Post Offices as provided by An Post (Retail PostOffice List (An Post) ). This will enable geo-location of DSP payments to act as proxy for Ukrainian refugees distribution around the state.  This will also aid policy response to refugee influx. Ongoing

The ADC is not creating any Statistical Outputs. The ADC will be performing the matching on behalf of staff from the SSCU and Methodology sections of the CSO and will store the result sets in the Analysis tier of the ADC repository.

The statistical outputs expected from the matching are:
- Tabular output of head counts aggregated to geographic area.
- Highly aggregated summary geographic statistics will be published to GeoHive via CSO AGOL platform.

None Building Energy Rating details for domestic premises (SEAI), Central Bank Central Credit Register on loans data (CBI), Central Record System - Client, Payment and Employment Details (Welfare), COVAX Vaccination Data (HSE), Directory of Irish Property Addresses, including Eircodes (GeoDir), ESB Networks electricity consumption and customer data (ESBNetwk), Grant Application and Payment Data (SUSI), Higher Education Student and Course Details (HEA), Housing Assistance Payment (LCouncil), Landlord and Tenant Details from the Register of Tenancies (RTB), Local Property Tax Returns (Revenue), National Vehicle and Driver File, Driver Details (DTTAS), Post Primary Pupil Details (DES), PPSN and Personal Details Data (Revenue), Primary Care Reimbursement Service Data (HSE), Primary Pupil Details (DES), Registered Births Data (GRO), Registered Deaths Data (GRO), Registered Marriages Data (GRO), Stamp Duty on Property Transactions Data (Revenue) To create geospatial reference data to be added to admin datasets.  Ongoing

This product will enable the compilation of small area statistics from admin data including Census-like population estimates. Pseudonymised geospatial info will be generally available within the CSO for appropriate statistical projects.

  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) Pseudonymised Central Record System - Client Details (Welfare) Pseudonymised Central Record System - Payment and Employment Details (Welfare) Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) ANNUAL

Electronic publication (text, tabular, graphic) PxStat tables

 

 

 Back to Top

 

CSO Division: Agriculture, Transport and Tourism

CSO Dataset Matched

 

Non-CSO Dataset Matched

Reason

 

Frequency

 

Statistical Outputs Obtained

Agriculture Register

Farm database from Department of Agriculture and Food

Update CSO Agriculture register

Annual

Details of farm 'births'

Census of Agriculture 2010

Survey of Agricultural Production Methods 2010

Farm Structure Survey 2013

Annual June Crops & Livestock Survey

Animal Identification & Movement database for cattle & the Single Payment Scheme for crops

December Sheep & Goat Census (DAFM)

To enable CSO to fulfil requirements for Agriculture data under Regulation 2018/1091, Regulation 1166/2008 and Regulation 543/2009.

Annual

Census of Agriculture; 

Annual June Crops & Livestock Results;

Farm Structure Survey  Results 

Annual June Crops & Livestock Survey

CRS Client ITForm11Per_Analysis AgriSingleFarm  ITForm11Bus_Analysis   SPP35

Match the Annual June Crops & Livestock Survey 2016 returns with CRS Client data to check the age, marital status and gender of the farm holder in these returns.  This happens every 3/4 years during FSS or CoA processing

Ongoing

Farm Structure Survey dataset

Vehicle Licensing

National Vehicle and Driver File, Driver Details (DTTAS)

The aim of the project is to calculate the total vehicle-kilometres from odometer readings which is a business requirement for the transport section.

ANNUAL

The following PXstat tables will be published: THA10 Road traffic volumes by type of vehicle and year THA17 Road traffic volumes by fuel type, county, year of registration and type THA18 Road traffic volumes of cars by type of ownership, engine capacity, fuel type, country and year of registration THA19 Road traffic volumes of goods vehicles THA20 Road traffic volumes of small public service vehicle

 › Back to Top

CSO Division: Applications

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Business Register Data 

Address Matching Tool Sets using GeoDirectory (CSO) 

To GEO Code geographical locations of Local Units within a Business Register structure or NACE classification.

Ongoing

Visualisation of geographical units within the Central Business Register.
Business Statistics statistical outputs using this matched data have not been fully defined.

 › 

CSO Division: Balance of Payments

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Balance of Payments Data Section 110 Revenue Data To match BOP data to ADL Section 110 company data in order to identify SPEs Once Off Tabular
Balance of Payments Data, BOPSMS  CRO Accounts Details Data (DandB), Annual B1 Company Returns Data (CRO) The project proposes to link CRO accounts and ownership data to Balance of Payments data and register information for the purposes of validating respondent data, monitoring survey coverage and informing survey recruitment. Ongoing The project will facilitate the production of spreadsheets detailing like-for-like comparisons between BOP data and CRO accounts and ownership data.
Balance of International Payments, Structural Business Statistics - Industrial, Structural Business Statistics - Services,) Business Register data  Annual Business Survey of Economic Impact (DETE) Derivation of grossing factors of both Balance of Payments (BOP) profits and services exports/imports using ABSEI, CIP and ASI data. In Balance of Payments we collect detailed information on all bop relevant enterprises. Profits and trade in services of the relevant manufacturing and non-financial service companies not covered by the BOP surveys are currently estimated from Census of Industrial Production and Annual Services Inquiry returns. ANNUAL Grossing factors of both Balance of Payments (BOP) profits and services exports/imports using ABSEI, CIP and ASI data.
Structural Business Statistics - Industrial, Trade Enterprise Characteristics (TEC) Data, Business Register data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables  Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) To analyse the effect of the Brexit referendum and Brexit itself on the Irish economy. This includes, the geographical source and destination of goods exports and imports, and the effect on the firms that were most exposed to the UK market. ONE-OFF A frontier publication on the effect of Brexit on the Irish economy. Aggregate data in charts and tables. Regression results presented in tabular form with coefficients, standard errors, P-values and model metrics.

 › Back to Top

CSO Division: Business Statistics, Business Register & Purchasing Power Parities

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Business Register

CRO data primarily relating to company ownership and company accounts

Enhace the usefulness of the CRO data by classifying the records by economic activity

Quarterly

Improved economic statistics

Business Register

 

Revenue - VAT; PREM (employer registrations); Income Tax; Corporation Tax; P35 files

Update CSO register

Quarterly and annual

Improved CSO business register

Business Register

 

Companies Registration Office registration file

Improve the quality of the CSO business register

Monthly

Improved CSO business register

Business Register

CRO file containing most recent Annual Return

Help fulfill European requirements and also help with sampling

Continuous

 

Improved quality business register as a basis for statistical surveys, etc.

CSO Business Register

GEO Directory

Improve location of Enterprise

Continuous

Improved quality business register as a basis for statistical surveys, etc.

Business Register

EuroGroups Register

Contribute to the setup and maintenance of the EuroGroups Register as required under EU law.

Continuous

Improved quality of statistical outputs that are affected by multinational groups, e.g. FD statistics, Outward FATS, Inward FATS.

CSO data for BERD, CIS, CIP, ASI and Business Register

The Dept of Jobs, Enterprise and Innovation's Annual Business Survey of Economic Impact (ABSEI)

The purpose of this Data Matching Project is to ascertain from Dept of Jobs, Enterprise and Innovation (DJEI) (formerly Forfás), using data from their Annual Business Survey of Economic Impact (ABSEI), a list of the likely performers of R&D in Ireland. The data matching will be done by CSO in line with the Memorandum of Understanding in place (under the Statistics Act, 1993) between the CSO and DJEI, and the results of the matching will be sent to DJEI.

Ongoing

An anonymised matched file of likely R&D active firms in Ireland.

Business Register Data 

Pseudonymised Integrated Short Term Payment System Data (Welfare), Integrated Short Term Payment System Data (Welfare), Vat Information and Exchange System Acquisitions Data (Revenue), Vat Information and Exchange System Dispatches Data (Revenue), Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue)

To identify signs of activity in the Irish business economy during the COVID-19 period of restrictions on trade and subsequent re-opening.

One-Off

Tabular output, presenting aggregated statistics by various economic and demographic characteristics including economic sector, size class, region.

All outputs will be have checked in line with standard CSO practices regarding confidentiality.

None

Corporation Tax Data (Revenue), CRO Accounts Details Data (DandB)

The purpose of this Data Matching Project is to compare parsed iXRBL company accounts data to statistical returns of the Annual Services Inquiry to determine if it can further enhance current data holdings.

Ongoing

An anonymised matched file of services firms in Ireland.

Structural Business Surveys

Pseudonymised Income Tax Form 11, Business Details Data (Revenue)

The purpose of this Data Matching Project is to compare Form 11 data to statistical returns of the Structural Business Surveys (SBS: ASI, CIP and BCI) to determine if it can further enhance current data holdings.

Ongoing

Improvements in SBS tabular estimation processes

Business Register Data

VAT Registrations Data (Revenue)

Match data from the Business Register with the VAT Register to obtain valid email addresses.

Annual 

The output is purely concerned with the replacement and updating of email addresses to conduct surveys. This singular piece of data will be available within the CSO Data Management System and only available to those with clearance to operate the survey.

None

Pseudonymised VAT Trader Returns (VAT3 and RTD), Data Combined (Revenue) Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue)  Business Statistics DCU is to develop the use of administrative data sources for compliance with the EBS regulation and particularly for the provision of Eurostat SBS early estimates. This project will feed into the development of the SMS, improve the quality of our statistics, reduce the burden on our respondents and possibly generate additional outputs.  Ongoing  The output is concerned with the consolidation of new data sources to the greatest extent possible for additional compliance with the EBS regulation, and the development of a process flow to implement admin data sources for SBS early estimates. The data will be housed securely within the CSO Data Management System and only available to those with clearance to operate the survey. 

 › Back to Top

 

 CSO Division: Census Management

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

None Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB) This project uses remote sensing techniques and data to aid census enumeration. A model has been developed which analyses high resolution aerial imagery and returns the precise location of all objects the model believes are buildings in the image it consumes. The intention of this DMP is to verify if this model can be used to confirm secondary residential units exist by comparing it to locations of known tenancies. The result will be a statistical measure of the models accuracy. One-Off By comparing the coordinates of the deep-learning model with locations of possible secondary dwellings on RTB and LPT, it will be possible to count the number of coincident pairs, this can be taken (somewhat) as a statistical measure of accuracy and this is the only anticipated output.
Census of Population Data Income Tax Form 11 Data (Revenue), PPSN and Personal Details Data (Revenue),  Building Energy Rating details for domestic premises (SEAI), QQI Course and Award Details Data (QQI), Child Benefit Data (Welfare), Central Record System - Client, Payment and Employment Details (Welfare), Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB), SOLAS PLSS Client and Course Details (SOLAS), Long and Short Term Social Welfare Payments Data (Welfare), Water Consumption Details for Residential Properties (IrishWat), Central Record System - Client Details (Welfare), Higher Education Student and Course Details (HEA), PAYE Real Time Data (Revenue) The purpose of the work is to improve the quality of the census 2022 dataset by linking and imputing records from administrative datasets to reduce household, person and item non-response. Additionally by creating this link it will allow the CSO to publish cross sectional publications such as the Geographical Profiles of Income in Ireland, Offenders 2016 and Tenure and Households in Ireland releases. One-Off A series of thematic publications using census data with accompanying interactive tables and maps covering topics such as population, housing, families, employment and education. This project will improve the quality of these reports by reducing non-response.
Also it will allow the publication of cross sectional publications including the Geographical Profiles of Income in Ireland, Offenders 2016 and Tenure and Households in Ireland releases.
Census of Population Results Post Primary Pupil Details (DES), Primary Pupil Details (DES) Improve the quality of census commuting data using place of school data for students. ONE-OFF The data will be used in the Place of Work, School, College - Census of Anonymised Records (POWSCAR) research microdata file. This file is used for the analysis of commuting patterns by authorized researchers. This file is scheduled to be made available on 19/10/ 2023. Census 2022 Profile 7 - Employment, Occupations and Commuting will be published on the 30/11/2023. This release will contain charts, maps and tabular data of commuting patterns and related variables such as means of travel.

 › Back to Top

 

CSO Division: Ecosystems Accounts

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Business Register data

VAT Registrations Data (Revenue)

The CSO Ecosystems Accounts Division will conduct the Waste Generation Survey of enterprises. This survey is carried out on a bi-annual basis to fulfil waste generation reporting requirements for Eurostat, in accordance with the Regulation on waste statistics (EC) No. 2150/2002, amended by Commission Regulation (EU) No. 849/2010. A link between the Business Register and the VAT Register is required to provide email addresses for enterprises in the sample, to which the survey is sent.

ONGOING

Statistical outputs include aggregated data in tabular form which is delivered to Eurostat on a bi-annual basis.

Business Register data

EPA Pollution Release and Transfer Register (EPA)

The CSO Ecosystem Accounts Division conducts the Waste Generation Survey of enterprises. This survey is carried out on a bi-annual basis to fulfil waste generation reporting requirements to Eurostat, in accordance with the Regulation on waste statistics (EC) No. 2150/2002, amended by Commission Regulation (EU) No. 849/2010. The Environmental Protection Agency (EPA) Pollution Release and Transfer Register is matched to the CSO Business Register to exclude EPA facilities from the survey sample.

ONGOING

Statistical outputs include aggregated data in tabular form which is delivered to Eurostat on a bi-annual basis.

› 

CSO Division:  Environment & Climate

CSO Dataset Matched

Non-CSODataset Matched

Reason

Frequency

Statistical Outputs Obtained

Business Energy Use Survey Data, Census of Industrial Production Data, Annual Services Inquiry Data

 

 

Emissions Trading Scheme Data (EPA), Large Industry Energy Network Data (SEAI), Administrative Business Files for CSO Business Register (Revenue) To manage the survey response burden by using administrative data where possible. Annual Annual Statistical Release

CSO Business Register

Irish Water non-domestic datasbase

The purpose is to obtain data on water consumption by NACE sector to meet Eurostat and other requirements on water statistics e.g. Inland Waters questionnaire and Water Framework Directive.

Ongoing

The output will be in tabular format.

CSO Business Register and CSO Trade Register

EPA Pollution Release and Transfer Register.

 

Dublin City Council National Trans Frontier Shipment Office.

Matched for Waste Statistics in the Environmental Statistics Division

Ongoing

Linkage created between EPA PRTR register and CSO Business Register.

NTFSO matched to CSO external trade statistics register.

Survey on Income and Living Conditions Data, Household Budget Survey Data, Census of Population 2011 Data, Census of Population 2016 Data

Better Energy Warmer Homes Data (SEAI), Electric Meter Data (ESB), Air Quality Data (EPA), Building Energy Rating Details (SEAI), Long and Short Term Social Welfare Payments Data (Welfare), Gas Usage Details for Residential and Commercial Customers (GasNetwk)

To analyse the factors leading to energy poverty; the impact of the environment on health; and related issues.

Ongoing

The Statistical outputs will be a statistical release with an analysis of factors leading to energy poverty, the impact of the environment on health and related issues.

Census of Population 2011 Data, Census of Population 2016 Data, Census 2011 Housing Data, Census 2016 Housing Data  Building Energy Rating Details (SEAI) The objective of this data matching project is to facilitate research to assess the extent of residential solid fuel use in Ireland and identify the factors that determine households' use of solid fuels. One-Off An anonymised RMF will be produced. Access to the RMF has already been requested by University College Cork, for research into residential solid fuel use. Proposed outputs include reports, policy briefs and academic papers.
Survey on Income and Living Conditions Data ESB MPRN, Building Energy Rating details for domestic premises (SEAI) The objective of the data matching project is to facilitate research into causes of energy poverty. Traditionally households were classified as in energy poverty based on income and heating and other fuel costs. More recently, energy efficiency of the household is recognised as an important determinant. The data matching project would combine the most important variables for energy poverty research from the CSO Survey on Income and Living Conditions and SEAI Building Energy Rating audits. ANNUAL CSO is a member of an energy poverty research group along with the Department of Environment, ESRI, SEAI, Social Protection, CRU, and Gas Networks Ireland.

The RMF file will allow researchers to analyse energy poverty in a broader context by incorporating the energy efficiency of the dwelling into the analysis.
Business Register data VAT Registrations Data (Revenue) The CSO Environment and Climate Division conducts several surveys including the Environmental Expenditure Survey, Roundwood Removals Survey, Wood Inputs Survey and the Green Economy Survey. The purpose of the surveys are to compile data to fulfill EU regulations and questionnaires regarding various environmental statistics. This data matching project allows for the Division to obtain updated email addresses for enterprises which are then used for survey post outs. ONGOING Expected final outputs from surveys include: 1. Reporting of aggregated data in tabular form as required by Eurostat 2. CSO statistical releases containing aggregated data in tabular form

 › Back to Top

CSO Division: Government Accounts Compilation & Outputs

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

None

List of Inspected Nursing Homes and Bed Numbers (HIQA), Income Tax Form 11 Data (Revenue),  Corporation Tax Data (Revenue)

Matching HIQA list of inspected nursing homes and bed numbers to Revenue Data Files (CT File & IT form 11 data file) to estimate average cost of nursing home beds.

Annual

Tabular Output 

 › Back to Top

 

CSO Division: Growing Up In Ireland

 

CSO Dataset Matched

Non-CSO Dataset Matched 

Reason

Frequency

Statistical Outputs Obtained

Growing up in Ireland, Business Register data 

Directory of Irish Property Addresses, including Eircodes (GeoDir), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA)

The purposes of the proposed data matching is to augment the GUI Cohort 98 dataset with administrative data relating to education and employment.

ONGOING

Tabular data in statistical releases.

Growing up in Ireland - '08 Cohort, Growing up in Ireland - '24 Cohort, Growing up in Ireland - '98 Cohort 

Water Quality by water supply zone (TCD), Water supply zones shape files (Irish Water)

In agreement with researchers from TCD, the Growing up in Ireland division is to facilitate a research project linking GUI's longitudinal data on the residencies of survey participants with the quality of water they would have had access to as children. Of special interest is whether the water participants consumed was fluoridated or not and tracing the effects of this on participants health through all cohorts and waves of GUI data collection, including the new birth cohort (Cohort 24).

ONGOING

Eventual outputs from this project will include a series of RMFs; one per wave and cohort with relevant data from the three data sources listed together. For future GUI data releases published on future GUI survey data, data on water supply zone will be added to the AMFs and RMFs. Publications on this research may be published by TCD for which CSO will monitor and support. This will be the first project for Growing Up in Ireland in CSO to use geospatial data and geographical information systems.

Growing up in Ireland - '98 Cohort, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Activity Register Data, Pseudonymised Person Income Register Data 

COVAX Vaccination Data (HSE), Directory of Irish Property Addresses, including Eircodes (GeoDir,) GRO_Deaths (GRO) Registered Deaths Data (GRO), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Central Record System - Client Details (Welfare),Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Live Register Claims Data from DEASP Integrated Short Term System (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS Apprentice Data (SOLAS), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA) 

UPDATE TO DMPs 11018 and 11265, merging all existing DMPs into one data matching protocol. The purpose of all the proposed data matching is to augment the GUI dataset with administrative data to lessen the respondent burden and enrich the dataset.

ONGOING

Tabular data in statistical releases. The data may also be included in a Research Microdata File (RMF), available to approved users subject to the CSO RMF Policy.

Growing up in Ireland - '08 Cohort

Central Record System - Client, Payment and Employment Details (Welfare)

The purpose of the proposed project is to apply a CSOPPSN to all GUI Cohort ‘08 survey participants (parents, young person and twins/triplets of young person) for both the pilot and the main sample to allow for future linkage to administrative datasets.

ONE-OFF

This data matching project will generate a CSOPPSN for each individual in the cohort. This will allow for the matching of the GUI cohort '08 sample to administrative data sets to supplement survey data in GUI analysis in line with the CSO Data Protocol.

 › 

 

 CSO Division: Income, Consumption and Wealth

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

SILC dataset

SUSI data

 

To assess whether adminsitrative data can be used to replace variables on the Survey on Income and Living Conditions (SILC) to reduce the burden on respondents, particularly with respect to education grants.

Initially once off, maybe annually depending on the results.

Tabular, diagrams and written comment

HFSC Household Finance and Consumption Survey

AgriSingleFarm – Pseudonymised Single Farm Payment Data
Agri - Basic Payment Scheme Area file
DAFM - Sheep and Goat Census
AIMS Analysis – Pseudonymised Animal Identification and Movement Data
BER Analysis – Pseudonymised Building Energy Rating Details
CensusAnalysis – Pseudonymised Census of Population 2016 with Geodirectory and DEASP Variables
CRS_Client – Pseudonymised Central Record System – Client Details
DSPpayments – Pseudonymised Long and Short Term Social Welfare Payments Data
RTBAnalysis – Pseudonymised Landlord and Tenant Details from the Register of Tenancies
SUSIAnalysis – Pseudonymised Grant Application and Payment Data
Revenue’s P35L: “SPP35 – P35L dataset for analysis”
Revenue’s Form 11: “ITForm11Per_Analysis - Income Tax Form11 Person Analysis files”

Verification of data in the Household Finance and Consumption Survey (HFCS), possible imputation of missing values.

We will also assess whether administrative data can be utilised to replace some survey questions and thus lessen the burden on respondents.

It is anticipated that the HFCS will be produced every 3 years form 2020 Tabular, diagrams, written comment. 

Survey on Income and Living Conditions Data

Pseudonymised Housing Assistance Payment - Analysis Tier (HAP)

To assess whether administrative HAP data can be used to replace variables on the Survey on Income and Living Conditions (SILC) to reduce the burden on respondents, and for data validation. Annual Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements.
Queries requested will be provided within CSO guidelines for confidentiality.

HFSC Household Finance and Consumption Survey

Housing Assistance Payment (HAP)

To verify data provided by respondents in the Household Finance and Consumption Survey (HFCS) and to match data in cases of non-response. Ongoing The data will provide the monthly rent paid by a HAP household and also the amount paid on their behalf to the landlord. These are core variables in the HFCS used to calculate expenditure and social transfers. 
Survey on Income and Living Conditions Data  Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) PMOD will replace P35 administrative data for employee income from SILC 2019.  This matching project is for the use of PMOD income data in SILC processing, reducing the burden on survey respondents and increasing the accuracy of SILC data. Annual Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements.
Queries requested will be provided within CSO guidelines for confidentiality.
Household Finance and Consumption Survey Data, Survey on Income and Living Conditions Data, Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised DEASP Covid 19 Illness Claims (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue)

Update of previous project to include EWSS and COVID-19 Illness Benefits data.

Analyse the affect COVID-19 has had on the financial viability of Irish households and assess the impact income support schemes (TWSS, EWSS & PUP) have had in supporting households.

Ongoing

Impact of COVID-19 on financial viability of households including Debt Sustainability Rates, Income to Loan Ratios, Negative Equity Rates.
Aggregated statistics presented in tabular form by various economic and demographic characteristics including Economic Sector, Size class, Income Distribution, Gender, Age group, Region.
Regression results presented in tabular form with coefficients, standard errors, P-values and model metrics.

Household Finance and Consumption Survey Data Pseudonymised Central Bank Central Credit Register on loans data (CBI)

Data from the CCR is matched to respondents of the Household Finance and Consumption Survey (HFCS) in order to accurately estimate debt at the household level in Ireland. In order to populate some debt variables of the HFCS, pseudononymised CCR data will be matched to HFCS respondents using CSOPPSN as a linking variable.

Ongoing

The data will fill core variables of the HFCS including data on mortgages, personal loans, credit cards and overdrafts

Household Finance and Consumption Survey Data Central Bank Central Credit Register on loans data (CBI) Data from the CCR is matched to respondents of the Household Finance and Consumption Survey (HFCS) in order to accurately estimate debt at the household level in Ireland. For these cases only, personal data of the individuals from the HFCS sample frame is matched to data on the CCR source tier. One-off The data will confirm the identity of certain HFCS respondents in the CCR in order to fill core variables of the HFCS including data on mortgages, personal loans, credit cards and overdrafts.
Survey on Income and Living Conditions Data Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue)

The primary purpose of this project is to allow SILC RAP to perform final edit checks on income variables in the SILC survey, as well as coherence checks.

Annual

No outputs are directly expected from this project. If errors/discrepancies are spotted in the final income data through this matching, it will be reverted to the SILC DCU team for correction. The second objective centers around exploring potential uses of the administrative datasets, but any updates to be incorporated into the actual processing of SILC data will be done by SILC DCU & covered by a separate matching request.

Household Budget Survey Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil)

To obtain rent details for HBS respondents who are renting their homes through the HAP system.
The cost to the householder of renting their home and the financial value of the benefit to the householder of being on the HAP scheme will also be obtained. These feed into data on housing costs and housing benefits in the overall HBS results.

Annual

The outputs will be:
(1) the amount of rent paid by HAP tenants who responded to HBS and
(2) the value of the benefit to these HAP tenants of being on the HAP scheme.
This data will not be used alone, it will be included in the HBS results as a whole.

Household Budget Survey Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB)

The matching exercise will be done for 2 different reasons.
1. To obtain details of rent paid by HBS respondents who are renting their homes.
2. To obtain details of rent received by HBS respondents who are landlords.

Annual

The outputs will be:
1. The amount of rent paid by HBS respondents who are renting their dwellings.
2. The amount of rent received by HBS landlords who are letting dwellings.

Household Budget Survey

Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue)

 

The purpose of the matching exercise is to obtain income and deductions made at source i.e. tax, USC, PRSI, pension contributions etc. for HBS respondents who are in the PAYE system. Annual Gross income and deductions from income made at source (e.g. tax, PRSI, USC, pension contributions, etc.) for HBS respondents who are paid through the PAYE system.
Household Budget Survey Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised Corporate Customer System Data (DAFM) The purpose of this work is to link respondents to the HBS with their Basic Farm Payments.   The Basic Farm Payments are used in the calculation of farm income.   Basic Farm payments are also known as "Single Farm Payments" Annual We expect to obtain the Basic Farm Payment component of income for farm households included in the HBS
Household Budget Survey Pseudonymised Income Tax Form 11, Person Details Data (Revenue) The IT Form 11 data will be used to provide details of income from self employment for HBS respondents Annual Income from self employment for respondents to the HB
Household Budget Survey Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) To obtain details of social welfare income for respondents to the HBS Annual Income from social welfare sources for respondents to the HBS
Household Budget Survey Pseudonymised Grant Application and Payment Data (SUSI) The purpose of the matching exercise is to link HBS respondents to any income they may have obtained through Education grants from SUSI.  The HBS collects information on all household income of which education grants may be a component. Annual The output will be the component of household income that is obtained from SUSI education grants.
Household Budget Survey Central Record System - Client Details (Welfare) The CRS_Client_Source file is used to verify PPSNs that are collected in the HBS Annual No actual data outputs from the matching procedure.  We will simply obtain confirmation of which PPSNs are correctly assigned to HBS respondents.
Household Finance and Consumption Survey Data Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Verification and supplementation of data in the Household Finance and Consumption Survey(HFCS) Ongoing The data will provide PAYE income and pension amounts for the survey reference period of HFCS respondents. These are core variables in the HFCS used to estimate household PAYE income amounts and pension contributions.
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Animal Identification and Movement Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Sheep and Goat census data (DAFM), Pseudonymised Single Farm Payment Data (DAFM) The supplementing of survey data with administrative data for the Household Finance and Consumption Survey (HFCS). ONGOING Tabular, diagrams and written comment
Survey on Income and Living Conditions Vital Statistics data (Bfacts) Research project exploring excess mortality amongst people at risk of poverty or living in consistent poverty. ONE-OFF CSO Frontier publication
Survey on Income and Living Conditions Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Person Income Register Data (CSO), Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised VAT Registrations Data (Revenue), Pseudonymised VAT Activity Analysis (VAT3 and RTD) Data (Revenue), Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue) The purpose of this project is to allow SILC RAP estimate current income for the Survey on Income and Living Conditions. Using the latest administrative income and modeling and estimating other income components will allow the estimation of current income for individuals and households thus allowing the production of "Flash" income estimates and poverty and deprivation rates. ANNUAL State level: - gross, net, disposable and equalised income. - Poverty rates - Deprivation rates
Census of Population Results Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Single Farm Payment Data (DAFM) It is required by the Social Data Design Division who require an updated Census linked to income sampling frame for all surveys that now require income for the sample design. Secondly it is required to update the publication "Geographical Profiles of Income in Ireland 2016" which there is a high demand for by both internal and external users. Finally, there has been an urgent request from the Department of Housing which require this updated data for their Housing Need and Demand Assessment tool. One-off There will be two statistical outputs from this project: 1. A dataset with combined pseudonymised Census 2022 data and income, which is to be used as a sampling frame by the Social Data Design division for CSO surveys. 2. A publication which will be an update of "Geographical Profiles of Income in Ireland 2016" based on Census 2022 and calendar year income 2022. This will provide income data at electoral division and local authority level as well as other demographics.
Census of Population Results Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) Creation of benchmarks for the Survey on Income and Living Conditions (SILC) data following the publication of Census 2022. One-off Benchmark files for use in the calibration of weights for the Survey on Income and Living Conditions (SILC) and possibly other ICW household surveys such as the HBS and the HFCS.

 

 › 

 

 CSO Division: International Trade In Goods

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Trade Register Data, INTRASTAT survey data, Received Microdata under European legislation, Customs declarations 

VAT Trader Returns (VAT3 and RTD) Data (Revenue), Vat Information and Exchange System Acquisitions Data (Revenue), Vat Information and Exchange System Dispatches Data (Revenue)

Intrastat survey data, Customs declarations and VAT information are all required as part of the system to produce Intra and Extra EU trade statistics in compliance with European legislation.

Ongoing

Detailed trade in goods statistics in compliance with European legislation

Business Register data

Detailed Trade Statistics

Matching of Business Register with trade statistics detailed microdata required for compliance with TEC (Trade by Enterprise Characteristics) reporting to Eurostat and production of anonymised trade data at enterprise level for production of Researcher Microdata Files (RMF)

ANNUAL

The statistical outputs expected are annual TEC data for Eurostat, in compliance with EBS legislation requirements. This data may also be published domestically on the CSO website and in PX-Stat tables. RMFs (Researcher Microdata File) are produced using the business register as an anonymised company identifier to allow researchers to analyse trade data and link it to other relevant business RMFs.

 

 › 

 

CSO Division: Labour Market and Earnings

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

None

DSFA CRS; P35 file from Revenue Commissioners; CSO Central Business Register 

To investigate the extent to which foreign nationals engaged with and remained in employment

Annual

 

 

 

CSO Statistical release; other aggregate tables

 

 

Census 2016 Analysis tier

EAADS (subset of P35 analysis dataset)

To match Census 2016 analysis level data to the data  being used to prepare for the Earnings Analysis using Adminsitrative Data Sources (EAADS) release.

Once-off

Tables, charts

CSO’s Earnings, Hours and Employment Costs Survey (EHECS) data
CSO’s Central Business Register
CSO’s Employer Identification Inquiry (EII) – a small survey run specifically for the EAADS to ensure correct alignment of NACE codes.
CSO’s Structure of Earnings Administrative Data Project (SESADP) 2011-14.
CSO’s Census 2016 – approved for matching to the EAADS 2014 and later (DMP118).

Revenue’s P35L: “SPP35 – P35L dataset for analysis” data flow on ADC
Department of Employment Affairs and Social Protection (DEASP) data:
"CRS Client table from DEASP - Analysis" data flow on ADC
"DSP CRS from DEASP - Analysis" data flow on ADC

It is proposed that several data sources (both administrative and survey) will be used in the creation of the Earnings Analysis using Administrative Data Sources (EAADS) release.

The EAADS provides Structure of Earnings Statistics of employees within Ireland and is predominantly an administrative data project. Matching the proposed data sources will allow for an accurate and detailed EAADS to be produced, in alignment with what was previously released for 2011-14.

 Annual Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements.

Queries requested will be provided within CSO guidelines for confidentiality.

Business Register Data, Earnings Hours and Employment Costs Survey Data 

Covid19Refund - Covid 19 Refund scheme (Welfare)
PMOD - PAYE Real Time Data (Revenue)

To match data from EHECS with real time data from Revenue, Business Register data and data in relation to the Temporary Wage Subsidy scheme to investigate the impact of the Covid19 crisis and assess whether administrative data could be used to impute EHECS variables in the context of low response rates. Quarterly Statistical release and aggregate tables.

Labour Force Survey Data, Business Register Data, Earnings Hours and Employment Costs Survey Data, Earnings Analysis using Administrative Data Sources Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables 

Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue)

Analysis of the income support schemes put in place in response to COVID 19. Ongoing Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, Earnings bands, Gender, Age group, Region.

Labour Force Survey Data

Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Revenue Employment Wage Subsidy Scheme (EWSS) data (Revenue)

The purpose of this project is to analyse the labour market characteristics (as measured in LFS) of persons in receipt of the PUP, TWSS or EWSS pandemic income support schemes. Quarterly Table showing the labour market status (ILO) and Principle Economic Status (PES) of recipients of PUP/EWSS.

Labour Force Survey Data

Central Record System - Client Details (Welfare)

The purpose of this project is to collect gross income data from PMOD in order to complete the INCGROSS LFS variable, an annual data requirement of Regulation (EU) 2019/1700 of the European Parliament and of the Council of 10 October 2019.
Due to high item non-response and inconsistencies in collected survey data, it is proposed to use administrative data (i.e. PMOD) as a consistent high-quality source in order to satisfy the variable specification.
Annual

Earnings (i.e. LFS INCGROSS) microdata is included in annual microdata file transmitted to Eurostat in respect of Wave 3 responses in all quarters.

Earnings (i.e. LFS INCGROSS) microdata may be included in national annual RMF file for approved researchers.

Labour Force Survey Data 

Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised DSP Extract About Ukrainian Refugees (Welfare)

Labour Market analysis related to Ukrainian beneficiaries of Temporary Protection. Ongoing

Outputs may include statistical release, bulletin, publication (incl. tables and charts) or aggregated data sent to Eurostat.

Labour Force Survey Data

Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue

Enhance the scope of analysis possible from the Labour Force Survey data by adding earnings estimates. Quarterly

It is currently expected that the data would be included in a Research Microdata File (RMF), available to approved users subject to the CSO RMF Policy.

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Quarterly National Household Survey/2017Q3+Labour Force Survey 

Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue)

Analysis of residency and geographical status of employees.

Ongoing

Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Earnings, Sex, Age group, Region.

Live Register, Business Register Sampling Frame

Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue)

Analysis of Labour Market Flows over the COVID-19 period.

Ongoing

Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, MNE versus domestic, Earnings bands, Sex, Age group.

Earnings Analysis using Administrative Data Sources, Census of Population, with GeoDirectory and DEASP Variables 

Central Record System - Client Details (Welfare), COVAX Vaccination Data (HSE), Integrated Short Term Payment System Data (Welfare), Local Property Tax Returns (Revenue)

Add names and addresses to sample of employees selected for the Structure of Earnings Survey.

ANNUAL

Survey sample with correspondence details

Live Register Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Provide greater insight about those joining/leaving the Live Register, in particular information about the economic sectors in which they were/are employed. MONTHLY Tabular output, potentially in existing or new publication, and/or on PxStat. Descriptive statistics, number of individuals by various characteristics/categories.
Earnings Analysis using Administrative Data Sources, Earnings and Labour Costs Quarterly, Labour Force Survey Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Enhance the scope of analysis possible from the Labour Force Survey by comparing industry sector data (NACE) with other sources QUARTERLY Matched data would be used to supplement survey response in cases on non-response. The project will also consider the use of PMOD as the primary source for NACE categorisation in LFS by comparing with current outputs.

Census of Population Data

Earnings Analysis using Administrative Data Sources To provide an estimate of a monthly reference wage for Ireland to the Department of Social Protection (DSP) in relation to meeting Irelands requirements under the European Code of Social Security. ANNUAL Tabular outputs

Earnings Analysis using Administrative Data Sources, Business Register data, Census of Population Data, Earnings Hours and Employment Costs Survey, Structural Earnings Statistics data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables 

Central Record System - Client Details (Welfare), PAYE Real Time Data (Revenue), Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) It is proposed that several data sources (both administrative and survey) will be used in the creation of the Earnings Analysis using Administrative Data Sources (EAADS) release. ANNUAL Tabular, diagrams and written comment. Information will be published in web, electronic and paper formats as well as standard EU templates for Eurostat requirements.

Business Register data, Business Register Sampling Frame 

Covid 19 Refund scheme (Revenue), PAYE Real Time Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) To match data from real time data from Revenue with the Business Register and data in relation to the Temporary Wage Subsidy scheme to investigate the impact of the Covid19 crisis on earnings and to produce comparative statistics on domicile of enterprise ownership. Monthly Statistical release and aggregate tables. 

Irish Population Estimates from Administrative Data Sources

PAYE Real Time Data (Revenue) It is proposed that the county and nationality variables included in IPEADS is matched to PMOD data for use in the analysis of regional distribution of earnings in the Earnings Analysis using Administrative Data Sources (EAADS) release. Annual Tabular, diagrams and written comment. Information will be published in web, electronic and paper formats.

Earnings and Labour Costs Quarterly

PAYE Real Time Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) To match data from the real time Revenue data flow with the Earnings and Labour costs quarterly survey data to explore the coherence between the two sources and frequency of irregular payments by economic sector. Monthly No statistical outputs. Explore coherence and consistency of outputs in advance of publication of the frontier monthly earnings series publication.

Earnings Analysis using Administative Sources

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables  It is proposed that administrative and survey data will be used in the creation of thematic reports relating to earnings analysis. By matching variables available in COP datasets, in-depth earnings and distributional analysis by demographic characteristics may be provided. Ongoing Electronic publication (text, tabular, graphic) PxStat tables

Earnings Analysis using Administrative Data Sources

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables It is proposed that administrative and survey data will be used in the creation of thematic reports relating to earnings analysis. By matching variables available in COP datasets, in-depth earnings and distributional analysis by demographic characteristics may be provided. ONGOING Electronic publication (text, tabular, graphic) PxStat tables

› Back to Top

 

CSO Division: Life Events and Demography

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Vital Statistics Quarterly Address Matching Tool Sets using GeoDirectory  It is planned to update Vital statistics publications with more standard geographic units and move away from the custom vital statistics 3 digit address codes. Access is required in this case to help revise historic data and also help cut down on manual work within the section to geocode address strings. Please note this application refers to Births, Deaths and Marriages addresses. This job would be run on a yearly basis in order to ensure data is fit to publish for annual reports. ONGOING With small area codes assigned vital statistics will be able to produce new life events PxStat tables with the first providing breakdowns of births, deaths and marriages at local electoral level.

 

› 

 

CSO Division: Methodology

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables European 1km-square grid (ETRS) To match Census 2016 analysis level dataset to European 1km-square grid for Eurostat transmission adjustment process. One-Off ETRS grid joined with HRN_PIK variable (from Census 2016 (analysis tier) dataset).
Spatial Data Directory of Irish Property Addresses, including Eircodes (GeoDir) This project is a proof of concept of adding CSO statistical geographies to the GeoDirectory source and analysis tiers. This removes the need for users to do spatial analysis requiring access to source tiers of Geo-Directory and matching dataset. Moreover, the availability of this data on the analysis tier may encourage new uses of the Geo-Directory data. Proof of concept expired Data Matching ID:10918 QUARTERLY The output will be a lookup table between eircodes and geographic ID
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables GRO_Deaths (GRO), Registered Deaths Data (GRO) The project purpose is to math GRO deaths data to census data to provide additional information which is not available on the GRO flow. An example of such would relate to ethnicity which is currently available on the census flow but not on the GRO. My matching the two flows it would be possible to provide a breakdown of deaths by ethnicity or some other variable on the census (level of education, underlying cause of death etc.). ONGOING The plan is to create a publication and PxStat tables for users which will provide greater insights into mortality.

› 

 

 

CSO Division: National Accounts

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Business Register; Balance of Payments; Census of Industrial Production; Annual Services Inquiry

Revenue Corporation Tax files

Examine consistency between Revenue profits data and relevant data from CSO surveys; derive additional NA variables

Annual

Improved estimates for NA variables (mainly profits)

 

 

Business Register Data

P35LF, Employer Level Data (Revenue)

This project is in place to obtain estimates of wages and salaries, ECSI and Other Labour Costs in the National Income Accounts at A64 and 2-digit Nace level.

Annual

Annual Compensation of Employees estimates at overall and detailed Nace level, numbers employed, average wage/ECSI/COE per employee at overall and detailed levels.

Business Register

Revenue Commissioners P35 file and DSFA CRS files

To obtain county based average income data

Annual

To produce regional accounts and county household income

CSO Business Register, CIP, ASI, Trade and BOP data

Revenue Commissioners P35, Corporation Tax files and Dunn & Bradstreet (details of all companies on the CRO register) files

To create a datafile for use internally by CSO’s National Accounts and BOP divisions.

Annual and twice yearly.

The data will be disseminated in National Accounts, Financial Accounts and Balance of Payments related aggregate tables.

Business Register Data

Pensions Authority Source Dataset (Pen_Auth) To code Pensions contributions paid by Employers to Institutional Sector and Nace activity Annual Improved estimates for National Accounts CoE (D1) and labour costs (D12)

Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data 

Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (Revenue, DEASP, CSO) To improve and extend the National Accounts supply and use tables by reconciling differences between the CSO Business Statistics and the National Accounts income estimates. Ongoing Improved data quality, distributional national accounts, dis-aggregated supply and use tables, economic growth accounts.

Business Register Data

Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) The process uses PMOD and the other datasets to estimate monthly Compensation Of Employees in the National Accounts Quarterly and Annual outputs. Compensation Of Employees by NACE activity and Institutional Sectors is made also and provided to Government Accounts. Ongoing The results are part of the Quarterly National Accounts results sent to Eurostat at T+60 days after the end of the Quarter (ESA T0103) . The results are used also in the benchmark Annual National Accounts (National Income and Expenditure) released nationally. Related annual results are sent to Eurostat (ESA T0103).
The results are also part of the Output and Value Added By Activity annual release.

Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Earnings Analysis using Administrative Data Sources Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35) (

CT-CRO Linking File (Revenue), CRO Accounts Details Data (DandB), PAYE Real Time Data (Revenue) The aim of the project is to examine productivity of companies at a micro level in Ireland. This will entail company by company analysis and will require access to all of the above data sets due to the cross cutting nature of this project. Productivity is an analysis of the change in production over time achieved by the employees of an enterprise. All of this information is essential to derive the productivity indicators/assessments at an entity by entity analysis. Annual The statistical outputs will follow the annual productivity publication published by National Accounts- Labour productivity and GVA breakdowns/ nominal unit labour cost/ multifactor-productivity/capital deepening/capital services/hours worked/ tangible capital deepening/intangible capital deepening. See Link: https://www.cso.ie/en/releasesandpublications/ep/p-pii/productivityinireland2019/

Business Expenditure on Research & Development (BERD) Data (CSO), Census of Industrial Production Data (CSO), Annual Services Inquiry Data (CSO), Trade Register Data (CSO), Business Register Data (CSO), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO)

Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Corporation Tax Data (Revenue), Non-Profit Account Details Data (Bfacts), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) To improve the National Accounts by reconciling differences between the CSO Business Statistics, Balance of Payments, Trade Statistics and the National Accounts income estimates. Ongoing Improved Supply and Use Table estimates, improved National Accounts accuracy, publications on trade and global value chains, similar in nature to the National Accounts publication on “Food and Agriculture: A Value Chain Analysis”.

Census of Agriculture, Industrial Production and Turnover - RAP, Labour Force Survey, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables 

Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare),  Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) The aim of the project is to calculate average income and employment statistics required in the yearly regional accounts.  Annual I will be producing county-by-county breakdowns on numbers of employees/employers and compensation of employees for use in the regional accounts.

  › 

 CSO Division: Prices

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Labour Force Survey Data

Directory of Irish Property Addresses, including Eircodes (GeoDir), Live Register Analysis (Welfare), Central Record System - Client, Payment and Employment Details (Welfare), Central Record System - Client Details (Welfare)

This project explores small area estimation that combine data from administrative and survey sources to produce estimates for small areas or domains.

Quarterly

Dissemination of details on births by nationality

None

Stamp Duty Returns, Business Energy Rating Certificates, Geodirectory, Pobal Haase-Praschke Deprivation Index

 

 

The purpose of this data matching is to produce linked data on residential property transactions in Ireland. This data is used to calculate the statistics for the monthly Residential Property Price Index (RPPI)

Ongoing

(i) Monthly national and regional prices indices (ii) monthly indicators on the volume, value and price of residential property (iii) quarterly prices indices on new and existing dwellings (iii) annual information on non-household residential property transactions.

Labour Force Survey Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables 

Live Register Analysis (Welfare) 

The purpose of the project is to estimate unemployment rates at county level using small area estimation techniques. Ongoing Unemployment rates by county

Census of Agriculture 

Stamp Duty Returns, Property Registration Authority of Ireland Data, Geo Directory

The purpose of this Data Matching Project is to calculate agricultural land prices by region and land type. Annual  Tables for Eurostat and possible future CSO release.

Business Register Data

Pseudonymised Corporation Tax Data (Revenue)

Biennial access is required to the Research & Development fields on the CTAnalysis file to identify potential enterprises carrying out R&D in Ireland, to produce statistics in accordance with European Commission Regulation (EC) No 995/2012. Annual Biennial results. Principal Variables: Detailed information on research and development expenditure; Sources of funds for research and development expenditure; Detailed information on research and development personnel; Recruitment of researchers; Research and development collaboration.
 

Directory of Irish Property Addresses, including Eircodes (GeoDir), Stamp Duty on Property Transactions Data (Revenue), Pobal Deprivation Indices Data (TrutzHaa), Building Energy Rating details for non-domestic premises (SEAI)

The purpose of this data matching is to produce linked data on Commercial Real Estate (CRE) transactions in Ireland. This is an analysis to explore data quality issues, the potential for data linking and whether it is possible to produce statistical outputs for CRE. Ongoing This is an exploratory analysis to look at the potential to produce statistical outputs on the volume, value and price of Commercial Real Estate (CRE).
Residential Property Price Index

ESB Networks electricity consumption and customer data (ESBNetwk)

The project will attempt to match the CUSTOMER dataset from the flow above to the Building Energy Rating (BER) dataset by MPRN, and increase the number of Eircodes available on BER. Ongoing The expected output will be an enhanced BER dataset with additional Eircodes assigned to the property addresses.


 › Back to Top

 

 

CSO Divsion: Secondary Data Sources & Innovation

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Census of Population 2016, Person and Dwelling Data (CensusNameData)

Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare)

To explore the feasibility of using administrative data lists to evaluate Census coverage

Annual

The statistical outputs expected is a report and possibly a dataset of aggregated coverage indicators.

Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35)

Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA),  Pseudonymised Post Primary Pupil Details (DES),  Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Early Childcare and Education Scheme Data (Children), Pseudomymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Linked PAYE Real Time Data Test Data  with extra DEASP Variables (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE)

To create a Person Activity Register to provide structural analysis of populations and sub-populations, over time.

On-going

Populate activity indicator dataset

Used in Population estimates (PECADO) as input for admin census

None

Directory of Irish Property Addresses, including Eircodes (GeoDir), Central Record System - Client, Payment and Employment Details (Welfare), Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB) The purpose of the project is to develop a dataset with the potential to be used as an occupied residence sampling frame. Such a dataset could be an option as a sampling frame for CSO postal household surveys or could be used as an indicator of occupied properties, to assist Census 2021 enumerators.

On-going

The statistical output will be a property dataset containing addresses and names of occupiers. The dataset will be an occupied residence dataset, as indicated by the latest LPT and RTB data instances. The dataset will have the potential to be used as an occupied residence household survey sampling frame and Census 2021-oriented indicator of occupied properties. Whether the output is used for such purposes, and, if so, how it is used, is outside the scope of the current project. 

 

 None

PPSN and Personal Details Data (Revenue), Household Sampling Frame (Revenue)

Provide home Addresses to DCU for a selection of individuals sampled for the Structure of Earnings Survey to facilitate post out of survey notices to those individuals at their place of residence.

 

 One-Off

Dataset containing CSO_ID (identifier created by ADC for SES survey) and home address

None 

Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), Pseudonymised HSE coronavirus test referrals and test facilities (HSE), Pseudonymised Hospital Inpatient Discharge Data (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE)

Pseudonymised  COVID-19 person based HSE datasets are linked by CSO staff and permitted researchers to undertake statistical analysis to inform the national response to COVID-19.

Ongoing

Statistical outputs that have value in informing the public and national response to COVID-19

Vital Statistics, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables

Pobal Deprivation Indices Data (TrutzHaa)

The purpose of this project is to investigate possible effects of deprivation on cause of death, and also to produce statistics on causes of death on smaller geographical areas.

Ongoing

We will produce tables of aggregated information on causes of death. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld 
Pseudonymised Person Income Register Data

Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised HSE Drugs Payment Scheme Data (HSE)

The purpose of this project is to explore the possible relationships between persons income, household composition and engagement with Drug Payment Scheme.

ONGOING

The expected output will be the provision of research in the form of aggregated data to support decision making in the Health Sector. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld.

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data

Pseudonymised Central Bank Central Credit Register on loans data (CBI)

The purpose is to enhance the statistical potential of an existing project to compile short-term indicators that explore the dynamics within the consumer credit market. These indicators are primarily based on quantifying active contracts, customers, and borrowers in the consumer credit market wrt different population cohorts and type of credit. Matching data sources will enable cohorts to be defined by age, gender, employment, location of residence, household structure, and income.

Ongoing

The expected outputs are enhanced statistical indicators to provide information on the Consumer Credit Market. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld.

Business Demography, Structural Business Statistics 

Corporation Tax Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue)

The project involves the exploration of the statistical potential of xbrl files. The initial target use cases to explore include - cost of business information (structured data items) - business risk perception by business (text fields) - description of business activity - gender composition of directors The project involves parsing of structured xml files to extract numeric and text data. It is expected text analytics will play a significant role in this project.

ANNUAL

The initial target use cases to explore include - cost of business information (structured data items) - business risk perception by business (text fields) - description of business activity - gender composition of directors

 › Back to Top

 

CSO Divsion: Social Analysis

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Prison Releases Report 

 

Central Record System - Client Details (Welfare)

Individuals released from custodial sentences between 2011 and 2018 will be matched to the CRS client details to establish a pseudo anonymized PPSN. The PPSN linking identifier will then be used to link the population of interest to analysis of earnings, social welfare, employment and housing indicators that is conducted using administrative data sources by existing CSO divisions.

Annual

 

Earnings estimates. Medium earnings prior and post custodial sanctions
Employment estimates: Time spent in employment prior and post custodial sanctions
Social welfare: Level of social welfare support prior and post custodial sanctions

Tables will be classified by year of custody, age at time of release, gender, offence type, re-offending indicator

Irish Health Survey

Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Primary Care Reimbursement Service Data (HSE)

The CSOPPSN will be appended to the Irish Health Survey 2019 data in order to match to the PCRS datasets. This will allow us to cross reference medical card and drug payment scheme data against the health data supplied in the Irish Health Survey.

Annual

Statistical outputs will include cross tabulations of optical, dental, doctor and pharmacy claims against the Irish Health Survey data which includes data on health conditions, disability, activity limitations, access to medical specialists and frequency of visits.

Prison Reoffending Statistics, Probation Reoffending Statistics 

Central Record System - Client Details (Welfare)

The project will match data provided by the Justice agencies of individuals who have been released from custodial sanctions or received a probation order between 2011 and 2021 to the Client Record System. Once linked with a CSOPPSN assigned the personal identification characteristics (PPSN, name, detailed address) of the individuals to be removed from the data so that statistical analysis relating to employment participation can be carried out using CSO's analysis tier data.

Annual

The data matching project will allow CSO to carry out monthly employment estimates of individuals who have links to custodial sanctions

 › 

 

CSO Divsion: Social Data Collection

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Survey on Income and Living Conditions Data

 

 

Revenue P35 file, Revenue form 11 file 

 

Verification of income data

 

Ongoing

 

Anonymised micro data and aggregated output tables

Survey on Income and Living Conditions Data

Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Single Farm Payment Data (DAFM)

The purpose of this work is to link respondents to the SILC survey with their Basic Farm Payments.   The Basic Farm Payments are used in the calculation of farm income.   Basic Farm payments are also known as "Single Farm Payments"

Annual

We expect to obtain the Basic Farm Payment component of income for farm households in the SILC (the main aim of which is to collect all household income).

Survey on Income and Living Conditions Data Pseudonymised Housing Assistance Payment - Analysis Tier (HAP) To obtain rent details for SILC respondents who are renting their homes through the HAP system.  The cost to the householder of renting their home and the financial value of the benefit to the householder of being on the HAP scheme will also be obtained.  These feed into data on housing costs and housing benefits in the overall SILC results. Annual The outputs will be: (1) the amount of rent paid by HAP tenants who responded to the SILC and (2) the value of the benefit to these HAP tenants of being on the HAP scheme.  This data will not be used alone, it will be included in the SILC results as a whole.
Survey on Income and Living Conditions Data Pseudonymised Grant Application and Payment Data (SUSI) The purpose of the matching exercise is to link SILC respondents to any income they may have obtained through Education grants from SUSI.  The SILC collects information on all household income of which education grants may be a component. Annual The output will be the component of household income that is obtained from SUSI education grants.
Survey on Income and Living Conditions Data Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB) The matching exercise will be done for 2 different reasons.   To obtain details of rent paid by SILC respondents who are renting their homes.  To obtain details of rent received by SILC respondents who are landlords. Annual The outputs will be: (1) The amount of rent paid by SILC respondents who are renting their dwellings.  The amount of rent received by SILC landlords who are letting dwellings.
Survey on Income and Living Conditions Data Pseudonymised Income Tax Form 11, Person Details Data (Revenue) The IT Form 11 data will be used to provide details of income from self employment for SILC respondents Annual Income from self employment for respondents to the SILC survey
Survey on Income and Living Conditions Data Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) The purpose of the matching exercise is to obtain income and deductions made at source i.e. tax, USC, PRSI, pension contributions etc. for SILC respondents who are in the PAYE system. Annual Gross income and deductions from income made at source (e.g. tax, PRSI, USC, pension contributions, etc.) for SILC respondents who are paid through the PAYE system.
Survey on Income and Living Conditions Data Pseudonymised Local Property Tax Returns (Revenue) The matching exercise will be done to obtain figures for Local Property Tax paid by SILC respondents. Annual The outputs will be the amount of LPT due on respondents' properties.
Survey on Income and Living Conditions Data Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) To obtain details of social welfare income for respondents to the SILC survey Annual Income from social welfare sources for respondents to the SILC survey.
Survey on Income and Living Conditions Data Central Record System - Client Details (Welfare) The CRS_Client_Source file is used to verify PPSNs that are collected in the SILC survey Annual No actual data outputs from the matching procedure.  We will simply obtain confirmation of which PPSNs are correctly assigned to SILC respondents.
Labour Force Survey Data, Pseudonymised Quarterly National Household Survey/2017Q3 Labour Force Survey Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Under the Integrated European Social Statistics Regulation, NACE categories are now required to be coded at a 3-digit level. The purpose of this project is to improve the coding of NACE categories in the LFS from 2-digit to 3-digit. Quarterly An output SAS dataset containing primary and NACE 3-digit variables. 
Survey on Income and Living Conditions Data Pseudonymised Higher Education Student and Course Details (HEA) To match the SILC Educational Attainment Levels data of SILC respondents to their respective educational attainment levels on the HEA Higher Education Student and Course Details Administrative dataset. SILC Publication provides a comparative analysis of the equivalized income by highest level of educational attainment of the head of household. The HEA administrative data would greatly enhance the quality of this analysis.
Annual
An output SAS dataset will be produced which will then be used by RAP to produce the SILC Publication which is disseminated in tabular format, diagrams, written comment. SILC provides a comparative analysis of the equivalized income by highest level of educational attainment of the head of household. The administrative data would greatly enhance the quality of this analysis.

Survey on Income and Living Conditions

Pseudonymised Central Bank Central Credit Register on loans data (CBI) Data from the CCR is matched to respondents of the Survey on Income and Living Conditions (SILC) in order to accurately estimate debt at the household level in Ireland. ONGOING The data will fill core variables of the SILC namely data on mortgages.

Household Budget Survey

Central Record System - Client, Payment and Employment Details (Welfare) PPSNs are collected in the HBS to allow respondents to be linked up with their data from administrative sources. CRS_src will be used to verify the PPSNs to ensure that the PPSNs collected are valid PPSNs and that the correct PPSN is entered for each respondent. ANNUAL No actual data outputs from the matching procedure. We will simply obtain confirmation of which PPSNs are correctly assigned to HBS respondents.

 

  › 

 

CSO Divsion: Social Data Design

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Census of Population Data Directory of Irish Property Addresses, including Eircodes (GeoDir) The intended sampling frame for the PIAAC 2022 main study is the Census sampling frame. Due to the Census delay, the Census 2016 frame will be used for the PIAAC main study sample. The purpose of this project is to enhance the 2016 Census sampling frame by adding new households from the Geo-Directory. One-off We expect to obtain an enhanced 2016 Census sampling frame, that includes new households not originally included in the 2016 Census of Population.
Census of Population Data Pseudonymised Person Income Register Data  The purpose of this project is to add income to the Census file by matching Census to the Person Income Register. This project is being conducted  in Social Data Collection as part of the new sampling approach being carried out for the Q1/Q2 2022 SILC sample. As recommended by methodology the 2022 SILC sample will be chosen using Stratified Simple Random Sample, using Income as the stratification variable. Thus to do this, income must first be added to SDCs census frame. One-Off Social Data Collection expect to obtain and enhance Census Frame, which includes income. Allowing for the 2022 SILC sample to be chosen using a stratified simple random sample approach.
IPEADS_src - Irish Population Estimates from Administrative Data Sources, Census of Population Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables  COVAX Vaccination Data (HSE), Integrated Short Term Payment System Data (Welfare) The purpose of this project is to improve address quality and coverage on the IPEADs sample frame. The address coverage and quality of the sampling frame is not in itself of sufficient quality. The objective of this project is to improve the Eircode coverage and address quality on the sample frame significantly, by linking with other administrative and census datasets. One-off It is expected to obtain an enhanced sampling frame, with significantly higher Eircode coverage on the sample. Currently the sample has Eircode coverage in the region of 50% – 55%. It is hoped that this project will achieve 80%+ based on exploratory analysis and summary statistics already done on the datasets for other outputs.
IPEADS_src - Irish Population Estimates from Administrative Data Sources, Census of Population Data  Pseudonymised Census of Population, with GeoDirectory and DEASP Variables (CSO) The purpose of this project is to enhance the IPEADs sample frame for the purpose for household surveys (such as Safety of a Person (SOP) and Adult Education Survey (AES)). Currently the frame lacks household characteristic variables that would typically be available on the census household frame such as Deprivation Index, Urban/Rural, small area. The objective of this project is to enhance the IPEADS frame for post sampling purposes, by linking with the census dataset. Ongoing It is expected to obtain an enhanced sampling frame, with household characteristic variables, that can be used to facilitate GIS mapping, non-response adjustment, weighting and calibration, for upcoming household surveys such as SOP and AES.

Measuring Mortality Using Public Data Sources, Census of Population Data

Central Record System - Client Details (Welfare) The purpose of this project is to better identify signs of life on the Census Household Sampling Frame. Social Data Collection intend to enhance our Census sampling frame by adding a flag for deceased persons from the CRS source flow and/or Rip.ie data. Monthly Social Data Collection expect to obtain an enhanced Census Household Sampling frame, which will include a flag for deceased persons. This process will be run before a sample is distributed to the field, to take into account quarterly CRS updates and the latest Rip.ie data.

None

Central Record System - Client Details (Welfare), Child Benefit Data (Welfare), Registered Births Data (GRO) The purpose of this project is to build a sampling frame for the new infant GUI cohort. The pilot will run in 2023 and the main sample in 2024. As part of this project, ADC is providing multiple drafts of the sampling frame, in order to design the sample for the next GUI infant survey. This involves a matching exercise of the ADC GRO Births, CRS and Child Benefit ADC flows. ONGOING A sampling from for the new Infant cohort (both pilot and main samples).

Growing up in Ireland - '98 Cohort 

GRO_Deaths (GRO), Registered Deaths Data (GRO), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Springboard and ICT Student and Course Details (HEA), Registered Deaths Data (GRO), GRO_Deaths (GRO) The purpose of the proposed data matching is to enhance the GUI Cohort 98 sampling frame to better identify signs of life for the current and future waves. ONGOING Expect to obtain an enhanced GUI Cohort 98 sampling frame, which will include a flag for deceased persons. This process will update cases distributed to the field for the current wave as well as identifying cases in advance for future waves before they are distributed to the field

 

  › 

 

CSO Division: Statistical Systems Co-Ordination Unit 

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Census 2011 - Census Main Persons Dataset, Census 2016 - COP2016_NDI_DATA_V1

ESB new connections, LPT - Local Property Tax, HTB - Help-to-buy Scheme, BER - Building Energy Rating file, Geodirectory

To produce new experimental building completions statistical series using additional data from ESB, Census, Revenue and Geodirectory data sets

Quarterly

Aggregate tabular format

None Post primary Pupils Database
SPP35 linked employer employee file
IT form 11 (subset to indicate type of activity/trade)
SOLAS PLSS database of further training
QQI analysis dataset of awards
HEA Student Records System
DSP CRS and Jobseekers Longitudinal Database (JLD)
At the request of SOLAS, the CSO and SOLAS have agreed to collaborate on a project to evaluate outcomes of graduates of SOLAS funded further education courses. This data is held by SOLAS in the Programme Learner Support System (PLSS). A statistical product detailing this Outcomes analysis will be jointly produced. Annual Report, tabular/aggregated, publication of findings

None

DES Post Primary and Exam Datasets
SPP35 linked employer employee file
IT form 11
SOLAS/FAS database of further training and PLSS
QQI analysis dataset of awards
HEA Datasets on Student Enrolment and Graduation
DSP (JLD, CRS, DSP Payments and unemployment data)
SUSI Dataset

The CSO has recently undertaken a statistical collaboration with the HEA to analyse the outcomes for graduates of higher education courses, in particular mature students and graduates of “Springboard” courses.
Linking data across the datasets described below will allow us to develop profiles of the activities of these graduates from higher education courses, in terms of their employment, unemployment, continued education, earnings, etc.

Annual/Biannual

Report (either hard copy or electronic T4 release), tabular/aggregated

Census 2016 dataset

Revenue P35 file, Revenue form 11 file, Revenue Local Property Tax file,Revenue PPSN details file

DSP Integrated Short Term Payment System; DSP Central Record System
Residential Tenancies Board File

The project will create two new data products based on linking administrative files to the Census file to demonstrate the advantage of linkable data.

This project is in the early stages and is ongoing

Analysis of Vacant Housing.
Income and welfare dependency maps for small areas
Micro data file available for statistical purposes within the CSO only.

Labour Force Survey Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables,  Pseudonymised Person Income Register Data   Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Domestic Wastewater Treatment System Registrations (LGMA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Meath County Council iHouse (LGMA), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Property Registration Authority (PRA) folio, consideration, and other data (PRA), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous) We plan to create report(s) on social housing and their occupants - including social renters - through the use of public sector administrative data in order to provide evidence and insights for policy makers in the sector, as well as providing statistical information to assist with the Rebuilding Ireland project. Ongoing  Report/publication(s) on social housing in Ireland. 
Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir),  Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Domestic Wastewater Treatment System Registrations (LGMA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Property Registration Authority (PRA) folio, consideration data (Bfacts), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Networks electricity consumption and customer data (ESBNetwk) We plan to create report(s) and analysis on the rental sector in Ireland - looking at it's participants (landlords, renters) and rental properties. This will be undertaken through the use of public sector administrative data and will look provide evidence and insights for policy makers in the sector. Ongoing Report/publication(s) on the rental sector in Ireland
None Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE) To gain further insight into the Covid-19 pandemic. The anticipated outputs from this Data Matching Request is to create reports and analysis in order to, amongst other things, identify sectors of the economy most affected by the disease. This will be undertaken through the use of public sector administrative data, currently available on the ADC, and will seek to provide insights to decision makers and members of the public. Ongoing The anticipated outputs from this Data Matching Request is to create reports with graphs and tables of aggregate data (including tables available on PXstat) as part of the COVID-19 Insight Bulletins: Deaths and Cases series of outputs.
Business Register Data, Vital Statistics Data, Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables  Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE) Project to produce aggregate reports, in line with the purpose of DPIA 1156, on Covid-19 vaccinated population by characteristics such as age, gender, location, socioeconomic profile , economic status, industry. This will allow CSO to make available to the public statistics about the progress of the vaccination programme and could also be used to assist Health services in maximising vaccination uptake. Ongoing Statistical Bulletin with tables on Covid 19 Information Hub on cso website.
Bespoke reports for stakeholders.
New Dwelling Completions, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables  Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Networks electricity consumption and customer data (ESBNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB)

This project will seek to establish trends and/or levels in housing vacancy using utility data as a proxy for housing occupancy. Assessments of vacancy will be made at different geographical levels as well as other levels of disaggregation.

Ongoing Report/publication(s) on housing vacancy in Ireland. This will be in the form of Frontier publication(s). Tables will be provided through PxStat.
Business Register Data , Earnings Analysis using Administrative Data Sources Data, Pseudonymised Person Income Register Data Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue),
Pseudonymised Central Record System - Payment and Employment Details (Welfare ,
Pseudonymised Central Record System - Client Details (Welfare),
Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare),
Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue),
Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue),
Pseudonymised Integrated Short Term Payment System Data (Welfare ),
Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare)
This project is expected to provide aggregate statistical data focussing evidence on the potential impact that the expansion of paid parental leave may have on businesses. One-off The final output will be disseminated as a report/paper (tabular/aggregated).
Irish Population Estimates from Administrative Data Sources, Business Register data Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) Statistical Analysis to aid and inform policy at DSP ONGOING Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, MNE versus domestic, Earnings bands, Sex, Age group

Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data 

Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Primary Pupil Details (DES)

Local Area Data Analysis to support government departments in needs assessment and service provision for disadvantaged areas

ONE-OFF Dashboard for use by government departments

Business Register data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data

Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Department of Education teaching and other staff information (DES), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Jobseekers Longitudinal Dataset (Welfare), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Pupil Details (DES), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Teaching Council Register of Teachers (DES)

The aim of this project is to explore the discrepancy between the number of Teachers registered on the Teaching Council Register and those actually employed as teachers and generate statistics that will support Teacher Supply and Demand initiatives. We wish to ascertain “Signs of Life” using pseudonymised PPSN numbers of teachers registered but who are not employed as teachers by the Department of Education.

ANNUAL Reports with aggregated data in graphs and tables will be produced.

Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables 

Consolidated (ITForm11/P35L) Income dataset Migration tier (Revenue), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Property Registration Authority (PRA) folio, consideration data (PRA), Pseudonymised Stamp Duty on Property Transactions Data (Revenue)

The CSO will undertake a data matching exercise to better understand and quantify as much as possible the reasons for the differences between the number of households who rented from a private landlord published in Census 2022 and the number of registered tenancies at the end of 2021 published by the Residential Tenancies Board (RTB).

ONGOING Report/publication(s) on the rental sector in Ireland, statistical tables on Pxstat

Business Register data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data 

Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Department of Education teaching and other staff information (DES), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Jobseekers Longitudinal Dataset (Welfare), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Pobal Programmes Implementation Platform - Childcare Providers (POBAL), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS Apprentice Data (SOLAS), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Teaching Council Register of Teachers (DES)

The Educational Longitudinal Database (ELD) is a statistical framework for the compilation and analysis of learner outcomes over many years. The ELD provides the basis for a series of projects that the CSO has established in collaboration with Irish public sector bodies to examine learner outcomes across a range of educational levels and programmes.

ONGOING Reports with aggregated data in graphs and tables will be produced, as will some tables for PxStat. Reports may be produced in collaboration with other agencies or by agencies working alone (but with oversight from CSO for quality and data protection matters).

 

 › 

 CSO Division: Statistical Systems Co-Ordination Unit - Horizontal Reports

 

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Pseudonymised Person Income Register Data,  Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables 

Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA)

This project will compliment previous reports produced by the Department of Education and Skills related to Early School Leavers

One-Off

Report. Tabular/Aggregated. Publication of findings.

Census of Population 2011 Data, Census of Population 2016 Data, Census 2011 Housing Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Census 2016 Housing Data,Pseudonymised Person Income Register Data

Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Grant Application and Payment Data (SUSI)

 

This project aims to provide insight into social and economic characteristics of individuals living across a range of six geographical urban/rural defined areas, defined by population density and access to services and amenities.  CSO data will be the starting point (and make up the majority of the report) but by matching with non-CSO data, additional insights will be achieved.

One-Off

Report. Tabular/Aggregated. Publication of findings. 

Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Pseudonymised Person Income Register Data (PIR), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35), Annual Business Survey of Economic Impact (ABSEI) Data Corporation Tax Historical Tax Year (April to April) Returns Data (Revenue), Income Tax Form 11 Data (Revenue), Pseudonymised Corporation Tax Data (Revenue), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised SOLAS Client and Course Details (SOLAS), CRO Accounts Details Data (DandB), Pseudonymised Corporation Tax Historical Tax Year (April to April) Returns Data (Revenue), Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Springboard and ICT Student and Course Details (HEA), CT-CRO Linking File (Revenue), Pseudonymised Grant Application and Payment Data (SUSI)  Analysis on skills by sector:
The objective of this project would be to identify the key skills and education of workers by the sector in which they work. The sectors would also be subdivided between companies considered productive and non-productive at an aggregate level. It will help identify where there are potential skill gaps/shortages or where certain skills are over subscribed in non-related sectors.
Ongoing  Publication/report (tabular/aggregated) 
Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Pseudonymised Person Income Register Data (PIR), Annual Business Survey of Economic Impact (ABSEI) Data   Pseudonymised Flows of Jobs and Persons Data (DEASP), Revenue Sources (REVENUE) 

A Network Analysis of Productivity Spillovers via Labour Mobility:

The objective of this research project is to analyse clusters of firms, in terms of their knowledge and skill flows, when workers switch jobs between multinational enterprises and domestic firms (and vice-versa) and assess to what extent positive or negative productivity spillovers may occur, if any.

 
Ongoing  Report/paper (tabular/aggregated), including peer-review working paper. 
Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables  Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudomymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) 

The goal of this data matching project is to identify and analyse migration flows using administrative data sources. 

One-Off  Report. Tabular/Aggregated. Publication of findings. 
Censuses of Population 2011 and 2016 Data, Census 2016 Homeless Project, Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised P35LF, Employer Level Data (Revenue), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue),  Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare),  Pseudonymised Central Record System - Payment and Employment Details (Welfare), Housing Assistance Payment (HAP), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Property Registration Authority (PRA), consideration data (Bfacts), Housing Agency social housing waiting lists (DeptHous)

This horizontal report will examine the characteristics around housing tenure type and family composition. It will look at:
- The make-up of family composition in Ireland
- The diversity of tenure and dwelling types
- Socio-economic analysis of families receiving state housing assistance payments and rent supplements

The purpose of the project is to to contribute to the evidence-base for the development of housing policy.

One-Off  Report/Paper (tabular aggregated data) 
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables,  Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables   Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS),  Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) The goal of this data matching project is to obtain population activity counts and identify and analyse migration flows using administrative data sources.
Note that this project expands the aims of project ID 1126 (above) by including population counts and including the dataset SPP35 in the data matching proposal.
One-Off Report. Tabular/Aggregated. Publication of findings. 
Pseudonymised Flows of Jobs and Persons Data from DEASP and Revenue Sources, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables  Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS),  Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Stamp Duty (1980-2009) Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Vehicle Registrations Data (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue),Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised  Vehicle Licencing Data (DTTAS), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous) This project will develop and build a social and economic aggregate statistical analysis of offenders (before and after prison). It will help with:
o Understanding offenders interactions, at an aggregate level, with the State before and after release e.g. are they registering for welfare support, housing, education
o Measure/gauge reintegration into the community after prison
This information will be used to help inform policy discussions and development regarding the offender population.
One-Off Report/Paper (tabular aggregated data)
Pseudonymised Person Income Register Data Pseudonymised Post Primary Pupil Details (DES), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS) The CSO wishes to collaborate further with the Road Safety Authority (RSA). This project will demonstrate the value of ADC data to the RSA and the mutual benefit of further collaborative projects. One-Off Report. Tabular/Aggregated. Publication of findings.
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO), Pseudonymised Person Income Register Data (CSO), Pseudonymised Census of Population, with GeoDirectory and DEASP Variables (CSO)

Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS),  Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB),  Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir),  Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised DSP Extract About Ukrainian Refugees (Welfare), Pseudonymised Ukrainian Primary Pupil Details (DES), Pseudonymised Ukrainian Post-Primary Pupil Details (DES), Ukraine Driver Licence Exchanges  (DTTAS), Pseudonymised Ukrainian employees under the Temporary Protection Directive Data (Revenue), Pseudonymised PREM Registrations Data (Revenue)

 

 

Update of and extension to DMP 1405. Includes new datasets on driving licenses and employment (PREM register for NACE and PAYE data).

The goal of this data matching project is to obtain statistical information and insight around the circumstances and integration of migrants in Ireland, in particular refugees from Ukraine.

Ongoing Outputs will be in the form of a report(s) with graphs and tables of aggregate data (including tables available on pxstat)
Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data 

Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare),, Pseudonymised Central Record System - Client Details (Welfare) Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Primary Pupil Details (DES), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS)

This project is expected to provide aggregate statistical data exploring the social and economic lives of one parent families in Ireland including income, employment and welfare.

One-off The final output will be disseminated as a Frontier or Pathfinder release including tables, graphs and an infographic.

 

 

CSO Division: Sustainable Development Goals & Indicator Reports

 

CSO Dataset Matched

Non-CSO Dataset Matched

Reason

Frequency

Statistical Outputs Obtained

Census of Population 2016 Data

Directory of Irish Property Addresses, including Eircodes (GeoDir), OSi National Mapping Database (PRIME 2) (OSi)

 

To use the coordinates of the Census 2016 geography dataset and the coordinates of a number of destination points to calculate the shortest-path distance of residential dwellings to various services and infrastructure. This is to examine the effect of proximity to certain day-to-day services relative to where people are living.

One-Off

It is proposed to produce a publication on proximity containing, inter alia, average distance by county and urban-rural, an investigation of settlements with core services and an analysis of isolated dwellings in rural areas.

 

Address Matching Tool Sets using GeoDirectory (GeoDirAMToolSets) (CSO), Census 2016 Housing Data (CSO), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CSO)

OSi National Mapping Database (PRIME 2) (OSi)

The objective of this project is to continue the work on the examination of the proximity of the population to everyday services and infrastructure by measuring the shortest-path distance from an origin (the coordinate of a residential dwelling on the Census 2016 dataset) to a destination (the coordinate of a particular facility or infrastructure).

Ongoing

CSO has a central role in the production of indicators for the Sustainable Development Goals (SDGs). There are three indicators; 11.2.1 (Proportion of population that has convenient access to public transport, by sex, age and persons with disabilities), 11.7.1 (Average share of the built-up area of cities that is open space for public use for all, by sex, age and persons with disabilities), and 9.1.1 (Proportion of the rural population who live within 2km of an all-season road.

 


 
 › Back to Top

 Archive of completed Data Matching Activities