A CSO Data Protocol for how the CSO manages the combining of CSO and non-CSO data came into effect in May 2005. The Protocol covers any work undertaken within the CSO to match the individual records contained in two or more data holdings, at least one of which originates outside the Office.
It also covers any assistance the CSO may give to other public authorities to enable them to link data holdings under their control for statistical purposes.
The tables below detail CSO Divisions engaged in data matching, datasets matched and outputs obtained.
Queries may be e-mailed to Dataoffice@cso.ie.
CSO Division: Administrative Data Governance and Analysis
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
CSO: Mortality Data;Census 2016 data | DEASP:CRS DATA | To produce an updated version of the Mortality Differentials in Ireland release using the 2016 Mortality and Census data | Every 5 years in line with Census | Tabular |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables | Pseudonymised Income Tax Form 11, Business Details Data (Revenue) | To examine the possibility of compiling a dataset from which the average weekly wage for full-time equivalent farm employees can be calcualted for the Agricultural Accounts. | Ongoing | The output will be in tabular format. |
Pseudonymised Person Income Register Data (PIR), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35) | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Grant Application and Payment Data (SUSI) | This project investigates the possibility of utilising data obtained from administrative sources in order to identify links between individuals and create an experimental household register. | Annual | Outputs will be varied ranging from an experimental household composition register to a quality report which compares the distribution of households from the register with the 2016 Census results. |
Address Matching Tool Sets using GeoDirectory (GeoDirAMToolSets) | Directory of Irish Property Addresses, including Eircodes (GeoDir), Registered Deaths Data (GRO) | The purpose of the project is to understand if there are variations in mortality in the Mid-West Region. | One-Off | The researchers are analysing the mortality data to see if there are any variations in mortality in the Mid-West - They intend producing a report. |
Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis) | Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | To check the accuracy of the Administrative Data Centre’s (ADC) geocoding process and to identify issues that may need improvement. This to be done by matching persons in both datasets to see if their place of residence appears in the same geographical areas i.e. small areas (SA), electoral divisions (ED) etc. | One-Off | Report or paper in aggregated tabular form. |
Census of Population 2016 Data | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE) | To produce a socio-economic analysis of the COVID-19 pandemic and COVID-19 mortality differentials using the Census of Population 2016, the anonymised Central Record System datasets and data sources of the Health Service Executive (HSE) which have been supplied to the CSO to support analysis of COVID-19 related issues. | One-Off | Tabular output. |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | This is an update of a previous project to include the inclusion of PMODAnalysis as SPP35 is no longer being updated. To build a register from already pseudonymised data sources in order to allow the creation of single source income register. | Ongoing | Creates the dataset called PIR which is used as an input into various statistical outputs. |
Business Register Data, Earnings Hours and Employment Costs Survey Data, Earnings Analysis using Administrative Data Sources Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue)~ Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) | The PMOD Analysis Group aims to utilise the Revenue's PAYE Modernisation data along with other administrative and survey datasets to develop a standardised approach to the analysis of PMOD linked data and to produce a range of new, timely and informative outputs for the CSO. The output envisaged will be a continuation of the following - https://www.cso.ie/en/releasesandpublications/fp/fp-c19issse/impactofcovid-19incomesupportsonemployees | Ongoing | Population Pyramid Cohort analysis: An employment and earnings analysis of joiners, leavers, stayer. Monthly trends in earnings and employment Aggregated statistics presented in tabular form by various economic and demographic characteristics including Economic Sector, Size class, Income Distribution, Gender, Age group, Region. https://www.cso.ie/en/releasesandpublications/fp/fp-c19issse/impactofcovid-19incomesupportsonemployeesq42020-insightsfromrealtimeadministrativesourcesseries2/ |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data | Building Energy Rating details for domestic premises (SEAI), Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Stamp Duty on Property Transactions Data (Revenue), Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | This is an update of previous project to include an additional data source and update project purpose. The project purpose is to solely create a publication which examines the types of cohort who purchase properties in Ireland. The inclusion of the CCRAnalysis will be used to investigate the potential of including buyers with and without a mortgage. | One-off | Frontier Publication, PXStat files and also potentially a value added dataflow on the ADC portal or a new analysis tier data flow which has been created by pseudonymising eStamping. |
None | Pseudonymised DSP Extract About Ukrainian Refugees (Welfare), Retail PostOffice List (An Post) | The Analysis version of the Payments data set of the DSP extract concerning Ukrainian Refugees (UKR_DSP_Extract) will be enriched with Post Office geo-location by linking the data set with the list of Post Offices as provided by An Post (Retail PostOffice List (An Post) ). This will enable geo-location of DSP payments to act as proxy for Ukrainian refugees distribution around the state. This will also aid policy response to refugee influx. | Ongoing |
The ADC is not creating any Statistical Outputs. The ADC will be performing the matching on behalf of staff from the SSCU and Methodology sections of the CSO and will store the result sets in the Analysis tier of the ADC repository. The statistical outputs expected from the matching are: |
None | Directory of Irish Property Addresses, including Eircodes (GeoDir), PPSN and Personal Details Data (Revenue), Stamp Duty on Property Transactions Data (Revenue), Building Energy Rating details for domestic premises (SEAI), Registered Births Data (GRO), Registered Deaths Data (GRO), Registered Marriages Data (GRO), Primary Care Reimbursement Service Data (HSE), Central Record System - Client, Payment and Employment Details (Welfare), Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB), National Vehicle and Driver File, Driver Details (DTTAS), Higher Education Student and Course Details (HEA), Grant Application and Payment Data (SUSI), Housing Assistance Payment (LCouncil), COVAX Vaccination Data (HSE) | To create geospatial reference data to be added to admin datasets. The processing involves passing the address information of all relevant data flows through a geocoding facility and collating the resulting geospatial information for statistical purposes. This will be an ongoing activity to be applied to different datasets | Ongoing |
This product will enable the compilation of small area statistics from admin data including Census-like population estimates. Pseudonymised geospatial info will be generally available within the CSO for appropriate statistical projects. |
CSO Division: Agriculture, Transport and Tourism
CSO Dataset Matched
|
Non-CSO Dataset Matched |
Reason
|
Frequency
|
Statistical Outputs Obtained |
Agriculture Register |
Farm database from Department of Agriculture and Food |
Update CSO Agriculture register |
Annual |
Details of farm 'births' |
Census of Agriculture 2010 Survey of Agricultural Production Methods 2010 Farm Structure Survey 2013 Annual June Crops & Livestock Survey |
Animal Identification & Movement database for cattle & the Single Payment Scheme for crops December Sheep & Goat Census (DAFM) |
To enable CSO to fulfil requirements for Agriculture data under Regulation 2018/1091, Regulation 1166/2008 and Regulation 543/2009. |
Annual |
Census of Agriculture; Annual June Crops & Livestock Results; Farm Structure Survey Results |
Annual June Crops & Livestock Survey |
CRS Client ITForm11Per_Analysis AgriSingleFarm ITForm11Bus_Analysis SPP35 |
Match the Annual June Crops & Livestock Survey 2016 returns with CRS Client data to check the age, marital status and gender of the farm holder in these returns. This happens every 3/4 years during FSS or CoA processing |
Ongoing |
Farm Structure Survey dataset |
Vehicle Licensing |
National Vehicle and Driver File, Driver Details (DTTAS) |
The aim of the project is to calculate the total vehicle-kilometres from odometer readings which is a business requirement for the transport section. |
ANNUAL |
The following PXstat tables will be published: THA10 Road traffic volumes by type of vehicle and year THA17 Road traffic volumes by fuel type, county, year of registration and type THA18 Road traffic volumes of cars by type of ownership, engine capacity, fuel type, country and year of registration THA19 Road traffic volumes of goods vehicles THA20 Road traffic volumes of small public service vehicle |
CSO Division: Applications
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Business Register Data |
Address Matching Tool Sets using GeoDirectory (CSO) |
To GEO Code geographical locations of Local Units within a Business Register structure or NACE classification. |
Ongoing |
Visualisation of geographical units within the Central Business Register. |
›
CSO Division: Balance of Payments
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Balance of Payments Data | Section 110 Revenue Data | To match BOP data to ADL Section 110 company data in order to identify SPEs | Once Off | Tabular |
Balance of Payments Data, BOPSMS | CRO Accounts Details Data (DandB), Annual B1 Company Returns Data (CRO) | The project proposes to link CRO accounts and ownership data to Balance of Payments data and register information for the purposes of validating respondent data, monitoring survey coverage and informing survey recruitment. | Ongoing | The project will facilitate the production of spreadsheets detailing like-for-like comparisons between BOP data and CRO accounts and ownership data. |
Balance of International Payments, Structural Business Statistics - Industrial, Structural Business Statistics - Services,) Business Register data | Annual Business Survey of Economic Impact (DETE) | Derivation of grossing factors of both Balance of Payments (BOP) profits and services exports/imports using ABSEI, CIP and ASI data. In Balance of Payments we collect detailed information on all bop relevant enterprises. Profits and trade in services of the relevant manufacturing and non-financial service companies not covered by the BOP surveys are currently estimated from Census of Industrial Production and Annual Services Inquiry returns. | ANNUAL | Grossing factors of both Balance of Payments (BOP) profits and services exports/imports using ABSEI, CIP and ASI data. |
Structural Business Statistics - Industrial, Trade Enterprise Characteristics (TEC) Data, Business Register data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | To analyse the effect of the Brexit referendum and Brexit itself on the Irish economy. This includes, the geographical source and destination of goods exports and imports, and the effect on the firms that were most exposed to the UK market. | ONE-OFF | A frontier publication on the effect of Brexit on the Irish economy. Aggregate data in charts and tables. Regression results presented in tabular form with coefficients, standard errors, P-values and model metrics. |
CSO Division: Business Statistics, Business Register & Purchasing Power Parities
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Business Register |
CRO data primarily relating to company ownership and company accounts |
Enhace the usefulness of the CRO data by classifying the records by economic activity |
Quarterly |
Improved economic statistics |
Business Register
|
Revenue - VAT; PREM (employer registrations); Income Tax; Corporation Tax; P35 files |
Update CSO register |
Quarterly and annual |
Improved CSO business register |
Business Register
|
Companies Registration Office registration file |
Improve the quality of the CSO business register |
Monthly |
Improved CSO business register |
Business Register |
CRO file containing most recent Annual Return |
Help fulfill European requirements and also help with sampling |
Continuous
|
Improved quality business register as a basis for statistical surveys, etc. |
CSO Business Register |
GEO Directory |
Improve location of Enterprise |
Continuous |
Improved quality business register as a basis for statistical surveys, etc. |
Business Register |
EuroGroups Register |
Contribute to the setup and maintenance of the EuroGroups Register as required under EU law. |
Continuous |
Improved quality of statistical outputs that are affected by multinational groups, e.g. FD statistics, Outward FATS, Inward FATS. |
CSO data for BERD, CIS, CIP, ASI and Business Register |
The Dept of Jobs, Enterprise and Innovation's Annual Business Survey of Economic Impact (ABSEI) |
The purpose of this Data Matching Project is to ascertain from Dept of Jobs, Enterprise and Innovation (DJEI) (formerly Forfás), using data from their Annual Business Survey of Economic Impact (ABSEI), a list of the likely performers of R&D in Ireland. The data matching will be done by CSO in line with the Memorandum of Understanding in place (under the Statistics Act, 1993) between the CSO and DJEI, and the results of the matching will be sent to DJEI. |
Ongoing |
An anonymised matched file of likely R&D active firms in Ireland. |
Business Register Data |
Pseudonymised Integrated Short Term Payment System Data (Welfare), Integrated Short Term Payment System Data (Welfare), Vat Information and Exchange System Acquisitions Data (Revenue), Vat Information and Exchange System Dispatches Data (Revenue), Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue) |
To identify signs of activity in the Irish business economy during the COVID-19 period of restrictions on trade and subsequent re-opening. |
One-Off |
Tabular output, presenting aggregated statistics by various economic and demographic characteristics including economic sector, size class, region. All outputs will be have checked in line with standard CSO practices regarding confidentiality. |
None |
Corporation Tax Data (Revenue), CRO Accounts Details Data (DandB) |
The purpose of this Data Matching Project is to compare parsed iXRBL company accounts data to statistical returns of the Annual Services Inquiry to determine if it can further enhance current data holdings. |
Ongoing |
An anonymised matched file of services firms in Ireland. |
Structural Business Surveys |
Pseudonymised Income Tax Form 11, Business Details Data (Revenue) |
The purpose of this Data Matching Project is to compare Form 11 data to statistical returns of the Structural Business Surveys (SBS: ASI, CIP and BCI) to determine if it can further enhance current data holdings. |
Ongoing |
Improvements in SBS tabular estimation processes |
Business Register Data |
VAT Registrations Data (Revenue) |
Match data from the Business Register with the VAT Register to obtain valid email addresses. |
Annual |
The output is purely concerned with the replacement and updating of email addresses to conduct surveys. This singular piece of data will be available within the CSO Data Management System and only available to those with clearance to operate the survey. |
None |
Pseudonymised VAT Trader Returns (VAT3 and RTD), Data Combined (Revenue) Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue) | Business Statistics DCU is to develop the use of administrative data sources for compliance with the EBS regulation and particularly for the provision of Eurostat SBS early estimates. This project will feed into the development of the SMS, improve the quality of our statistics, reduce the burden on our respondents and possibly generate additional outputs. | Ongoing | The output is concerned with the consolidation of new data sources to the greatest extent possible for additional compliance with the EBS regulation, and the development of a process flow to implement admin data sources for SBS early estimates. The data will be housed securely within the CSO Data Management System and only available to those with clearance to operate the survey. |
Production in Building and Construction Index |
Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue) | VAT returns and PMOD data will be linked to the PBCI returns to train various machine learning models to develop a model-assisted estimator for construction | ONE-OFF | The initial research will focus on the feasibility of using model-assisted estimation (MAE). MAE is where we train auxiliary information from the survey returns to estimate our variable of interest (in this case, the monthly Value of Work done in Construction sector). The model assisted estimator is expressed as the sum of the population total of predictions and an adjustment term that accounts for model misspecification. |
CSO Division: Census Management
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
None | Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB) | This project uses remote sensing techniques and data to aid census enumeration. A model has been developed which analyses high resolution aerial imagery and returns the precise location of all objects the model believes are buildings in the image it consumes. The intention of this DMP is to verify if this model can be used to confirm secondary residential units exist by comparing it to locations of known tenancies. The result will be a statistical measure of the models accuracy. | One-Off | By comparing the coordinates of the deep-learning model with locations of possible secondary dwellings on RTB and LPT, it will be possible to count the number of coincident pairs, this can be taken (somewhat) as a statistical measure of accuracy and this is the only anticipated output. |
Census of Population Data | Income Tax Form 11 Data (Revenue), PPSN and Personal Details Data (Revenue), Building Energy Rating details for domestic premises (SEAI), QQI Course and Award Details Data (QQI), Child Benefit Data (Welfare), Central Record System - Client, Payment and Employment Details (Welfare), Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB), SOLAS PLSS Client and Course Details (SOLAS), Long and Short Term Social Welfare Payments Data (Welfare), Water Consumption Details for Residential Properties (IrishWat), Central Record System - Client Details (Welfare), Higher Education Student and Course Details (HEA), PAYE Real Time Data (Revenue) | The purpose of the work is to improve the quality of the census 2022 dataset by linking and imputing records from administrative datasets to reduce household, person and item non-response. Additionally by creating this link it will allow the CSO to publish cross sectional publications such as the Geographical Profiles of Income in Ireland, Offenders 2016 and Tenure and Households in Ireland releases. | One-Off | A series of thematic publications using census data with accompanying interactive tables and maps covering topics such as population, housing, families, employment and education. This project will improve the quality of these reports by reducing non-response. Also it will allow the publication of cross sectional publications including the Geographical Profiles of Income in Ireland, Offenders 2016 and Tenure and Households in Ireland releases. |
Census of Population Results | Post Primary Pupil Details (DES), Primary Pupil Details (DES) | Improve the quality of census commuting data using place of school data for students. | ONE-OFF | The data will be used in the Place of Work, School, College - Census of Anonymised Records (POWSCAR) research microdata file. This file is used for the analysis of commuting patterns by authorized researchers. This file is scheduled to be made available on 19/10/ 2023. Census 2022 Profile 7 - Employment, Occupations and Commuting will be published on the 30/11/2023. This release will contain charts, maps and tabular data of commuting patterns and related variables such as means of travel. |
CSO Division: Ecosystems Accounts
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Business Register data |
VAT Registrations Data (Revenue) |
The CSO Ecosystems Accounts Division will conduct the Waste Generation Survey of enterprises. This survey is carried out on a bi-annual basis to fulfil waste generation reporting requirements for Eurostat, in accordance with the Regulation on waste statistics (EC) No. 2150/2002, amended by Commission Regulation (EU) No. 849/2010. A link between the Business Register and the VAT Register is required to provide email addresses for enterprises in the sample, to which the survey is sent. |
ONGOING |
Statistical outputs include aggregated data in tabular form which is delivered to Eurostat on a bi-annual basis. |
Business Register data |
EPA Pollution Release and Transfer Register (EPA) |
The CSO Ecosystem Accounts Division conducts the Waste Generation Survey of enterprises. This survey is carried out on a bi-annual basis to fulfil waste generation reporting requirements to Eurostat, in accordance with the Regulation on waste statistics (EC) No. 2150/2002, amended by Commission Regulation (EU) No. 849/2010. The Environmental Protection Agency (EPA) Pollution Release and Transfer Register is matched to the CSO Business Register to exclude EPA facilities from the survey sample. |
ONGOING |
Statistical outputs include aggregated data in tabular form which is delivered to Eurostat on a bi-annual basis. |
›
CSO Division: Environment & Climate
CSO Dataset Matched |
Non-CSODataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Business Energy Use Survey Data, Census of Industrial Production Data, Annual Services Inquiry Data
|
Emissions Trading Scheme Data (EPA), Large Industry Energy Network Data (SEAI), Administrative Business Files for CSO Business Register (Revenue) | To manage the survey response burden by using administrative data where possible. | Annual | Annual Statistical Release |
CSO Business Register |
Irish Water non-domestic datasbase |
The purpose is to obtain data on water consumption by NACE sector to meet Eurostat and other requirements on water statistics e.g. Inland Waters questionnaire and Water Framework Directive. |
Ongoing |
The output will be in tabular format. |
CSO Business Register and CSO Trade Register |
EPA Pollution Release and Transfer Register.
Dublin City Council National Trans Frontier Shipment Office. |
Matched for Waste Statistics in the Environmental Statistics Division |
Ongoing |
Linkage created between EPA PRTR register and CSO Business Register. NTFSO matched to CSO external trade statistics register. |
Survey on Income and Living Conditions Data, Household Budget Survey Data, Census of Population 2011 Data, Census of Population 2016 Data |
Better Energy Warmer Homes Data (SEAI), Electric Meter Data (ESB), Air Quality Data (EPA), Building Energy Rating Details (SEAI), Long and Short Term Social Welfare Payments Data (Welfare), Gas Usage Details for Residential and Commercial Customers (GasNetwk) |
To analyse the factors leading to energy poverty; the impact of the environment on health; and related issues. |
Ongoing |
The Statistical outputs will be a statistical release with an analysis of factors leading to energy poverty, the impact of the environment on health and related issues. |
Census of Population 2011 Data, Census of Population 2016 Data, Census 2011 Housing Data, Census 2016 Housing Data | Building Energy Rating Details (SEAI) | The objective of this data matching project is to facilitate research to assess the extent of residential solid fuel use in Ireland and identify the factors that determine households' use of solid fuels. | One-Off | An anonymised RMF will be produced. Access to the RMF has already been requested by University College Cork, for research into residential solid fuel use. Proposed outputs include reports, policy briefs and academic papers. |
Survey on Income and Living Conditions Data | ESB MPRN, Building Energy Rating details for domestic premises (SEAI) | The objective of the data matching project is to facilitate research into causes of energy poverty. Traditionally households were classified as in energy poverty based on income and heating and other fuel costs. More recently, energy efficiency of the household is recognised as an important determinant. The data matching project would combine the most important variables for energy poverty research from the CSO Survey on Income and Living Conditions and SEAI Building Energy Rating audits. | ANNUAL | CSO is a member of an energy poverty research group along with the Department of Environment, ESRI, SEAI, Social Protection, CRU, and Gas Networks Ireland. The RMF file will allow researchers to analyse energy poverty in a broader context by incorporating the energy efficiency of the dwelling into the analysis. |
Business Register data | VAT Registrations Data (Revenue) | The CSO Environment and Climate Division conducts several surveys including the Environmental Expenditure Survey, Roundwood Removals Survey, Wood Inputs Survey and the Green Economy Survey. The purpose of the surveys are to compile data to fulfill EU regulations and questionnaires regarding various environmental statistics. This data matching project allows for the Division to obtain updated email addresses for enterprises which are then used for survey post outs. | ONGOING | Expected final outputs from surveys include: 1. Reporting of aggregated data in tabular form as required by Eurostat 2. CSO statistical releases containing aggregated data in tabular form |
CSO Division: Government Accounts Compilation & Outputs
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
None |
List of Inspected Nursing Homes and Bed Numbers (HIQA), Income Tax Form 11 Data (Revenue), Corporation Tax Data (Revenue) |
Matching HIQA list of inspected nursing homes and bed numbers to Revenue Data Files (CT File & IT form 11 data file) to estimate average cost of nursing home beds. |
Annual |
Tabular Output |
CSO Division: Growing Up In Ireland
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Growing Up in Ireland |
Central Record System - Client Details (Welfare), Higher Education Student and Course Details (HEA) |
To apply a CSO PPSN to all person records in the Growing Up in Ireland Cohort ’98 Sample File. |
One-Off |
The output will be a file matching the GUI survey ID to the CSO PPSN. |
Growing Up in Ireland |
COVAX Vaccination Data (HSE), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA) Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil) Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) Pseudonymised Post Primary Pupil Details (DES) Pseudonymised QQI Course and Award Details Data (QQI) Pseudonymised SOLAS Apprentice Data (SOLAS) Pseudonymised SOLAS PLSS Client and Course Details (SOLAS) Pseudonymised Springboard and ICT Student and Course Details (HEA) |
The purposes of the proposed data matching is to augment the GUI dataset with administrative data to lessen the respondent burden and enrich the dataset. |
Ongoing |
Tabular data in statistical releases. The data may also be included in a Research Microdata File (RMF), available to approved users subject to the CSO RMF Policy. |
Growing up in Ireland, Business Register data |
Directory of Irish Property Addresses, including Eircodes (GeoDir), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA) |
The purposes of the proposed data matching is to augment the GUI Cohort 98 dataset with administrative data relating to education and employment. |
ONGOING |
Tabular data in statistical releases. |
Growing up in Ireland - '98 Cohort, Pseudonymised Person Activity Register Data, Pseudonymised Person Income Register Data |
Central Record System - Client, Payment and Employment Details (Welfare), Higher Education Student and Course Details (HEA), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare) Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Live Register Claims Data from DEASP Integrated Short Term System (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Primary Care Reimbursement Service Data (HSE), Registered Deaths Data (GRO) GRO_Deaths (GRO) |
This is an extension of the already agreed DMP 11018, as in that data matching protocol the purpose of this DMP is to augment the GUI Cohort 98 dataset with administrative data and to expand the scope of possible analyses that can be carried out on the data. |
ONGOING |
Tabular data in statistical releases. The data may also be included in a Research Microdata File (RMF), available to approved users subject to the CSO RMF Policy. |
Growing up in Ireland - '08 Cohort, Growing up in Ireland - '24 Cohort, Growing up in Ireland - '98 Cohort |
Water Quality by water supply zone (TCD), Water supply zones shape files (Irish Water) |
In agreement with researchers from TCD, the Growing up in Ireland division is to facilitate a research project linking GUI's longitudinal data on the residencies of survey participants with the quality of water they would have had access to as children. Of special interest is whether the water participants consumed was fluoridated or not and tracing the effects of this on participants health through all cohorts and waves of GUI data collection, including the new birth cohort (Cohort 24). |
ONGOING |
Eventual outputs from this project will include a series of RMFs; one per wave and cohort with relevant data from the three data sources listed together. For future GUI data releases published on future GUI survey data, data on water supply zone will be added to the AMFs and RMFs. Publications on this research may be published by TCD for which CSO will monitor and support. This will be the first project for Growing Up in Ireland in CSO to use geospatial data and geographical information systems. |
›
CSO Division: Income, Consumption and Wealth
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
SILC dataset |
SUSI data
|
To assess whether adminsitrative data can be used to replace variables on the Survey on Income and Living Conditions (SILC) to reduce the burden on respondents, particularly with respect to education grants. |
Initially once off, maybe annually depending on the results. |
Tabular, diagrams and written comment |
HFSC Household Finance and Consumption Survey |
AgriSingleFarm – Pseudonymised Single Farm Payment Data |
Verification of data in the Household Finance and Consumption Survey (HFCS), possible imputation of missing values.
We will also assess whether administrative data can be utilised to replace some survey questions and thus lessen the burden on respondents. |
It is anticipated that the HFCS will be produced every 3 years form 2020 | Tabular, diagrams, written comment. |
Survey on Income and Living Conditions Data |
Pseudonymised Housing Assistance Payment - Analysis Tier (HAP) |
To assess whether administrative HAP data can be used to replace variables on the Survey on Income and Living Conditions (SILC) to reduce the burden on respondents, and for data validation. | Annual | Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements. Queries requested will be provided within CSO guidelines for confidentiality. |
HFSC Household Finance and Consumption Survey |
Housing Assistance Payment (HAP) |
To verify data provided by respondents in the Household Finance and Consumption Survey (HFCS) and to match data in cases of non-response. | Ongoing | The data will provide the monthly rent paid by a HAP household and also the amount paid on their behalf to the landlord. These are core variables in the HFCS used to calculate expenditure and social transfers. |
Survey on Income and Living Conditions Data | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | PMOD will replace P35 administrative data for employee income from SILC 2019. This matching project is for the use of PMOD income data in SILC processing, reducing the burden on survey respondents and increasing the accuracy of SILC data. | Annual | Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements. Queries requested will be provided within CSO guidelines for confidentiality. |
Household Finance and Consumption Survey Data, Survey on Income and Living Conditions Data, Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised DEASP Covid 19 Illness Claims (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) |
Update of previous project to include EWSS and COVID-19 Illness Benefits data. Analyse the affect COVID-19 has had on the financial viability of Irish households and assess the impact income support schemes (TWSS, EWSS & PUP) have had in supporting households. |
Ongoing |
Impact of COVID-19 on financial viability of households including Debt Sustainability Rates, Income to Loan Ratios, Negative Equity Rates. |
Household Finance and Consumption Survey Data | Pseudonymised Central Bank Central Credit Register on loans data (CBI) |
Data from the CCR is matched to respondents of the Household Finance and Consumption Survey (HFCS) in order to accurately estimate debt at the household level in Ireland. In order to populate some debt variables of the HFCS, pseudononymised CCR data will be matched to HFCS respondents using CSOPPSN as a linking variable. |
Ongoing |
The data will fill core variables of the HFCS including data on mortgages, personal loans, credit cards and overdrafts |
Household Finance and Consumption Survey Data | Central Bank Central Credit Register on loans data (CBI) | Data from the CCR is matched to respondents of the Household Finance and Consumption Survey (HFCS) in order to accurately estimate debt at the household level in Ireland. For these cases only, personal data of the individuals from the HFCS sample frame is matched to data on the CCR source tier. | One-off | The data will confirm the identity of certain HFCS respondents in the CCR in order to fill core variables of the HFCS including data on mortgages, personal loans, credit cards and overdrafts. |
Survey on Income and Living Conditions Data | Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue) |
The primary purpose of this project is to allow SILC RAP to perform final edit checks on income variables in the SILC survey, as well as coherence checks. |
Annual |
No outputs are directly expected from this project. If errors/discrepancies are spotted in the final income data through this matching, it will be reverted to the SILC DCU team for correction. The second objective centers around exploring potential uses of the administrative datasets, but any updates to be incorporated into the actual processing of SILC data will be done by SILC DCU & covered by a separate matching request. |
Household Budget Survey | Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil) |
To obtain rent details for HBS respondents who are renting their homes through the HAP system. |
Annual |
The outputs will be: |
Household Budget Survey | Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB) |
The matching exercise will be done for 2 different reasons. |
Annual |
The outputs will be: |
Household Budget Survey |
Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue)
|
The purpose of the matching exercise is to obtain income and deductions made at source i.e. tax, USC, PRSI, pension contributions etc. for HBS respondents who are in the PAYE system. | Annual | Gross income and deductions from income made at source (e.g. tax, PRSI, USC, pension contributions, etc.) for HBS respondents who are paid through the PAYE system. |
Household Budget Survey | Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised Corporate Customer System Data (DAFM) | The purpose of this work is to link respondents to the HBS with their Basic Farm Payments. The Basic Farm Payments are used in the calculation of farm income. Basic Farm payments are also known as "Single Farm Payments" | Annual | We expect to obtain the Basic Farm Payment component of income for farm households included in the HBS |
Household Budget Survey | Pseudonymised Income Tax Form 11, Person Details Data (Revenue) | The IT Form 11 data will be used to provide details of income from self employment for HBS respondents | Annual | Income from self employment for respondents to the HB |
Household Budget Survey | Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | To obtain details of social welfare income for respondents to the HBS | Annual | Income from social welfare sources for respondents to the HBS |
Household Budget Survey | Pseudonymised Grant Application and Payment Data (SUSI) | The purpose of the matching exercise is to link HBS respondents to any income they may have obtained through Education grants from SUSI. The HBS collects information on all household income of which education grants may be a component. | Annual | The output will be the component of household income that is obtained from SUSI education grants. |
Household Budget Survey | Central Record System - Client Details (Welfare) | The CRS_Client_Source file is used to verify PPSNs that are collected in the HBS | Annual | No actual data outputs from the matching procedure. We will simply obtain confirmation of which PPSNs are correctly assigned to HBS respondents. |
Household Finance and Consumption Survey Data | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | Verification and supplementation of data in the Household Finance and Consumption Survey(HFCS) | Ongoing | The data will provide PAYE income and pension amounts for the survey reference period of HFCS respondents. These are core variables in the HFCS used to estimate household PAYE income amounts and pension contributions. |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, | Pseudonymised Animal Identification and Movement Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Sheep and Goat census data (DAFM), Pseudonymised Single Farm Payment Data (DAFM) | The supplementing of survey data with administrative data for the Household Finance and Consumption Survey (HFCS). | ONGOING | Tabular, diagrams and written comment |
Survey on Income and Living Conditions | Vital Statistics data (Bfacts) | Research project exploring excess mortality amongst people at risk of poverty or living in consistent poverty. | ONE-OFF | CSO Frontier publication |
Survey on Income and Living Conditions | Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Person Income Register Data (CSO), Pseudonymised Single Farm Payment Data (DAFM), Pseudonymised VAT Registrations Data (Revenue), Pseudonymised VAT Activity Analysis (VAT3 and RTD) Data (Revenue), Pseudonymised VAT Trader Returns (VAT3 and RTD) Data (Revenue) | The purpose of this project is to allow SILC RAP estimate current income for the Survey on Income and Living Conditions. Using the latest administrative income and modeling and estimating other income components will allow the estimation of current income for individuals and households thus allowing the production of "Flash" income estimates and poverty and deprivation rates. | ANNUAL | State level: - gross, net, disposable and equalised income. - Poverty rates - Deprivation rates |
Census of Population Results | Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Single Farm Payment Data (DAFM) | It is required by the Social Data Design Division who require an updated Census linked to income sampling frame for all surveys that now require income for the sample design. Secondly it is required to update the publication "Geographical Profiles of Income in Ireland 2016" which there is a high demand for by both internal and external users. Finally, there has been an urgent request from the Department of Housing which require this updated data for their Housing Need and Demand Assessment tool. | One-off | There will be two statistical outputs from this project: 1. A dataset with combined pseudonymised Census 2022 data and income, which is to be used as a sampling frame by the Social Data Design division for CSO surveys. 2. A publication which will be an update of "Geographical Profiles of Income in Ireland 2016" based on Census 2022 and calendar year income 2022. This will provide income data at electoral division and local authority level as well as other demographics. |
Census of Population Results | Pseudonymised Central Bank Central Credit Register on loans data (CBI), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | Creation of benchmarks for the Survey on Income and Living Conditions (SILC) data following the publication of Census 2022. | One-off | Benchmark files for use in the calibration of weights for the Survey on Income and Living Conditions (SILC) and possibly other ICW household surveys such as the HBS and the HFCS. |
›
CSO Division: International Trade In Goods
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Trade Register Data, INTRASTAT survey data, Received Microdata under European legislation, Customs declarations |
VAT Trader Returns (VAT3 and RTD) Data (Revenue), Vat Information and Exchange System Acquisitions Data (Revenue), Vat Information and Exchange System Dispatches Data (Revenue) |
Intrastat survey data, Customs declarations and VAT information are all required as part of the system to produce Intra and Extra EU trade statistics in compliance with European legislation. |
Ongoing |
Detailed trade in goods statistics in compliance with European legislation |
Business Register data |
Detailed Trade Statistics |
Matching of Business Register with trade statistics detailed microdata required for compliance with TEC (Trade by Enterprise Characteristics) reporting to Eurostat and production of anonymised trade data at enterprise level for production of Researcher Microdata Files (RMF) |
ANNUAL |
The statistical outputs expected are annual TEC data for Eurostat, in compliance with EBS legislation requirements. This data may also be published domestically on the CSO website and in PX-Stat tables. RMFs (Researcher Microdata File) are produced using the business register as an anonymised company identifier to allow researchers to analyse trade data and link it to other relevant business RMFs. |
›
CSO Division: Labour Market and Earnings
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
None |
DSFA CRS; P35 file from Revenue Commissioners; CSO Central Business Register |
To investigate the extent to which foreign nationals engaged with and remained in employment |
Annual
|
CSO Statistical release; other aggregate tables
|
Census 2016 Analysis tier |
EAADS (subset of P35 analysis dataset) |
To match Census 2016 analysis level data to the data being used to prepare for the Earnings Analysis using Adminsitrative Data Sources (EAADS) release. |
Once-off |
Tables, charts |
CSO’s Earnings, Hours and Employment Costs Survey (EHECS) data |
Revenue’s P35L: “SPP35 – P35L dataset for analysis” data flow on ADC |
It is proposed that several data sources (both administrative and survey) will be used in the creation of the Earnings Analysis using Administrative Data Sources (EAADS) release.
The EAADS provides Structure of Earnings Statistics of employees within Ireland and is predominantly an administrative data project. Matching the proposed data sources will allow for an accurate and detailed EAADS to be produced, in alignment with what was previously released for 2011-14. |
Annual | Tabular, diagrams, written comment. All information will be published within CSO guidelines for web, electronic and paper dissemination & standard EU templates for Eurostat requirements.
Queries requested will be provided within CSO guidelines for confidentiality. |
Business Register Data, Earnings Hours and Employment Costs Survey Data |
Covid19Refund - Covid 19 Refund scheme (Welfare) |
To match data from EHECS with real time data from Revenue, Business Register data and data in relation to the Temporary Wage Subsidy scheme to investigate the impact of the Covid19 crisis and assess whether administrative data could be used to impute EHECS variables in the context of low response rates. | Quarterly | Statistical release and aggregate tables. |
Labour Force Survey Data, Business Register Data, Earnings Hours and Employment Costs Survey Data, Earnings Analysis using Administrative Data Sources Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables |
Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) |
Analysis of the income support schemes put in place in response to COVID 19. | Ongoing | Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, Earnings bands, Gender, Age group, Region. |
Labour Force Survey Data |
Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Revenue Employment Wage Subsidy Scheme (EWSS) data (Revenue) |
The purpose of this project is to analyse the labour market characteristics (as measured in LFS) of persons in receipt of the PUP, TWSS or EWSS pandemic income support schemes. | Quarterly | Table showing the labour market status (ILO) and Principle Economic Status (PES) of recipients of PUP/EWSS. |
Labour Force Survey Data |
Central Record System - Client Details (Welfare) |
The purpose of this project is to collect gross income data from PMOD in order to complete the INCGROSS LFS variable, an annual data requirement of Regulation (EU) 2019/1700 of the European Parliament and of the Council of 10 October 2019. Due to high item non-response and inconsistencies in collected survey data, it is proposed to use administrative data (i.e. PMOD) as a consistent high-quality source in order to satisfy the variable specification. |
Annual |
Earnings (i.e. LFS INCGROSS) microdata is included in annual microdata file transmitted to Eurostat in respect of Wave 3 responses in all quarters. Earnings (i.e. LFS INCGROSS) microdata may be included in national annual RMF file for approved researchers. |
Labour Force Survey Data |
Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised DSP Extract About Ukrainian Refugees (Welfare) |
Labour Market analysis related to Ukrainian beneficiaries of Temporary Protection. | Ongoing |
Outputs may include statistical release, bulletin, publication (incl. tables and charts) or aggregated data sent to Eurostat. |
Labour Force Survey Data |
Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue |
Enhance the scope of analysis possible from the Labour Force Survey data by adding earnings estimates. | Quarterly |
It is currently expected that the data would be included in a Research Microdata File (RMF), available to approved users subject to the CSO RMF Policy. |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Quarterly National Household Survey/2017Q3+Labour Force Survey |
Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) |
Analysis of residency and geographical status of employees. |
Ongoing |
Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Earnings, Sex, Age group, Region. |
Live Register, Business Register Sampling Frame |
Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) |
Analysis of Labour Market Flows over the COVID-19 period. |
Ongoing |
Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, MNE versus domestic, Earnings bands, Sex, Age group. |
Earnings Analysis using Administrative Data Sources, Census of Population, with GeoDirectory and DEASP Variables |
Central Record System - Client Details (Welfare), COVAX Vaccination Data (HSE), Integrated Short Term Payment System Data (Welfare), Local Property Tax Returns (Revenue) |
Add names and addresses to sample of employees selected for the Structure of Earnings Survey. |
ANNUAL |
Survey sample with correspondence details |
Live Register | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | Provide greater insight about those joining/leaving the Live Register, in particular information about the economic sectors in which they were/are employed. | MONTHLY | Tabular output, potentially in existing or new publication, and/or on PxStat. Descriptive statistics, number of individuals by various characteristics/categories. |
Earnings Analysis using Administrative Data Sources, Earnings and Labour Costs Quarterly, Labour Force Survey | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | Enhance the scope of analysis possible from the Labour Force Survey by comparing industry sector data (NACE) with other sources | QUARTERLY | Matched data would be used to supplement survey response in cases on non-response. The project will also consider the use of PMOD as the primary source for NACE categorisation in LFS by comparing with current outputs. |
Census of Population Data |
Earnings Analysis using Administrative Data Sources | To provide an estimate of a monthly reference wage for Ireland to the Department of Social Protection (DSP) in relation to meeting Irelands requirements under the European Code of Social Security. | ANNUAL | Tabular outputs |
Earnings Analysis using Administrative Data Sources, Business Register data, Census of Population Data, Earnings Hours and Employment Costs Survey, Structural Earnings Statistics data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables |
Central Record System - Client Details (Welfare), PAYE Real Time Data (Revenue), Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | It is proposed that several data sources (both administrative and survey) will be used in the creation of the Earnings Analysis using Administrative Data Sources (EAADS) release. | ANNUAL | Tabular, diagrams and written comment. Information will be published in web, electronic and paper formats as well as standard EU templates for Eurostat requirements. |
Business Register data, Business Register Sampling Frame |
Covid 19 Refund scheme (Revenue), PAYE Real Time Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | To match data from real time data from Revenue with the Business Register and data in relation to the Temporary Wage Subsidy scheme to investigate the impact of the Covid19 crisis on earnings and to produce comparative statistics on domicile of enterprise ownership. | Monthly | Statistical release and aggregate tables. |
Irish Population Estimates from Administrative Data Sources |
PAYE Real Time Data (Revenue) | It is proposed that the county and nationality variables included in IPEADS is matched to PMOD data for use in the analysis of regional distribution of earnings in the Earnings Analysis using Administrative Data Sources (EAADS) release. | Annual | Tabular, diagrams and written comment. Information will be published in web, electronic and paper formats. |
Earnings and Labour Costs Quarterly |
PAYE Real Time Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | To match data from the real time Revenue data flow with the Earnings and Labour costs quarterly survey data to explore the coherence between the two sources and frequency of irregular payments by economic sector. | Monthly | No statistical outputs. Explore coherence and consistency of outputs in advance of publication of the frontier monthly earnings series publication. |
CSO Division: Life Events and Demography
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Vital Statistics Quarterly | Address Matching Tool Sets using GeoDirectory | It is planned to update Vital statistics publications with more standard geographic units and move away from the custom vital statistics 3 digit address codes. Access is required in this case to help revise historic data and also help cut down on manual work within the section to geocode address strings. Please note this application refers to Births, Deaths and Marriages addresses. This job would be run on a yearly basis in order to ensure data is fit to publish for annual reports. | ONGOING | With small area codes assigned vital statistics will be able to produce new life events PxStat tables with the first providing breakdowns of births, deaths and marriages at local electoral level. |
›
CSO Division: Methodology
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | European 1km-square grid (ETRS) | To match Census 2016 analysis level dataset to European 1km-square grid for Eurostat transmission adjustment process. | One-Off | ETRS grid joined with HRN_PIK variable (from Census 2016 (analysis tier) dataset). |
Spatial Data | Directory of Irish Property Addresses, including Eircodes (GeoDir) | This project is a proof of concept of adding CSO statistical geographies to the GeoDirectory source and analysis tiers. This removes the need for users to do spatial analysis requiring access to source tiers of Geo-Directory and matching dataset. Moreover, the availability of this data on the analysis tier may encourage new uses of the Geo-Directory data. | One-off | The output will be a lookup table between eircodes and geographic ID: Proposed name: cso_eircode2geography |
Spatial Data | Directory of Irish Property Addresses, including Eircodes (GeoDir) | This project is a proof of concept of adding CSO statistical geographies to the GeoDirectory source and analysis tiers. This removes the need for users to do spatial analysis requiring access to source tiers of Geo-Directory and matching dataset. Moreover, the availability of this data on the analysis tier may encourage new uses of the Geo-Directory data. Proof of concept expired Data Matching ID:10918 | QUARTERLY | The output will be a lookup table between eircodes and geographic ID |
›
CSO Division: National Accounts
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Business Register; Balance of Payments; Census of Industrial Production; Annual Services Inquiry |
Revenue Corporation Tax files |
Examine consistency between Revenue profits data and relevant data from CSO surveys; derive additional NA variables |
Annual |
Improved estimates for NA variables (mainly profits)
|
Business Register Data |
P35LF, Employer Level Data (Revenue) |
This project is in place to obtain estimates of wages and salaries, ECSI and Other Labour Costs in the National Income Accounts at A64 and 2-digit Nace level. |
Annual |
Annual Compensation of Employees estimates at overall and detailed Nace level, numbers employed, average wage/ECSI/COE per employee at overall and detailed levels. |
Business Register |
Revenue Commissioners P35 file and DSFA CRS files |
To obtain county based average income data |
Annual |
To produce regional accounts and county household income |
CSO Business Register, CIP, ASI, Trade and BOP data |
Revenue Commissioners P35, Corporation Tax files and Dunn & Bradstreet (details of all companies on the CRO register) files |
To create a datafile for use internally by CSO’s National Accounts and BOP divisions. |
Annual and twice yearly. |
The data will be disseminated in National Accounts, Financial Accounts and Balance of Payments related aggregate tables. |
Business Register Data |
Pensions Authority Source Dataset (Pen_Auth) | To code Pensions contributions paid by Employers to Institutional Sector and Nace activity | Annual | Improved estimates for National Accounts CoE (D1) and labour costs (D12) |
Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (Revenue, DEASP, CSO) | To improve and extend the National Accounts supply and use tables by reconciling differences between the CSO Business Statistics and the National Accounts income estimates. | Ongoing | Improved data quality, distributional national accounts, dis-aggregated supply and use tables, economic growth accounts. |
Business Register Data |
Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue) | The process uses PMOD and the other datasets to estimate monthly Compensation Of Employees in the National Accounts Quarterly and Annual outputs. Compensation Of Employees by NACE activity and Institutional Sectors is made also and provided to Government Accounts. | Ongoing | The results are part of the Quarterly National Accounts results sent to Eurostat at T+60 days after the end of the Quarter (ESA T0103) . The results are used also in the benchmark Annual National Accounts (National Income and Expenditure) released nationally. Related annual results are sent to Eurostat (ESA T0103). The results are also part of the Output and Value Added By Activity annual release. |
Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Earnings Analysis using Administrative Data Sources Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35) ( |
CT-CRO Linking File (Revenue), CRO Accounts Details Data (DandB), PAYE Real Time Data (Revenue) | The aim of the project is to examine productivity of companies at a micro level in Ireland. This will entail company by company analysis and will require access to all of the above data sets due to the cross cutting nature of this project. Productivity is an analysis of the change in production over time achieved by the employees of an enterprise. All of this information is essential to derive the productivity indicators/assessments at an entity by entity analysis. | Annual | The statistical outputs will follow the annual productivity publication published by National Accounts- Labour productivity and GVA breakdowns/ nominal unit labour cost/ multifactor-productivity/capital deepening/capital services/hours worked/ tangible capital deepening/intangible capital deepening. See Link: https://www.cso.ie/en/releasesandpublications/ep/p-pii/productivityinireland2019/ |
Business Expenditure on Research & Development (BERD) Data (CSO), Census of Industrial Production Data (CSO), Annual Services Inquiry Data (CSO), Trade Register Data (CSO), Business Register Data (CSO), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO) |
Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Corporation Tax Data (Revenue), Non-Profit Account Details Data (Bfacts), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | To improve the National Accounts by reconciling differences between the CSO Business Statistics, Balance of Payments, Trade Statistics and the National Accounts income estimates. | Ongoing | Improved Supply and Use Table estimates, improved National Accounts accuracy, publications on trade and global value chains, similar in nature to the National Accounts publication on “Food and Agriculture: A Value Chain Analysis”. |
Census of Agriculture, Industrial Production and Turnover - RAP, Labour Force Survey, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables |
Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | The aim of the project is to calculate average income and employment statistics required in the yearly regional accounts. | Annual | I will be producing county-by-county breakdowns on numbers of employees/employers and compensation of employees for use in the regional accounts. |
›
CSO Division: Prices
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Labour Force Survey Data |
Directory of Irish Property Addresses, including Eircodes (GeoDir), Live Register Analysis (Welfare), Central Record System - Client, Payment and Employment Details (Welfare), Central Record System - Client Details (Welfare) |
This project explores small area estimation that combine data from administrative and survey sources to produce estimates for small areas or domains. |
Quarterly |
Dissemination of details on births by nationality |
None |
Stamp Duty Returns, Business Energy Rating Certificates, Geodirectory, Pobal Haase-Praschke Deprivation Index
|
The purpose of this data matching is to produce linked data on residential property transactions in Ireland. This data is used to calculate the statistics for the monthly Residential Property Price Index (RPPI) |
Ongoing |
(i) Monthly national and regional prices indices (ii) monthly indicators on the volume, value and price of residential property (iii) quarterly prices indices on new and existing dwellings (iii) annual information on non-household residential property transactions. |
Labour Force Survey Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables |
Live Register Analysis (Welfare) |
The purpose of the project is to estimate unemployment rates at county level using small area estimation techniques. | Ongoing | Unemployment rates by county |
Census of Agriculture |
Stamp Duty Returns, Property Registration Authority of Ireland Data, Geo Directory |
The purpose of this Data Matching Project is to calculate agricultural land prices by region and land type. | Annual | Tables for Eurostat and possible future CSO release. |
Business Register Data |
Pseudonymised Corporation Tax Data (Revenue) |
Biennial access is required to the Research & Development fields on the CTAnalysis file to identify potential enterprises carrying out R&D in Ireland, to produce statistics in accordance with European Commission Regulation (EC) No 995/2012. | Annual | Biennial results. Principal Variables: Detailed information on research and development expenditure; Sources of funds for research and development expenditure; Detailed information on research and development personnel; Recruitment of researchers; Research and development collaboration. |
Directory of Irish Property Addresses, including Eircodes (GeoDir), Stamp Duty on Property Transactions Data (Revenue), Pobal Deprivation Indices Data (TrutzHaa), Building Energy Rating details for non-domestic premises (SEAI) |
The purpose of this data matching is to produce linked data on Commercial Real Estate (CRE) transactions in Ireland. This is an analysis to explore data quality issues, the potential for data linking and whether it is possible to produce statistical outputs for CRE. | Ongoing | This is an exploratory analysis to look at the potential to produce statistical outputs on the volume, value and price of Commercial Real Estate (CRE). | |
Residential Property Price Index |
ESB Networks electricity consumption and customer data (ESBNetwk) |
The project will attempt to match the CUSTOMER dataset from the flow above to the Building Energy Rating (BER) dataset by MPRN, and increase the number of Eircodes available on BER. | Ongoing | The expected output will be an enhanced BER dataset with additional Eircodes assigned to the property addresses. |
CSO Divsion: Secondary Data Sources & Innovation
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Census of Population 2016, Person and Dwelling Data (CensusNameData) |
Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) |
To explore the feasibility of using administrative data lists to evaluate Census coverage |
Annual |
The statistical outputs expected is a report and possibly a dataset of aggregated coverage indicators. |
Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35) |
Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Early Childcare and Education Scheme Data (Children), Pseudomymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Linked PAYE Real Time Data Test Data with extra DEASP Variables (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE) |
To create a Person Activity Register to provide structural analysis of populations and sub-populations, over time. |
On-going |
Populate activity indicator dataset Used in Population estimates (PECADO) as input for admin census |
None |
Directory of Irish Property Addresses, including Eircodes (GeoDir), Central Record System - Client, Payment and Employment Details (Welfare), Local Property Tax Returns (Revenue), Landlord and Tenant Details from the Register of Tenancies (RTB) | The purpose of the project is to develop a dataset with the potential to be used as an occupied residence sampling frame. Such a dataset could be an option as a sampling frame for CSO postal household surveys or could be used as an indicator of occupied properties, to assist Census 2021 enumerators. |
On-going |
The statistical output will be a property dataset containing addresses and names of occupiers. The dataset will be an occupied residence dataset, as indicated by the latest LPT and RTB data instances. The dataset will have the potential to be used as an occupied residence household survey sampling frame and Census 2021-oriented indicator of occupied properties. Whether the output is used for such purposes, and, if so, how it is used, is outside the scope of the current project. |
None |
PPSN and Personal Details Data (Revenue), Household Sampling Frame (Revenue) |
Provide home Addresses to DCU for a selection of individuals sampled for the Structure of Earnings Survey to facilitate post out of survey notices to those individuals at their place of residence. |
One-Off |
Dataset containing CSO_ID (identifier created by ADC for SES survey) and home address |
None |
Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), Pseudonymised HSE coronavirus test referrals and test facilities (HSE), Pseudonymised Hospital Inpatient Discharge Data (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE) |
Pseudonymised COVID-19 person based HSE datasets are linked by CSO staff and permitted researchers to undertake statistical analysis to inform the national response to COVID-19. |
Ongoing |
Statistical outputs that have value in informing the public and national response to COVID-19 |
Vital Statistics, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables |
Pobal Deprivation Indices Data (TrutzHaa) |
The purpose of this project is to investigate possible effects of deprivation on cause of death, and also to produce statistics on causes of death on smaller geographical areas. |
Ongoing |
We will produce tables of aggregated information on causes of death. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld |
Pseudonymised Person Income Register Data |
Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised HSE Drugs Payment Scheme Data (HSE) |
The purpose of this project is to explore the possible relationships between persons income, household composition and engagement with Drug Payment Scheme. |
ONGOING |
The expected output will be the provision of research in the form of aggregated data to support decision making in the Health Sector. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld. |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data |
Pseudonymised Central Bank Central Credit Register on loans data (CBI) |
The purpose is to enhance the statistical potential of an existing project to compile short-term indicators that explore the dynamics within the consumer credit market. These indicators are primarily based on quantifying active contracts, customers, and borrowers in the consumer credit market wrt different population cohorts and type of credit. Matching data sources will enable cohorts to be defined by age, gender, employment, location of residence, household structure, and income. |
Ongoing |
The expected outputs are enhanced statistical indicators to provide information on the Consumer Credit Market. Statistical disclosure control will be applied to ensure the principle of statistical confidentiality is upheld. |
Business Demography, Structural Business Statistics |
Corporation Tax Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) |
The project involves the exploration of the statistical potential of xbrl files. The initial target use cases to explore include - cost of business information (structured data items) - business risk perception by business (text fields) - description of business activity - gender composition of directors The project involves parsing of structured xml files to extract numeric and text data. It is expected text analytics will play a significant role in this project. |
ANNUAL |
The initial target use cases to explore include - cost of business information (structured data items) - business risk perception by business (text fields) - description of business activity - gender composition of directors |
CSO Divsion: Social Analysis
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Prison Releases Report
|
Central Record System - Client Details (Welfare) |
Individuals released from custodial sentences between 2011 and 2018 will be matched to the CRS client details to establish a pseudo anonymized PPSN. The PPSN linking identifier will then be used to link the population of interest to analysis of earnings, social welfare, employment and housing indicators that is conducted using administrative data sources by existing CSO divisions. |
Annual
|
Earnings estimates. Medium earnings prior and post custodial sanctions Tables will be classified by year of custody, age at time of release, gender, offence type, re-offending indicator |
Irish Health Survey |
Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Primary Care Reimbursement Service Data (HSE) |
The CSOPPSN will be appended to the Irish Health Survey 2019 data in order to match to the PCRS datasets. This will allow us to cross reference medical card and drug payment scheme data against the health data supplied in the Irish Health Survey. |
Annual |
Statistical outputs will include cross tabulations of optical, dental, doctor and pharmacy claims against the Irish Health Survey data which includes data on health conditions, disability, activity limitations, access to medical specialists and frequency of visits. |
Prison Reoffending Statistics, Probation Reoffending Statistics |
Central Record System - Client Details (Welfare) |
The project will match data provided by the Justice agencies of individuals who have been released from custodial sanctions or received a probation order between 2011 and 2021 to the Client Record System. Once linked with a CSOPPSN assigned the personal identification characteristics (PPSN, name, detailed address) of the individuals to be removed from the data so that statistical analysis relating to employment participation can be carried out using CSO's analysis tier data. |
Annual |
The data matching project will allow CSO to carry out monthly employment estimates of individuals who have links to custodial sanctions |
›
CSO Divsion: Social Data Collection
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Survey on Income and Living Conditions Data
|
Revenue P35 file, Revenue form 11 file
|
Verification of income data
|
Ongoing
|
Anonymised micro data and aggregated output tables |
Survey on Income and Living Conditions Data |
Pseudonymised Corporate Customer System Data (DAFM), Pseudonymised Single Farm Payment Data (DAFM) |
The purpose of this work is to link respondents to the SILC survey with their Basic Farm Payments. The Basic Farm Payments are used in the calculation of farm income. Basic Farm payments are also known as "Single Farm Payments" |
Annual |
We expect to obtain the Basic Farm Payment component of income for farm households in the SILC (the main aim of which is to collect all household income). |
Survey on Income and Living Conditions Data | Pseudonymised Housing Assistance Payment - Analysis Tier (HAP) | To obtain rent details for SILC respondents who are renting their homes through the HAP system. The cost to the householder of renting their home and the financial value of the benefit to the householder of being on the HAP scheme will also be obtained. These feed into data on housing costs and housing benefits in the overall SILC results. | Annual | The outputs will be: (1) the amount of rent paid by HAP tenants who responded to the SILC and (2) the value of the benefit to these HAP tenants of being on the HAP scheme. This data will not be used alone, it will be included in the SILC results as a whole. |
Survey on Income and Living Conditions Data | Pseudonymised Grant Application and Payment Data (SUSI) | The purpose of the matching exercise is to link SILC respondents to any income they may have obtained through Education grants from SUSI. The SILC collects information on all household income of which education grants may be a component. | Annual | The output will be the component of household income that is obtained from SUSI education grants. |
Survey on Income and Living Conditions Data | Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB) | The matching exercise will be done for 2 different reasons. To obtain details of rent paid by SILC respondents who are renting their homes. To obtain details of rent received by SILC respondents who are landlords. | Annual | The outputs will be: (1) The amount of rent paid by SILC respondents who are renting their dwellings. The amount of rent received by SILC landlords who are letting dwellings. |
Survey on Income and Living Conditions Data | Pseudonymised Income Tax Form 11, Person Details Data (Revenue) | The IT Form 11 data will be used to provide details of income from self employment for SILC respondents | Annual | Income from self employment for respondents to the SILC survey |
Survey on Income and Living Conditions Data | Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | The purpose of the matching exercise is to obtain income and deductions made at source i.e. tax, USC, PRSI, pension contributions etc. for SILC respondents who are in the PAYE system. | Annual | Gross income and deductions from income made at source (e.g. tax, PRSI, USC, pension contributions, etc.) for SILC respondents who are paid through the PAYE system. |
Survey on Income and Living Conditions Data | Pseudonymised Local Property Tax Returns (Revenue) | The matching exercise will be done to obtain figures for Local Property Tax paid by SILC respondents. | Annual | The outputs will be the amount of LPT due on respondents' properties. |
Survey on Income and Living Conditions Data | Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | To obtain details of social welfare income for respondents to the SILC survey | Annual | Income from social welfare sources for respondents to the SILC survey. |
Survey on Income and Living Conditions Data | Central Record System - Client Details (Welfare) | The CRS_Client_Source file is used to verify PPSNs that are collected in the SILC survey | Annual | No actual data outputs from the matching procedure. We will simply obtain confirmation of which PPSNs are correctly assigned to SILC respondents. |
Labour Force Survey Data, Pseudonymised Quarterly National Household Survey/2017Q3 Labour Force Survey | Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | Under the Integrated European Social Statistics Regulation, NACE categories are now required to be coded at a 3-digit level. The purpose of this project is to improve the coding of NACE categories in the LFS from 2-digit to 3-digit. | Quarterly | An output SAS dataset containing primary and NACE 3-digit variables. |
Survey on Income and Living Conditions Data | Pseudonymised Higher Education Student and Course Details (HEA) | To match the SILC Educational Attainment Levels data of SILC respondents to their respective educational attainment levels on the HEA Higher Education Student and Course Details Administrative dataset. SILC Publication provides a comparative analysis of the equivalized income by highest level of educational attainment of the head of household. The HEA administrative data would greatly enhance the quality of this analysis. |
Annual
|
An output SAS dataset will be produced which will then be used by RAP to produce the SILC Publication which is disseminated in tabular format, diagrams, written comment. SILC provides a comparative analysis of the equivalized income by highest level of educational attainment of the head of household. The administrative data would greatly enhance the quality of this analysis. |
Survey on Income and Living Conditions |
Pseudonymised Central Bank Central Credit Register on loans data (CBI) | Data from the CCR is matched to respondents of the Survey on Income and Living Conditions (SILC) in order to accurately estimate debt at the household level in Ireland. | ONGOING | The data will fill core variables of the SILC namely data on mortgages. |
Household Budget Survey |
Central Record System - Client, Payment and Employment Details (Welfare) | PPSNs are collected in the HBS to allow respondents to be linked up with their data from administrative sources. CRS_src will be used to verify the PPSNs to ensure that the PPSNs collected are valid PPSNs and that the correct PPSN is entered for each respondent. | ANNUAL | No actual data outputs from the matching procedure. We will simply obtain confirmation of which PPSNs are correctly assigned to HBS respondents. |
›
CSO Divsion: Social Data Design
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Census of Population Data | Directory of Irish Property Addresses, including Eircodes (GeoDir) | The intended sampling frame for the PIAAC 2022 main study is the Census sampling frame. Due to the Census delay, the Census 2016 frame will be used for the PIAAC main study sample. The purpose of this project is to enhance the 2016 Census sampling frame by adding new households from the Geo-Directory. | One-off | We expect to obtain an enhanced 2016 Census sampling frame, that includes new households not originally included in the 2016 Census of Population. |
Census of Population Data | Pseudonymised Person Income Register Data | The purpose of this project is to add income to the Census file by matching Census to the Person Income Register. This project is being conducted in Social Data Collection as part of the new sampling approach being carried out for the Q1/Q2 2022 SILC sample. As recommended by methodology the 2022 SILC sample will be chosen using Stratified Simple Random Sample, using Income as the stratification variable. Thus to do this, income must first be added to SDCs census frame. | One-Off | Social Data Collection expect to obtain and enhance Census Frame, which includes income. Allowing for the 2022 SILC sample to be chosen using a stratified simple random sample approach. |
IPEADS_src - Irish Population Estimates from Administrative Data Sources, Census of Population Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | COVAX Vaccination Data (HSE), Integrated Short Term Payment System Data (Welfare) | The purpose of this project is to improve address quality and coverage on the IPEADs sample frame. The address coverage and quality of the sampling frame is not in itself of sufficient quality. The objective of this project is to improve the Eircode coverage and address quality on the sample frame significantly, by linking with other administrative and census datasets. | One-off | It is expected to obtain an enhanced sampling frame, with significantly higher Eircode coverage on the sample. Currently the sample has Eircode coverage in the region of 50% – 55%. It is hoped that this project will achieve 80%+ based on exploratory analysis and summary statistics already done on the datasets for other outputs. |
IPEADS_src - Irish Population Estimates from Administrative Data Sources, Census of Population Data | Pseudonymised Census of Population, with GeoDirectory and DEASP Variables (CSO) | The purpose of this project is to enhance the IPEADs sample frame for the purpose for household surveys (such as Safety of a Person (SOP) and Adult Education Survey (AES)). Currently the frame lacks household characteristic variables that would typically be available on the census household frame such as Deprivation Index, Urban/Rural, small area. The objective of this project is to enhance the IPEADS frame for post sampling purposes, by linking with the census dataset. | Ongoing | It is expected to obtain an enhanced sampling frame, with household characteristic variables, that can be used to facilitate GIS mapping, non-response adjustment, weighting and calibration, for upcoming household surveys such as SOP and AES. |
Census of Population Data |
Pseudonymised Local Property Tax Returns (Revenue) | The purpose of this project is to add the property valuation bands to the Census Household Sampling frame, by matching to the Local Property Tax (LPT) file. This project is being conducted as part of the sample design procedure for the Q3/Q4 2023 Household Finance and Consumption Survey (HFCS). The valuation bands will be used, along with home ownership, as a proxy measure for wealth. This 'wealth' indicator will then be used for stratification purposes. | ONE-OFF | Social Data Collection expect to obtain and enhance Census Sampling Frame, which includes Local Property Tax valuation bands. This will allow for the sample design of the 2023 Household Finance and Consumption Survey to include a 'wealth' indicator as part of the stratification. |
Measuring Mortality Using Public Data Sources, Census of Population Data |
Central Record System - Client Details (Welfare) | The purpose of this project is to better identify signs of life on the Census Household Sampling Frame. Social Data Collection intend to enhance our Census sampling frame by adding a flag for deceased persons from the CRS source flow and/or Rip.ie data. | Monthly | Social Data Collection expect to obtain an enhanced Census Household Sampling frame, which will include a flag for deceased persons. This process will be run before a sample is distributed to the field, to take into account quarterly CRS updates and the latest Rip.ie data. |
None |
Central Record System - Client Details (Welfare), Child Benefit Data (Welfare), Registered Births Data (GRO) | The purpose of this project is to build a sampling frame for the new infant GUI cohort. The pilot will run in 2023 and the main sample in 2024. As part of this project, ADC is providing multiple drafts of the sampling frame, in order to design the sample for the next GUI infant survey. This involves a matching exercise of the ADC GRO Births, CRS and Child Benefit ADC flows. | ONGOING | A sampling from for the new Infant cohort (both pilot and main samples). |
Growing up in Ireland - '98 Cohort |
GRO_Deaths (GRO), Registered Deaths Data (GRO), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Springboard and ICT Student and Course Details (HEA), Registered Deaths Data (GRO), GRO_Deaths (GRO) | The purpose of the proposed data matching is to enhance the GUI Cohort 98 sampling frame to better identify signs of life for the current and future waves. | ONGOING | Expect to obtain an enhanced GUI Cohort 98 sampling frame, which will include a flag for deceased persons. This process will update cases distributed to the field for the current wave as well as identifying cases in advance for future waves before they are distributed to the field |
›
CSO Division: Statistical Systems Co-Ordination Unit
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Census 2011 - Census Main Persons Dataset, Census 2016 - COP2016_NDI_DATA_V1 |
ESB new connections, LPT - Local Property Tax, HTB - Help-to-buy Scheme, BER - Building Energy Rating file, Geodirectory |
To produce new experimental building completions statistical series using additional data from ESB, Census, Revenue and Geodirectory data sets |
Quarterly |
Aggregate tabular format |
None | Post primary Pupils Database SPP35 linked employer employee file IT form 11 (subset to indicate type of activity/trade) SOLAS PLSS database of further training QQI analysis dataset of awards HEA Student Records System DSP CRS and Jobseekers Longitudinal Database (JLD) |
At the request of SOLAS, the CSO and SOLAS have agreed to collaborate on a project to evaluate outcomes of graduates of SOLAS funded further education courses. This data is held by SOLAS in the Programme Learner Support System (PLSS). A statistical product detailing this Outcomes analysis will be jointly produced. | Annual | Report, tabular/aggregated, publication of findings |
None |
DES Post Primary and Exam Datasets |
The CSO has recently undertaken a statistical collaboration with the HEA to analyse the outcomes for graduates of higher education courses, in particular mature students and graduates of “Springboard” courses. |
Annual/Biannual |
Report (either hard copy or electronic T4 release), tabular/aggregated |
Census 2016 dataset |
Revenue P35 file, Revenue form 11 file, Revenue Local Property Tax file,Revenue PPSN details file DSP Integrated Short Term Payment System; DSP Central Record System |
The project will create two new data products based on linking administrative files to the Census file to demonstrate the advantage of linkable data. |
This project is in the early stages and is ongoing |
Analysis of Vacant Housing. |
Labour Force Survey Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Domestic Wastewater Treatment System Registrations (LGMA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Meath County Council iHouse (LGMA), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Property Registration Authority (PRA) folio, consideration, and other data (PRA), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous) | We plan to create report(s) on social housing and their occupants - including social renters - through the use of public sector administrative data in order to provide evidence and insights for policy makers in the sector, as well as providing statistical information to assist with the Rebuilding Ireland project. | Ongoing | Report/publication(s) on social housing in Ireland. |
Business Register Data, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Income Tax Form 11, Business Details Data (Revenue), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Domestic Wastewater Treatment System Registrations (LGMA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Property Registration Authority (PRA) folio, consideration data (Bfacts), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Networks electricity consumption and customer data (ESBNetwk) | We plan to create report(s) and analysis on the rental sector in Ireland - looking at it's participants (landlords, renters) and rental properties. This will be undertaken through the use of public sector administrative data and will look provide evidence and insights for policy makers in the sector. | Ongoing | Report/publication(s) on the rental sector in Ireland |
None | Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE) | To gain further insight into the Covid-19 pandemic. The anticipated outputs from this Data Matching Request is to create reports and analysis in order to, amongst other things, identify sectors of the economy most affected by the disease. This will be undertaken through the use of public sector administrative data, currently available on the ADC, and will seek to provide insights to decision makers and members of the public. | Ongoing | The anticipated outputs from this Data Matching Request is to create reports with graphs and tables of aggregate data (including tables available on PXstat) as part of the COVID-19 Insight Bulletins: Deaths and Cases series of outputs. |
Business Register Data, Pseudonymised Person Income Register Data (PIR) | Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Jobseekers Longitudinal Dataset (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Primary Pupil Details (DES), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Pobal Programmes Implementation Platform - Childcare Providers (POBAL), Pseudonymised Department of Education teaching and other staff information (DES), Pseudonymised Teaching Council Register of Teachers (DES) | The Educational Longitudinal Database (ELD) is a statistical framework for the compilation and analysis of learner outcomes over many years. The ELD provides the basis for a series of projects that the CSO has established in collaboration with Irish public sector bodies to examine learner outcomes across a range of educational levels and programmes. | Ongoing | Reports with aggregated data in graphs and tables will be produced, as will some tables for Statbank. Reports may be produced in collaboration with other agencies or by agencies working alone (but with oversight from CSO for quality and data protection matters). |
Business Register Data, Vital Statistics Data, Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised HSE Computerised Infectious Disease Reporting System (HSE), C19 Covid Care Tracker Application Data Analysis Tier (HSE) | Project to produce aggregate reports, in line with the purpose of DPIA 1156, on Covid-19 vaccinated population by characteristics such as age, gender, location, socioeconomic profile , economic status, industry. This will allow CSO to make available to the public statistics about the progress of the vaccination programme and could also be used to assist Health services in maximising vaccination uptake. | Ongoing | Statistical Bulletin with tables on Covid 19 Information Hub on cso website. Bespoke reports for stakeholders. |
New Dwelling Completions, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables | Pseudonymised Building Energy Rating details for domestic premises (SEAI), Pseudonymised Networks electricity consumption and customer data (ESBNetwk), Pseudonymised New Residential Electricity Network Connections (ESBNetwk), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Gas Usage Details for Residential and Commercial Customers (GasNetwk), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Water Consumption Details for Residential Properties (IrishWat), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB) |
This project will seek to establish trends and/or levels in housing vacancy using utility data as a proxy for housing occupancy. Assessments of vacancy will be made at different geographical levels as well as other levels of disaggregation. |
Ongoing | Report/publication(s) on housing vacancy in Ireland. This will be in the form of Frontier publication(s). Tables will be provided through PxStat. |
Business Register Data , Earnings Analysis using Administrative Data Sources Data, Pseudonymised Person Income Register Data | Pseudonymised Linked Covid Refund Scheme Data with extra DEASP Variable (Revenue), Pseudonymised Central Record System - Payment and Employment Details (Welfare , Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Linked EWSS Data with extra DEASP Variable (Revenue), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Integrated Short Term Payment System Data (Welfare ), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare) |
This project is expected to provide aggregate statistical data focussing evidence on the potential impact that the expansion of paid parental leave may have on businesses. | One-off | The final output will be disseminated as a report/paper (tabular/aggregated). |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register, Data Disability Outcomes Analysis, Irish Population Estimates from Administrative Data Sources | Pobal Deprivation Indices Data (TrutzHaa), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Income Tax Form 1 Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS Apprentice Data (SOLAS) | This project is expected to provide aggregated statistical data on Regional Health Areas (RHAs). | One-off | Statistical Tables Electronic Publication Internal Report for the CSO |
Irish Population Estimates from Administrative Data Sources, Business Register data | Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) | Statistical Analysis to aid and inform policy at DSP | ONGOING | Tabular output, presenting aggregated statistics by various economic and demographic characteristics including Economic Sector, Size class, MNE versus domestic, Earnings bands, Sex, Age group |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables |
Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Garda Employee Management Systems (GEMS) data (Garda), Pseudonymised PPSN and Personal Details Data (Revenue) | This is a pilot project to develop a process to produce statistical analysis of diversity of the workforce in the public service. | ONE-OFF | We expect to obtain: Statistical Tables Internal/ External Report |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data |
Covid 19 Refund scheme (Revenue), Directory of Irish Property Addresses, including Eircodes (GeoDir), Landlord and Tenant Details from the Register of Tenancies (RTB), Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Primary Pupil Details (DES), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA) |
UPDATE TO MATCHING REQUEST 1372- to allow for matching to build RMF based on project This project will explore the economic and social characteristics of individuals with a disability using the Census and administrative data sources, exploring potential themes related to employment, education/training, housing, health and welfare. |
One-Off | An RMF file to support electronic release exploring the social and economic characteristics of individuals with disabilities. |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data |
Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC), Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Primary Pupil Details (DES) |
Local Area Data Analysis to support government departments in needs assessment and service provision for disadvantaged areas |
ONE-OFF | Dashboard for use by government departments |
Business Register data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data |
Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Department of Education teaching and other staff information (DES), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised Jobseekers Longitudinal Dataset (Welfare), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Pupil Details (DES), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Teaching Council Register of Teachers (DES) |
The aim of this project is to explore the discrepancy between the number of Teachers registered on the Teaching Council Register and those actually employed as teachers and generate statistics that will support Teacher Supply and Demand initiatives. We wish to ascertain “Signs of Life” using pseudonymised PPSN numbers of teachers registered but who are not employed as teachers by the Department of Education. |
ANNUAL | Reports with aggregated data in graphs and tables will be produced. |
Pseudonymised Person Income Register Data, Pseudonymised Census of Population, with GeoDirectory and DEASP Variables |
Consolidated (ITForm11/P35L) Income dataset Migration tier (Revenue), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Property Registration Authority (PRA) folio, consideration data (PRA), Pseudonymised Stamp Duty on Property Transactions Data (Revenue) |
The CSO will undertake a data matching exercise to better understand and quantify as much as possible the reasons for the differences between the number of households who rented from a private landlord published in Census 2022 and the number of registered tenancies at the end of 2021 published by the Residential Tenancies Board (RTB). |
ONGOING | Report/publication(s) on the rental sector in Ireland, statistical tables on Pxstat |
›
CSO Division: Statistical Systems Co-Ordination Unit - Horizontal Reports
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables |
Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Springboard and ICT Student and Course Details (HEA) |
This project will compliment previous reports produced by the Department of Education and Skills related to Early School Leavers |
One-Off |
Report. Tabular/Aggregated. Publication of findings. |
Census of Population 2011 Data, Census of Population 2016 Data, Census 2011 Housing Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Census 2016 Housing Data,Pseudonymised Person Income Register Data |
Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Grant Application and Payment Data (SUSI)
|
This project aims to provide insight into social and economic characteristics of individuals living across a range of six geographical urban/rural defined areas, defined by population density and access to services and amenities. CSO data will be the starting point (and make up the majority of the report) but by matching with non-CSO data, additional insights will be achieved. |
One-Off |
Report. Tabular/Aggregated. Publication of findings. |
Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Pseudonymised Person Income Register Data (PIR), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CensusAnalysis), Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (SPP35), Annual Business Survey of Economic Impact (ABSEI) Data | Corporation Tax Historical Tax Year (April to April) Returns Data (Revenue), Income Tax Form 11 Data (Revenue), Pseudonymised Corporation Tax Data (Revenue), Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised SOLAS Client and Course Details (SOLAS), CRO Accounts Details Data (DandB), Pseudonymised Corporation Tax Historical Tax Year (April to April) Returns Data (Revenue), Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Consolidated Income Tax Forms 11 and 12 and P35L Data (Revenue), Pseudonymised Springboard and ICT Student and Course Details (HEA), CT-CRO Linking File (Revenue), Pseudonymised Grant Application and Payment Data (SUSI) | Analysis on skills by sector: The objective of this project would be to identify the key skills and education of workers by the sector in which they work. The sectors would also be subdivided between companies considered productive and non-productive at an aggregate level. It will help identify where there are potential skill gaps/shortages or where certain skills are over subscribed in non-related sectors. |
Ongoing | Publication/report (tabular/aggregated) |
Census of Industrial Production Data, Annual Services Inquiry Data, Business Register Data, Pseudonymised Person Income Register Data (PIR), Annual Business Survey of Economic Impact (ABSEI) Data | Pseudonymised Flows of Jobs and Persons Data (DEASP), Revenue Sources (REVENUE) |
A Network Analysis of Productivity Spillovers via Labour Mobility: The objective of this research project is to analyse clusters of firms, in terms of their knowledge and skill flows, when workers switch jobs between multinational enterprises and domestic firms (and vice-versa) and assess to what extent positive or negative productivity spillovers may occur, if any. |
Ongoing | Report/paper (tabular/aggregated), including peer-review working paper. |
Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables | Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudomymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) |
The goal of this data matching project is to identify and analyse migration flows using administrative data sources. |
One-Off | Report. Tabular/Aggregated. Publication of findings. |
Censuses of Population 2011 and 2016 Data, Census 2016 Homeless Project, Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables | Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised P35LF, Employer Level Data (Revenue), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Housing Assistance Payment (HAP), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Property Registration Authority (PRA), consideration data (Bfacts), Housing Agency social housing waiting lists (DeptHous) |
This horizontal report will examine the characteristics around housing tenure type and family composition. It will look at: The purpose of the project is to to contribute to the evidence-base for the development of housing policy. |
One-Off | Report/Paper (tabular aggregated data) |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables | Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) | The goal of this data matching project is to obtain population activity counts and identify and analyse migration flows using administrative data sources. Note that this project expands the aims of project ID 1126 (above) by including population counts and including the dataset SPP35 in the data matching proposal. |
One-Off | Report. Tabular/Aggregated. Publication of findings. |
Pseudonymised Flows of Jobs and Persons Data from DEASP and Revenue Sources, Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables, Pseudonymised Person Income Register Data, Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables | Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Stamp Duty (1980-2009) Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Vehicle Registrations Data (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue),Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Vehicle Licencing Data (DTTAS), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (HAP), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Housing Agency social housing waiting lists (DeptHous) | This project will develop and build a social and economic aggregate statistical analysis of offenders (before and after prison). It will help with: o Understanding offenders interactions, at an aggregate level, with the State before and after release e.g. are they registering for welfare support, housing, education o Measure/gauge reintegration into the community after prison This information will be used to help inform policy discussions and development regarding the offender population. |
One-Off | Report/Paper (tabular aggregated data) |
Pseudonymised Person Income Register Data | Pseudonymised Post Primary Pupil Details (DES), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS) | The CSO wishes to collaborate further with the Road Safety Authority (RSA). This project will demonstrate the value of ADC data to the RSA and the mutual benefit of further collaborative projects. | One-Off | Report. Tabular/Aggregated. Publication of findings. |
Pseudonymised Linked P35L Employee Level Data with extra DEASP and CSO Variables (CSO), Pseudonymised Person Income Register Data (CSO), Pseudonymised Census of Population, with GeoDirectory and DEASP Variables (CSO) |
Pseudonymised QQI Course and Award Details Data (QQI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS), Pseudonymised Central Record System - Client Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Integrated Short Term Payment System Data (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Stamp Duty on Property Transactions Data (Revenue), Pseudonymised Income Tax Form 11, Person Details Data (Revenue), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Local Property Tax Returns (Revenue), Pseudonymised Help to Buy Scheme Data (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Directory of Irish Property Addresses, including Eircodes (GeoDir), Pseudonymised Springboard and ICT Student and Course Details (HEA), Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised PPSN and Personal Details Data (Revenue), Pseudonymised Primary Pupil Details (DES), Pseudonymised HSE Drugs Payment Scheme Data (HSE), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised DSP Extract About Ukrainian Refugees (Welfare), Pseudonymised Ukrainian Primary Pupil Details (DES), Pseudonymised Ukrainian Post-Primary Pupil Details (DES), Ukraine Driver Licence Exchanges (DTTAS), Pseudonymised Ukrainian employees under the Temporary Protection Directive Data (Revenue), Pseudonymised PREM Registrations Data (Revenue)
|
Update of and extension to DMP 1405. Includes new datasets on driving licenses and employment (PREM register for NACE and PAYE data). The goal of this data matching project is to obtain statistical information and insight around the circumstances and integration of migrants in Ireland, in particular refugees from Ukraine. |
Ongoing | Outputs will be in the form of a report(s) with graphs and tables of aggregate data (including tables available on pxstat) |
Pseudonymised Census of Population, with GeoDirectory and DEASP Variables |
Grant Application and Payment Data (SUSI) Leaving Certificate and Leaving Certificate Applied Results from SEC (SEC) Pseudonymised Central Record System - Client Details (Welfare) Pseudonymised Central Record System - Payment and Employment Details (Welfare) Pseudonymised Child Benefit Data (Welfare) Pseudonymised Grant Application and Payment Data (SUSI) Pseudonymised Higher Education Student and Course Details (HEA) Pseudonymised Income Tax Form 11, Person Details Data (Revenue) Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue) Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare) Pseudonymised Post Primary Pupil Details (DES) Pseudonymised Primary Care Reimbursement Service Data (HSE) Pseudonymised Primary Pupil Details (DES) Pseudonymised QQI Course and Award Details Data (QQI) Pseudonymised SOLAS Apprentice Data (SOLAS) Pseudonymised Springboard and ICT Student and Course Details (HEA) TuslaCiC_ANA (Tusla) |
Statistical analysis of educational attainment of children in care (CiC), who are currently in care or were in care since April 2018 based on Tusla’s National Childcare Information System (NCCIS). |
ONE-OFF | Frontier or Pathfinder release including tables, graphs and an infographic. |
Census of Population, with GeoDirectory and DEASP Variables, Pseudonymised Person Income Register Data |
Pseudonymised all claim and allowance DSP records captured on ISTS and BOMi (Welfare),, Pseudonymised Central Record System - Client Details (Welfare) Pseudonymised Central Record System - Payment and Employment Details (Welfare), Pseudonymised Child Benefit Data (Welfare), Pseudonymised Grant Application and Payment Data (SUSI), Pseudonymised Higher Education Student and Course Details (HEA), Pseudonymised Housing Agency social housing waiting lists (DeptHous), Pseudonymised Housing Assistance Payment - Analysis Tier (LCouncil), Pseudonymised Landlord and Tenant Details from the Register of Tenancies (RTB), Pseudonymised Linked PAYE Real Time Data with extra DEASP Variable (Revenue), Pseudonymised Long and Short Term Social Welfare Payments Data (Welfare), Pseudonymised Maternity Benefit Payments Data from DEASP (Welfare), Pseudonymised National Vehicle and Driver File, Driver Details (DTTAS), Pseudonymised Post Primary Pupil Details (DES), Pseudonymised Primary Care Reimbursement Service Data (HSE), Pseudonymised Primary Pupil Details (DES), Pseudonymised SOLAS PLSS Client and Course Details (SOLAS) |
This project is expected to provide aggregate statistical data exploring the social and economic lives of one parent families in Ireland including income, employment and welfare. |
One-off | The final output will be disseminated as a Frontier or Pathfinder release including tables, graphs and an infographic. |
›
CSO Division: Sustainable Development Goals & Indicator Reports
CSO Dataset Matched |
Non-CSO Dataset Matched |
Reason |
Frequency |
Statistical Outputs Obtained |
Census of Population 2016 Data |
Directory of Irish Property Addresses, including Eircodes (GeoDir), OSi National Mapping Database (PRIME 2) (OSi)
|
To use the coordinates of the Census 2016 geography dataset and the coordinates of a number of destination points to calculate the shortest-path distance of residential dwellings to various services and infrastructure. This is to examine the effect of proximity to certain day-to-day services relative to where people are living. |
One-Off |
It is proposed to produce a publication on proximity containing, inter alia, average distance by county and urban-rural, an investigation of settlements with core services and an analysis of isolated dwellings in rural areas.
|
Address Matching Tool Sets using GeoDirectory (GeoDirAMToolSets) (CSO), Census 2016 Housing Data (CSO), Pseudonymised Census of Population 2016, with GeoDirectory and DEASP Variables (CSO) |
OSi National Mapping Database (PRIME 2) (OSi) |
The objective of this project is to continue the work on the examination of the proximity of the population to everyday services and infrastructure by measuring the shortest-path distance from an origin (the coordinate of a residential dwelling on the Census 2016 dataset) to a destination (the coordinate of a particular facility or infrastructure). |
Ongoing |
CSO has a central role in the production of indicators for the Sustainable Development Goals (SDGs). There are three indicators; 11.2.1 (Proportion of population that has convenient access to public transport, by sex, age and persons with disabilities), 11.7.1 (Average share of the built-up area of cities that is open space for public use for all, by sex, age and persons with disabilities), and 9.1.1 (Proportion of the rural population who live within 2km of an all-season road. |