This release is categorised as a CSO Frontier Series Output. Particular care must be taken when interpreting the statistics in this release as it may use new methods which are under development and/or data sources which may be incomplete, for example, new administrative data sources.
Growing Up in Ireland (GUI) is the national longitudinal study of children and young people in Ireland. The study is a joint project by the Central Statistics Office (CSO) and the Department of Children, Disability and Equality (DCDE). The CSO is responsible for the GUI survey itself: designing and building the survey; collecting, processing, and analysing the data; and facilitating data access to researchers and policy makers. DCDE as the departmental sponsor has responsibility for the wider elements of the GUI study: engaging with policy and scientific stakeholders to understand data requirements; consulting with children/young people; identifying research needs, data priorities and policy objectives, and promoting the use of GUI data for research and policy development. Working closely together ensures seamless integration of the complimenting responsibilities.
Established in 2006, the study originally followed two groups of Irish children: Cohort '98 who joined the study when they were 9-years-old, and Cohort '08 who joined the study when they were 9-months-old. The two cohorts have been surveyed at regular intervals since. Cohort '08 are aged 17/18 in the wave of data collection currently ongoing. Cohort '98 were age 25 at their last wave of data collection (Wave 5) and the results of that exercise are the subject of this report. Only those who took part in Wave 5 of the study were included in the analyses presented in this release. This release analyses data from Wave 1 and Wave 5 of the study.
For further information on the sample frame and design see Background Notes from the Cohort ‘98 Main Results release.
Data collection for Cohort '98 Wave 1 took place between August 2007 and May 2008. Data collection for Cohort '98 Wave 5 took place between April 2023 and April 2024.
Logistic regression analyses were carried out using the survey package in R. Each variable was tested in a logistic regression analysis independently without confounders in the model. These were then combined into one model containing all confounders (see list of variables below) and a stepwise approach was taken whereby the least significant variable in the combined model was removed in the next model and the regression was rerun. The models were then tested using Akaike Information Criterion which compares the relative quality of different statistical models for a given set of data to find the model that best explains the data without being overly complex, balancing goodness of fit with model simplicity.
The marginal effects for each predictor variable in the best fitting model were then calculated, accounting for all other variables in the model. The Wave 5 weights were included in these models (see Background Notes from the Cohort ‘98 main results release for more information).
Below is the full list of variables that were examined against highest education level of the respondent at age 25 prior to the stepwise process being applied. The variables for inclusion as confounders in the model were selected to reduce the risk of overestimation of the effect of the predictor variables.
The variables used in the final model can be found in Table A1 of the Appendix.
Those with no-valid social class were not included in the models with social class.
Below is the full list of variables that were examined against the respondent’s ability to make loan repayments and their ability to make ends meet at age 25 prior to the stepwise process being applied. The variables used in the final model can be found in Table A2 and Table A3 of the Appendix.
A regression model is a tool that uses information to make predictions. In this release, we use logistic regression models. These models use the data provided to predict whether something is more or less likely to happen.
In this release, we report our regression results as average marginal effects. An average marginal effect is a type of probability statistic that tells us how many more or fewer people out of 100 would end up with a certain outcome when one thing changes, on average.
For example, in this release we found that the parent using the public library for the child at age 9 increases the probability of the respondent having a degree at age 25 by 7%, on average in the sample, relative to respondents with parents who did not use the public library for their child. This means that out of 100 similar respondents, about seven more respondents would end up with a degree at the age of 25 if they used the library at age 9 compared with those who did not use the library at age 9, on average.
We say on average as this number will not be exactly the same for every group or every person. Take the example in the previous question and answer, some groups might see more than a 7% increase and some groups might see less than that, but 7% is the typical result overall.
These results do not mean that a certain outcome is guaranteed based on a certain set of factors. If we again refer to the example regarding library use, it does not mean that everyone who used the library at age 9 will get a degree and vice versa.
The results do not mean that the factors examined in this study are the only factors that influence education and financial well-being at age 25. There are likely many other factors that influence these outcomes.
Learn about our data and confidentiality safeguards, and the steps we take to produce statistics that can be trusted by all.