(30.3.2026)
New datasets have been published or will soon be published for several ready-made data modules. The new datasets are related to the Finnish Immigration Service's residence permits, road traffic accidents, early childhood education and care and historical data on higher education students. Below you can find detailed information about each data module.
New ready-made datasets connected to the Finnish Immigration Service's residence permits
New ready-made datasets have been published alongside the Finnish Immigration Service's MIGR_OLESK ready-made dataset to complement data on applicants for a residence permit starting from 2011.
The new datasets are
- MIGR_OPISK – data on studies of applicants for a residence permit
- MIGR_PERHE – data on families of applicants for a residence permit
- MIGR_TYO – data on employment of applicants for a residence permit
- MIGR_VOPAL – data on reception services of applicants for a residence permit (to be published later)
In the next stage, the data module is supplemented with historical data from 1987 to 2010. Older data may be of lower quality and less comparable.
Ready-made data module on road traffic accidents (PATJA)
The ready-made data module on road traffic accidents contains data on road traffic accidents in Finland starting from 1989. The datasets are based on personal injury accidents known to the police (PATJA information system) and they are supplemented with the help of Traficom, the Finnish Transport Infrastructure Agency, Statistics Finland's statistics on causes of death and population data.
The new datasets are
- TON_ONN - road traffic accidents (accident module)
- TON_ONN_OSALL – road traffic accidents and parties involved (individual level)
TON_ONN module includes data on accidents and sum level data on parties involved. Particularly suitable for studies concerning the number or scale of accidents or their connection to external factors.
Publication: March 2026
TON_ONN_OSALL module contains data on accidents and parties involved in them on the individual, vehicle and animal levels. Suitable for studies and reports requiring individual-level information on accidents and parties involved in them.
Publication: March 2026
Ready-made data module on early childhood education and care (VARDA)
The ready-made data module on early childhood education and care contains data on participating children, customer relationships, actors and establishments in early childhood education and care starting from 2019.
The new datasets are
- FOLK_VAKA – children participating in early childhood education and care in Finland
- VAKA_ASIAKKUUS and VAKA_ASIAKKUUS_SUPPEA – data on customer relationships in early childhood education and care
- VAKA_TOIMIJA_TOIMIPAIKKA and VAKA_TOIMIJA_TOIMIPAIKKA_SUPPEA – actors and establishments organising and producing early childhood education and care
FOLK_VAKA is a monthly level panel dataset containing data on children participating in early childhood education and care in Finland. The dataset is updated yearly.
Publication: March 2026
VAKA_ASIAKKUUS and VAKA_ASIAKKUUS_SUPPEA modules are suitable for studies examining the duration, type and related changes of early childhood education and care.
- The more extensive VAKA_ASIAKKUUS is updated monthly and may become revised.
- VAKA_ASIAKKUUS_SUPPEA is updated yearly and is particularly suitable for projects where repeatability of data is essential.
Publication: March 2026
VAKA_TOIMIJA_TOIMIPAIKKA and VAKA_TOIMIJA_TOIMIPAIKKA_SUPPEA contain data on actors and establishments organising and producing early childhood education and care.
- The more extensive VAKA_TOIMIJA_TOIMIPAIKKA is updated monthly and may become revised.
- VAKA_TOIMIJA_TOIMIPAIKKA_SUPPEA is updated yearly and is particularly suitable for projects where repeatability of data is essential.
Estimated publication: March 2026
Historical data on higher education students
EDUC_OPISK_HIST contains historical data on higher education students from 1968 to 1972 and 1975 to 1995. The dataset is based on higher education institutions’ old registers of students and supplements the present student data by extending the time series to the 1960s.
The first part of the dataset contains registered higher education students, their background data and data related to studies for 1968 to 1972. The second part continues the time series with a more concise data content for 1975 to 1995. The historical data and their descriptions are partly incomplete.
Estimated publication: April 2026
The descriptions of the datasets can be found in the TAIKA data catalogue.