We are currently developing the documentation of statistics. At the moment, the content is not necessarily up-to-date in all respects.

Income distribution statistics: documentation of statistics

The documentation of the statistics describes how the statistics were compiled and what methods were used in the compilation. The data help interpret the figures of the statistics and evaluate their reliability and comparability. The quality report is based on the EU's SIMS model. The documentation also contains change releases describing changes in the statistics and possible specifying methodological descriptions.

If you are looking for statistical figures for these statistics, go to the statistics page: Income distribution statistics

Quality report

Data description (SIMS 3.1)

Income distribution statistics describe distribution of households' annual income, income differentials and low income between population groups. Statistics describe disposable income and its formation taking into account taxation and income transfers. Data are published once a year.

Sector coverage (SIMS 3.3)

The income distribution statistics cover private households in Finland.

The total data on income distribution over population in household-dwelling units in Finland.

Statistical unit (SIMS 3.5)

The statistical units of the income distribution statistics are a private household (a household-dwelling unit and a common housekeeping unit), person and consumption units.

The definition of a household differs between the total data and sample data of the income distribution statistics. In the total data, the household is a household-dwelling unit. A household-dwelling unit is formed of persons living permanently in the same dwelling or at the same address. The household-dwelling unit is used in all register-based statistics of Statistics Finland. In the sample data of the income distribution statistics, the household is defined based on common housekeeping with the help of data collected with interviews. A household is formed of all those persons who live together and have meals together or otherwise use their income together. In the population, the correspondence on the individual level of households in the data has been around 94 to 95 per cent in recent years.

Statistical population (SIMS 3.6)

The target population of the income distribution statistics are private households and their members, i.e. the dwelling population in Finland at the end of the statistical reference year (31 December).

The frame population includes all private households and their members living permanently in Finland at the end of the statistical reference year (31.12., survey year – 1).

The household-dwelling population is formed by all persons living permanently at dwellings. Good two per cent of the entire population are excluded from the statistics. They include persons without a postal address, the institutional population (e.g. long-term residents of old peoples homes, care institutions, prisons or hospitals), persons permanently resident abroad and persons temporarily resident in Finland. Conscripts are regarded as part of the population in these statistics.

Reference area (SIMS 3.7)

Regional classifications corresponding to the EU27 uniform NUTS classification of regional units (NUTS2, or classification of major regions, NUTS3 or classification of regions), wellbeing service county, sub-regional unit and municipality are used in the income distribution statistics.

Time coverage (SIMS 3.8)

The time series data of the total data of the income distribution statistics cover comparable data from year 1995 onwards.

The sample data of the income distribution statistics are available annually from 1986 onwards. The data published for the years 1966, 1971, 1976, and 1981 are based on the Household Budget Survey.

Base period (SIMS 3.9)

The base year for the real values of monetary data in the income distribution statistics is the latest statistical reference year.

Unit of measure (SIMS 4)

The units of measure in the income distribution statistics are euros, %, numbers of households, persons and consumption units.

Reference period (SIMS 5)

The data of the income distribution statistics describe data for the statistical reference year, which is the whole calendar year, and for the end of the statistical reference year (31 December).

Classifications (SIMS 3.2)

The classifications used in the total data on income distribution are: gender, age, level of education, income decile group, income structure, main source of income, dependence on basic social security, region.

The classifications used in the sample data are: gender, age, stage in life, socio-economic group, type of household, income qvintile group, income decile group, at-risk-of-poverty, tenure of dwelling.

Regional classifications corresponding to the EU uniform NUTS classification of regional units (NUTS2, or classification of major regions, NUTS3 or classification of regions), sub-regional unit and municipality are used in the income distribution statistics.

Concepts and definitions (SIMS 3.4)

Consumption unit

Income and consumption expenditure calculated per consumption unit can be used to compare households of different sizes and structures with each other. There are several different ways of calculating consumption units. From 2002, the income distribution statistics and the Household Budget Survey have used the OECD's adjusted consumption unit scale recommended by Eurostat, the Statistical Office of the European Communities, where - the first adult of the household receives the weight 1 - other over 13-year-olds receive the weight 0.5 - children receive the weight 0.3 (0 to 13-year-olds). The selected consumption unit scale has a significant effect on income levels and on placement of different population groups in the income distribution.

Current transfers paid

The household's current transfers paid are mainly formed of direct taxes and social security contributions. In addition, current transfers paid include compulsory pension contributions and unemployment insurance premiums, as well as child maintenance support paid. Taxes paid do not include church tax, voluntary individual insurance premiums (from 2000 regarded as savings in the income distribution statistics) and indirect taxes. Current transfers paid are based on register data, except for withholding taxes paid on interest income. From 2011 onwards, current transfers paid also include part of current transfers between households (e.g. bills paid for other households and money given for studying).

Current transfers received

Current transfers received by households and persons are formed of earnings-related and national pensions and other social security benefits, social allowances and other current transfers received. Other social security benefits are such as rehabilitation allowances, daily and parental allowances, compensations of statutory accident insurance and earnings-related unemployment allowance. Social allowances are such as child benefits, support for care of small children, conscript's allowance, social assistance, general housing allowance, study and research grants, basic unemployment allowance and labour market allowance. Other current transfers received are current transfers received between households.

Disposable income

In the income distribution statistics and in the Household Budget Survey, households' disposable income included all salaries and wages, entrepreneurial income and property income (including imputed rent from owner-occupied dwellings and taxable sales profits from property), benefits in kind and current transfers received, from which sum, current transfers paid were deducted. The formation of disposable income can be described as follows: + Wages and salaries + Entrepreneurial income + Property income (incl. imputed rent from owner-occupied dwellings and sales profits) ----------------------------------------------- = Factor income + Current transfers received (incl. imputed rent from a rental dwelling from another household) --------------------------------------------- = Gross income – Current transfers paid -------------------------------------------- = Disposable income Before the statistical reference year 2011, the income distribution statistics primarily utilised the concept of disposable income. The imputed rent of owner-occupiers was regarded as factor income (property income) and imputed rent for a dwelling rented from another household as current transfers received in the income distribution statistics. Imputed rent is still formed in the income distribution statistics but from the statistical reference year 2011, it is treated as a separate income item (see "Imputed rent"). Similarly, taxable realised capital gains or sales profits are treated as a memorandum item according to international recommendations. When social current transfers in kind are added to income, adjusted disposable income is obtained. This concept is not formed in the income distribution statistics. Wages and salaries include income paid for households as pay - either in money or benefit in kind. Income from incentive stock options is included in the income concept in benefits in kind and thus in wages and salaries. Entrepreneurial income includes income from agriculture and forestry, business activity and business group and copyright fees. Entrepreneurial income in agriculture also contains various subsidies and compensations such as agricultural subsidies, European Union agricultural aid and compensation for harvest losses. Property income is rental, interest and dividend income received by households, imputed net rent from an owner-occupied dwelling, taxable capital gain and pensions based on private insurance and other income. Current transfers received comprise earnings-related pensions and national pensions and other social security benefits, social assistance and other current transfers received. Current transfers paid comprise direct taxes and social security contributions. In addition, current transfers paid comprise compulsory pension and unemployment insurance premiums and in the income distribution statistics also child maintenance support paid. The key income distribution statistics concept, disposable income, is arrived at when current transfers paid are deducted from gross income. The concept of disposable income in the Household Budget Survey is based on register data, and does not, differing from the income distribution statistics, include wages and salaries subject withholding tax and tax-free interest income and current transfers between several households (e.g. child maintenance support).

Disposable money income

Households' disposable money income includes monetary income items and benefits in kind connected to employment relationships. Money income does not include imputed income items, of which the main one is imputed rent. The formation of disposable money income can be described as follows: + wages and salaries + entrepreneurial income + property income (without imputed rent) ----------------------------------------------- = factor income + current transfers received (without imputed rent) --------------------------------------------- = gross money income – current transfers paid -------------------------------------------- = disposable money income When current transfers paid are deducted from gross money income, the remaining income is the household's disposable money income. The primary income concept used in the income distribution statistics is household's disposable money income according to international recommendations, in which case sales profits and taxes paid on them do not belong to the scope of the income concept. Following international recommendations, they are treated as a memorandum item outside the income concept. The concept of disposable money income in the total statistics on income distribution differs from disposable money income in the income distribution statistics. As a conceptual difference, the income concept of the total statistics on income distribution includes taxable realised capital gains. For practical reasons, the total statistics on income distribution do not include the majority of interest income and current transfers received and paid between households (e.g. child maintenance support). Real property tax is not deducted in the total statistics on income distribution either.

Entrepreneurial income

Entrepreneurial income includes income from agriculture and forestry, business activity and business group and copyright fees. Entrepreneurial income in agriculture also contains various subsidies and compensations such as agricultural subsidies, European Union agricultural aid and compensation for harvest losses. Income from agriculture does not include imputed income received from products taken into own use.

Equivalent income

Equivalent income is an income concept by which incomes of households of different types are made comparable by taking account of shared consumption benefits. Equivalent income = the household's income divided by the number of consumption units in the household. From 2002 the income distribution statistics have used the OECD's adjusted consumption unit scale recommended by Eurostat, the Statistical Office of the European Communities, where - the first adult of the household receives the weight 1 - other over 13-year-olds receive the weight 0.5 - children receive the weight 0.3 (0 to 13-year-olds are defined as children) The assumption is that income is evenly distributed inside the household between all household members in relation to the above-mentioned consumption need.

Factor income

In the income distribution statistics, factor income is monetary compensations received by households for participation in the production activity as wages and salaries, entrepreneurial income and property income.

GINI co-efficient

The Gini coefficient is the most common indicator describing income differences. The higher value the Gini coefficient gets, the more unequally is income distributed. The biggest possible value for the Gini coefficient is one. Then the highest earning income recipient receives all the income. The smallest Gini coefficient value is 0, when the income of all income recipients is equal. In the income distribution statistics, Gini coefficients are presented as percentages (multiplied by one hundred). The Gini coefficient describes relative income differences. The Gini coefficient does not change if the incomes of all income earners change by the same percentage.

Gross income

The household's gross income is obtained when current transfers received by the household are added to the household's factor income (wages and salaries, entrepreneurial and property income), but paid current transfers (e.g. taxes and social security contributions) are not deducted.

Household

A household is formed of all those persons who live together and have meals together or otherwise use their income together. The concept of household is only used in interview surveys. Excluded from the household population are those living permanently abroad and the institutional population (such as long-term residents of old-age homes, care institutions, prisons or hospitals). The corresponding register-based information is household-dwelling unit. A household-dwelling unit is formed of persons living permanently in the same dwelling or address. More than one household may belong to the same household-dwelling unit. The concept of household-dwelling unit is used in register-based statistics in place of the household concept.

Housing expenditure

Housing expenditure includes operating expenditure, interests on and amortisations of housing loans, capital charges, and real estate tax for the household's actual dwelling.

Income deciles

The income distribution is described by means of tenths or deciles. Sometimes fifths or quintiles are also used, formed in the corresponding way as deciles. An example of how income deciles are formed: Nowadays the decile groups or income deciles used in the income distribution statistics are formed by dividing first the household's income by the household's consumption units (so-called equivalent income). Each household member will have the same equivalent income. The persons are then arranged in the order of their income and divided into ten groups of equal size. Each income decile then has 10 per cent of the population. The first income decile contains the lowest income tenth and the last one the highest income tenth. The income shares of income deciles show how large share of the total sum of the income in question each decile gets.

Income share of housing costs

Housing costs include operating expenditure, interests on housing loans and real estate tax paid by the household for its actual dwelling. Depending on its tenure status, the dwelling's operating expenditure comprises maintenance charges, rents, water and waste charges, separate energy expenses, costs of maintenance repairs, and other operating and maintenance expenditure of the dwelling. The income share of housing costs (in gross) indicates the share of housing costs in the household's disposable income. In the income share of housing costs in net, housing costs and disposable money income do not include housing benefits received by the household.

Long-term low-income

Long-term low-income earners are those who have belonged to low-income households in two years within the three previous years in addition to the statistical year (see the definition of low income). The definition is based on the recommendations of Eurostat, the Statistical Office of the European Communities.

Low income

Low-income earners (persons at risk of poverty) are considered those whose household's disposable money income per consumption unit (so-called equivalent income) is lower than 60 per cent of the equivalent median money income of all households. The proportion of the population falling below this income limit is called the low income rate (at-risk-of-poverty-rate). The euro-denominated limit for low income varies by year. The definition is based on the recommendations of Eurostat, the Statistical Office of the European Communities. There is no official national definition for low income or poverty line in Finland. From the statistical reference year 2011 onwards, the income distribution statistics started to use the money income concept meeting international recommendations for statistics on low income earning (poverty risk). In reports published before that, a wider income concept was used, that is, households' disposable equivalent income, when income included so-called imputed rent and sales profits.

Money income

Money income is obtained when imputed income items are deducted from household gross income. Imputed items are imputed income obtained from an owner-occupied dwelling in own use. Money income includes benefits in kind connected to employment relationships. Gross money income = the household's factor income (wages and salaries, entrepreneurial and property income) + current transfers received by the household.

Property income

Property income includes rental, interest and dividend income, pensions based on private insurance and other income (from 2000). Interest income subject to the Act on Withholding Tax is included in interest income as gross. Withholding taxes paid on them are included in current transfers paid. In international recommendations, sales profits are not counted as income, so taxable realised capital gains are not included in the income concept in the income distribution statistics. Instead, they are included in income in the total statistics on income distribution. In the statistics published before the statistical reference year 2011, dwelling income and taxable sales profits were included in property income. From the statistical reference year 2011, dwelling income and sales profits were removed from the income concept, because the compilation of statistics is based on the concept of disposable money income fulfilling international recommendations. Data on the previous income concept including dwelling income and sales profits are still formed as reference data and they can be requested from Statistics Finland.

Reference person

In the income distribution statistics and in the statistics of household's assets the person with the highest personal income is chosen as the household's reference person. Personal income is defined according to register data and interview data. Although income is the main criterion determining the reference person, in some cases (e.g. entrepreneur households) the activity of the whole household is taken into account. Households of pensioner parents with children (including those over the age of consent) are also special cases where the parent with the higher income is selected as the reference person if the combined incomes of the parents clearly exceed those of a child.

Reference person

The household member with the highest gross income is selected as the reference person in total statistics on income distribution. Income is determined from register data.

Socio-economic group

In the Household Budget Survey and income distribution survey a socio-economic group is formed for household members on the basis of the person's activity in the last 12 months. For determining the socio-economic group, persons are first divided into economically active and inactive. As a rule, all those who have participated in the production activity for at least six months during the survey year are counted as economically active. Economically active are further divided into self-employed and wage and salary earners on the basis of information reported in the interview. Self-employed are also such persons who have been taxed as employees in taxation (typically entrepreneurs working as employees in their own company). Economically inactive are grouped into students, pensioners, unemployed and others. Unemployed are persons who have been unemployed for at least six months during the year. The socio-economic group of the household is determined by the household's reference person. The classification is based on the Statistics Finland's classification standard of socio-economic groups from 1989. There account is taken of the person's occupation, status in occupation, nature of work and stage in life (Classification of Socio-economic Group 1989. Helsinki. Statistics Finland, Handbooks, 17).

Unemployed

In the income distribution statistics, persons who have been unemployed for at least six months during the year are classified as unemployed. Months of unemployment are asked from persons in the interview. Interview months are checked and where needed, corrected on the basis of register data (the Social Insurance Institution's register data on unemployment allowances and times of receipt, the tax register's unemployment allowances).

Wages and salaries

Wages and salaries include income paid to households as pay – either in money or benefits in kind. In the income concept, income from incentive stock options is included in benefits in kind and thus in wages and salaries. The concept of wages and salaries used in the income distribution statistics includes not only wages and salaries for regular working hours but also overtime compensations and income received from secondary jobs. Realised incentive stock options are also included in wages and salaries in the income concept of the income distribution statistics. Their generating costs are deducted from wages and salaries, but not travel expenses.

Institutional mandate (SIMS 6)

The compilation of statistics is guided by the Statistics Act. The Statistics Act contains provisions on collection of data, processing of data and the obligation to provide data. Besides the Statistics Act, the General Data Protection Regulation, the Data Protection Act and the Act on the Openness of Government Activities are applied to processing of data when producing statistics.

Statistics Finland compiles statistics in line with the EU’s regulations applicable to statistics, which steer the statistical agencies of all EU Member States.

Further information: Statistical legislation

Legal acts and other agreements (SIMS 6.1)

The compilation of statistics is guided by the Statistics Act. The Statistics Act contains provisions on collection of data, processing of data and the obligation to provide data. Besides the Statistics Act, the Data Protection Act and the Act on the Openness of Government Activities are applied to processing of data when producing statistics.

Statistics Finland compiles statistics in line with the EU’s regulations applicable to statistics, which steer the statistical agencies of all EU Member States.

Further information: Statistical legislation

The data content of the sample data of the income distribution statistics is based on framework Regulation 1177/2003 and 1700/2019 of the European Parliament and of the Council concerning Community statistics on income and living conditions (EU-SILC).

Data sharing (SIMS 6.2)

Besides Statistics Finland, regional data from the total data of the income distribution statistics are also published as tabulated data in the statistics and indicator databank SOTKAnet maintained by the National Institute for Health and Welfare (THL).

The income data of the income distribution statistics are used for Statistics Finland's statistics on living conditions. The sample data of the income distribution statistics and the statistics on living conditions are based on the same sample data. The data are used for the international ESS EU-SILC statistics (EU-SILC, Statistics on income and living conditions). Eurostat, the Statistical Office of the European Union, is responsible for compiling statistics on the EU-SILC and for the release of its statistical data for research use. Research use requires an application for licence to use statistical data.

In addition, sample data from the income distribution statistics is supplied to the OECD (OECD IDD) and at set intervals to the Luxembourg Income Studys (LIS) international database. They publish internationally comparable data on their statistical pages.

Cost and burden (SIMS 16)

In Statistics Finland's income distribution statistics, a considerable cost burden is caused by data collected from households with interviews. These data are not available with other methods or there are no administrative data sources available for forming them. The response burden is related to the interview data collection.

Statistics Finland's Data Collection Department is responsible for the interviews. The interviews are computer-assisted and conducted with the help of Blaise questionnaire software mainly as telephone interviews. The interview language is either Finnish, Swedish or English depending on the interviewee’s choice (since the statistical reference year 2014). In 2023, the average duration of an interview was roughly 36 minutes.

Source data (SIMS 18.1)

Source data

The total data of the income distribution statistics are statistical data covering the entire household-dwelling population, which are compiled on the individual level from several administrative files and registers. Thus, the statistics contain detailed data on the income of all household-dwelling units and persons belonging to them.

The following administrative and statistical registers have been used in the compilation of the total data:

The Population Information System of the Digital and Population Data Services Agency and Statistics Finland's population and dwelling data resource the Tax Administration's tax database
The Social Insurance Institution of Finland's pension and benefit database (health insurance compensation and rehabilitation register, registers of child maintenance allowances, financial aid for students and housing allowances)
Data on preventive and supplementary income support collected by the National Institute for Health and Welfare (THL) from municipalities
The register of pension contingency of the Finnish Centre for Pensions
Statistics Finland’s Register of Completed Education and Degrees
The State Treasury's database on the military injuries indemnity system
The Financial Supervisory Authority's data (earnings-related unemployment allowances)
Statistics Finland's Business Register
The Employment Fund’s (formerly the Education Fund) data
Incomes register

The sample data of income distribution statistics is based on a representative sample survey. The basic sample data of the income distribution statistics are compiled by combining the data collected from households by interviews and the register data of total data for the acceptably interviewed sample. A majority of classification data on households and the income data that are not available from registers have been collected by interviews in the Survey on income and living conditions.

Frame

The target population of the total data is Finland's dwelling population at the end of the statistical reference year (31 December). The household-dwelling population is formed by all persons living permanently in dwellings. Good two per cent of the entire population are excluded from the statistics. They include persons registered as permanently resident at institutions (e.g. long-term residents of old people's homes, care institutions, prisons or hospitals), homeless persons, persons residing abroad and persons registered as unknown.

The total data are compiled by combining administrative and register data sources to persons on the basis of personal identity codes. The income of a household-dwelling unit is formed by adding up the income of persons belonging to the same household-dwelling unit.

Sample frame

The target population and reference period (the last day of the statistical reference year) of the sample data are the same as in the total data. The sampling frame consists of total data based on the Population Information System of the Digital and Population Data Services Agency and Statistics Finland's population and dwelling data resource. The reference period of the population of the sample data is 31 December. The sampling frame is formed before the end of the statistical reference year, as a result of which the sampling frame contains slight errors (13.3.1 Coverage error). The sample is checked from the total data updated before data collection and the error caused by over-coverage is corrected before the sample is drawn.

The sampling frame is used for different purposes of sampling, in the sample data of the income distribution statistics for forming household-dwelling units and sampling categories.

The population information system of the Digital and Population Data Services Agency is generally exhaustive and up-to-date as concerns persons. Data on population changes are updated in real time. Statistics Finland uses the data weekly for its personal and dwelling databases, which are used for statistical purposes, e.g. monthly for the publication of preliminary statistics on the population by municipality and sex.

Sample data of the income distribution statistics

The sample survey of the income distribution statistics follows a rotating panel design of four years. Each panel comprises four survey rounds.

The sampling design is stratified sampling. The random sample of persons (5,500) including their household-dwelling units is drawn with non-proportional quota from the strata formed in the overall frame. The frame covers the target population almost without errors (see sampling frame). Until the statistical reference year 2020, the draw was two-phased and the person sample (5,500 or 5,000) was drawn from a so-called master sample. The master sample, which consisted of 50,000 persons (exceptionally 100,000 in 2016, 2019 and 2020), was drawn in the first phase of sampling by systematic sampling from the overall frame.

The strata used are 12 socio-economic groups. Socio-economic groups are formed based on taxable income usually according to the household's (household-dwelling unit) highest earning income type and income level (for example, entrepreneurs are an exception).

In 2019, a draw of an additional sample of 500 persons was included in the first survey round of the sample. Since 2020, the sample size of the first survey round has been 5,500 persons and their household-dwelling units.

As a result of the panel design, an additional sample of 16-year-olds is selected for the second to fourth survey rounds following the sampling of the second phase.

The sample persons (and their household-dwelling units) refer to the population registered as permanently resident in Finland on 31 December. The sample unit is a person aged 16 or over.

Frequency of data collection (SIMS 18.2)

The basic data for the income distribution statistics are collected annually.

Data collection (SIMS 18.3)

The main data collection method for the data collected with the interviews of the statistics on living conditions is a computer-assisted telephone interview (CATI) administered by an interviewer and web interview (CAWI). Only a small part of the interviews (around one to two per cent) are collected with a computer-assisted personal interview (CAPI).

Data validation (SIMS 18.4)

The correctness of the data formed for the total data of the income distribution statistics is ensured by checking the correctness and congruence of the data used from different source data for the derived classifications and variables. Checks are also performed in sample data once the total data have been combined with the sample.

As regards population data, the quality of the total data is examined, for example, in the quality description of Statistics Finland's statistics on dwellings and housing conditions. The coverage of income data in the total data is good relative to the used income concept (disposable monetary income). The data do not include income items that are entirely excluded from registers or that are not considered to be income. The coverage and quality of income data are studied by comparing total data with other statistical sources, such as the statistics of the Tax Administration, the Social Insurance Institution, the Finnish Centre for Pensions and the National Institute for Health and Welfare, and data on the household sector in Statistics Finland's national accounts. Comparisons are conducted regularly every year and more detailed information on them can be requested from Statistics Finland.

The main source of error in the sample data is unit non-response, which is corrected with weighting based on the sampling design. Besides non-response and random variation, the quality of the results is also affected by coverage errors (the frame population differs from the target population) and measurement errors (the measured value of the result variable differs from its actual value). Only a small proportion of income items are collected with interviews (e.g. interest income subject to withholding tax). The electronic data collection form contains plausibility and logicality checks of the data. The data are processed after the data collection with necessary checks and editing at unit level, mainly with automatic procedures, following the standard rules. Item non-response is imputed with the hot deck method. Some of these error sources can cause systematic errors. Systematic errors are estimated by comparing the estimates with the data concerning the entire population available from the total data and other registers and with corresponding data from other statistics. Comparisons are conducted annually and information on them can be requested from Statistics Finland.

Data compilation (SIMS 18.5)

In the sample data of the income distribution statistics, households and persons receive a weighting coefficient with which their data are raised to represent the data of the basic population. First, design weights are formed for households relying on the sample selection probability of the sample person. A non-response correction is performed for the approved sample by inverse inclusion probability. The weights corrected for non-response are calibrated with the CALMAR macro to correspond to the key known distributions of the population from the total data. The procedure aims at reducing the bias caused by the selectivity of non-response and produce as exact estimates as possible for the main income variables.

In the calibration of the weights for the 2023 material, the following data were used:

area (division of regions, where Helsinki and the rest of the Greater Helsinki region separately; statistical grouping of municipalities) size of household-dwelling unit age and gender groups of members level of education of persons aged 16 or over
Total sums of the main income items: wages and salaries, entrepreneurial and property income, unemployment allowances (basic unemployment allowance and labour market allowance, earnings-related share), pensions, interest on housing and student loans, number of income recipients
(earnings-related unemployment allowance, wage and salary income, pension income)
number of persons belonging to low-income household-dwelling units in the household-dwelling population in the total data on income distribution (register-based income concept)

Of the calibration data, the number of persons belonging to low-income household-dwelling units was applied in the statistical reference year 2015 and the level of education in the statistical reference year 2016 to correct the increased bias caused by higher non-response. The effect on the educational distribution of persons aged 16 or over was significant: the number of persons with only comprehensive school or no education data grew and that of persons with university degrees decreased. By contrast, changes in median income and annual changes in population groups were small. The income relations between population groups did not change. The calibration change did not affect the comparability of key indicators.

The sum of the weighting coefficients of the sample households that responded acceptably is an estimate of the total number of households in the population at the end of the statistical reference year. Starting from the statistical reference year 2021, register data sources based on total data are used in the estimation of the total number of households. These are estimated to describe the number and structure of households more accurately than before. Prior to that, a so-called master sample drawn from the population at the end of the statistical reference year was primarily used in the estimation. Due to the new method, the total number of households at the end of the year differs slightly more from the number of household-dwelling units.

Overall accuracy (SIMS 13.1)

Only administrative register data are used as data sources for the total data of the income distribution statistics, so the quality of the statistics depends on the quality of the source data and the error related to the processing of the data. The quality of data sources is good in statistics compilation based on a register system.

The sample data of the income distribution statistics is based on a representative sample survey. Most of the data derive from administrative data sources. Some of the data are collected by interviewing households. The sources of error are sampling error and other error sources are coverage, measurement, non-response and processing errors.

The main sources of error in the sample data of the income distribution statistics are related to non-response. Unit non-response is corrected with weighting based on the sampling design (two-phase sampling design). The design weights are first corrected by stratum with the inverse inclusion probabilities of sample persons. After this, the response-corrected weights are scaled to the number of households and the weights are calibrated to correspond with the population’s key known demographic distributions and income sums in the total data. The error caused by item non-response is minor in the sample data and mostly concerns the interest income subject to withholding tax of the few income data collected with the interview. The item non-response is corrected by imputation.

In addition to non-response and random variation, the quality of the results of the income distribution statistics is also affected by coverage errors (the frame population differs from the target population) and measurement errors (the measured value of the result variable differs from its actual value). These error sources are minor in both datasets (total and sample data).

Some of the error sources in the income distribution statistics can cause systematic errors. Systematic errors are estimated by comparing the estimates with the data concerning the entire population available from the total data and other registers and with corresponding data from other statistics. As regards population data, the quality of the total data is examined, for example, in the quality description of Statistics Finland's statistics on dwellings and housing conditions. The coverage of income data in the total data is good relative to the used income concept (disposable monetary income). The data do not include income items that are entirely excluded from registers or that are not considered to be income. The coverage and quality of income data are studied by comparing total data with other statistical sources, such as the statistics of the Tax Administration, the Social Insurance Institution, the Finnish Centre for Pensions and the National Institute for Health and Welfare, and data on the household sector in Statistics Finland's national accounts. Comparisons are conducted regularly every year and more detailed information on them can be requested from Statistics Finland.

In the sample data of the income distribution statistics, the bias and accuracy of estimates are estimated with the help of standard errors of the data.

Non-sampling error (SIMS 13.3)

Besides sampling errors other sources of error in the sample data of the income distribution statistics are coverage, measurement, non-response and processing errors.

The main sources of error in the sample data of the income distribution statistics are related to non-response. Unit non-response is corrected with weighting based on the sampling design (two-phase sampling design). The design weights are first corrected by stratum with the inverse inclusion probabilities of sample persons. After this, the response-corrected weights are scaled to the number of households and the weights are calibrated to correspond with the population’s key known demographic distributions and income sums in the total data. The error caused by item non-response is minor in the sample data and concerns interest income subject to withholding tax of the few income data collected with the interview. The item non-response is corrected by imputation.

The coverage error of the income distribution statistics is minor. Likewise, the processing error of the statistics compiled annually with an established production process is estimated to be relatively small.

Coverage error (SIMS 13.3.1)

The framework for the total data of the income distribution statistics is the total data based on Statistics Finland's population and dwelling data resource of 31 December. The sources of errors in the data have been checked and the quality is good.

The sampling frame for the sample data of the income distribution statistics consists of total data based on the Population Information System of the Digital and Population Data Services Agency and Statistics Finland's population and dwelling data resource. The reference period of the population of the sample data is 31 December. The sampling frame is formed before the end of the statistical reference year, as a result of which the sampling frame contains slight errors. The sample is checked from the updated total data before the data collection and after that in the interviews, when persons not belonging to the target population of the statistics in the reference period (31 December), so-called over-coverage, are removed from it. Excluded from the sample accepted in interviews are persons temporarily absent from the household, e.g. persons residing abroad for more than a year if their household resident in Finland considers that the person was not part of the household in question during the reference period. The number of sample persons left outside the sampling frame which synchronise with the registers at a delay is small as well.

The population for the statistical reference year is revised after the reference period of the statistics approximately three months later in the data of Statistics Finland's statistics on household-dwelling units and the total data of the income distribution statistics. The data are used in the calibration of the sample data of the income distribution statistics, with which it is made to correspond to the population.

Measurement error (SIMS 13.3.2)

The data of the income distribution statistics are compiled in an integrated manner according to the work stages of the established production process. Changes, for example in data sources or production systems, are tested and possible error sources are checked when forming the data. The measurement error is minor in statistics compilation based on a register system.

In the sample data of the income distribution statistics, the measurement error is primarily connected to data collected with interviews, which is affected by error sources concerning responses, both for the target and the interviewer. The error is estimated to be random for a majority of the data. Measurement errors in the data collection are prevented with interviewer training and instructions for data collection, as well as questionnaire designing and testing. Automatic checks (outlier and data logicality checks) are included in the form. The data obtained from the data collection are checked and errors are corrected in the statistics.

Non-response error (SIMS 13.3.3)

The unit non-response of the income distribution statistics is corrected with weighting, which aims to remove non-response error.

Processing error (SIMS 13.3.4)

Data processing errors in the income distribution statistics are minor. The data are processed in the established production process by work phase.

Model assumption error (SIMS 13.3.5)

The sampling design and estimation of the sample data of the income distribution statistics are based on established methods. Design-based estimation is used, for which the data selection is model-assisted.

Quality assurance (SIMS 11.1)

Quality management requires comprehensive guidance of activities. The European Statistics Code of Practice forms the basis for the common quality system of the European Statistical System.

The Code of Practice is based on 16 principles that concern statistical authorities' independence, accountability and the quality of the processes and data to be published.

The principles are in line with the Fundamental Principles of Official Statistics approved by the United Nations Statistics Commission and are supplementary to them. The quality criteria of Official Statistics of Finland are compatible with the European Statistics Code of Practice.

Further information:

Quality assessment (SIMS 11.2)

Quality assessment, see OSF quality criteria and recommendation on quality description.

Data revision - policy (SIMS 17.1)

Revisions – i.e. improvements in the accuracy of statistical data already published – are a normal feature of statistical production and result in improved quality of statistics. The principle is that statistical data are based on the best available data and information concerning the statistical phenomenon. On the other hand, the revisions are communicated as transparently as possible in advance. Advance communication ensures that the users can prepare for the data revisions.

The reason why data in statistical releases become revised is often caused by the data becoming supplemented. Then the new, revised statistical figure is based on a wider information basis and describes the phenomenon more accurately than before.

Revisions of statistical data may also be caused by the calculation method used, such as annual benchmarking or updating of weight structures. Changes of base years and used classifications may also cause revisions to data.

The preliminary data of the income distribution statistics become revised for the statistical reference year if the data sources used for the statistics are updated, or there is a need for revision due to detected errors or deficiencies before the final data are published.

Methodological changes to the statistical reference year and the revisions to time series data they cause are planned in advance. The time series is revised if the effect on key result data of the statistics is statistically significant.

Data revision - practice (SIMS 17.2)

Comparability - geographical (SIMS 15.1)

The total data of the income distribution statistics describe household-dwelling units; income exhaustively according to the following regional classifications: the EU27 uniform NUTS classification of regional units (NUTS2, or classification of major regions, NUTS3 or classification of regions), sub-regional unit and municipality.

The sample data of the income distribution statistics are based on a nationally representative sample survey. The sample data are nationally regionally comparable according to NUTS2 or the classification of major regions used in the statistics and by municipality group, and internationally by country according to the NUTS2 classification taking into account the difference in the income concept. The income of the sample data in the income distribution statistics corresponds, apart from small exceptions, to the data published by Eurostat and the OECD. Such an exception is caused by fringe benefits included in wages and salaries, which are included exhaustively in income in national statistics, but not in EU-SILC (EU Statistics on Income and Living Conditions).

Comparability - over time (SIMS 15.2)

The time series data from the total dataset of the income distribution statistics are available for the years 1995 onwards. The time series formed based on the total dataset of the income distribution statistics is not completely comparable between the years 1995–2009 and 2010–. Since 2010, changes and adjustments due to technical reasons have also been made in the formation of income, which have not been corrected in the time series. In 2023, tax-free insurance compensations for personal injury were added to income data from the income register.

Time series data from the sample dataset are available from 1986 onwards. In the time series data, efforts have been made to take into account the most significant changes in the formation of incomes. The data from the years 1986-1992 and 1993- are not entirely comparable due to the tax reform of 1993. In the time series data, the information for the years 1966, 1971, 1976, and 1981 is based on the consumption survey.

Coherence – cross domain (SIMS 15.3)

The total data of the income distribution statistics are consistent with Statistics Finland's statistics based on total data. The statistical data of the sample data and the statistics on living conditions and households’ assets have been formed in an integrated manner by means of data collected with interviews and total data in the Survey on income and living conditions.

Besides the income distribution statistics, Statistics Finland's households’ assets, households’ consumption and national accounts also contain income concepts.

There are no considerable conceptual differences between the sample data of the income distribution statistics and households’ consumption. Both follow the definition of disposable income that is accordant with international recommendations. The housing costs in the income distribution statistics and the consumption expenditure of housing in the households’ consumption are congruent. The data of the households’ consumption contain all consumption expenditure related to the housing costs of the household’s actual dwellings and free-time residences (incl. imputed consumption). The statistics use the gross rent principle and the Classification of Individual Consumption by Purpose (COICOP-HBS). In addition to the above-mentioned factors, the data of the statistics may differ for reasons related to sampling and production methods.

When comparing the income sums of the income distribution statistics for the whole country with the items of the national accounts’ income and use of income accounts, the differences in defining the sector, in certain definitions, and in the compilation methods of the statistics should be noted. Due to the differences, the figures of the national accounts and income distribution statistics on, for example, annual changes in households’ disposable income may differ considerably from one another.

In current transfers received in the income distribution statistics, social benefits are divided into target/main groups according to the ESSPROS classification (the European System of Integrated Social Protection Statistics). The classification is consistent with the Finnish Institute for Health and Welfare's statistics on social protection expenditure and financing, and it is used in the EU-SILC statistics. In the income distribution statistics, social assistance is included in other social security. In the Finnish Institute for Health and Welfare's statistics on social protection expenditure and financing, social assistance granted for housing expenditure is included in other income security benefits for housing, and social assistance granted for health expenses is included in other income security benefits during periods of illness as of the statistical reference year 2022. Student benefits are not at all in the ESSPROS statistics.

Coherence - sub-annual and annual statistics (SIMS 15.3.1)

The income distribution statistics are annual statistics.

Coherence - national accounts (SIMS 15.3.2)

The income distribution statistics describe the income and current transfers of the household sector and are thus an extension of the household sector’s income and use of income accounts of the national accounts. When comparing the income sums of the income distribution statistics for the whole country with the items of the national accounts’ income and use of income accounts, the differences in defining the sector, in certain definitions, and in the compilation methods of the statistics should be noted. Due to the differences, the figures of the national accounts and income distribution statistics on, for example, annual changes in households’ disposable income may differ considerably from one another.

In the national accounts, the disposable income includes imputed rent for owner-occupied housing, while the main income concept in the income distribution statistics (disposable cash income) does not include imputed rent.

There are significant conceptual differences in property income. The national accounts do not include holding gains, but they do include taxes paid on taxable realized capital gains. The income distribution statistics (total dataset) include realized capital gains (capital gains minus losses) as property income and the taxes paid on them as paid income transfers. The transaction D412R FISIM adjustment of interest from deposits as income, is not included in the income distribution statistics. The transactions D422R withdrawals from quasi-corporations, and D44R property income attributed to insurance policyholders, do not correspond to the concepts used in the income distribution statistics.

There are also significant methodological and conceptual differences in entrepreneurial income.

Coherence - internal (SIMS 15.4)

The content of the statistics is uniform, except for the effects of differences arising from definitional differences in the data on household and income, and the effects of special sources of error included in the sample data.

The income data of the total data and sample data are otherwise the same, but the sample statistics contain income data missing from registers that are collected with interviews (interest income, certain current transfers between households).

Release calendar (SIMS 8.1)

Statistics Finland publishes new statistical data at 8 am on weekdays in its web service. The release times of statistics are given in advance in the release calendar available in the web service. The data become public after they have been updated in the web service.

Further information: Publication principles for statistics at Statistics Finland

Release calendar access (SIMS 8.2)

Future publications of the statistics can be found on the page of the statistics at: Future publications of the statistics

User access (SIMS 8.3)

The data are released to all users at the same time. Statistical data may be processed at Statistics Finland and information on them may be given before release only by persons involved in the production of the statistics concerned or who need the data of the statistics concerned in their own work before the data are published.

Further information: Publication principles for statistics

Unless otherwise specifically stated in connection with the product, data or service concerned, Statistics Finland is the producer and copyright owner of the data.

Further information: The terms of use for statistical data

Frequency of dissemination (SIMS 9)

The data of the income distribution statistics are disseminated yearly. Possible revisions are made to the time series in connection with annual releases.

News release (SIMS 10.1)

The release is published annually on the home page of the statistics.

Publications (SIMS 10.2)

Information on individual publications of the statistics on living conditions, including articles in Statistics Finland's Tieto&Trendit online periodical, can be found on the home page of the statistics.

Online database (SIMS 10.3)

The database tables of the statistics can be found in the StatFin database.

Micro-data access (SIMS 10.4)

A service data set is compiled annually based on the sample data of the income distribution statistics, and it is released as anonymised unit-level micro data (so-called service data) for scientific research use, statistical surveys and microsimulation through Statistics Finland's research services. The use of service data is subject to licence. The application must contain the purpose for which the data will be used, a research plan and the signed pledges of secrecy from the persons participating in the research. The service data are chargeable.

Data collected for statistical purposes must be kept confidential by virtue of Section 24 of the Act on the Openness of Government Activities (621/1999). The response data are only used for statistical purposes. The research data are protected in accordance with the data protection regulations of Statistics Finland and responses given by individual households cannot be distinguished from the statistical tables. According to Section 13 of the Statistics Act (280/2004), Statistics Finland may, on the basis of a separate application for licence to use statistical data, release data for scientific studies and statistical surveys without data enabling direct identification. The Statistics Act prohibits the use of data collected for statistical purposes in an investigation, surveillance, legal proceedings, administrative decision-making or other similar handling of a matter concerning the enterprise. Guidelines 6 February 2020 10 (16).

National data containing sample data of the income distribution statistics and data of the statistics on living conditions are released to Eurostat, the Statistical Office of the European Union, for the international, comparative ESS EU-SILC micro data. Eurostat releases anonymised micro data (EU-SILC Users' Database) for scientific research use based on an application for licence to use statistical data. The data obtained through Eurostat include data from countries conducting the EU-SILC survey. Finland’s data are available through Eurostat at a longer time lag than from Statistics Finland. Further information about the ESS EU SILC micro data is available on Eurostat's web pages.

Other (SIMS 10.5)

Data on the income distribution statistics are available as chargeable special compilations, such as table data, through Statistics Finland's research services. Data collected for statistical purposes must be kept confidential by virtue of Section 24 of the Act on the Openness of Government Activities (621/1999). The response data are only used for statistical purposes. The research data are protected in accordance with the data protection regulations of Statistics Finland and responses given by individual households cannot be distinguished from the statistical tables.

Documentation on methodology (SIMS 10.6)

The data content of the sample data of the income distribution statistics is based on the ESS EU-SILC statistics (EU-SILC, Statistics on Income and Living Conditions, Regulation No 1177/2003 and 1700/2019 of the European Parliament and of the Council).

The income data used in the classifications are based on data formed for the needs of the income distribution statistics. These income data follow the international recommendations of income distribution statistics: OECD (2013) OECD Framework for Statistics on the Distribution of Household Income, Consumption and Wealth, OECD Publishing; UNECE (2011) Canberra Group Handbook on Household Income Statistics, Second Edition 2011.

Confidentiality - policy (SIMS 7.1)

The data protection of data collected for statistical purposes is guaranteed. The compilation of statistics is guided by the Statistics Act. Alongside the Statistics Act, the EU’s General Data Protection Regulation (eur-lex.europa.eu) and the Finnish Data Protection Act (Finlex.fi) are applied to the processing of personal data. Provisions on the confidentiality of data collected for statistical purposes are laid down in the Act on the Openness of Government Activities (Finlex.fi).

The data are processed only by persons who need the data in their work. The use of data is restricted by usage rights. All persons employed by Statistics Finland have signed a pledge of secrecy, where they have obliged to keep secret the data prescribed as confidential by virtue of the Statistics Act or the Act on the Openness of Government Activities.

Further information: Data protection

Confidentiality - data treatment (SIMS 7.2)

The processing of the data is limited by user licences to the producers of the statistics. All persons employed by Statistics Finland have signed a pledge of secrecy, where they have obliged to keep secret the data prescribed as confidential by virtue of the Statistics Act or the Act on the Openness of Government Activities.

The compilation of statistics is steered by the Statistics Act (280/2004). Alongside the Statistics Act, the EU’s General Data Protection Regulation EU 2016/679 and the national Data Protection Act are applied to the processing of personal data. Confidentiality of data collected for statistical purposes is decreed in the Act on the Openness of Government Activities (621/1999).

Sample data of the income distribution statistics are combined with the service set of Statistics Finland's income distribution statistics. The service data do not contain direct identifiers. To ensure data protection, the values of income variables which make identification easier are made less detailed.

Sample data of the income distribution statistics and statistical data on which the statistics on living conditions are based are released to Eurostat, the Statistical Office of the European Union, for the EU-SILC statistics (EU-SILC, Statistics on Income and Living Conditions). The statistical data do not contain direct identifiers. In addition, protection measures common to the countries and, where necessary, nation-specific measures, are applied to the data. Eurostat releases data from the EU-SILC statistics for research use upon application. Researchers handling the data sign a pledge of secrecy.

Statistical protection methods are described, for example, in the Handbook on Statistical Disclosure Control (2010).

Contact information

Service email

toimeentulo.tilastokeskus@stat.fi

Inquiries primarily

Veli-Matti Törmälehto

Senior Researcher

029 551 3680

Income distribution statistics: documentation of statistics

Quality report

Basic data of the statistics

Data description (SIMS 3.1)

Sector coverage (SIMS 3.3)

Statistical unit (SIMS 3.5)

Statistical population (SIMS 3.6)

Reference area (SIMS 3.7)

Time coverage (SIMS 3.8)

Base period (SIMS 3.9)

Unit of measure (SIMS 4)

Reference period (SIMS 5)

Classifications (SIMS 3.2)

Concepts and definitions (SIMS 3.4)

Consumption unit

Current transfers paid

Current transfers received

Disposable income

Disposable money income

Entrepreneurial income

Equivalent income

Factor income

GINI co-efficient

Gross income

Household

Housing expenditure

Income deciles

Income share of housing costs

Long-term low-income

Low income

Money income

Property income

Reference person

Reference person

Socio-economic group

Unemployed

Wages and salaries

Institutional mandate (SIMS 6)

Legal acts and other agreements (SIMS 6.1)

Data sharing (SIMS 6.2)

Cost and burden (SIMS 16)

Statistical process

Source data (SIMS 18.1)

Frequency of data collection (SIMS 18.2)

Data collection (SIMS 18.3)

Data validation (SIMS 18.4)

Data compilation (SIMS 18.5)

Relevance

User needs (SIMS 12.1)

Accuracy and reliability

Overall accuracy (SIMS 13.1)

Non-sampling error (SIMS 13.3)

Coverage error (SIMS 13.3.1)

Measurement error (SIMS 13.3.2)

Non-response error (SIMS 13.3.3)

Processing error (SIMS 13.3.4)

Model assumption error (SIMS 13.3.5)

Quality assurance (SIMS 11.1)

Quality assessment (SIMS 11.2)

Data revision - policy (SIMS 17.1)

Data revision - practice (SIMS 17.2)

Timeliness and punctuality

Timeliness (SIMS 14.1)

Punctuality (SIMS 14.2)

Coherence and comparability

Comparability - geographical (SIMS 15.1)

Comparability - over time (SIMS 15.2)

Coherence – cross domain (SIMS 15.3)

Coherence - sub-annual and annual statistics (SIMS 15.3.1)

Coherence - national accounts (SIMS 15.3.2)

Coherence - internal (SIMS 15.4)

Accessibility and clarity

Release calendar (SIMS 8.1)

Release calendar access (SIMS 8.2)

User access (SIMS 8.3)

Frequency of dissemination (SIMS 9)

News release (SIMS 10.1)

Publications (SIMS 10.2)

Online database (SIMS 10.3)

Micro-data access (SIMS 10.4)