Investigating the impact of different weighting methods on CPIH

1. Introduction

Consumer price indices estimate changes to the total cost of a “basket” of goods and services by calculating the average price change of items within the basket. As households spend more of their household budget on some goods and services than others, price indices are weighted using the amount that we spend on these items as consumers. This ensures that indices reflect the relative importance of the various items in the basket. For example, we would expect a 10% increase in the price of petrol to have a greater impact on the rate of inflation than a similar increase in the price of tea.

Three different approaches to measuring inflation are presented in the article Measuring changes in prices and costs for consumers and households. Three indices are presented that meet different user needs: the Consumer Prices Index including owner occupiers’ housing costs (CPIH), the Household Costs Indices (HCIs) and the Retail Prices Index (RPI).

The CPIH is a comprehensive measure of price change across the UK economy as a whole, and is the lead measure in our publications of consumer price inflation. The HCIs are a set of measures currently in development that aim to reflect changes in prices and costs as understood and experienced by households. The RPI is a “legacy” measure that is produced to meet ongoing requirements for index-linked long-term gilts and contracts.

While there are a number of similarities in the way that these measures are constructed, there are also notable differences that are necessary in meeting the indices’ required concepts. For example, the measures all have different items within scope of their respective baskets and all use (or are proposed to use in the case of the HCIs) different data sources and different methods of weighting the items within their baskets.

This article investigates the impact that using different data sources and different methods of weighting has on our lead measure of inflation, the CPIH. To investigate this impact it is necessary to keep all other aspects of index construction constant. Therefore this empirical analysis utilises CPIH methodology and references CPIH (as published) for the remainder of this article. This is a research article and we are not currently considering making any of these changes to CPIH, which is designated a National Statistic and uses methodology that is in line with international best practice.

Nôl i'r tabl cynnwys

2. Data sources

Background

There are two primary data sources that are currently used when constructing expenditure weights for consumer price inflation measures. The Consumer Prices Index including owner occupiers’ housing costs (CPIH) largely uses data from estimates of household final consumption expenditure (HHFCE), whereas the Retail Prices Index (RPI) uses data from the Living Costs and Food Survey (LCF). As the expenditure data from both sources is not timely enough for immediate use in price indices, the data are price updated to approximate, as far as possible, current patterns of expenditure. Further details regarding how these weights are calculated are provided in Consumer price inflation, updating weights: 2017.

These two sources and their limitations are discussed in this section along with the impact that using LCF, instead of HHFCE, as the primary data source for CPIH weights would have on the resulting index, while holding all other elements of construction constant.

The Living Costs and Food Survey

The most detailed level of household expenditure data currently available is from the Living Costs and Food Survey (LCF). The LCF is a continuous survey of the expenditure patterns of private households based on an achieved sample of around 6,000 households per year. Declining response rates in social surveys means the achieved sample is decreasing over time, which may have implications on the accuracy of estimates.

Each household’s expenditure within the sample is weighted using a “survey weight”, which reflects how many households within the population each sample household represents. More details on the LCF weighting can be found in the LCF technical report (PDF, 689KB).

As LCF expenditure data are available at the household level, a benefit of this data source is that, in addition to expenditure, it also collects a useful array of information on the demographics of households that can be used to analyse differences in spending patterns between households and household groups. However, it also has a number of limitations. For example, the LCF is believed to under-report expenditure for a number of items (such as alcohol and tobacco) and as the LCF only surveys private households, a small proportion of the population are missed (such as those in student halls and other communal establishments, such as nursing homes).

Household final consumption expenditure

Expenditure at the aggregate level can be obtained from HHFCE data. HHFCE includes data on consumption goods and services and is used by the national accounts to measure the contribution of household spending to economic growth. The LCF is one of the inputs for HHFCE, but HHFCE also uses data from a number of secondary sources, as displayed in Figure 1.

Figure 1: Sources of expenditure used in the CPIH weights (class-level and above)

A number of sources and processes are displayed that are used in the compilation of CPIH expenditure weights.

Source: Office for National Statistics, 2017

Notes:

Figure 1 shows a number of the sources and processes used in the compilation of the CPIH. LCF is the Living Costs and Food Survey, HMRC is Her Majesty’s Revenue and Customs, BEIS is the Department for Business, Energy and Industrial Strategy, OfWat is the water regulator, DCLG is the Department for Communities and Local Government, Int. Passenger Survey is the International Passenger Survey, VOA is the Valuation Office Agency.
The weight for a small number of classes (for example, package holidays) uses a different source of information (see the Consumer Price Indices Technical Manual for more information).

Download this image Figure 1: Sources of expenditure used in the CPIH weights (class-level and above)

.png (14.7 kB)

Alternative sources are used within HHFCE where the LCF is believed to under-report expenditure (including alcohol and tobacco) or where data quality is deemed to be stronger from administrative sources (including energy). Estimates also vary where the concepts captured in the national accounts differ from the pure expenditure estimates collected in the LCF. For example, the national accounts adjust the data to a domestic basis, while LCF only captures expenditure of UK private households (national basis). HHFCE is published quarterly in the Consumer trends release as part of the quarterly national accounts.

While the aggregate expenditure estimates for HHFCE may be considered more accurate than the unadjusted LCF data, the data lacks the low-level detail of the LCF. This means that HHFCE cannot be used to investigate spending patterns of households, or groups of households, without first being reconciled with another source.

Method for constructing weights using different data sources

Where CPIH is presented in this section it is the same as the CPIH as published in our consumer price inflation bulletins. The CPIH uses data primarily from the HHFCE because the expenditure information is comprehensive and balanced against data collected in other sectors of the economy to create the most accurate picture of consumer spending. However, there are a few exceptions where additional source data are used to supplement the HHFCE data and improve the coherence with the intended scope of the index. For example, when calculating the CPIH weights for insurance, an average of the most recent three years data is used in line with international best practice (details of this practice can be found in Consumer price inflation, updating weights: 2017).

As the CPIH expenditure deviates from HHFCE expenditure in a small number of areas, the remainder of this section refers to “CPIH expenditure”, although HHFCE remains the primary source.

The CPIH uses the Classification of Individual Consumption According to Purpose (COICOP) as its underlying aggregation structure. COICOP groups together similar goods and services to enable analysis and comparisons between categories of items and between years. As this is the aggregation structure used for consumer price indices internationally, it also enables comparisons between countries.

To construct expenditure weights from the raw LCF expenditure data that are consistent with the CPIH, the expenditure of each household within the LCF is weighted so that the total expenditure of LCF households is representative of total expenditure of the whole population. Weighted expenditure on each LCF variable is then mapped to the appropriate COICOP class. The total LCF expenditure on each COICOP class is calculated as a proportion of total expenditure on all classes and this proportion is expressed in parts per thousand.

To construct an aggregate price index consistent with the CPIH using this data, the class-level weights are then combined with the published CPIH class-level indices.

Results

Impact of different data sources on expenditure shares

The average weight of 12 divisions between 2005 and 2016 are presented in Table 1 for both CPIH (as published) and CPIH as it would look were it to be calculated based primarily on LCF data. For ease of interpretation these averages have been rounded to zero decimal places (dp). The percentage difference between these averages is also presented (rounded to one dp), a positive value shows that the CPIH using LCF as the primary data source to construct the weights is higher than the CPIH as published.

Table 1: Average CPIH weight compared to the average weight calculated using LCF data as the primary expenditure source, parts per thousand, UK, 2005 to 2016

Division	CPIH (as published)	CPIH (LCF weighted)	Difference (%)
1. Food and non-alcoholic beverages	87	118	35.6
2. Alcoholic beverages and tobacco	34	26	-23.5
3. Clothing and footwear	52	49	-5.8
4. Housing, water, electricity, gas and other fuels	300	241	-19.7
5. Furniture, household equipment and maintenance	50	70	40
6. Health	19	13	-31.6
7. Transport	122	141	15.6
8. Communication	21	29	38.1
9. Recreation and culture	117	126	7.7
10. Education	16	17	6.3
11. Restaurants and hotels	101	88	-12.9
12. Miscellaneous goods and services	79	82	3.8
Source: Office for National Statistics

Download this table Table 1: Average CPIH weight compared to the average weight calculated using LCF data as the primary expenditure source, parts per thousand, UK, 2005 to 2016

.xls (26.6 kB)

There are five divisions where there is a larger average weight in the CPIH (as published) than when the LCF is used as the primary data source for constructing weights. These are: alcoholic beverages and tobacco; clothing and footwear; housing, water, electricity, gas and other fuels; health; and restaurants and hotels¹.

This can be explained by the additional data sources used for the expenditure estimates used to calculate CPIH weights (primarily from HHFCE). For example, expenditure on alcoholic beverages and tobacco is believed to be largely underreported in the LCF; as HHFCE adjusts the data to account for this, the weight for this division within CPIH is larger than it would be if using the LCF data alone. This is one of the divisions, along with clothing and footwear, and health, where the CPIH data is largely obtained from other sources (PDF, 172KB).

Although the underlying expenditure total may be broadly similar for some divisions using the two data sources, the relative weight will not often be the same. For example, the underlying expenditure total for food and non-alcoholic beverages will be broadly similar for the CPIH and LCF-derived weights, as the HHFCE largely utilises data from the LCF to estimate household spending within this COICOP division. However, as expenditure within other divisions is greater in the CPIH, the food and non-alcoholic beverages division receives a smaller relative weight.

Presenting the division-level weights may obscure interesting differences at lower levels of aggregation. For example, within miscellaneous goods and services are three categories of insurance. Insurance premia can be broken into two components; some of the premium is paid into a “claims pool” that is redistributed back to the household sector, the rest of the premium is considered a service charge and is what households pay for the service. As the former component is returned to the household sector, it is not in scope of the CPIH. Therefore only the latter service charge is included in the construction of the weight.

Conversely, as LCF expenditure does not separate the service charge from the payment into the claims pool, the total expenditure on insurance premiums is included. This leads to a much greater weight for insurance classes when calculated using primarily LCF data than is used in the CPIH. On average, insurance weights are 34% lower in the CPIH than if they were to be calculated using primarily LCF expenditure data between the years 2005 and 2016.

Impact of using LCF expenditure data to construct weights on CPIH

Figure 2 shows the impact of using the LCF expenditure data to weight CPIH (holding everything else constant), compared with the CPIH as published. The CPIH that is constructed using data primarily from the LCF is referred to as CPIH (LCF-weighted), while CPIH is referred to CPIH (as published) for the remainder of this article.

Figure 2: Impact of using the LCF expenditure data to construct weights for CPIH, compared to CPIH (as published), cumulative price changes

UK, January 2005 to December 2016

Source: Office for National Statistics

Download this chart Figure 2: Impact of using the LCF expenditure data to construct weights for CPIH, compared to CPIH (as published), cumulative price changes

Image .csv .xls

CPIH (LCF-weighted) has grown at a faster rate to the CPIH (as published) over the period 2005 to 2016. To examine this in further detail, Figure 3 compares the 12-month growth rate for these indices.

Figure 3: Impact of using the LCF expenditure data to construct weights for CPIH, compared to CPIH (as published), 12-month growth rate (%)

UK, January 2006 to December 2016

Source: Office for National Statistics

Download this chart Figure 3: Impact of using the LCF expenditure data to construct weights for CPIH, compared to CPIH (as published), 12-month growth rate (%)

Image .csv .xls

While CPIH (LCF-weighted) follows the same trend as CPIH (as published), the movements appear more extreme. In earlier periods the CPIH (as published) shows slower growth than it would, were the weights constructed using primarily LCF expenditure data. Furthermore, CPIH (LCF-weighted) shows periods of negative growth in 2014 and 2015, while the CPIH (as published) shows slow, yet positive, growth.

To identify why the LCF data shows more extreme trends in a number of years, the contribution of different categories of product to the 12-month growth rate for the indices are examined in Figure 4. This demonstrates the categories of product that are driving the difference in growth rates between these indices.

Figure 4: Contributions to the difference in the 12-month growth rate: CPIH (LCF weighted) 12-month growth rate less CPIH (as published) 12-month growth rate, percentage points

UK, January 2005 to December 2016

Food and non-alcoholic beverages and miscellaneous goods and services contribute to CPIH (LCF weighted) experiencing stronger positive growth. This is offset in a number of periods mainly by transport and housing and housing services.

Source: Office for National Statistics

Notes:

Stacked bar charts reflect the difference in percentage point contributions of each of the 87 class level items (defined using COICOP categorisation) to the 12-month growth rate between CPIH (as published) and CPIH were the weights calculated primarily using LCF data as the underlying expenditure source. The contribution of each of the 87 class level items is estimated separately, before being aggregated to the categories above. Note that a reduction in the contribution to the 12-month growth rate need not imply falling prices; it could also reflect a lower rate of growth than observed in the previous year.
Other is comprised of the following divisions (as defined using COICOP categorisation): furniture, household equipment and maintenance; health; communication; and education. All other divisions are presented in their own right.
Contributions may not sum due to rounding.

Download this image Figure 4: Contributions to the difference in the 12-month growth rate: CPIH (LCF weighted) 12-month growth rate less CPIH (as published) 12-month growth rate, percentage points

.png (356.8 kB) .xls (49.7 kB)

The differences in the contributions between the two indices are naturally driven by divisions where the data sources display the greatest differences. This is further exaggerated when prices are rising or falling rapidly. For example, food and non-alcoholic beverages has a higher weight when constructed using LCF expenditure as the primary data source than in the published CPIH. This means that when prices for this division are rising (for example, between 2006 and 2010) the 12-month growth rate for CPIH (as published) is lower than the 12-month growth rate for CPIH (LCF-weighted). Between 2014 and 2016, prices for food and non-alcoholic beverages have had a negative contribution to the growth rate, which has contributed to CPIH (as published) rising faster than CPIH (LCF-weighted).

One of the main contributors to the difference in 12-month rates is miscellaneous goods and services. As already discussed, there are conceptual differences regarding the measurement of insurance within this division, using these two sources. This leads to insurance having a lower weight in CPIH (as published). Therefore, when insurance is experiencing price growth, the CPIH (LCF-weighted) grows at a faster rate than the published CPIH.

Notes for Data sources:

This was corrected from four to five divisions on 16 November 2017 to include clothing and footwear

Nôl i'r tabl cynnwys