The numbers are derived from the General Household Survey, a multi-purpose annual survey conducted by the national statistical agency, Statistics South Africa, to collect information on a range of topics from households in the country’s nine provinces.
The GHS uses a Master Sample frame which has been developed as a general-purpose household survey frame that can be used by all other Stats SA household-based surveys that have design requirements that are reasonably compatible with the GHS. The sample is drawn from Census enumeration areas using a stratified two-stage design with probability proportional to size sampling of PSUs in the first stage, and sampling of dwelling units with systematic sampling in the second stage. The resulting sample consists of just over 20,000 households with around 70,000 individuals, and should be representative of all households in South Africa. It is also designed to be representative at provincial level and within provinces at metro/non-metro levels and three geography types (urban areas, rural areas under traditional authority, and farms).
The sample consists of households and does not cover other collective institutionalised living-quarters such as boarding schools, orphanages, students’ hostels, old-age homes, hospitals, prisons, military barracks and workers’ hostels. These exclusions probably do not have a noticeable impact on the findings in respect of children.
Changes in sample frame and stratification
Since 2014 the GHS has been based on the 2013 master sample that that is, in turn, based on information collected during the 2011 Population Census. The previous master sample for the GHS was used for the first time in 2008, and the one before that in 2004. These again differed from the master sample used in the first two years of the GHS: 2002 and 2003. Thus there have been four different sampling frames during history of the annual GHS, with the changes occurring in 2004, 2008 and 2013. In addition, there have been changes in the method of stratification over the years. These changes could compromise comparability across iterations of the survey to some extent, although it is common practice to use the GHS for longitudinal monitoring and many of the official trend analyses are drawn from this survey.
Weights
Person and household weights are provided by Stats SA and are applied in Children Count analyses to give population estimates on the indicators. The GHS weights are derived from Stats SA’s mid-year population estimates for the relevant year. The population estimates are based on a model that is revised from time to time when it is possible to calibrate the population model to Census data and larger population surveys such as the Community Survey.
In 2017, Stats SA revised its demographic model to produce a new series of mid-year population estimates and the GHS data were re-released with the revised population weights. All the Children Count indicators were re-analysed retrospectively, using the revised weights provided by Stats SA, based on the 2013 model. The estimates are therefore comparable over all years. The revised weights particularly affected estimates for the years 2002 – 2007.
The 2017 model drew on the 2011 census, along with vital registration, antenatal and other administrative data, but was a “smoothed” model that did not mimic the unusual shape of the age distribution found in the census. The results of the 2011 census were initially distrusted because it seemed to over-count children in the 0 – 4 age group and under-count children in the 4 – 14-year group. It is now thought that the fertility rates recorded in the 2011 population census may have been an accurate reflection of demopraphic trends, with an unexplained upswing in fertility around 2009 after which fertility rates declined again gradually. Similar patterns were found in the vital registration data as more births were reported retrospectively to the Department of Home Affairs, and in administrative data from schools, compiled by the Department of Basic Education. In effect, this means that there may be more children in South Africa than appear from the analyses presented in these analyses, where we have applied weights based on a model that it is now known to be inaccurate.
Stats SA has subsequently developed a new population model - the 2022 series, which provides revised mid-year population estimates back to 2002 and projected to 2032. However, the GHS series has not yet been reweighted.The population estimates in Children Count are therefore based on weights derived from outdated population model (2017). It is not yet clear when and how the population model will be revised again following the 2022 Census, as there are concerns around census under-count and plausibility of its findings.
Disaggregation
Statistics South Africa suggests caution when attempting to interpret data generated at low level disaggregation. The population estimates are benchmarked at the national level in terms of age, sex and population group while at provincial level, benchmarking is by population group only. This could mean that estimates derived from any further disaggregation of the provincial data below the population group may not be robust enough.
Reporting error
Error may be present due to the methodology used, i.e. the questionnaire is administered to only one respondent in the household who is expected to provide information about all other members of the household. Not all respondents will have accurate information about all children in the household. In instances where the respondent did not or could not provide an answer, this was recorded as “unspecified” (no response) or “don’t know” (the respondent stated that they didn’t know the answer).
For more information on the methods of the General Household Survey, see the metadata for the respective survey years, available on
Nesstar or DataFirst