Skip past navigation to main part of page
 
HILDA Homepage
---

Current Data Issues

This page is updated as necessary with any known issues with the HILDA data.

Release 8.0 Data and Documentation Issues and Corrections

Update 27/04/2010

Following a crash, an update to the notes for the 5 personality scale variables in the wave 5 derived variable coding frameworks was lost. The notes should have been updated for:
epnextrv 'DV: SCQ:B19 Personality scale - Extroversion'
epnagree 'DV: SCQ:B19 Personality scale - Agreeableness '
epnconsc 'DV: SCQ:B19 Personality scale - Conscientiousness'
epnemote 'DV: SCQ:B19 Personality scale - Emotional stability'
epnopene 'DV: SCQ:B19 Personality scale - Openness to experience'.

The notes should say:
In wave 5 respondents were questioned on their personality character traits using a 36-item inventory. The approach used was based on the trait descriptive adjectives approach used by Saucier (1994), which in turn was based on the approach employed by Goldberg (1992), both of which assume a 5-factor structure (as is commonly assumed in the literature). Not all 36 items, however, are used in the five derived scales summarizing the 5 personality factors. First, the ex ante scales were tested for item reliability, with any items omitted if item total correlation was less than 0.3. Second, principal components analysis with a five factor solution was undertaken, with items only retained if their highest factor loading was on the expected factor, exceeded 0.4 and exceeded the second highest factor loading by at least 0.1. A slightly different approach to derivation of these scales, but which obtains identical conclusions, is provided in Losoncz (2009).

References:
Goldberg, L.R. (1992), ‘The Development of Markers for the Big-Five Factor Structure’, Psychological Assessment, vol. 4, pp. 26–42.
Losoncz, I. (2009), ‘Personality Traits in HILDA’, Australian Social Policy, pp. 169–198.
Saucier, G. (1994), ‘Mini-markers: A Brief Version of Goldberg’s Unipolar Big-five Markers’, Journal of Personality Assessment, vol. 63, pp. 506–516

Update 07/04/2010

Replacement General Release Wave 3 cover sheet (“Cover sheet c80c.pdf” on the HILDA DVD contained a copy of the HF).

Update 12/03/2010

There is an incorrect xwaveid in the Eperson and Combined files for wave 8. This was a child that fieldwork treated as a new entrant in wave 8, but she had been become part of the sample in wave 2.
The correct xwaveid was supplied in the master file but the correction was not applied to the Eperson file and, in consequence, the Combined file. The incorrect xwaveid will cause a mismatch when matching the master file to the Eperson or Combined files, or when creating longitudinal files.

SPSS code (apply to Eperson and Combined files):
if hhhrpid ='2328203' and xwaveid = '0800238' xwaveid = '0200519'.

SAS code (apply to Eperson and Combined files):
if hhhrpid ='2328203' and xwaveid = '0800238' then xwaveid = '0200519';

Stata code:
use Combined_h80c, clear
replace xwaveid = "0200519" if hhhrpid=="2328203" & xwaveid=="0800238"
save Combined_h80c, replace
use Eperson_h80c, clear replace xwaveid = "0200519" if hhhrpid=="2328203" & xwaveid=="0800238"
save Eperson_h80c, replace

Update 10/03/2010

The derived variable coding frameworks supplied on the DVD inappropriately excluded the derived History: variables.

A zip of replacement General Release frameworks for each of the 8 waves is available for download in Derived Variable Coding Framework 80 replacements.zip (3Mb).

A zip of replacement In-Confidence frameworks for each of the 8 waves is available for download in In-Confidence Derived Variable Coding Framework 80u replacements.zip (3Mb).

 

Release 7.0 Data and Documentation Issues and Corrections

Update 10/09/2009

Two wave 7 sample weights need to be overwritten. The supplied versions have a mean greater than 1 and will inflate the sample size.

The two weights are ghhwtes 'DV: Enumerated person sample weight', and ghhwths 'DV: Household sample weight'.

Corrections:
if ghhwte >= 0 ghhwtes = ghhwte * (17281 / 20427816.90).
if ghhwth >= 0 ghhwths = ghhwth * ( 7063 / 8147207.53 ).

Update 01/09/2009

For longitudinal consistency sex should be swapped for these two Wave 7 respondents:
Wave 7 household id 59111 ghgsex1 from 2 to 1 and ghgsex2 from 1 to 2;
xwaveid 0113924 from 2 to 1 and 0113925 from 1 to 2.

Wave last interviewed:
Due to a programming error, 223 Wave 7 Continuing Persons were set to not asked (-1) on wave last interviewed (ghgwli) and date last interviewed (ghgdli).
Replacement variables (by xwaveid) are provided in this spreadsheet.

Update 15/07/2009

The flag BHWLINK indicating if household wealth was longitudinally imputed (whether a wave 2 household could be linked to a wave 6 household) was not written into the wave 2 household file. A spreadsheet containing BHWLINK and the wave 2 randomised household id can be downloaded here.

Update 17/03/2009

The imputation flags for the household-level wealth variables for own credit card debt and joint credit card debt only contain missing values.

There was an error in the program which created them (the variable name specified for the unimputed variable was missing the final “t”).

You can create your own versions of these flags:
bhwoccdf =(bhwoccdi <> bhwoccdt).
bhwccdtf =(bhwccdti <> bhwccdt).
fhwoccdf =(fhwoccdi <> fhwoccdt).
fhwccdtf =(fhwccdti <> fhwccdt).

This says the flag (the first variable) equals the logical test of comparing the imputed and unimputed variables, giving a result of 1 if they are not equal and zero if they are equal.

Update 16/03/2009

Replacement marked up wave 1 Household Questionnaire, labelled 80, but no changes from release 70.

Update 23/02/2009

There is a data issue with the 21 wave 3 SCQ "Life Events occurs" variables (got married; separated from spouse etc).
The error was introduced post release 6.0, when, for consistency, a program applicable to later waves was brought back into the wave 3 SCQ cleaning program.

The problem only affects wave 3, the 6 other waves are unaffected.

The following variables need to be corrected:
clemar, clesep, clercl, cleprg, clebth, cleins, cleinf, cledsc, cledrl, cledfr, clevio, clepcm, clejls, clejlf, clertr, clefrd, clejob, cleprm, clefni, clefnw, clemvd

The problem can be corrected in SPSS, STATA or SAS using the code provided below.

These files require correction:
Combined c70c (or Combined c70u for in-confidence users); and Rperson c70c (or Rperson c70u for in-confidence users)

The time period variables for the quarter the wave 3 life event occurred are not affected.

Frequencies or tables should show the following numbers of "2 Yes"

  before fix after fix
CLEMAR SCQ:B16a Got married 46 275
CLESEP SCQ:B16b Separated from spouse 30 517
CLERCL SCQ:B16c Got back together with spouse 10 134
CLEPRG SCQ:B16d Pregnancy 25 569
CLEBTH SCQ:B16e Birth/adoption of new child 13 409
CLEINS SCQ:B16f Serious personal injury/illness 89 1049
CLEINF SCQ:B16g Serious injury/illness to family member 155 2074
CLEDSC SCQ:B16h Death of spouse or child 20 106
CLEDRL SCQ:B16i Death of close relative/family member 99 1251
CLEDFR SCQ:B16j Death of a close friend 161 1325
CLEVIO SCQ:B16k Victim of physical violence 18 236
CLEPCM SCQ:B16l Victim of a property crime 46 738
CLEJLS SCQ:B16m Detained in jail 4 26
CLEJLF SCQ:B16n Close family member detained in jail 13 142
CLERTR SCQ:B16o Retired from the workforce 106 306
CLEFRD SCQ:B16p Fired or made redundant 26 363
CLEJOB SCQ:B16q Changed jobs 86 1584
CLEPRM SCQ:B16r Promoted at work 75 721
CLEFNI SCQ:B16s Major improvement in finances 37 396
CLEFNW SCQ:B16t Major worsening in finances 53 385
CLEMVD SCQ:B16u Changed residence 103 2093

 

STATA, SPSS, SAS fixes
======================

STATA

replace clemar=2 if ((clemar==-4 | clemar==1) & (clemarq1==1 | clemarq2==1 | clemarq3==1 | clemarq4==1))
replace clesep=2 if ((clesep==-4 | clesep==1) & (clesepq1==1 | clesepq2==1 | clesepq3==1 | clesepq4==1))
replace clercl=2 if ((clercl==-4 | clercl==1) & (clerclq1==1 | clerclq2==1 | clerclq3==1 | clerclq4==1))
replace cleprg=2 if ((cleprg==-4 | cleprg==1) & (cleprgq1==1 | cleprgq2==1 | cleprgq3==1 | cleprgq4==1))
replace clebth=2 if ((clebth==-4 | clebth==1) & (clebthq1==1 | clebthq2==1 | clebthq3==1 | clebthq4==1))
replace cleins=2 if ((cleins==-4 | cleins==1) & (cleinsq1==1 | cleinsq2==1 | cleinsq3==1 | cleinsq4==1))
replace cleinf=2 if ((cleinf==-4 | cleinf==1) & (cleinfq1==1 | cleinfq2==1 | cleinfq3==1 | cleinfq4==1))
replace cledsc=2 if ((cledsc==-4 | cledsc==1) & (cledscq1==1 | cledscq2==1 | cledscq3==1 | cledscq4==1))
replace cledrl=2 if ((cledrl==-4 | cledrl==1) & (cledrlq1==1 | cledrlq2==1 | cledrlq3==1 | cledrlq4==1))
replace cledfr=2 if ((cledfr==-4 | cledfr==1) & (cledfrq1==1 | cledfrq2==1 | cledfrq3==1 | cledfrq4==1))
replace clevio=2 if ((clevio==-4 | clevio==1) & (clevioq1==1 | clevioq2==1 | clevioq3==1 | clevioq4==1))
replace clepcm=2 if ((clepcm==-4 | clepcm==1) & (clepcmq1==1 | clepcmq2==1 | clepcmq3==1 | clepcmq4==1))
replace clejlf=2 if ((clejlf==-4 | clejlf==1) & (clejlfq1==1 | clejlfq2==1 | clejlfq3==1 | clejlfq4==1))
replace clejls=2 if ((clejls==-4 | clejls==1) & (clejlsq1==1 | clejlsq2==1 | clejlsq3==1 | clejlsq4==1))
replace clertr=2 if ((clertr==-4 | clertr==1) & (clertrq1==1 | clertrq2==1 | clertrq3==1 | clertrq4==1))
replace clefrd=2 if ((clefrd==-4 | clefrd==1) & (clefrdq1==1 | clefrdq2==1 | clefrdq3==1 | clefrdq4==1))
replace clejob=2 if ((clejob==-4 | clejob==1) & (clejobq1==1 | clejobq2==1 | clejobq3==1 | clejobq4==1))
replace cleprm=2 if ((cleprm==-4 | cleprm==1) & (cleprmq1==1 | cleprmq2==1 | cleprmq3==1 | cleprmq4==1))
replace clemvd=2 if ((clemvd==-4 | clemvd==1) & (clemvdq1==1 | clemvdq2==1 | clemvdq3==1 | clemvdq4==1))
replace clefni=2 if ((clefni==-4 | clefni==1) & (clefniq1==1 | clefniq2==1 | clefniq3==1 | clefniq4==1))
replace clefnw=2 if ((clefnw==-4 | clefnw==1) & (clefnwq1==1 | clefnwq2==1 | clefnwq3==1 | clefnwq4==1))

SPSS

if any(clemar,-4,1) and any(1,clemarq1,clemarq2,clemarq3,clemarq4) clemar = 2.
if any(clesep,-4,1) and any(1,clesepq1,clesepq2,clesepq3,clesepq4) clesep = 2.
if any(clercl,-4,1) and any(1,clerclq1,clerclq2,clerclq3,clerclq4) clercl = 2.
if any(cleprg,-4,1) and any(1,cleprgq1,cleprgq2,cleprgq3,cleprgq4) cleprg = 2.
if any(clebth,-4,1) and any(1,clebthq1,clebthq2,clebthq3,clebthq4) clebth = 2.
if any(cleins,-4,1) and any(1,cleinsq1,cleinsq2,cleinsq3,cleinsq4) cleins = 2.
if any(cleinf,-4,1) and any(1,cleinfq1,cleinfq2,cleinfq3,cleinfq4) cleinf = 2.
if any(cledsc,-4,1) and any(1,cledscq1,cledscq2,cledscq3,cledscq4) cledsc = 2.
if any(cledrl,-4,1) and any(1,cledrlq1,cledrlq2,cledrlq3,cledrlq4) cledrl = 2.
if any(cledfr,-4,1) and any(1,cledfrq1,cledfrq2,cledfrq3,cledfrq4) cledfr = 2.
if any(clevio,-4,1) and any(1,clevioq1,clevioq2,clevioq3,clevioq4) clevio = 2.
if any(clepcm,-4,1) and any(1,clepcmq1,clepcmq2,clepcmq3,clepcmq4) clepcm = 2.
if any(clejls,-4,1) and any(1,clejlsq1,clejlsq2,clejlsq3,clejlsq4) clejls = 2.
if any(clejlf,-4,1) and any(1,clejlfq1,clejlfq2,clejlfq3,clejlfq4) clejlf = 2.
if any(clertr,-4,1) and any(1,clertrq1,clertrq2,clertrq3,clertrq4) clertr = 2.
if any(clefrd,-4,1) and any(1,clefrdq1,clefrdq2,clefrdq3,clefrdq4) clefrd = 2.
if any(clejob,-4,1) and any(1,clejobq1,clejobq2,clejobq3,clejobq4) clejob = 2.
if any(cleprm,-4,1) and any(1,cleprmq1,cleprmq2,cleprmq3,cleprmq4) cleprm = 2.
if any(clefni,-4,1) and any(1,clefniq1,clefniq2,clefniq3,clefniq4) clefni = 2.
if any(clefnw,-4,1) and any(1,clefnwq1,clefnwq2,clefnwq3,clefnwq4) clefnw = 2.
if any(clemvd,-4,1) and any(1,clemvdq1,clemvdq2,clemvdq3,clemvdq4) clemvd = 2.

SAS

if ((clemar=-4 or clemar=1) and (clemarq1=1 or clemarq2=1 or clemarq3=1 or clemarq4=1)) then clemar = 2;
if ((clesep=-4 or clesep=1) and (clesepq1=1 or clesepq2=1 or clesepq3=1 or clesepq4=1)) then clesep = 2;
if ((clercl=-4 or clercl=1) and (clerclq1=1 or clerclq2=1 or clerclq3=1 or clerclq4=1)) then clercl = 2;
if ((cleprg=-4 or cleprg=1) and (cleprgq1=1 or cleprgq2=1 or cleprgq3=1 or cleprgq4=1)) then cleprg = 2;
if ((clebth=-4 or clebth=1) and (clebthq1=1 or clebthq2=1 or clebthq3=1 or clebthq4=1)) then clebth = 2;
if ((cleins=-4 or cleins=1) and (cleinsq1=1 or cleinsq2=1 or cleinsq3=1 or cleinsq4=1)) then cleins = 2;
if ((cleinf=-4 or cleinf=1) and (cleinfq1=1 or cleinfq2=1 or cleinfq3=1 or cleinfq4=1)) then cleinf = 2;
if ((cledsc=-4 or cledsc=1) and (cledscq1=1 or cledscq2=1 or cledscq3=1 or cledscq4=1)) then cledsc = 2;
if ((cledrl=-4 or cledrl=1) and (cledrlq1=1 or cledrlq2=1 or cledrlq3=1 or cledrlq4=1)) then cledrl = 2;
if ((cledfr=-4 or cledfr=1) and (cledfrq1=1 or cledfrq2=1 or cledfrq3=1 or cledfrq4=1)) then cledfr = 2;
if ((clevio=-4 or clevio=1) and (clevioq1=1 or clevioq2=1 or clevioq3=1 or clevioq4=1)) then clevio = 2;
if ((clepcm=-4 or clepcm=1) and (clepcmq1=1 or clepcmq2=1 or clepcmq3=1 or clepcmq4=1)) then clepcm = 2;
if ((clejlf=-4 or clejlf=1) and (clejlfq1=1 or clejlfq2=1 or clejlfq3=1 or clejlfq4=1)) then clejls = 2;
if ((clejls=-4 or clejls=1) and (clejlsq1=1 or clejlsq2=1 or clejlsq3=1 or clejlsq4=1)) then clejlf = 2;
if ((clertr=-4 or clertr=1) and (clertrq1=1 or clertrq2=1 or clertrq3=1 or clertrq4=1)) then clertr = 2;
if ((clefrd=-4 or clefrd=1) and (clefrdq1=1 or clefrdq2=1 or clefrdq3=1 or clefrdq4=1)) then clefrd = 2;
if ((clejob=-4 or clejob=1) and (clejobq1=1 or clejobq2=1 or clejobq3=1 or clejobq4=1)) then clejob = 2;
if ((cleprm=-4 or cleprm=1) and (cleprmq1=1 or cleprmq2=1 or cleprmq3=1 or cleprmq4=1)) then cleprm = 2;
if ((clemvd=-4 or clemvd=1) and (clemvdq1=1 or clemvdq2=1 or clemvdq3=1 or clemvdq4=1)) then clefni = 2;
if ((clefni=-4 or clefni=1) and (clefniq1=1 or clefniq2=1 or clefniq3=1 or clefniq4=1)) then clefnw = 2;
if ((clefnw=-4 or clefnw=1) and (clefnwq1=1 or clefnwq2=1 or clefnwq3=1 or clefnwq4=1)) then clemvd = 2;

Update 06/02/2009

We have identified a problem with one case in the enumerated person file and combined file for waves 6 and 7. The cross-wave identifier for this case is incorrect so when you match files across waves or to the master file using xwaveid, this person will not be matched correctly. To fix the problem, the following code should be run on the enumerated file and the combined file for both wave 6 and wave 7:

SAS: if xwaveid='0600941' then xwaveid='0112961';

SPSS: if xwaveid='0600941' xwaveid='0112961'.

Stata: replace xwaveid = "0112961" if xwaveid == "0600941"

Data managers – please make the HILDA users in your organisation aware of this problem.

 

Release 6.0 Data and Documentation Issues and Corrections

Update 04/04/2008

Current data and documentation issues
Replacement Household Form e60c.pdf

top of pagetop of page

HILDA Contact us

Contact the University : Disclaimer & Copyright : Privacy : Accessibility