Notes for data users - Core dataset
ADA-Accessed (Core) data
A subset of the ALSWH survey data can be assessed through the Australian Data Archive, ADA. These datasets are known as the ADA-Accessed (Core) data. These datasets contain most but not all of the survey data that are assessable through the ALSWH EOI procedure.
Differences between the ALSWH data and the ADA-Accessed (Core) data
Some variables have been removed from the ADA-Accessed (Core) data. These variables were considered to be sensitive and therefore requiring ALSWH oversight. Also, some variables that are not needed for analysis were removed. These include the survey questions that are used to derive another variable or scale. Furthermore, a small number of variables were recoded to avoid sensitivities. These variables are outlined below. The ADA-Accessed (Core) data will be referred to as the Core data from here.
Questionnaire items that were components in deriving a scale variable were removed from the Core datasets and only the derived scale items were kept. Sensitive variables dealing with violence and drug use are not in the Core datasets.
The ALSWH Data Dictionary, available on the website, lists all the ASLWH variables with an indicator whether they are in the Core datasets or not.
The Data Dictionary can be downloaded from here:
The ‘age’ variable is age in integer years at time the survey was returned. Other age variables will be dropped, such as, age first starting smoking, came to Australia. Exceptions will be where the age range is narrow and there are many women in each category.
There are not be any exact dates on the Core datasets. The only date will be birth by year and response date by year. All other dates will be removed, such as date first came to Australia. Exceptions will be where the year range is narrow and there are many women in each category.
The geographic variables kept are State and ARIAPGP. The variables ARIAPLUS and MMM were dropped.
For ARIAPGP, the very remote and remote categories were collapsed into a single category.
State / Territories
ACT and NSW will be collapsed together and NT and SA will be collapsed together.
The exercise statistic was removed from Survey 1 because this was a different variable to subsequent surveys. The exercise variables begin from Survey 2.
Short Surveys records are kept within the main datasets, as they are with the main ALSWH datasets.
The variables survey, e.g., ‘m2survey’ has values 1 for full survey and 2 for short survey.
Where a categorical variable had a category less than 10 it was either collapsed into fewer categories or dropped. If the Variable was not commonly used in research it was dropped and if it was commonly used, such as Marital Status in certain Cohorts, then it was kept with collapsed categories.
The individual survey items for time use and labour force were removed from the Core datasets where they were used to derive other variables. The derived Labour Force Status variables are LABF, HRS, HRSWORK, and these are on the Core datasets.
The Child dataset variables will not be kept except for the ‘children’ variable. This is number of children (0, 1, 2, 3 or more). All other ‘Number of … ‘ reproductive variables are removed.
All the following variables and sets of variables are not in the Core datasets:
- Complete Food Frequency datasets
- All text variables
- The qualitative data
- All ATSI
- Domestic and child abuse questions
- Illicit drug questions
- Medications free text data
- Cause of death
- Linked data
- Six month follow up data from the 1921-26 cohort
There will be a participant file. This will have ID, Year of response for each survey. Otherwise for each survey: year of response, did not respond, dead.
There are a few variables in the Core dataset that have been recoded from the main ALSWH variables. These differences are described below.
There is an identifier for each woman in the Core datasets, IDcore. This is different from the IDalias.
State of residence
The ACT and NSW are combined together, as are the Northern Territory and South Australia
ARIA+ grouped, has only 4 categories, with very remote combined with remote.
Age completed survey and year of birth are available. Some ages that were outside the standard range for the cohort and had very small frequencies were recoded to the nearest age within the standard range.
Some cohorts and waves had their own recodes, particularly the 1946-51 cohort.
Cohort 1989 – 1995
- The number of children variable (children) is capped at 3 for all relevant surveys
- ‘Divorced’ and ‘widowed’ are collapsed with ‘separated’ for the marital status variable in all surveys
Cohort 1973 – 1978
- The number of children variable (children) is capped at 3 for all surveys
- Y1Q76, Age left school : ‘Never attended school’ recoded to missing
- y1q83, Speak English: “Speak English not at all” is collapsed with “Not well”
Surveys 2, 3, 4, 5, 6, 7
- There are no recodes beyond those mentioned above
From survey 3 onwards these two variable sets have been recoded
- ‘How many times have you consulted GP/hospital doctor/ specialist in the last 12 months’
The category ‘25+ times’ is collapsed with ‘13-24 times’
- ‘Number of people living with you’
Category ‘3 or more’ combined with ‘2’
- Nothing beyond mentioned above
- m2q71, ‘Number people dependent on income’, is capped at 7
- M3q49a – j , ‘How often drink cola, etc’
Category ‘3 or more times a day’ is collapsed with ‘2 times a day’
- M3q88, ‘How many dependent on household income’
Capped at 7
- Nothing beyond mentioned above
- m6q70, “How many slices of bread eat per day”
Category ‘8+ slices per day’ is collapsed with ‘5-7 slices per day’
- M7q77a – l , ‘Number of drinks’
The category ‘3 or more’ is collapsed with ‘2 times per day’
- M8q57a – l , ‘Number of drinks’
The category ‘3 or more’ is collapsed with ‘2 times per day’
- M8q82 ‘Which best describes your housing situation’
Category ‘Nursing home / residential aged care’ is collapsed with ‘Retirement village / self care unit’, and ‘Hostel / boarding house’ collapsed with ‘Other’
- M8q83 “How many bedrooms”
Capped at 8
- M8q85 “Years lived in current home ”
Capped at 50
- M8q87, ‘Where living in 10 years time’
Category ‘Hostel/ boarding house’ collapsed with ‘Have no idea’
- M8q76 a/b, Retirement of self/partner
Category ‘Never been in paid work’ is collapsed with ‘Other’
Cohort 1921 – 1926
Variable recodes for all surveys
- ‘De facto’ is collapsed with ‘married’ in the marital status variables
- ‘High risk drinker’ is collapsed with ‘Risky drinker’ in the Alcohol Status (NHMRC) variables
These datasets are designed to be used:
- for simple, descriptive analyses
- for longitudinal investigations
- as a first step to using and becoming familiar with the full survey datasets
- to test the feasibility of potential research questions
It includes total scale scores, with a reduced number of single survey items; sensitive variables have been omitted; and in some cases, response categories have been collapsed.
How to apply
- Data Book for CORE data, 1946-51 Survey 1
- Data Book for CORE data, 1946-51 Survey 2
- Data Book for CORE data, 1946-51 Survey 3
- Data Book for CORE data, 1946-51 Survey 4
- Data Book for CORE data, 1946-51 Survey 5
- Data Book for CORE data, 1946-51 Survey 6
- Data Book for CORE data, 1946-51 Survey 7
- Data Book for CORE data, 1946-51 Survey 8
Approved Projects using ALSWH Core Data
|Trajectories of physical activity across the lifespan in Australian women: correlates and health consequences||Yuta Nemoto|
|The Role of Stress in the Relationship Between Premenstrual Tension and Postpartum Depression||Sophia Bracken|
|The economic cost of violence, abuse, neglect and exploitation for people with disability||Dennis McCarthy|
|The harms and benefits of sun exposure: striking the right balance||Namal Nishantha Balasooriya Mudiyanselage|
|The influencing factors of premenstrual tension and its influence on other mental disorders||Lulu Hou|
|Pregnancy Intentions and Subsequent Fertility and Contraceptive Behaviors among Australian Women||Otobo Ujah|
|Partnering patterns associated with polycystic ovary syndrome (PCOS) in Australian women||Yoobin Park|
|Adverse Childhood Experiences and the Risk of Pregnancy Complications and Adverse Pregnancy Outcomes||Tuhin Biswas|
|Gender equality in Australia: An exploration of the impact on sleep and health||Anna Scovelle|
|International Study on the Positive Aspects of Caregiving||Emily Princehorn|
|The intergenerational transmission of violence and poverty: evidence from Australian women||Alice Campbell|
|Gender (in)equality in Australia - an exploration of the impact on key health outcomes across the life-course||Jennifer Ervin|
|Is Workplace stress killing women||Janine Fletcher-Taylor|
|Longitudinal association between educational mobility and social support||Farzaneh Zolala|
|An observational cross-sectional study to identify factors associated with dental service utilization by Australian women of age 65-70||Shilpy Vaid|
|Nature and Extent of Family Violence Amongst South-Asian Communities||Iswa Chaudry|
|Trajectories of physical activity, sitting time and falls from middle age to older age||Wing Kwok|
|Life events and loneliness among older Australians||Jack Lam|
|Economics of Improved Health associated with Interventions for Heart Valve Disease||Marie Ishida|
|Family dynamics over the life course: Foundations, turning points and outcomes. Chapter 12: LGBTIQ+ families||Alice Campbell|
|Consumption of diet soft drinks||Jo Zhou|
|The impact of recessions on women’s health outcomes||Li Ang|
|Evaluating the aged care system in Australia and its preparedness to offer quality care to the changing demographics||Alison Campbell|
|Health and health literacy of refugees in Australia||Prince Peprah|
|Misreporting of alcohol consumption in a cohort of Australian Women: Influences of Pregnancy on Reporting||Zachary Hayward|