CanDLe Data
The CanDLe project has received ethical approval and enables the Cancer Institute NSW to create two linked datasets (CanDLe 1 and CanDLe 2 Women Screen). These datasets bring together routinely collected health information to support high-quality cancer research in New South Wales (NSW) and Australian Capital Territory (ACT).
Each dataset is based on a defined cohort of people in NSW and ACT and is created through a privacy-preserving data linkage process.
CanDLe 1
A primary cohort of people diagnosed with, or treated for, cancer in NSW and ACT. It includes information from the NSW Cancer Registry and ACT Cancer Registry, as well as a subset of the NSW Admitted Patient Data Collection (APDC) and ACT Admitted Patient Care data. The hospital data included relate specifically to admissions where a malignant condition is recorded in the diagnosis code.CanDLe 2 Women Screen
All women in NSW who participated in either breast and/or cervical cancer screening.
The population health and administrative data collections that are included are shown below:

Variable lists and data dictionaries
The CanDLe datasets include over 200 listed variables. For more information about these data sets, visit the Centre for Health Record Linkage (CHeReL) data dictionary page and the Cancer Institute NSW data dictionary page, scroll down to the relevant data set.
Where is the CanDLe data stored and accessed?
CanDLe datasets are stored, accessed and analysed in a secure environment, which enables appropriate monitoring and control of data access and use. At present, CanDLe data is available in two approved secure environments:
Secure Unified Research Environment (SURE)
SURE is a remote-access computing environment that allows researchers to access and analyse linked health-related data files for approved studies. The SURE is provided by the Sax Institute. For more information, please visit the Sax Institute website or see the Introduction to SURE.UNSW E-Research Institutional Cloud Architecture (ERICA)
ERICA is a secure cloud computing infrastructure for individuals working with sensitive data. The UNSW ERICA is approved for CanDLe. For more information, please visit the UNSW ERICA website.
Costs of the secure environment
Each research group will be required to fund the storage and access costs charged by the secure environment provider. The Cancer Institute NSW will fund the costs of the data linkage.
Managing CanDLe Datasets
The Cancer Institute NSW is responsible for managing the master dataset and allocating access to Lead Researchers via Project Folders specific to each approved sub-study protocol.
Confidentiality
All data linkage will be conducted by the Centre for Health Record Linkage and will adhere to strict guidelines that ensure that privacy and security of data is maintained.
Only variables approved by data custodians will be included in CanDLe. To minimise potential confidentiality risks, sensitive or personally identifying information will not be included in datasets.
Lead Researchers will be responsible for ensuring that research findings are presented in aggregate form, with sufficiently large cell sizes (suppressing cell sizes <5), to prevent the identification of individuals in peer review publications, conference presentations and other public outputs. All draft reports, manuscripts and presentations must be reviewed by the CanDLe Co-ordinating Principal Investigator and submitted to cinsw-candleprogram@health.nsw.gov.au prior to submission to publication or public dissemination.