Canadians’ views on the use of routinely collected data in health research: a patient-oriented cross-sectional survey ======================================================================================================================= * Natalie McCormick * Clayon B. Hamilton * Cheryl L. Koehn * Kelly English * Allan Stordy * Linda C. Li ## Abstract **Background:** Little is known about Canadians’ knowledge of and level of support for using administrative and other large, routinely collected data for health research, despite the benefits of this type of research to patients, health care systems and society. We sought to benchmark the views of Canadian adults on this topic. **Methods:** Researchers and patient leaders of 3 joint and skin disease organizations codeveloped a cross-sectional online survey that was conducted between January and August 2017. The patient partners were engaged as full partners. Recruitment was mainly through the organizations’ websites, email and social media. The survey captured respondents’ initial perceptions, then (after background information on the topic was provided) elicited their views on the benefits of health research using routinely collected data, data access/privacy concerns, ongoing perceptions and educational needs. **Results:** Of the 230 people who consented, 183 (79.6%) started the survey, and 151 (65.6%) completed the survey. Of the 151, 117 (77.5%) were women, 84 (55.6%) were British Columbians, 87 (57.6%) were university graduates, and 101 (66.9%) had a chronic disease. At the beginning of the survey, 119 respondents (78.8%) felt positively about the use of routinely collected data for health research. Respondents identified the ability to study long-term treatment effects and rare events (114 [75.5%]) and large numbers of people (110 [72.8%]) as key benefits. Deidentification of personal information was the top privacy measure (135 [89.4%]), and 101 respondents (66.9%) wanted to learn more about data stewards’ granting access to data. On survey completion, more respondents (141 [93.4%]) felt positively about the use of routinely collected data, but only 87 (57.6%) were confident about data security and privacy. **Interpretation:** Respondents generally supported the use of deidentified routinely collected data for health research. Although further investigation is needed with more representative samples, our findings suggest that additional education, especially about access and privacy controls, may enhance public support for research endeavours using these data. **Plain language summary:** Every day, information is collected about how Canadians use publicly funded health care. The information collected is stored in large electronic data sets. Studying these data can teach us about the quality of our health care, help patients and improve government spending on health care. For example, some data sets can show long-term effects of a given type of treatment. This is not commonly known from single research studies involving small numbers of patients. It is not clear how much Canadians know about these data sets and how researchers can use them. This study surveyed adult Canadians on their views on using such data sets to do health research. The study was done in partnership between university researchers and 3 patients representing national arthritis and skin disease groups. Together, they wrote and tested an online survey, found participants, and interpreted and shared what they learned from the study. Most people who finished the survey agreed with using large data sets to do health research, especially after they read background information on their value. The respondents said the best uses were shown when researchers studied long-term effects in large numbers of people. They felt names and personal information should not be shown. They wished to learn more about how researchers are given access to data sets, and, at the end of survey, they still felt unsure about how privacy and security are ensured. In conclusion, providing information to the public on the use of large sets of health data may increase support for this type of health research. Analyses of data collected routinely by provincial health ministries and other public bodies1 can inform health policy-making2 and advance our knowledge of the burden3 and risks4 of diseases, the clinical effectiveness5 and cost-effectiveness6 of treatments, and drivers of health care costs.7 Health researchers in Canada can access a growing array of publicly collected data on the use of health care services, workplace safety, immigration and early childhood development, and link them with other health data including electronic medical records8 and cancer and perinatal registries. For example, by linking 17 sources of data, Ontario researchers assessed regional variation in cardiovascular event rates9 and the added value of high-intensity statin therapy for older patients.10 A crucial advantage of these “real-world” data is their coverage of Canadians of all ages and ethnicities.11 This makes findings more applicable to the entire Canadian population, including groups who tend not to participate in studies. Despite these benefits, an expert panel of researchers, data custodians, ethicists and managers of health research organizations convened by the Canadian Institutes of Health Research concluded that researchers face many barriers in accessing health and social data in a timely manner.12 Interestingly, the panel felt many of these barriers stemmed from institutions’ being “overly cautious” in their interpretations of privacy legislation and public attitudes about this use of health data.12 The panel suggested that these barriers could be reduced through greater public engagement. It may be that current data-access policies and practices are stricter than most people would prefer given the societal benefits of this type of research and deidentified nature of the data. Although a 2016 report by the Office of the Privacy Commissioner of Canada suggested that Canadians have concerns about organizations’ collecting and using their personal information,13 little is known regarding their privacy concerns with the use of publicly collected health data for health research. There is also a dearth of Canadian research on people’s understanding of what these data sets contain, what they are used for, who can access them and how they are protected. Prior Canadian studies focused on a single issue (namely, consent preferences for research use of medical records)14–16 or sampled patients with specific diagnoses.17–19 In the present study, researchers and patient leaders from 3 national joint and skin disease organizations developed and distributed an online survey to benchmark the views of Canadians about the use of large sets of routinely collected data in health research. ## Methods In this patient-initiated study, we collected data via an online survey among Canadians aged 18 years or more, in English and French, between Jan. 17 and Aug. 15, 2017. The survey focused on administrative data collected by provincial/federal ministries and other public bodies, with access controlled by data stewards,1 officials who adjudicate researchers’ requests for deidentified subsets of these data. We chose a Web-based format so responses could be collected from different areas of Canada and analyzed efficiently. The survey was hosted on the University of British Columbia’s FluidSurveys platform ([https://it.ubc.ca/services/teaching-learning-tools/survey-tool-fluidsurveys](https://it.ubc.ca/services/teaching-learning-tools/survey-tool-fluidsurveys)). All collected data remained in Canada, on a secured network server at the researchers’ offices at Arthritis Research Canada. ### Questionnaire design Development of the questionnaire was co-led by C.L.K., N.M. and C.B.H., with contributions from all authors. N.M. subsequently modified some questions and content based on her strong background in the acquisition, validation and use of administrative data in health research.20–25 The draft questionnaire was reviewed and tested by the other patient partners, who provided iterative feedback via email to ensure it could be completed within 15 minutes and its content would be understandable and interesting to lay audiences. Although the survey focused on administrative data, we used the more recognizable (although somewhat less suitable) term “big data” to enhance its accessibility. The questionnaire was formulated in English as a Word document, then translated into French by a professional translator and programmed online. Researchers and patient partners provided additional feedback on the online version, focusing on ease of navigation. The final questionnaire consisted of 6 sections along with a preamble that introduced respondents to the survey’s topic and purpose (Appendix 1, available at [www.cmajopen.ca/content/7/2/E203/suppl/DC1](http://www.cmajopen.ca/content/7/2/E203/suppl/DC1)). The first section asked about respondents’ familiarity with and initial perceptions about the use of big data in health research. In the next 4 sections, respondents were queried about their views on specific topics, including perceived benefits of using these data, types of health research questions that can be answered, and data access and privacy measures. In the final section, respondents were queried further about their perceptions and interest in learning more following completion of the survey. The questionnaire was designed primarily to ascertain respondents’ views, but a small amount of educational information was provided since we expected that most respondents would not be familiar with these data sets. For example, when asking which privacy measures were most important, a description was provided of each. Finally, the same question (“In general, how do you feel about the use of big data for health research?”) was posed at the start and end of the survey to explore whether respondents’ perceptions changed as they completed it. ### Recruitment Respondents were recruited online, mainly through the websites, email newsletters26 and social media channels of the patient partners’ affiliated organizations. These efforts were supplemented by direct emails, social media and word-of-mouth communication between the grant investigators and their colleagues at health research institutes and patient organizations throughout Canada. All of these recruitment strategies were in English and French. Recruitment notices contained a link to the consent page listing the principal investigator, title and purpose of the study, and how and where responses would be stored. Participants were required to review the page, confirm that they were at least 18 years of age and consent to participate before starting the survey. All questions were optional, no personal identifiers were collected, and no incentives were offered. ### Patient engagement The study was conducted at all stages through a partnership with researchers at Arthritis Research Canada; more details are provided in Table 1,26,28,29 which was guided by the Patient Engagement in Research Description Framework,29 as used previously.30 The patient partners, 2 women and 1 man, represent national organizations pertaining to their disease: Arthritis Consumer Experts (C.L.K.), the Arthritis Patient Advisory Board of Arthritis Research Canada (K.E.) and the Canadian Skin Patient Alliance (A.S.). Led by C.L.K., the patient partners are coinvestigators on a multidisciplinary team funded by the Canadian Institutes of Health Research27 aiming to advance knowledge about the prevention, burden and management of serious complications in chronic inflammatory diseases. Their role includes codeveloping projects to engage the public about the conduct of health research. Having successfully administered an online survey about Canadians’ research priorities31 that informed the grant’s development, they sought to increase people’s awareness and understanding about using routinely collected data in health research. This need arose through the frequent interactions they had, as leaders of their respective research and advocacy organizations, with patients, investigators, clinicians and the public at large about the Canadian health research landscape. View this table: [Table 1:](http://www.cmajopen.ca/content/7/2/E203/T1) Table 1: Outline of partnership between patient partners and researchers in the research process The patient partners teamed up with a doctoral candidate with expertise in using administrative data for health services research (N.M.),20–24 a postdoctoral fellow studying patient engagement in research (C.B.H.) and a senior researcher in knowledge translation and implementation science (L.C.L.). During the team’s initial meetings, C.L.K. explained how the patient partners aimed to create a series of communiqués to educate the public. As a first step, they wanted to conduct a survey to benchmark people’s current understanding, support, concerns and willingness to learn more about this type of research. ### Statistical analysis Only responses by respondents who formally submitted the survey were analyzed; missing responses were permitted for individual questions within these submissions. For each question, we calculated the proportion of respondents who selected each item. Since respondents were often asked to select multiple items from a list, the sum of percentages/frequencies could exceed 100%. Analysis was conducted with SAS software, version 9.5 (SAS Institute) and presented to the team at in-person meetings. The patient partners reviewed and affirmed the results, discussed their possible impact for policy-makers, researchers and the public, and developed a dissemination plan. ### Ethics approval The study received ethics approval from the University of British Columbia’s Behavioural Research Ethics Board. ## Results A total of 230 people consented to participate, of whom 151 (65.6%) submitted responses and were included in the analysis. Of the 79 responses not submitted, 47 were from people who consented to participate but never started the survey, and 32 were from people who answered at least the first question but did not complete and submit the entire survey. Most of the surveys submitted (140 [92.7%)] were completed in English. More than three-quarters of the respondents (117 [77.5%]) were women, 71 (47.0%) were aged 50–69 years, and 42 (27.8%) were aged 30–49 years (Table 2). Most lived in British Columbia (84 [55.6%]) or Ontario (39 [25.8%]), over half (87 [57.6%]) had a university degree, and 101 (66.9%) reported having a chronic disease. View this table: [Table 2:](http://www.cmajopen.ca/content/7/2/E203/T2) Table 2: Respondents’ characteristics ### Initial knowledge and perceptions A total of 119 respondents (78.8%) felt positively about the use of routinely collected data for health research, 30 (19.9%) did not know, 1 (0.7%) felt negatively, and 1 (0.7%) declined to answer. Almost all respondents (144 [95.4%]) had heard of the term “electronic health/medical record”; fewer (88 [58.3%]) had heard of the terms “administrative health database” or “administrative data.” ### Perceived uses and benefits The ability to study long-term effects and rare events (114 respondents [75.5%]) and large numbers of people (110 [72.8%]) were perceived to be the most important benefits of using routinely collected data (Table 3). Similarly, potentially harmful treatments (97 [64.2%]) and long-term effects/rare events (96 [63.6%]) were the top benefits they wanted to learn more about. Respondents selected long-term harms and benefits of a particular treatment and complications of a particular disease as the most important issues related to health policy and patient care (79–84 [52.3%–55.6%]). View this table: [Table 3:](http://www.cmajopen.ca/content/7/2/E203/T3) Table 3: Responses regarding reasons to use routinely collected data for health research ### Access and privacy The need to apply for the use of research data (94 [62.2%]) and the need to obtain approval from university research ethics boards (89 [58.9%]) were the top-ranked data access controls (Table 4). Two-thirds of respondents (101 [66.9%]) wanted to learn more about the role of data stewards in granting access. Deidentifying personal information was selected by the most respondents (135 [89.4%]) as an important measure to enhance information security and protect privacy, followed by mandating researchers to complete privacy training and sign confidentiality agreements (87 [57.6%]). View this table: [Table 4:](http://www.cmajopen.ca/content/7/2/E203/T4) Table 4: Responses regarding data access and privacy and security controls ### Next steps The vast majority of respondents thought the provinces should promote the use of routinely collected data for health research (137 [90.7%]) and were very or somewhat willing to have their deidentified data used by Canadian health researchers (140 [92.7%]). When asked how they felt about the use of routinely collected data for health research at the end of the survey, more felt positively (141 [93.4%]) than at the start. Even still, only 87 respondents (57.6%) were confident about the privacy and security measures in place. The top concern was data access by insurance companies (89 [58.9%]). The potential cost of collecting, storing and overseeing the data was not a major concern (15 [9.9%]). Web sites were the preferred mode for learning more about the use of routinely collected data in Canadian health research (133 [88.1%]). ## Interpretation By the end of the survey, more than 90% of our respondents felt positively about having publicly collected data available and used for health research, particularly for studying rare health conditions and the long-term consequences of treatments. Consistent with reports from other countries,32–36 respondents placed high importance on deidentification of data. However, although 93% were at least somewhat willing to have their deidentified information used by health researchers in Canada, far fewer (58%) were confident about the privacy and security procedures in place. A strength of this study was the active role the patient partners took in interpreting the results. They were particularly interested in how respondents’ views changed during the survey: 79% felt positively about the use of routinely collected data at the beginning, compared to 93% at the end. Response bias may have contributed if respondents thought they should feel more positively by the end. However, the change may also have been due to the small amount of educational information provided alongside the survey questions. Although this hypothesis should be tested in future surveys, it is supported by studies conducted in the United Kingdom37,38 and New Zealand39 in which participants reported being more comfortable about the use of health data for research after receiving more information. This potentially influential role of education is important given our respondents’ desire to learn more about specific topics and their lack of confidence in existing privacy and security procedures. To address these needs, we recommend showcasing Canadian studies in which administrative data were used to assess complications40–42 and long-term effects43–46 of medications in patients with chronic diseases. Furthermore, it is essential to provide more information about the role of data stewards in adjudicating data requests and imposing conditions to data access. Prior work suggests that members of the public benefit from hearing the views of the different parties involved and affected by this type of research, including researchers, public health leaders and patients.38,39,47 As such, the researchers and patients who partnered on this survey are codeveloping educational resources that incorporate patients’ perspectives. These should complement the lay summaries, videos and other educational materials available from organizations such as Population Data BC ([https://www.popdata.bc.ca/](https://www.popdata.bc.ca/)) and ICES that facilitate research with health data. Engaging the public should also help in reaching consensus about the terminology used to describe this type of research. As per the patient partners’ suggestion, we used the somewhat catchy term big data, although, strictly speaking, structured administrative data sets do not fulfill the characteristics of the big data used in health research,48 which are typically unstructured. These characteristics include high volume, variety (linking multiple sources such as administrative, survey and electronic medical record data)48 and velocity (data collected daily or in real time).48 However, as we believe that most people are not aware of these distinctions, we felt that big data, a mainstream term used to describe this type of health research in Canada49,50 and elsewhere,51–53 would resonate better than “administrative” or “publicly/routinely collected” data. ### Limitations Our convenience sample was small, with over half of the respondents residing in BC and only 7% responding in French. Over half were university graduates, although the impact of education level on privacy concerns about health information is unclear,54 with several studies indicating no differences. 16,18,19 We did not formally assess the face or content validity of our questionnaire, and, although it was translated from English to French by an experienced professional translator, no back-translation55 or further refinements56 were done to correct mistranslations. Although the survey was open to all Canadian adults, recruitment was carried out mainly through groups for patients with arthritis or skin disease, and our findings may not reflect the privacy and security concerns of people with other health conditions or the public at large. An earlier survey that the patient partners conducted to inform the grant application was much shorter and received 636 responses over 3 weeks.31 Although we could not calculate the recruitment/participation rate for the current survey, we suspect that many potential respondents, especially those who found the survey through social media, reviewed the consent page and perceived the survey as too long or too formal. As such, although we intended to recruit a larger, more generalizable sample than those in prior Canadian investigations that limited sampling to patients with specific diagnoses (182–235 respondents each),17–19 our findings cannot be generalized to the entire Canadian population and must be interpreted cautiously. ### Lessons learned from patient involvement The patient partners’ engagement added credibility to the study and its findings. It may have appeared self-serving for the researchers to conduct this survey on their own and report a favourable level of support among people in Canada for the type of research they are personally involved with. The research trainees gained valuable experience in collaborating with patient partners and developed a better understanding of partners’ roles, such as researchers’ role in maintaining scientific rigor and patient partners’ role in driving study accessibility. In addition, through the guidance of the patient partners and mentoring of an experienced researcher, the team negotiated strategies to facilitate participation of patient partners over the course of this study. Notably, the team gained experience in discussing authorship. In turn, the patient partners learned more about the research process, including how researchers go about accessing administrative data, and copresented the findings at the American College of Rheumatology annual meeting.28 ### Conclusion As new sources of publicly and patient-collected information, including electronic medical records, biospecimens/genomic data, wearable devices and mobile health apps, become available for linkage and analysis, it is important to engage with Canadians about health research using this type of data and their preferences for data-access and data-sharing policies. Although the vast majority of our respondents felt positively about the use of publicly collected data, many lacked confidence in the access and privacy controls. Our findings warrant further investigation with more population-based sampling strategies so that the views of people of different ages, races/ethnicities, socioeconomic backgrounds, languages and health status are better represented. The extent to which wording (i.e., “personal health information” v. “administrative data”) and level of detail in the survey (i.e., whether to mention that consent is not sought for this type of research) affect respondents’ attitudes should also be investigated, along with any changes in concern about insurance companies’ accessing health data since the passage of Canada’s Genetic Non-Discrimination Act,57 in mid-2017. For now, this dynamic partnership between researchers and patients was an important first step in understanding the views of Canadians on the use of large sets of routinely collected health information for health research. ## Acknowledgements The authors thank all those who participated in the survey. They also thank Lynn Nowoselski, who coordinated the project. ## Footnotes * **Competing interests:** None declared. * This article has been peer reviewed. * **Contributors:** Clayon Hamilton, Cheryl Koehn, Kelly English, Allan Stordy and Linda Li conceived the study. Clayon Hamilton analyzed the data. Natalie McCormick and Clayon Hamilton drafted the manuscript. All of the authors contributed substantially to the study design and interpretation of data, critically revised the manuscript for important intellectual content, gave final approval of the version to be published and agreed to be accountable for all aspects of the work. * **Funding:** This study was funded by Team Grant THC-316595 from the Canadian Institutes of Health Research (CIHR). Natalie McCormick was funded by a Doctoral Research Award from CIHR and now holds a CIHR Fellowship Award. Clayon Hamilton is funded by a Michael Smith Foundation for Health Research Trainee Award in Health Services Research and a CIHR Health System Impact Fellowship. Linda Li is supported by the Canada Research Chair Program, the Harold Robinson/Arthritis Society Chair in Arthritic Diseases and the Michael Smith Foundation for Health Research Scholar Award. * **Supplemental information:** For reviewer comments and the original submission of this manuscript, please see [www.cmajopen.ca/content/7/2/E203/suppl/DC1](http://www.cmajopen.ca/content/7/2/E203/suppl/DC1). ## References 1. PopulationData BC, Glossary. Available: [https://www.popdata.bc.ca/BCDataDeliberation/glossary](https://www.popdata.bc.ca/BCDataDeliberation/glossary). accessed 2018 June 12. 2. Lavergne MR, Law MR, Peterson S, et al. (2016) A population-based analysis of incentive payments to primary care physicians for the care of patients with complex disease. CMAJ 188:E375–83. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiY21haiI7czo1OiJyZXNpZCI7czoxMToiMTg4LzE1L0UzNzUiO3M6NDoiYXRvbSI7czoyMDoiL2NtYWpvLzcvMi9FMjAzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 3. Benchimol EI, Bernstein CN, Bitton A, et al. (2017) Trends in epidemiology of pediatric inflammatory bowel disease in Canada: distributed network analysis of multiple population-based provincial health administrative databases. Am J Gastroenterol 112:1120–34. 4. Lega IC, Wilton AS, Austin PC, et al. (2016) The temporal relationship between diabetes and cancer: a population-based study. Cancer 122:2731–8. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=27400035&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 5. Rawson NSB, Downey W, Maxwell CJ, et al. (2011) 25 years of pharmacoepidemiologic innovation: the Saskatchewan health administrative databases. J Popul Ther Clin Pharmacol 18:e245–9. 6. Khor S, Beca J, Krahn M, et al. (2014) Real-world costs and cost-effectiveness of rituximab for diffuse large B-cell lymphoma patients: a population-based analysis. BMC Cancer 14:586. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1186/1471-2407-14-586&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=25117912&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 7. Fitzpatrick T, Rosella LC, Calzavara A, et al. (2015) Looking beyond income and education: socioeconomic status gradients among future high-cost users of health care. Am J Prev Med 49:161–71. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1016/j.amepre.2015.02.018&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=25960393&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 8. Tu K, Mitiku TF, Ivers NM, et al. (2014) Evaluation of Electronic Medical Record Administrative data Linked Database (EMRALD). Am J Manag Care 20:e15–21. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=24669409&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) [Web of Science](http://www.cmajopen.ca/lookup/external-ref?access_num=000330599000010&link_type=ISI) 9. Tu JV, Chu A, Maclagan L, et al. (2017) Regional variations in ambulatory care and incidence of cardiovascular events. CMAJ 189:E494–501. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiY21haiI7czo1OiJyZXNpZCI7czoxMToiMTg5LzEzL0U0OTQiO3M6NDoiYXRvbSI7czoyMDoiL2NtYWpvLzcvMi9FMjAzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 10. Alter DA, Tu JV, Koh M, et al. (2018) Projected real-world effectiveness of using aggressive low-density lipoprotein cholesterol targets among elderly statin users following acute coronary syndromes in Canada. J Am Heart Assoc 7:e007535. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiYWhhb2EiO3M6NToicmVzaWQiO3M6MTI6IjcvMTAvZTAwNzUzNSI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 11. Lougheed T (2015) Rethinking barriers to big data. CMAJ 187:E1. [FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiY21haiI7czo1OiJyZXNpZCI7czo4OiIxODcvMS9FMSI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 12. (2015) Accessing health and health-related data in Canada (Expert Panel on Timely Access to Health and Social Data for Health Research and Health System Innovation, Council of Canadian Academies, Ottawa) Available: [www.scienceadvice.ca/reports/accessing-health-and-health-related-data-in-canada/](http://www.scienceadvice.ca/reports/accessing-health-and-health-related-data-in-canada/). accessed 2019 Mar 6. 13. (2016) 2016 survey of Canadians on privacy: final report (Office of the Privacy Commissioner of Canada, Gatineau (QC)) Available: [https://www.priv.gc.ca/en/opc-actions-and-decisions/research/explore-privacy-research/2016/por\_2016_12/](https://www.priv.gc.ca/en/opc-actions-and-decisions/research/explore-privacy-research/2016/por_2016_12/). accessed 2018 Sept 16. 14. Nair K, Willison D, Holbrook A, et al. (2004) Patients’ consent preferences regarding the use of their health information for research purposes: a qualitative study. J Health Serv Res Policy 9:22–7. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1258/135581904322716076&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=15006236&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 15. Willison DJ, Swinton M, Schwartz L, et al. (2008) Alternatives to project-specific consent for access to personal information for health research: insights from a public dialogue. BMC Med Ethics 9:18. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1186/1472-6939-9-18&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=19019239&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 16. Willison DJ, Schwartz L, Abelson J, et al. (2007) Alternatives to project-specific consent for access to personal information for health research: What is the opinion of the Canadian public? J Am Med Inform Assoc 14:706–12. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1197/jamia.M2457&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=17712084&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 17. Page SA, Manhas KP, Muruve DA (2016) A survey of patient perspectives on the research use of health information and biospecimens. BMC Med Ethics 17:48. 18. Terry AL, Chesworth BM, Bourne RB, et al. (2015) Joint replacement recipients’ views about health information privacy. Health Expect 18:1519–29. 19. Page SA, Mitchell I (2006) Patients’ opinions on privacy, consent and the disclosure of health information for medical research. Chronic Dis Can 27:60–7. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=16867240&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 20. McCormick N, Reimer K, Famouri A, et al. (2017) Filling the gaps in SARDs research: collection and linkage of administrative health data and self-reported survey data for a general population-based cohort of individuals with and without diagnoses of systemic autoimmune rheumatic disease (SARDs) from British Columbia, Canada. BMJ Open 7:e013977. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMToiNy82L2UwMTM5NzciO3M6NDoiYXRvbSI7czoyMDoiL2NtYWpvLzcvMi9FMjAzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 21. McCormick N, Marra CA, Sadatsafavi M, et al. (2018) Incremental direct medical costs of systemic lupus erythematosus patients in the years preceding diagnosis: a general population-based study. Lupus 27:1247–58. 22. McCormick N, Lacaille D, Bhole V, et al. (2014) Validity of myocardial infarction diagnoses in administrative databases: a systematic review. PLoS One 9:e92286. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1371/journal.pone.0092286&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=24682186&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 23. McCormick N, Lacaille D, Bhole V, et al. (2014) Validity of heart failure diagnoses in administrative databases: a systematic review and meta-analysis. PLoS One 9:e104519. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1371/journal.pone.0104519&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=25126761&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 24. McCormick N, Bhole V, Lacaille D, et al. (2015) Validity of diagnostic codes for acute stroke in administrative databases: a systematic review. PLoS One 10:e0135834. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1371/journal.pone.0135834&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=26292280&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 25. Rai SK, Aviña-Zubieta JA, McCormick N, et al. (2017) The rising prevalence and incidence of gout in British Columbia, Canada: population-based trends from 2000 to 2012. Semin Arthritis Rheum 46:451–6. 26. English K (2017) Big Data: accessing existing health databases to advance health research (Arthritis Research Canada, Richmond (BC)) Available: [https://www.arthritisresearch.ca/big-data-accessing-existing-health-databases/](https://www.arthritisresearch.ca/big-data-accessing-existing-health-databases/). accessed 2018 Dec 20. 27. Esdaile J, Abrahamowicz M, Aviña-Zubieta A, et al. PRECISION: Preventing Complications from Inflammatory Skin, Joint and Bowel Conditions (Arthritis Research Canada, Richmond (BC)) Available: [www.arthritisresearch.ca/precision](http://www.arthritisresearch.ca/precision). accessed 2018 Apr 21. 28. McCormick N, Hamilton C, Koehn C, et al. (2018) Canadians’ views about using big data in health research from a national online survey: a partnership of patient-consumers and researchers [abstract]. Arthritis Rheumatol 70(Suppl 10). 29. Hamilton CB, Leese JC, Hoens AM, et al. (2017) Framework for advancing the reporting of patient engagement in rheumatology research projects. Curr Rheumatol Rep 19:38. 30. Leese J, Macdonald G, Kerr S, et al. (2018) ‘Adding another spinning plate to an already busy life.’ Benefits and risks in patient partner–researcher relationships: a qualitative study of patient partners’ experiences in a Canadian health research setting. BMJ Open 8:e022154. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMToiOC84L2UwMjIxNTQiO3M6NDoiYXRvbSI7czoyMDoiL2NtYWpvLzcvMi9FMjAzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 31. Koehn C, Attara G, Stordy A, et al. (2014) Consumer informing research: a survey of Canadians’ views and research priorities in chronic inflammatory diseases [abstract]. J Rheumatol 41:1465. 32. Xafis V (2015) The acceptability of conducting data linkage research without obtaining consent: lay people’s views and justifications. BMC Med Ethics 16:79. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 33. Buckley BS, Murphy AW, MacFarlane AE (2011) Public attitudes to the use in research of personal health information from general practitioners’ records: a survey of the Irish general public. J Med Ethics 37:50–5. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToibWVkZXRoaWNzIjtzOjU6InJlc2lkIjtzOjc6IjM3LzEvNTAiO3M6NDoiYXRvbSI7czoyMDoiL2NtYWpvLzcvMi9FMjAzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 34. Luchenski SA, Reed JE, Marston C, et al. (2013) Patient and public views on electronic health records and their uses in the United Kingdom: cross-sectional survey. J Med Internet Res 15:e160. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.2196/jmir.2701&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=23975239&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 35. Zalin A, Papoutsi C, Shotliff K, et al. (2016) The use of information for diabetes research and care: patient views in West London. Pract Diabetes 33:81–86a. 36. Whiddett R, Hunter I, Engelbrecht J, et al. (2006) Patients’ attitudes towards sharing their health information. Int J Med Inform 75:530–41. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1016/j.ijmedinf.2005.08.009&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=16198142&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) [Web of Science](http://www.cmajopen.ca/lookup/external-ref?access_num=000238244900003&link_type=ISI) 37. Hill EM, Turner EL, Martin RM, et al. (2013) “Let’s get the best quality research we can”: public awareness and acceptance of consent to use existing data in health research: a systematic review and qualitative study” BMC Med Res 3:72. 38. Tully MP, Bozentko K, Clement S, et al. (2018) Investigating the extent to which patients should control access to patient records for research: a deliberative process using citizens’ juries. J Med Internet Res 20:e112. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 39. Parkin L, Paul C (2011) Public good, personal privacy: a citizens’ deliberation about using medical information for pharmacoepidemiological research. J Epidemiol Community Health 65:150–6. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czo4OiI2NS8yLzE1MCI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 40. Feldman DE, Vinet É, Bérard A, et al. (2017) Heart disease, hypertension, gestational diabetes mellitus, and preeclampsia/eclampsia in mothers with juvenile arthritis: a nested case–control study: maternal outcomes in juvenile arthritis. Arthritis Care Res (Hoboken) 69:306–9. 41. Kuenzig ME, Barnabe C, Seow CH, et al. (2017) Asthma is associated with subsequent development of inflammatory bowel disease: a population-based case–control study. Clin Gastroenterol Hepatol 15:1405–12.e3. [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 42. Solomon DH, Love TJ, Canning C, et al. (2010) Risk of diabetes among patients with rheumatoid arthritis, psoriatic arthritis and psoriasis. Ann Rheum Dis 69:2114–7. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTE6ImFubnJoZXVtZGlzIjtzOjU6InJlc2lkIjtzOjEwOiI2OS8xMi8yMTE0IjtzOjQ6ImF0b20iO3M6MjA6Ii9jbWFqby83LzIvRTIwMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 43. Raymakers AJN, Sadatsafavi M, Sin DD, et al. (2017) The impact of statin drug use on all-cause mortality in patients with COPD. Chest 152:486–93. 44. CAN-AIM, Moura CS, Abrahamowicz M, Beauchamp ME, et al. (2015) Early medication use in new-onset rheumatoid arthritis may delay joint replacement: results of a large population-based study. Arthritis Res Ther 17:197. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1186/s13075-015-0713-3&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=26235697&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 45. Dixon WG, Abrahamowicz M, Beauchamp ME, et al. (2012) Immediate and delayed impact of oral glucocorticoid therapy on risk of serious infection in older patients with rheumatoid arthritis: a nested case–control analysis. Ann Rheum Dis 71:1128–33. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTE6ImFubnJoZXVtZGlzIjtzOjU6InJlc2lkIjtzOjk6IjcxLzcvMTEyOCI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 46. Shah M, Avgil Tsadok M, Jackevicius CA, et al. (2014) Warfarin use and the risk for stroke and bleeding in patients with atrial fibrillation undergoing dialysis. Circulation 129:1196–203. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMjkvMTEvMTE5NiI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 47. Douglas A, Ward HJT, Bhopal R, et al. (2018) Is the linkage of census and health data justified? Views from a public panel of the Scottish Health and Ethnicity Linkage study. J Public Health (Oxf) 40:435–40. 48. Fuller D, Buote R, Stanley K (2017) A glossary for big data in population and public health: discussion and commentary on terminology and research methods. J Epidemiol Community Health 71:1113–17. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czoxMDoiNzEvMTEvMTExMyI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 49. Project Big Life [homepage], Available: [https://www.projectbiglife.ca/](https://www.projectbiglife.ca/). accessed 2019 Jan 3. 50. Tu JV, Chu A, Donovan LR, et al. (2015) The Cardiovascular Health in Ambulatory Care Research Team (CANHEART): using big data to measure and improve cardiovascular health and healthcare services. Circ Cardiovasc Qual Outcomes 8:204–12. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2lyY2N2b3EiO3M6NToicmVzaWQiO3M6NzoiOC8yLzIwNCI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 51. Kashef Z (Apr 12, 2018) YaleNews [New Haven (CT)], Big data analysis accurately predicts patient survival from heart failure. Available: [https://news.yale.edu/2018/04/12/big-data-analysis-accurately-predicts-patient-survival-heart-failure](https://news.yale.edu/2018/04/12/big-data-analysis-accurately-predicts-patient-survival-heart-failure). accessed 2019 Jan 3. 52. (Feb 29, 2016) Big data initiative set to make impact on health of society [news release] (Farr Institute of Health Informatic Research), Available: [http://farrinstitute.org/news/big-data-initiative-set-to-make-impact-on-health-of-socie](http://farrinstitute.org/news/big-data-initiative-set-to-make-impact-on-health-of-socie). accessed 2019 Jan. 3. 53. van Staa TP, Goldacre B, Buchan I, et al. (2016) Big health data: the need to earn public trust. BMJ 354:i3636. [FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNTQvanVsMTRfMi9pMzYzNiI7czo0OiJhdG9tIjtzOjIwOiIvY21ham8vNy8yL0UyMDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 54. Kho ME, Duffett M, Willison DJ, et al. (2009) Written informed consent and selection bias in observational studies using medical records: systematic review. BMJ 338:b866. [Abstract/FREE Full Text](http://www.cmajopen.ca/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE2OiIzMzgvbWFyMTJfMi9iODY2IjtzOjQ6ImF0b20iO3M6MjA6Ii9jbWFqby83LzIvRTIwMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 55. Brislin RW (1970) Back-translation for cross-cultural research. J Cross Cult Psychol 1:185–216. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1177/135910457000100301&link_type=DOI) [Web of Science](http://www.cmajopen.ca/lookup/external-ref?access_num=A1970Y151500001&link_type=ISI) 56. Santos HPO, Black AM, Sandelowski M (2015) Timing of translation in cross-language qualitative research. Qual Health Res 25:134–44. [CrossRef](http://www.cmajopen.ca/lookup/external-ref?access_num=10.1177/1049732314549603&link_type=DOI) [PubMed](http://www.cmajopen.ca/lookup/external-ref?access_num=25189538&link_type=MED&atom=%2Fcmajo%2F7%2F2%2FE203.atom) 57. *Genetic Non-Discrimination Act* (S.C. 2017, c. 3), Available: [http://laws.justice.gc.ca/eng/AnnualStatutes/2017_3/FullText.html](http://laws.justice.gc.ca/eng/AnnualStatutes/2017_3/FullText.html). accessed 2018 Sept. 14. * Copyright 2019, Joule Inc. or its licensors