Conducting high-value secondary dataset analysis: an introductory guide and resources

Alexander K Smith; John Z Ayanian; Kenneth E Covinsky; Bruce E Landon; Ellen P McCarthy; Christina C Wee; Michael A Steinman

doi:10.1007/s11606-010-1621-5

Conducting high-value secondary dataset analysis: an introductory guide and resources

J Gen Intern Med. 2011 Aug;26(8):920-9. doi: 10.1007/s11606-010-1621-5. Epub 2011 Feb 8.

Authors

Alexander K Smith¹, John Z Ayanian, Kenneth E Covinsky, Bruce E Landon, Ellen P McCarthy, Christina C Wee, Michael A Steinman

Affiliation

¹ Department of Medicine, University of California, San Francisco, CA, USA. aksmith@ucsf.edu

Abstract

Secondary analyses of large datasets provide a mechanism for researchers to address high impact questions that would otherwise be prohibitively expensive and time-consuming to study. This paper presents a guide to assist investigators interested in conducting secondary data analysis, including advice on the process of successful secondary data analysis as well as a brief summary of high-value datasets and online resources for researchers, including the SGIM dataset compendium ( www.sgim.org/go/datasets ). The same basic research principles that apply to primary data analysis apply to secondary data analysis, including the development of a clear and clinically relevant research question, study sample, appropriate measures, and a thoughtful analytic approach. A real-world case description illustrates key steps: (1) define your research topic and question; (2) select a dataset; (3) get to know your dataset; and (4) structure your analysis and presentation of findings in a way that is clinically meaningful. Secondary dataset analysis is a well-established methodology. Secondary analysis is particularly valuable for junior investigators, who have limited time and resources to demonstrate expertise and productivity.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Review

MeSH terms

Biomedical Research / methods
Biomedical Research / standards
Biomedical Research / statistics & numerical data
Databases, Factual / standards*
Databases, Factual / statistics & numerical data
Humans
Internet / standards
Practice Guidelines as Topic / standards*

Abstract

Publication types

MeSH terms

Grants and funding