metricas
Cirugía Española (English Edition) Statistical Analysis Plan (SAP): What is it and how to develop it?
Journal Information
Visits
1879
Vol. 103. Issue 1.
Pages 1-56 (January 2025)
Methodological letter
Full text access

Statistical Analysis Plan (SAP): What is it and how to develop it?

Plan de Análisis Estadístico: ¿qué es y cómo elaborarlo?
Visits
1879
Marc Fraderaa,
Corresponding author
mfradera@tauli.cat

Corresponding author.
, Xavier Serra-Aracilb
a Responsable de la Unitat de Suport a la Recerca (USR), Institut d'Investigació i Innovació Parc Taulí I3PT-CERCA, Sabadell, Barcelona, Spain
b Consorci Corporació Sanitària Parc Taulí CCSPT, Institut d’Investigació i Innovació Parc Taulí I3PT-CERCA, Departamento de Cirugía, Universitat Autònoma de Barcelona, Sabadell, Barcelona, Spain
This item has received
Article information
Full Text
Bibliography
Download PDF
Statistics
Tables (1)
Table 1. Sections and components that the Statistical Plan Analysis must contain for randomised clinical trials, prospective observational studies and retrospective observational studies.
Tables
Full Text

In the field of biomedical research, regardless of the study type and design, statistics plays an essential part. It provides us with methods to organise, summarise and analyse data in order to extract valid conclusions from it and facilitate decision-making. Due to the significance of statistics in ensuring accuracy and validity of conclusions derived from data analysis, good planning of the statistical methods and procedures to be employed in the different research project stages is vital. For some time now, the most prestigious scientific journals have requested the attachment of a Statistical Analysis Plan. This provides a detailed description of the proposed statistical methods for project data analysis and accompanies the manuscript for publication assessment.1,2 Quoting the North American writer Rican Alan Lakein, “Planning is bringing the future into the present so you can do something about it now.” Throughout this document we will explain what a Statistical Analysis Plan (known as SAP) is, why it is important, and how to prepare it.

What is an SAP?

SAP, or statistical analysis plan, is a document which is separate from the study protocol and which describes and details the intended statistical methods that will be used to analyse the data collected in a research study. To sum up, it describes what variables and results will be collected and what statistical methods will be used to analyse them.3

Generally, the study protocol specifies the study design; eligibility criteria; primary and secondary objectives; statistical methods to analyse the main variables; statistical power, and duly justified sample size. Although it is true that the study protocol already contains the main characteristics of the statistical analysis, the SAP is usually a much more complete document. It contains exhaustive technical details on the clinical analysis planned for the main variables; the management of secondary variables; control and/or confounding variables; the confidence intervals that will be used to present the results; the management of missing data, and other relevant specifications.3,4

Why is it important to prepare an SAP?

In modern research and open science, transparency and reproducibility are two basic concepts in good research practices to guarantee that the said statistical methods and procedures are accessible and reproducible for other researchers.5 Having an SAP, first of all, increases the transparency of the analysis. By establishing a detailed plan that describes how the data will be analysed before beginning the study, clarity is provided about the statistical methods and procedures that will be used. This allows researchers and the scientific community to understand in a transparent and concise manner how the results were obtained, facilitating the reproducibility of statistical analyses by personnel outside the research team. This is essential to validate and guarantee the reliability of the findings, contributing to the confidence in and credibility of biomedical research.6,7 Furthermore, another notable advantage in developing a statistical analysis plan is the efficiency resulting from the necessary communication between the statistician and the researcher when preparing the document.1 The fact that both the research team and the statistician actively participate in the preparation of the statistical planning that will be carried out will save time when making statistical and methodological decisions during the data analysis process. Although preparing the SAP does require considerable time, it will undoubtedly be a worthwhile investment.

To conclude, the development of an SAP in biomedical research projects is essential to guarantee the transparency, reproducibility, objectivity and validity of the analyses, while promoting effective communication between researchers and statisticians. This contributes significantly to the quality and credibility of the project in question.

When should the SAP be made?

The SAP must be prepared either at the same time or shortly after completing the protocol.8 In the case of experimental studies, if necessary, the SAP can be updated before unblinding the study to guarantee the transparency, accuracy and validity of the analyses.9 In prospective observational studies, the SAP should be completed before the inclusion of the first patient.10 When the study is retrospective observational, it is also advisable to have a SAP and, in this case, its version must be final before closing the database to begin analysis.3 One aspect to consider, whatever the type of study, is to detail each update with its dated version.

How is an SAP prepared?

As previously commented upon, given the level of detail and specificity required by the document, it is essential that the research staff and the person or persons who will be responsible for the statistical analysis collaborate in the preparation of the SAP. Several scientific publications serve as guides for the preparation of SAPs,3,9,10 and a recent publication even provides an extensive and complete template for the scientific community, with the different sections to include and how they should be completed.8

Sections to be included in an SAP

The Statistical Analysis Plan must address different sections to provide detail on the statistical management that will ensue and thus enable other researchers to replicate the analysis with similar data sets. Some consensus exists among the scientific community on the main sections and sections that the SAP should contain, thanks to the work of Gamble et al. in 20179:

  • 1

    Administrative information: This includes the title of the project, the SAP version and the protocol, the different revisions that have been made, the signatures of those people involved in the preparation of the document, and their roles.

  • 2

    Introduction: This section contextualises the project with its scientific justification and the research questions intended to be answered.

  • 3

    Design and methods: A detailed description of the study methodology. This section also includes the statistical justification for the sample size calculation, as well as the randomisation procedures to be carried out, if applicable. Additionally, this section specifies the proposed interim analyses and the criteria for stopping the study early based on the results obtained in said analyses.

  • 4

    Statistical assumptions: This section details both the confidence intervals and the level of statistical significance that will be assumed. Another part of this section is the definition of adherence, protocol deviations and the population that will be analysed (by intention to treat, by protocol, etc.)

  • 5

    Study population: Here the eligibility criteria of the sample are explained, as well as the follow-up time, management of loss to follow-up, etc. The baseline characteristics that will be collected for each of the study participants are also detailed.

  • 6

    Data analysis: This is the broadest and most detailed section, since it is where what is stated in the protocol must be expanded. All primary and secondary variables to be analysed must be well documented, together with the units of measurement for each of them. All statistical analyses planned for these variables, the management of missing data and the statistical software that will be used to develop the analysis are also detailed.

There are some small differences between the content of an SAP depending on the type of study. Table 1 specifies the different sections that should be included in randomised clinical trials, in prospective and retrospective observational trials.

Table 1.

Sections and components that the Statistical Plan Analysis must contain for randomised clinical trials, prospective observational studies and retrospective observational studies.

    Study type
  Sections and components that the SAP must contain  Randomised clinical trial  Prospective observational  Retrospective observational 
Project title 
Study registration number     
SAP version number and date 
Study protocol version 
SAP review history 
Reasons for SAP reviews 
Time of SAP reviews in relation to the interim analyses   
SAP collaborators, with responsibilities and roles 
Name of the person who wrote the SAP 
10  Name of the senior statistician 
11  Name of the principal researcher 
12  Study background and justification 
13  Hypothesis and objectives 
14  Study type 
15  Randomisation details     
16  Estimation and justification of sample size, if applicable 
17  Focus of the superiority hypothesis tests, equivalence or non inferiority     
18  Interim analysis, time of analysis and person performing the analysis, if applicable   
19  Adjustment of the significance level due to interim analysis   
20  Indications for early termination of the study   
21  Final analysis moment 
22  Schedule of visits and time interval to evaluate each result   
23  Statistical significance levels (p values) and whether they are unilateral or bilateral 
24  Plan and justification for multiplicity adjustment, if applicable, including how type 1 error is controlled 
25  Confidence intervals to be reported and whether they are unilateral or bilateral 
26  Definition of adherence to the intervention and how it will be presented     
27  Definition and summary of protocol deviations   
28  Definition of the population analysed 
29  Report screening data to describe representation of the study population, if applicable 
30  Inclusion and exclusion criteria 
31  Recruitment strategy   
32  Level and timing of early withdrawal of patients from the study   
33  Presentation of early withdrawal and follow-up data   
34  Baseline characteristics of the patients and how they will be descriptively summarised. 
Points 35−37 apply to each of the primary and secondary results.
35  Definitions of results and sequence of measurement 
36  Specific measurements and units for each variable 
37  Estimations and transformations used to obtain the result 
38  Methods of analysis used 
39  Presentation of treatment or intervention effects     
40  Covariates and adjustments 
41  Methods for confirming distribution assumptions 
42  Alternative methods used if distribution assumptions are not met 
43  Sensitivity analysis for each outcome, if applicable 
44  Subgroup definition and analysis, if applicable 
45  Methods for missing data management 
46  Additional statistical analysis, if applicable 
47  Safety data summary details   
48  Statistical packages used for analysis 
49  Reference to standard operating procedure or additional documents 

Adaptation of Yuan et al. (2019), under licence permits.

References
[1]
New England Journal of Medicine. New Manuscripts [Accessed 14 May 2024]. Available from: https://www.nejm.org/author-center/new-manuscripts.
[2]
JAMA Surgery. Instructions for Authors [Accessed 14 May 2024]. Available from: https://jamanetwork.com/journals/jamasurgery/pages/instructions-for-authors#SecProtocols.
[3]
I. Yuan, A.A. Topjian, C.D. Kurth, M.P. Kirschen, C.G. Ward, B. Zhang, et al.
Guide to the statistical analysis plan.
Paediatr Anaesth., 29 (2019), pp. 237-242
[4]
International Conference on Harmonisation of Technical Requirements for Registration of Pharmaceuticals for Human Use (ICH). Statistical Principles for Clinical Trials (E9). 1998 1998 [Accessed 14 May 2024]. Available from: https://www.ema.europa.eu/en/documents/scientific-guideline/ich-e-9-statistical-principles-clinical-trials-step-5_en.pdf.
[5]
B.A. Nosek, G. Alter, G.C. Banks, D. Borsboom, S.D. Bowman, S.J. Breckler, et al.
Promoting an open research culture.
Science., 348 (2015), pp. 1422-1425
[6]
M. Munafò, B. Nosek, D. Bishop, K.S. Button, C.D. Chambers, N. Percie du Sert, et al.
A manifesto for reproducible science.
[7]
S. Landis, S. Amara, K. Asadullah, C.P. Austin, R. Blumenstein, E.W. Bradley, et al.
A call for transparent reporting to optimize the predictive value of preclinical research.
Nature., 490 (2012), pp. 187-191
[8]
G. Stevens, S. Dolley, R. Mogg, J.T. Connor.
A template for the authoring of statistical analysis plans.
Contemp Clin Trials Commun., 34 (2023),
[9]
C. Gamble, A. Krishan, D. Stocken, S. Lewis, E. Juszczak, C. Doré, et al.
Guidelines for the content of statistical analysis plans in clinical trials.
JAMA., 318 (2017), pp. 2337-2343
[10]
B. Hiemstra, F. Keus, J. Wetterslev, C. Gluud, I.C.C. van der Horst.
DEBATE-statistical analysis plans for observational studies.
BMC Med Res Methodol., 19 (2019), pp. 233
Copyright © 2024. AEC
Article options
Tools