|
Home | Issues | eMJA shop | My account | Classifieds | Contact | More... | Topics | Search |
→ Contents list for this issue
→ More articles on Journalology and publishing
→ More articles on Statistics Epidemiology and Research Design
Introduction
—Elements of an interpretation
—Recommended structure of a Discussion
—Statement of the findings
—Strengths and weaknesses
—Mechanisms and explanations
—Relation to other studies
—Implications for clinical practice and future research
—Interpretations, not just interpretation
—Competing interests
—Author details
—References
In preparing an article reporting a clinical trial, the authors are expected to provide a reasoned interpretation of the results and place them into a broader clinical context. “Interpretation” (Item 20 of the CONSORT statement) refers to how the authors account for their results (Box 1).1
Although it has been suggested that authors often have a vested interest in their data and may be biased towards a positive result,2 they nevertheless have first-hand experience of the design and conduct of the trial, and can offer a unique insight into interpretation of the results.
Authors are generally encouraged to summarise the extent to which the results and findings are consistent with their original hypothesis,3 and to comment on the robustness of the results for drawing conclusions and making recommendations. This should involve discussion of:
the suitability of the study design to answer the questions examined;
the ultimate quality of the trial as conducted;
the extent to which missing follow-up or imputed data contributed to the reported results; and
the potential influence of any protocol violations or other biases that may have affected the clinical experiment.4
Beyond this, specifically addressing the key aspects of internal validity (such as the fairness of the comparison of treatment groups in the trial) will help the reader assess the results. For example, the reader’s confidence in the results is increased by reassurance about the adequacy of randomisation,5 consistency across treatment arms of the methods used to measure outcomes,6 and similar levels of background care in the treatment arms.
Additionally, if results are derived from multiple comparisons, the dangers of over-interpretation of a variety of endpoints should be acknowledged.7,8
Even if a study has strong internal validity, the results may be surprising or unexpected. Therefore, the plausibility of the results in relation to expectations, and speculation as to possible mechanisms of action of the intervention should be discussed.7 The sensitivity of the findings to any departures from the assumptions in the design (eg, compliance levels, unblinding, losses to follow-up, and missing data) should be mentioned.4 Any other limitations or drawbacks of the study design or conduct should be acknowledged, and their influence, or lack of influence, on the outcomes should be argued. This will help the reader to compare these results with other relevant findings.
The strength of the findings (shown by P values and confidence intervals around the estimates) signifies their robustness. The size of the estimates of effect and their plausible variability (such as the risk reduction or hazard ratio and confidence intervals) show the potential importance of the intervention in clinical use.
After validity has been discussed, the potential ramifications of the results are usually presented. These include the value of the intervention beyond the trial, including the likely generalisability of the findings,9 the balance of benefits and harms,10 and any potential changes to clinical practice that may be appropriate. How consistent the results are with other findings, and how biologically and clinically plausible the interpretation is, will influence this discussion.
The results may raise new questions directing further research. These might arise from the findings for the main outcome, from a subset of patients, or from ancillary analyses. As the authors have an intimate knowledge of the study and the data, their views on the direction of such research may carry weight.
We recommend an ordered structure for the Discussion section (Box 2).
A simple declaration of the meaning of the results should introduce the authors’ interpretation. For example, the following sentence introduces the Discussion section of the LIPID study report:11
Our results provide strong evidence that lowering cholesterol levels with pravastatin in patients with a broad range of initial cholesterol levels and a history of myocardial infarction or unstable angina reduces the risk of death from CHD, cardiovascular disease, and all causes combined.
What follows extends this discussion with a brief statement of which patient groups benefited, and the extent and nature of these benefits.
The strengths of the trial may include its representative sample, its rigorous design and its clinical relevance.
The account of the weaknesses should aim to explain any flaws in the study identified by the authors, and outline the attempts made to minimise and compensate for these limitations. Identifying weaknesses in the study and discussing their likely influences will help readers appreciate the limitations of interpreting the study results. Discussion of weaknesses should include methodological aspects (eg, possible biases, the meaning of imprecision in the findings, and the number of multiple comparisons) as well as clinical aspects, such as any problem in translating statistical results to clinical importance.
An example is the study comparing hot water immersion with ice packs to relieve the pain of bluebottle (Physalia jellyfish) stings in participants recruited from beach first aid facilities.12 Hot water immersion for 20 minutes, unlike ice, was highly effective. The Discussion described study weaknesses, such as bias: there was a possibility that, with simultaneous recruitment of family members, treatments could have been allocated after randomisation on the basis of severity.
We suspect in some cases when two or three patients were simultaneously recruited (often one parent consenting for multiple children), the research assistants may have allocated hot water treatment to the more severe stings once the envelopes were open. However, this was likely to be rare, and a post-hoc analysis using simulations of matched treatment subgroups still showed a highly significant outcome at 20 minutes.
The Discussion also dealt with the subjectivity of pain and the problems of choosing how to measure it.
The measurement of pain is problematic because it is subjective and is influenced by numerous factors. However, pain is the most important and distressing effect of bluebottle stings, so it was essential that we establish the effect of treatment on pain. The VAS has become a standard tool for the measurement of pain in research, and has been validated in numerous settings.
It is important that the authors consider all possible mechanisms underlying the results and explain how they might relate to the outcomes.
For example, in the bluebottle study:12
It might be argued that the hot water immersion may be a symptomatic treatment for jellyfish stings, rather than providing definitive treatment by inactivating venom . . . We demonstrated a time-dependent effect of hot water immersion, with a barely significant effect at 10 minutes and a highly significant effect at 20 minutes. In addition, pain did not recur. This leads us to suggest that the mechanism of reducing pain by heat treatment is inactivation of venom.
However, unlikely or implausible hypothetical mechanisms should not be proposed merely so that they can be disproven.
All clinical trials start from a background of previous work. Authors should indicate where results extend, agree with or differ from those of other studies (a forest plot may help readers with interpretation). If there are differences, are these related to differences in methods or the characteristics of participants? An important finding of the Women’s Contraceptive and Reproductive Experiences (CARE) Study was new evidence on breast cancer risk:13
In conclusion, high parity and early age at first birth were associated with a reduction in risk only for ER+PR+ tumours. Breastfeeding was associated with a reduction in risk for both ER+PR+ and ER–PR– tumours. Combined with previous research, this suggests that parity and age at first birth act through different mechanisms than breastfeeding. All reproductive factors showed similar associations with both ductal, ductolobular and lobular tumours, suggesting that these tumours have similar aetiologies.
Readers may not have the authors’ background and experience in the research area. Their own interpretations are aided by the authors’ commentary, which may include how far the results can be applied in different clinical situations. For example, in the Heart Protection Study:14
As people with blood creatinine concentrations above 200 μmol/L were excluded from the present study, further large trials are required to determine prospectively whether statin therapy can prevent clinically relevant changes in renal function among people at particular risk of developing end-stage kidney disease.
Most clinical trial reports for publication draw together the expertise and interests of several authors, and the Discussion section is where their views are most likely to diverge. Authors bring different perspectives to interpreting the results.15 The Discussion needs to reflect a consensus view of all the contributors.
1 CONSORT checklist of items to include when reporting a randomised trial1
Interpretation of the results, taking into account study hypotheses, sources of potential bias or imprecision, and the dangers associated with multiplicity of analyses and outcomes |
|||||||||||||||
2 Suggested framework for the Discussion section
A brief statement of the findings;
Strengths and weaknesses (limitations) of the study, and methods used to minimise and compensate for the limitations;
Possible mechanisms of action of the intervention, and explanations of these mechanisms;
Comparison with relevant findings from other published studies;
Clinical and research implications of the work, as appropriate.
NHMRC Clinical Trials Centre, University of Sydney, Sydney, NSW.
Correspondence: tonyATctc.usyd.edu.au
|
Home | Issues | eMJA shop | My account | Classifieds | More... | Contact | Topics | Search |
©The Medical Journal of Australia 2007 www.mja.com.au PRINT ISSN: 0025-729X ONLINE ISSN: 1326-5377