Objectives A How-to Guide for an Effective Journal Club Lisa Lundquist, PharmD, BCPS Clinical Assistant Professor Mercer University Atlanta, GA • • • • How to select an article to evaluate How to critically evaluate a study How to apply basic statistical concepts How to effectively deliver a journal club presentation Sabrina Cole, PharmD, BCPS Clinical Pharmacist Specialist, Drug Information Grady Health System Atlanta, GA Primary Literature • Original publications • Necessary for the development of secondary and tertiary literature resources • Study design, methodology, and scientific results included • Peer review process enhances validity of study and authors’ conclusions • Examples: research studies, case reports, editorials, letters-to-the-editor Major Goals of Interpretation • Establish the significance or importance of the trial • Relate the results to the original objectives of the trial • Compare data from the trial with data obtained from other trials 1 Common Problems Encountered in Literature • • • • • • • Flawed study design Invalid statistical analysis Fraud, deception, and misrepresentation Unintentional errors Poorly conducted research Poorly written manuscripts Data dredging Publication Process 1. 2. 3. 4. 5. Selection of journal Preparation for submission Review / peer review Revision Resubmission Peer Review Process Manuscript Editor Reviewer A Comments Reviewer B and suggestions Reviewer C Critical Literature Evaluation Accept Accept with revision Reject 2 Types of Studies • Prospective vs. retrospective – Prospective – followed forward into time – Retrospective – reviewed back in time Types of Studies • Descriptive vs Explanatory – Descriptive – serves to inform other healthcare professionals – Explanatory – determines if a difference exists between interventions • Observational • Experimental Types of Studies • Crossover – Experimental (interventional), prospective – Patient receives both the study and control medications separated during specified time periods A B • Parallel A B – Experimental (interventional), prospective – Patient receives either the study or control medication throughout the study A B Types of Studies • Cohort – Observational, prospective – Design may involve the evaluation of risk factors for disease development in a specified population • Case-control – Observational, retrospective – Patients with the disease (cases) and without the disease (controls) are compared to determine the exposure to the risk factor in question • Cross-sectional – study where measurements are taken at a single point in time 3 Types of Studies Types of Studies • Clinical – any experiment in which a drug is administered or dispensed to one or more human subjects • Quality of life – evaluation of a patient’s living situation based on the patient’s environment, family life, financial situation, educations, and health • N-of-1 – controlled study conducted in a single subject where periods of exposure to a treatment are compared to periods of exposure to placebo • Post-marketing surveillance – study designed to examine drug use and frequency of side effects following approval by FDA • Stability – study designed to determine the stability of drugs in various preparations • Meta-analysis – type of review where conclusions are based on the summarization of results obtained from combining and statistically evaluating data from previously conducted studies • Bioequivalence – research that evaluates whether products are similar in rate and extent of absorption Relative Strength of Causal Relationships Based on Study Design Increasing Strength • • • • Randomized clinical trials Meta-analyses Follow-up (cohort) study Case-control (trohoc) study • Case series • Case reports • Pharmacoeconomic – study of economic impact of drug therapies or services Controls • Controls – a treatment used for comparison in a study – placebo control – historical control – cross-over control – standard-treatment control – within-patient comparison control 4 Randomization • Any one individual has a predetermined probability of being assigned to each particular study and control group • Decreases but does not eliminate the possibility that study and control groups will differ according to factors that affect prognosis Randomization • All inclusion and exclusion criteria must be met before randomization occurs • Types - Simple Block (cluster) Stratified Non-randomization Blinding • Rationale - To prevent clinicians from assessing/treating one patient group differently from the other - To overcome the “placebo effect” - To ensure equal patient compliance • Limitations Blinding • Types - Single-blind Double-blind Triple-blind Double-dummy - May be difficult to blind a medication with a distinctive taste, physiologic effect, or continuous titration - Expensive and time consuming 5 Bias • Components of the trial design which can influence the outcome(s) studied • Controlling for Bias - Minimize confounders Proper selection of patients Objective method(s) of selecting data Blinding Use a control group Reliable data sources Population and Samples • Population – Every individual in the universe with the specific characteristics or disease states under study • Sample – Group or individual chosen as representatives from the population under study Variables • Dependent variable – outcome of interest within the study • Independent variable – the intervention or what is being manipulated • Confounding variables – affect the patients’ conditions and are associated statistically with the intervention being evaluated – Example: • In studying whether cigarette smoking causes lung cancer in a case-control study, drinking alcohol would be a confounding factor because people who smoke cigarettes are more likely to drink alcohol. Validity • Internal validity – Within the confines of the study, the methods and analysis used stand up to scrutiny, the investigators’ interpretation is supported, and the results appear accurate • External validity – Generalizability. The ability to apply the information to the reader’s practice setting • NO internal validity = NO external validity 6 Analysis • Intention-to-treat – Compares outcome based on the intended initial subjects’ assignments – Determines the effect of treatment under usual conditions (eg, gives a better idea of how the drug will do in the real world) – No data should be eliminated Analysis • As-treated – Analyzes subjects based on what intervention the subjects actually received – No data should be eliminated • Per-protocol – Analyzes those subjects who precisely followed the protocol – Problematic if compliance is related to prognosis Basic Study Design • Superiority – Trial with the primary objective of showing that the response to the investigational product is superior to a comparator • Equivalence Basic Statistical Concepts – A trial with the primary objective of showing that the response to 2 or more treatments differs by an amount that is clinically unimportant • Noninferiority – Trial with the primary objective of showing that the response to the investigational product is not clinically inferior to a comparative agent 7 Types of Data • Discrete – Nominal – Ordinal • Continuous – Interval – Ratio Types of Data: Ordinal • Responses scored on a continuum, but no consistent level of magnitude between ranks. • Order of numbers is meaningful • Ordinal scale data can be ranked in a specific order, be it low to high or high to low. – Questionnaires: "strongly agree" is scored as 5, "agree" is scored 4, "no opinion" is scored 3, "disagree" is scored 2, and "strongly disagree" is scored 1. Types of Data: Nominal • Most primitive scale and thus the weakest level of measurement • Items (subjects, patients) placed into groups or categories based on some mutual characteristics, which the entire group possesses. • Data is unordered (there is no ranking among the groups) • Examples: – Gender (male, female), outcome (lived or died, cured or not cured, infection or no infection), diagnosis (intracranial hemorrhage or thromboembolism), risk factors (smoking: yes or no), race, etc… Types of Data: Continuous • A predetermined order to the numbering of the scale is present, as is a consistent level of magnitude between each unit of measure. • Examples: heart rate, blood pressure, blood glucose, distance, time, and degrees Kelvin 8 Hypothesis Testing Step 1: Statement of the Null Hypothesis • Tests of a null and research hypothesis set using data obtained from a sample to make inferences about a population parameter • Statistical hypothesis • Process for answering questions • No difference between groups • Denoted by “H0” ¾ Group A – Group B = 0 ¾ Group A = Group B Step 2: Statement of the Research Hypothesis • Alternative hypothesis • Denoted by “H1” • Difference exists between groups ¾ Group A – Group B = 0 ¾ Group A = Group B Hypothesis Testing • Study example – A new weight-loss medication (Drug A) is compared to an existing one (Drug B) to determine if one agent is better at achieving goal BMI at the recommended starting dose • What is the H0? H1? 9 Step 3: Determination of the Significance Level • Alpha (α) • “Goal score” that, when achieved, allows H0 to be rejected and H1 accepted • Decided upon a priori, or before the fact • Generally, established as α < 0.05 Step 4: Evaluation of the Data Alpha (α) • Derived from the raw data and statistical calculations/tables • Statistical significance is generally accepted – Probability of making a type I error is < 0.05 – 1 out of 20 times a type I error is made (5%) • Alpha may be more stringent in some situations – p ≤ 0.01 – 1 out of 100 times a type I error is made (1%) P-Value • P-value • P-value tells us if there is or is not a difference between groups • Result of statistical testing and direct measure of the evidence supporting H0 • P-value does NOT tell anything about the magnitude of difference • Determined a posteriori, or after the fact • Compared directly to alpha (α) to make a decision about study results – Smaller p-values only mean it is less likely “chance” explains the observed differences • If p < α = statistical significance – Statistically significant does not always mean clinically significant 10 Step 5: Decision Regarding the Null Hypothesis • Accept the null hypothesis – No statistically significant difference – “Fail-to-reject” – P-value > alpha • Reject the null hypothesis – Statistically significant difference – “Fail-to-accept” – P-value < alpha Decision Errors • Type I error – α type error (alpha) – Reject H0 when H0 is true (false positive) – To falsely conclude that a significant difference exists between populations/ samples – Due to chance Decision Errors Decision Errors • Type II error – β type error (beta) – Accept H0 when H0 is false (false negative) – To falsely conclude that no significant difference exists between populations/ samples – Due to chance or small sample size “Truth” H0 is true H0 is false Accept H0 No error Type II error Reject H0 Type I error No error Your decision Error type 11 Decision Errors Decision Errors “Truth” Your decision “Truth” H0 is true H0 is false Accept H0 No error Type II error Reject H0 Type I error No error Your decision H0 is true H0 is false Accept H0 No error Type II error Reject H0 Type I error No error Error type “False-positive” result Beta (β) • The probability of making a type II error is defined as beta (β) • Beta (β) – More difficult to derive – Not one single probability value – Often ignored by researchers Error type “False-negative” result Beta (β) • Beta (β) < 0.2 acceptable • Beta (β) < 0.1 ideal • Beta (β) is most commonly used to calculate the number of subjects needed 12 Power • Power is defined by beta (β) – Indicates the probability of the statistical test detecting significant differences when they exist – Analogous to sensitivity • Defined as 1 - β Descriptive Statistics • Used to present, organize, and summarize data • Usually the basic presentation of data • Provide clues as to the appearance of the data – Power of 80% is minimal (1 - 0.2) – Power of ≥ 90% is ideal (1 - 0.1) Measures of Central Tendency • Mean – Arithmetic average of the data – May be computed for continuous data – Extremely sensitive to outliers • Median – – – – The 50th percentile Value above which or below which half of the data points lie Not sensitive to outliers Useful for continuous or ordinal data • Mode – Most commonly obtained value in the distribution – Useful to describe nominal, ordinal, and continuous data Measures of Variability • Percentile – Point on the distribution where a value is larger than x% of the other values in the group • Range – Difference between the largest and the smallest values in the distribution – Highly sensitive to outliers 13 Measures of Variability • Interquartile range – Measure of variability directly related to the median – Range described by the interval between the 25th and 75th percentile values – Clearly defines where the middle 50% of measures occurs and indicates the spread of data – Used to describe the variability for ordinal data Standard Deviation Measures of Variability • Standard deviation (SD) – Describes the variability of data about the sample mean – Meaningful only when it is calculated for normally distributed continuous data – About 68% of the data will fall within ±1 SD and about 95% of data will fall within ±2 SD Inferential Statistics • Used to determine the likelihood that a conclusion, based on the analysis of the data from a sample, is true and represents the population studied 2.5% 2.5% -3SD -2SD -SD X +SD +2SD +3SD • Used to make inferences about the larger population of interest based on the results from the sample 68% 95% 99.7% 14 Inferential Statistics Inferential Statistics • Standard error of the mean (SEM) – Measure of the precision with which a sample mean estimates the population mean – Statistic derived from the SD from a single sample • SEM=SD/√n – Always smaller than SD – Used to calculate confidence intervals Inferential Statistics: Parametric • Methods that use data extrapolated from a sample of the population studies to numerically describe some characteristic of a population • Valid only when the characteristic follows (or nearly follows) the normal distribution in the population studies • Valid only for continuous data • Confidence intervals (CI) – Measurement of the variability of study data – 95% CI is a numerical range that contains the true value for the population 95% of the time – Most commonly used to estimate the true, but unmeasured, population’s mean values for continuous data that are normally distributed Inferential Statistics: Parametric • t-test – Used for independent samples • Paired t-test – Used when 2 groups contain the same people in groups (ie, cross-over study design) • ANOVA (analysis of variance) – Used when comparing 3 or more groups • ANCOVA (analysis of covariance) – Method used for controlling for the effects of multiple confounding variables 15 ANOVA: post-hoc tests • Used to compare the means of the groups two at a time • Less error associated with the use of compared with separate t-tests • Examples Inferential Statistics: Nonparametric • Applied to non-normal distributions or to data that do not meet the criteria for using parametric tests (ie, ordinal and nominal data) – Bonferroni Correction – Scheffè’s method – Tukey’s least significant difference Inferential Statistics: Nonparametric • Mann-Whitney U test – Nonparametric equivalent to the t-test – Used when data are measured on an ordinal scale – Mann-Whitney U test = Wilcoxon Rank Sum • Wilcoxon Signed Rank test – Nonparametric equivalent of the paired t-test – Used for ordinal data Inferential Statistics: Nonparametric • Kruskal-Wallis – Nonparametric equivalent to ANOVA – Used for ordinal data • Friedman – Used for 3 or more groups with dependent samples – Used for ordinal data 16 Inferential Statistics: Nonparametric • Chi Square (X2) – Compares the percentages between 2 or more groups – Most useful for nominal data – Used to answer research questions about rates, proportions, or frequencies – Used with independent samples Inferential Statistics: Nonparametric • Fisher’s Exact test – Used instead of Chi Square if a cell in the matrix has an expected frequency of less than 5 or when you have a very small sample size (eg, 20 to 40) – Samples are independent • McNemar’s Test – Used to compare nominal data from paired samples • Mantel-Haenszel – Used to compare nominal data while controlling for the effects of a confounder Statistical Significance vs Clinical Significance A study comparing the mean INR in the 90 days before and after patients switched from brand name to generic name warfarin was reported in 2099 patients. Results showed that the mean INR before the switch was 2.45 + 0.02 compared to 2.51 + 0.04 after the switch (p<0.0001). Are these results statistically significant? Clinically significant? Format of Outcome Data Yes No Group 1 A B Group 2 C D 17 Relative Risk Relative Risk Reduction • The ratio of risk of an outcome event occurring in the experimental group compared to the risk of the same outcome event occurring in the control group. • Relative risk reduction is a complement to RR • Percent reduction in the experimental group rate compared with the control group event rate • RRR estimates the percentage of baseline risk that is removed as a result of the new therapy • RRR = 1 – RR • If the RRR is zero, there was no effect of the treatment compared with the control • (C/C+D)/(A/A+B) – RR < 1.0 indicates the therapy lessened the risk of developing the adverse outcome – RR = 1.0 denotes no difference between treatments – RR > 1.0 indicated the therapy increased the risk of developing the adverse outcome Absolute Risk Reduction Numbers Needed-to-Treat • This is sometimes called the risk difference • Difference in the event rate between the control group and the experimental group • ARR = (A/A+B) – (C/C+D) • An ARR of zero indicates no difference between comparison groups • Number of patients who require treatment to prevent one additional undesired event • NNT assumes that baseline risk is the same for all patients • Can not be extrapolated beyond study points in time • NNT = 1/ARR • NNT = 1/[A/(A+B) – C/(C+D)] 18 Title A Systematic Approach to Journal Club Presentation Investigators • Do the investigators appear to be qualified to conduct the trial? • Are the investigators affiliated with reputable institutions? • Is a statistician involved with the trial? • Is it descriptive? • Is it accurate? • Does it describe the design, therapy, route of administration, populations, and outcomes assessed? • Does it suggest that one treatment is superior to another? Funding • Is the funding source one that fosters independent study? 19 Journal • Was the trial published in a reputable journal? • Was the study peer reviewed? Abstract • • • • Does it state the hypothesis of the trial? Does it describe how the trial was undertaken? Does it highlight the results accurately? Does it put the essence of the trial into perspective for the reader? • Is it an unstructured, structured, or informational abstract? • Is it free of bias? Introduction • Is it written clearly? • Is it free of bias? • Does it establish the rationale for conducting the trial? • Is it free of current investigation’s results? Objectives • • • • Are the objectives stated? Are the objectives specific? How will the objectives be measured? When and by whom will the objectives be measured? • Are the objectives reasonable or within the scope of the trial? 20 Methods • Study Design – Is the design appropriate for the investigation? • Inclusion Criteria – Are the inclusion criteria explicitly stated? – Are the inclusion criteria appropriate? • Exclusion Criteria – – – – Are the exclusion criteria explicitly stated? Are the exclusion criteria appropriate? Do the exclusion criteria result in a biased sample? Do the exclusion criteria limit the external validity? Methods • Patient selection – Are the subjects healthy volunteers or subjects with the condition that the intervention is meant to improve? – How were the patients selected? – Do the study subjects fairly represent the larger population of interest? – Were the patients randomized? Methods • Study treatment – Single dose vs multiple doses – Fixed doses vs titrating to desired effect – Comparable dosages for different agents – Dosage form, administration schedule, and duration of treatment – Identical placebo – Setting Methods • Study treatment – – – – How was compliance defined and assessed? Were the subjects receiving any other therapy? What was the potential impact of diet? What was the potential impact of changes in lifestyle? 21 Methods • Measurement of drug effects – Are the measurements valid? – Were the measurements standardized? – Were the measurements evaluated by the same person or the same laboratory? – Were the number of measurements identical between groups? – Are the results reproducible? Methods • Data analysis – Intention-to-treat – As-treated – Per-protocol Methods • Terminology – Were important terms defined? • Safety – How were adverse effects monitored? – When were safety assessments conducted? Methods • Statistical analysis – Are the tests appropriate for the type of data? – Are enough data given to do the calculations? – Was power defined? 22 Results • Are the results presented clearly? • Are the results complete? • Are graphs, charts, and illustrations accurate? • Were the results analyzed according to the original objectives? • • • • Discussion • Do the authors explain the limitations of the trial? • Do the authors consider the work of others? • Do the authors draw valid conclusions based on the data obtained? • Do the authors suggest future directions for further research on the topic? References Student Critique Do the authors cite themselves repetitively? Are the hallmark articles included? Are the references up-to-date? Are the references cited representative of the literature available on the topic? • What limitations can be identified in the study? • What strengths can be identified in the study? • Do you agree with the authors’ conclusions? • Will the results of the study impact clinical practice? 23 Useful Resources • Gehlbach SH. Interpreting the medical literature. New York: McGraw-Hill; 2006. • Gaddis ML, Gaddis GM. Introduction to Biostatistics: part 1-6. Annals of Emergency Medicine, 1990 – part 1, basic concepts. Ann Emerg Med. 1990;19:86-9. – part 2, descriptive statistics. Ann Emerg Med. 1990;19:309-15. – part 3, sensitivity, specificity, predictive value, and hypothesis testing. Ann Emerg Med. 1990;19:591-7. – part 4, statistical inference techniques in hypothesis testing. Ann Emerg Med. 1990;19:820-5. – part 5, statistical inference techniques in hypothesis testing with nonparametric testing. Ann Emerg Med. 1990;19:1054-9. – part 6, correlation and regression. Ann Emerg Med. 1990;19:146268. 24
© Copyright 2025