Moving beyond P values: Everyday data analysis with estimation plots

Joses Ho; Tayfun Tumkaya; Sameer Aryal; Hyungwon Choi; Adam Claridge-Chang

doi:10.1101/377978

Abstract

Over the past 75 years, a number of statisticians have advised that the data-analysis method known as null-hypothesis significance testing (NHST) should be deprecated (Berkson, 1942; Halsey et al., 2015; Wasserstein et al., 2019). The limitations of NHST have been extensively discussed, with a broad consensus that current statistical practice in the biological sciences needs reform. However, there is less agreement on reform’s specific nature, with vigorous debate surrounding what would constitute a suitable alternative (Altman et al., 2000; Benjamin et al., 2017; Cumming and Calin-Jageman, 2016). An emerging view is that a more complete analytic technique would use statistical graphics to estimate effect sizes and evaluate their uncertainty (Cohen, 1994; Cumming and Calin-Jageman, 2016). As these estimation methods require only minimal statistical retraining, they have great potential to shift the current data-analysis culture away from dichotomous thinking towards quantitative reasoning (Claridge-Chang and Assam, 2016). The evolution of statistics has been inextricably linked to the development of quantitative displays that support complex visual reasoning (Tufte, 2001). We consider that the graphic we describe here as estimation plot is the most intuitive way to display the complete statistical information about experimental data sets. However, a major obstacle to adopting estimation plots is accessibility to suitable software. To lower this hurdle, we have developed free software that makes high-quality estimation plotting available to all. Here, we explain the rationale for estimation plots by contrasting them with conventional charts used to display data with NHST results, and describe how the use of these graphs affords five major analytical advantages.

Introduction

The two-groups design is fundamental

While NHST limits the analyst to the ill-conceived question of ‘Does it?’ (McCloskey, 2002), estimation instead draws the analyst’s attention to the question of ‘How much?’ — the very topic that defines quantitative research. A fundamental research tool is an experiment that uses control and intervention samples: the two-groups design. Two-groups data are traditionally analyzed by Student’s t-test and its variants. We will use a series of visualizations of two-groups data to illustrate the progression from NHST to estimation-oriented analysis.

Significance tests obscure two data aspects

The Student’s t-test makes the assumption that two groups have identical means (i.e. it proposes that the effect size is zero). It then challenges this null hypothesis with the observed data, by calculating the chance of seeing the observed effect size (or greater) within the hypothesized null distribution—this is the P value. If the probability falls below a certain threshold (typically P < 0.05), the null hypothesis is rejected. The analyst then plots the two groups’ means in a bar chart and denotes ‘significance’ by marking it with a star (Figure 1A). This visualization has two important deficiencies. First, by displaying only the means and width of their errors, a bar chart obscures the observed values (2014). Second, NHST plots show only the test result (as indicated by a star or a P value), while omitting a diagram of the null distribution itself. The omission of the full dataset and distributional information in t-tests is a reflection of how NHST—by focusing on an accept/reject dichotomy—diverts attention from effect quantification.

Figure 1. The evolution of two-groups data graphics

A. Two-groups data presented in a bar chart. Control (C) and test groups (T) are shown as blue and orange bars, respectively.

B. The same data presented as a box plot.

C. A scatter plot (with jitter) allows for all observed values to be visualized (alongside appropriately-offset crossbars to indicate the mean and standard deviation for each group), but does not illustrate the groups’ comparison (ie. the effect size).

D. A visualization of the two-groups comparison from the null-hypothesis significance testing perspective. The filled curve on the difference axis indicates the null-hypothesis distribution of the mean difference. By definition, this distribution has a mean difference of zero. The area of the red segment indicates the P value (for one-sided testing).

E. An estimation plot uses the difference axis to display on an effect size, here the mean difference (∆). The filled curve indicates the resampled ∆ distribution, given the observed data. Horizontally aligned with the mean of the test group, the ∆ is indicated by the red circle. The 95% confidence interval of ∆ is illustrated by the red vertical line.

The transparency of bar charts is only modestly improved with box plots (Figure 1B); although they do outline more distributional information, box plots do not display complex attributes (e.g. bimodality) or the individual values (Matejka and Fitzmaurice, 2017).

Data-transparency is facilitated with the use of dot plots that show every datum (Cleveland and McGill, 1984; 2017) (Figure 1C). Dot plots are best drawn as beeswarm plots, which convey histogram-like information about the distribution while still displaying every observation (Ecklund, 2015; Waskom et al., 2016; Wilkinson, 1999) (Figure 1D).

Even when fully visualized, significance tests are misleading

An NHST plot can be made more informative by including a second axis to the side of the observed values (Gardner and Altman, 1986). This difference axis appropriately draws the viewer’s attention to the magnitude and variability information in the two groups’ comparison (Cumming and Calin-Jageman, 2016). We can use this axis to diagram NHST (Figure 1D). This design has three main features. (1) The mean of the null is the difference-axis origin, zero. (2) The origin is flanked by a sampling-error distribution; this null distribution can constructed with permutation (Pitman, 1937). (3) The P value is visualized as the tail segment of the distribution that is more extreme than the observed effect size. If this tail segment is smaller than a predefined significance level, traditionally α = 0.05, an analyst will reject the null.

While visualizing the null distribution is an improvement, this picture nevertheless illustrates the flawed logic of NHST: in order to prove that the null hypothesis is false, the analyst must invoke the existence of something (the tail segment) that the hypothesis predicts (Berkson, 1942). NHST has been criticized for this fallacy, as well as its misleading dichotomization (McShane and Gal, 2017; Yildizoglu et al., 2015). Even the premise of NHST is incorrect: any intervention to any system will produce some (at least infinitesimal) effect, thus a hypothesis of a precisely zero effect size is inevitably false (Cohen, 1994).

Estimation plots combine transparency and insight

As interventions always have effects, the analyst’s appropriate task is to quantify the effect size and assess its precision. First introduced by two biostatisticians over 30 years ago (Gardner and Altman, 1986), the best design for the analysis of two groups is an estimation plot that visualizes observed values and the effect size side-by-side. In this graphic, the difference-axis origin is aligned with the mean of the test group, making it easy to relate observed values to the difference of means, ∆ (Figure 1E). Around ∆, the analyst plots an indicator of precision known as the 95% confidence interval (CI) (Altman et al., 2000). The current design updates the Gardner-Altman plot by diagramming the sampling-error distribution as a filled curve.

Five key advantages of estimation plots

Estimation plots possess five key advantages over conventional NHST plots. First, as mentioned above, the difference axis affords transparency of the comparison being made. Second, while P values conflate magnitude and precision in a single number, the relative size of a CI provides a specific measure of its precision. Third, plotting the full sampling-error curve of the effect size prevents dichotomous thinking and draws attention to the distribution’s graded nature. Fourth, deriving this sampling-error curve with bootstrapping makes the method robust and versatile. Fifth, and most importantly, by focusing attention on an effect size, the difference diagram encourages quantitative reasoning about the system under study. Such reasoning empowers scientists to make domain-specific judgements on whether an effect magnitude is noteworthy and relevant to their research question.

Estimation plots are accessible

As two-groups analysis is the most frequently used statistical method in experimental research (Chochlac, 2018), the broad adoption of estimation for this type of analysis would greatly advance its acceptance more generally. However, while every major data-analysis tool can perform a Student’s t-test and chart NHST plots, very few software packages offer estimation plots. To improve the accessibility of estimation plots, we developed Data Analysis with Bootstrap-coupled ESTimation (DABEST), available in three open-source libraries for Matlab, Python, and R. DABEST calculates the sampling distribution and the CI with bootstrapping: resampling with replacement from the observations several thousand times (Efron, 1979; Efron and Tibshirani, 1994). Compared to parametric methods, bootstrapping is more robust for data sets with non-normal distributions (Efron, 1981, 1987). We have also used DABEST to build a free, user-friendly web application: estimationstats.com. Data is input via a spreadsheet, summary statistics are downloadable as text tables, and plots can be saved in image formats suitable for publication (PNG and SVG). The default CI can be easily re-specified to accommodate other interval sizes (Benjamin et al., 2017). With the web application and open-source libraries, DABEST caters to both scripting and spreadsheet workflows, empowering all researchers to rapidly adopt better data-analysis practices.

Estimation plots are versatile

DABEST can be used to visualize large samples (Figure 2A), paired data (Figure 2B), multiple groups (Figure 2C), shared-control designs (Figure 2D), and to display standardized effect sizes such as Hedges’ g. More generally, estimation-focused plots can be used for linear regression (Figure 3), and meta-research (e.g. forest plots) (Borenstein et al., 2009). Thus, the estimation approach is broadly relevant (Claridge-Chang and Assam, 2016).

Figure 2. The estimation approach can accommodate a range of experimental designs

A. Estimation plots can effectively display datasets with large numbers of observations. Shown here is a plot of the mean differences between two groups, each with 1000 observations. The mean and standard deviation (SD) of each group is plotted as a gapped line (the ends of the vertical lines correspond to mean ± SD, while the mean itself is depicted as a gap in the line) alongside all data points. Even larger samples are easily handled with related designs: the violin plot (Hintze and Nelson, 1998) which shows the density of observed values; or the sinaplot (Sidiropoulos et al., 2017) which controls the jitter width of the data points according to each group’s density function.

B. An estimation plot with a slopegraph (Tufte, 2001) depicting the pairs of within-subject observations and the ∆.

C. A multi-group estimation plot with multiple two-group comparisons plotted together. The lower panel—which shows the effect sizes (Δs)—is analogous to a forest plot, summarising the results of several comparisons. We propose multi-group estimation plots be named ‘Cumming plots’ after their originator (Cumming, 2012).

D. A shared-control plot. This is analogous to an ANOVA with multiple comparisons, where several groups (in this case, three groups: Test 1, Test 2, and Test 3) are compared against a single common control or reference group.

Figure 3. The estimation approach to linear regression.

The principle of showing both observed values and effect size applies to other types of estimation plots. Shown here is the example of a linear regression plot. A best-practice visualization should include the following: (1) a scatter plot that shows all observations; (2) a fit line with its confidence-interval band; (3) the slope effect size (m in y = mx + c of the regression fit line) with its confidence interval; and (4) the coefficient of determination effect size (R²) with its confidence interval. These features of a regression plot are the counterparts to the key aspects of estimation graphics for grouped data: dot plot; difference axis; ∆ value; and standardized effect size (e.g. Cohen’s d or Hedges’ g) (Cohen, 1988; Hedges, 1981). Both m and ∆ express a change in the dependent variable in terms of the independent variable—continuous or categorical, respectively. Both R² and d are indicators of the effect size as a proportion of variance; indeed, there are formulas for the interconversion of R² and d-type effect sizes (Borenstein et al., 2009).

Conclusion

Estimation plots constitute an elegant, robust framework for presenting data. The three software packages and accompanying web application offer non-statisticians a way to analyze their data without recourse to NHST, which derails analysts from quantification and misleads them to settle for superficial dichotomies. By visualizing effect sizes and their precision, estimation plots help analysts focus on quantitative thinking, enabling better scientific practice.

Author Contributions

Conceptualization: JH, ACC; Methodology: JH, ACC; Software: JH (Python, R), TT (Matlab, R), SA (Matlab); Writing: Original Draft: JH, Revision: JH, HC, ACC; Visualization: JH, ACC; Supervision: HC, ACC; Project Administration: ACC; Funding Acquisition: HC, ACC.

Sources of Funding

JH was supported by the A*STAR Scientific Scholars Fund. TT was supported by a Singapore International Graduate Award from the A*STAR Graduate Academy. SA was supported by a Singapore International Pre-Graduate Award. HC was supported by grants MOE-2016-T2-1-001 from the Singapore Ministry of Education and NMRC-CG-M009 from the National Medical Research Council. ACC was supported by grants MOE-2013-T2-2-054 and MOE2017-T2-1-089 from the Singapore Ministry of Education, grants 1231AFG030 and 1431AFG120 from the A*STAR Joint Council Office, and Duke-NUS Medical School. The authors received additional support from a Biomedical Research Council block grant to the Institute of Molecular and Cell Biology.

Data Availability

The Matlab, Python, and R packages are all available on Github, and are licensed under the BSD 3-Clause Clear License.

Guide to using DABEST

There are five ways to use DABEST.

No installation or download is required for the web application or Google Colab; either requires only an internet connection. The other methods require you to install Python, Matlab, or R on your personal computer.

Web application

Access estimationstats.com.
Choose one of the functions, e.g. two groups.
Use the preloaded data or enter your own data.

Google Colaboratory

Open an window in any modern browser (Chrome, Firefox, or Safari). Use incognito or private mode if you wish to remain anonymous.
Access this online example notebook to view the code that generated the Figure. You can view or download the notebook, but cannot run it without signing in.
If you would like to run the code in Colaboratory, you will need an Google account with which to sign in.

Matlab

Download DABEST-Matlab from Mathworks File Exchange or the Github repo.
Follow the tutorial on Github.

Python

Install the Anaconda distribution of Python 3.6 and Jupyter.
Download the example notebook from Colaboratory (see above).
Run the example notebook to install and test DABEST-Python.
Or, install DABEST with this line in the terminal: pip install dabest
A tutorial on DABEST-Python can be found here.

R

Run this line in the R console: install.packages(“dabestr”)
Note that a version of R > 3.5.0 is required.

Acknowledgements

The authors are grateful to Mashiur Rahman for help and advice, and to Hung Nguyen for developing the web app front end.

References

↵
Altman, D., Machin, D., Bryant, T., and Gardner, S. (2000). Statistics with confidence: confidence interval and statistical guidelines. Bristol: BMJ Books.
↵
Benjamin, D.J., Berger, J.O., Johannesson, M., Nosek, B.A., -J. Wagenmakers, E., Berk, R., Bollen, K.A., Brembs, B., Brown, L., Camerer, C., et al. (2017). Redefine statistical significance. Nature Human Behaviour 2, 6–10.
OpenUrl
↵
Berkson, J. (1942). Tests of Significance Considered as Evidence. J. Am. Stat. Assoc. 37, 325–335.
OpenUrl CrossRef Web of Science
↵
Borenstein, M., Hedges, L.V., Higgins, J.P.T., and Rothstein, H.R. (2009). Introduction to meta-analysis (Chichester, West Sussex, U.K; Hoboken: John Wiley & Sons).
↵
Chochlac, R. (2018). What are the most popular GraphPad QuickCalcs?
↵
Claridge-Chang, A., and Assam, P.N. (2016). Estimation statistics should replace significance testing. Nat. Methods 13, 108–109.
OpenUrl CrossRef
↵
Cleveland, W.S., and McGill, R. (1984). The Many Faces of a Scatterplot. Journal of the American Statistics Association 79.
↵
Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences (Lawrence Erlbaum Associates).
↵
Cohen, J. (1994). The earth is round (p < .05). American Psychologist 49, 997–1003.
OpenUrl CrossRef
↵
Cumming, G. (2012). Understanding the new statistics effect sizes, confidence intervals, and meta-analysis (New York: Routledge).
↵
Cumming, G., and Calin-Jageman, R. (2016). Introduction to the New Statistics: Estimation, Open Science, and Beyond (Routledge).
↵
Ecklund, A.C. (2015). beeswarm: an R package.
↵
Efron, B. (1979). Bootstrap Methods: Another Look at the Jackknife. Ann. Stat. 7, 1–26.
OpenUrl CrossRef Web of Science
↵
Efron, B. (1981). Nonparametric standard errors and confidence intervals. Can. J. Stat. 9, 139–158.
OpenUrl CrossRef
↵
Efron, B. (1987). Better Bootstrap Confidence Intervals. J. Am. Stat. Assoc. 82, 171–185.
OpenUrl CrossRef Web of Science
↵
Efron, B., and Tibshirani, R.J. (1994). An Introduction to the Bootstrap (CRC Press).
↵
Gardner, M.J., and Altman, D.G. (1986). Confidence intervals rather than P values: estimation rather than hypothesis testing. Br. Med. J. 292, 746–750.
OpenUrl Abstract/FREE Full Text
↵
Halsey, L.G., Curran-Everett, D., Vowler, S.L., and Drummond, G.B. (2015). The fickle P value generates irreproducible results. Nat. Methods 12, 179–185.
OpenUrl CrossRef PubMed
↵
Hedges, L.V. (1981). Distribution Theory for Glass’s Estimator of Effect size and Related Estimators. J. Educ. Behav. Stat. 6, 107–128.
OpenUrl CrossRef
↵
Hintze, J.L., and Nelson, R.D. (1998). Violin Plots: A Box Plot-Density Trace Synergism. Am. Stat. 52, 181–184.
OpenUrl CrossRef Web of Science
↵
Matejka, J., and Fitzmaurice, G. (2017). Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, (ACM), pp. 1290–1294.
↵
McCloskey, D. (2002). The Secret Sins of Economics (Prickly Paradigm Press).
↵
McShane, B.B., and Gal, D. (2017). Statistical Significance and the Dichotomization of Evidence. J. Am. Stat. Assoc. 112, 885–895.
OpenUrl
↵
Pitman, E.J.G. (1937). Significance Tests Which May be Applied to Samples From any Populations. Supplement to the Journal of the Royal Statistical Society 4, 119–130.
OpenUrl CrossRef
↵
Sidiropoulos, N., Sohi, S.H., Pedersen, T.L., Porse, B.T., Winther, O., Rapin, N., and Bagger, F.O. (2017). SinaPlot: An Enhanced Chart for Simple and Truthful Representation of Single Observations Over Multiple Classes. J. Comput. Graph. Stat. 1–4.
↵
Tufte, E.R. (2001). The Visual Display of Quantitative Information (Graphics Press).
↵
Waskom, M., Botvinnik, O., drewokane, Hobson, P., Halchenko, Y., Lukauskas, S., Warmenhoven, J., Cole, J.B., Hoyer, S., Vanderplas, J., et al. (2016). seaborn: v0.7.0 (January 2016) (Zenodo).
↵
Wasserstein, R.L., Schirm, A.L., and Lazar, N.A. (2019). Moving to a World Beyond “p < 0.05.” Am. Stat. 73, 1–19.
OpenUrl CrossRef
↵
Wilkinson, L. (1999). Dot Plots. Am. Stat. 53, 276–281.
OpenUrl
↵
Yildizoglu, T., Weislogel, J.-M., Mohammad, F., Chan, E.S.-Y., Assam, P.N., and Claridge-Chang, A. (2015). Estimating Information Processing in a Memory System: The Utility of Meta-analytic Methods for Genetics. PLoS Genet. 11, e1005718.
OpenUrl CrossRef
(2014). Kick the bar chart habit (Springer Nature).
(2017). Show the dots in plots. Nature Biomedical Engineering 1, s41551-017 – 0079.

View the discussion thread.

Posted April 06, 2019.

Download PDF

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11740)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17410)
Clinical Trials (138)
Developmental Biology (9420)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12239)
Genomics (16797)
Immunology (11865)
Microbiology (28070)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] ↵
Altman, D., Machin, D., Bryant, T., and Gardner, S. (2000). Statistics with confidence: confidence interval and statistical guidelines. Bristol: BMJ Books.

[2] ↵
Benjamin, D.J., Berger, J.O., Johannesson, M., Nosek, B.A., -J. Wagenmakers, E., Berk, R., Bollen, K.A., Brembs, B., Brown, L., Camerer, C., et al. (2017). Redefine statistical significance. Nature Human Behaviour 2, 6–10.
OpenUrl

[3] ↵
Berkson, J. (1942). Tests of Significance Considered as Evidence. J. Am. Stat. Assoc. 37, 325–335.
OpenUrl CrossRef Web of Science

[4] ↵
Borenstein, M., Hedges, L.V., Higgins, J.P.T., and Rothstein, H.R. (2009). Introduction to meta-analysis (Chichester, West Sussex, U.K; Hoboken: John Wiley & Sons).

[5] ↵
Chochlac, R. (2018). What are the most popular GraphPad QuickCalcs?

[6] ↵
Claridge-Chang, A., and Assam, P.N. (2016). Estimation statistics should replace significance testing. Nat. Methods 13, 108–109.
OpenUrl CrossRef

[7] ↵
Cleveland, W.S., and McGill, R. (1984). The Many Faces of a Scatterplot. Journal of the American Statistics Association 79.

[8] ↵
Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences (Lawrence Erlbaum Associates).

[9] ↵
Cohen, J. (1994). The earth is round (p < .05). American Psychologist 49, 997–1003.
OpenUrl CrossRef

[10] ↵
Cumming, G. (2012). Understanding the new statistics effect sizes, confidence intervals, and meta-analysis (New York: Routledge).

[11] ↵
Cumming, G., and Calin-Jageman, R. (2016). Introduction to the New Statistics: Estimation, Open Science, and Beyond (Routledge).

[12] ↵
Ecklund, A.C. (2015). beeswarm: an R package.

[13] ↵
Efron, B. (1979). Bootstrap Methods: Another Look at the Jackknife. Ann. Stat. 7, 1–26.
OpenUrl CrossRef Web of Science

[14] ↵
Efron, B. (1981). Nonparametric standard errors and confidence intervals. Can. J. Stat. 9, 139–158.
OpenUrl CrossRef

[15] ↵
Efron, B. (1987). Better Bootstrap Confidence Intervals. J. Am. Stat. Assoc. 82, 171–185.
OpenUrl CrossRef Web of Science

[16] ↵
Efron, B., and Tibshirani, R.J. (1994). An Introduction to the Bootstrap (CRC Press).

[17] ↵
Gardner, M.J., and Altman, D.G. (1986). Confidence intervals rather than P values: estimation rather than hypothesis testing. Br. Med. J. 292, 746–750.
OpenUrl Abstract/FREE Full Text

[18] ↵
Halsey, L.G., Curran-Everett, D., Vowler, S.L., and Drummond, G.B. (2015). The fickle P value generates irreproducible results. Nat. Methods 12, 179–185.
OpenUrl CrossRef PubMed

[19] ↵
Hedges, L.V. (1981). Distribution Theory for Glass’s Estimator of Effect size and Related Estimators. J. Educ. Behav. Stat. 6, 107–128.
OpenUrl CrossRef

[20] ↵
Hintze, J.L., and Nelson, R.D. (1998). Violin Plots: A Box Plot-Density Trace Synergism. Am. Stat. 52, 181–184.
OpenUrl CrossRef Web of Science

[21] ↵
Matejka, J., and Fitzmaurice, G. (2017). Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, (ACM), pp. 1290–1294.

[22] ↵
McCloskey, D. (2002). The Secret Sins of Economics (Prickly Paradigm Press).

[23] ↵
McShane, B.B., and Gal, D. (2017). Statistical Significance and the Dichotomization of Evidence. J. Am. Stat. Assoc. 112, 885–895.
OpenUrl

[24] ↵
Pitman, E.J.G. (1937). Significance Tests Which May be Applied to Samples From any Populations. Supplement to the Journal of the Royal Statistical Society 4, 119–130.
OpenUrl CrossRef

[25] ↵
Sidiropoulos, N., Sohi, S.H., Pedersen, T.L., Porse, B.T., Winther, O., Rapin, N., and Bagger, F.O. (2017). SinaPlot: An Enhanced Chart for Simple and Truthful Representation of Single Observations Over Multiple Classes. J. Comput. Graph. Stat. 1–4.

[26] ↵
Tufte, E.R. (2001). The Visual Display of Quantitative Information (Graphics Press).

[27] ↵
Waskom, M., Botvinnik, O., drewokane, Hobson, P., Halchenko, Y., Lukauskas, S., Warmenhoven, J., Cole, J.B., Hoyer, S., Vanderplas, J., et al. (2016). seaborn: v0.7.0 (January 2016) (Zenodo).

[28] ↵
Wasserstein, R.L., Schirm, A.L., and Lazar, N.A. (2019). Moving to a World Beyond “p < 0.05.” Am. Stat. 73, 1–19.
OpenUrl CrossRef

[29] ↵
Wilkinson, L. (1999). Dot Plots. Am. Stat. 53, 276–281.
OpenUrl

[30] ↵
Yildizoglu, T., Weislogel, J.-M., Mohammad, F., Chan, E.S.-Y., Assam, P.N., and Claridge-Chang, A. (2015). Estimating Information Processing in a Memory System: The Utility of Meta-analytic Methods for Genetics. PLoS Genet. 11, e1005718.
OpenUrl CrossRef

[31] (2014). Kick the bar chart habit (Springer Nature).

[32] (2017). Show the dots in plots. Nature Biomedical Engineering 1, s41551-017 – 0079.