This website serves as an open-access, “video textbook” for experimental design and data analysis. We use examples drawn from biology to illustrate and apply concepts but the principles apply generally (e.g., social sciences). As in a book, this website organizes topics by “chapter”, but this website replaces most text with videos. All videos offer transcripts to facilitate access with a poor internet connection; slides are also available, where applicable, as are practice problems with answers. Unless specified, all videos and downloadable materials, (except for datasets that come from elsewhere; noted within the website), are published under Creative Commons licensing (Attribution – No Derivatives). We hope educators and researchers use and share these materials, following Creative Commons licensing rules. If you use these materials for teaching purposes we ask you to please let us know (see Crispin’s contact details)! If you have an example (or a request for subject matter) that you believe would improve this resource, please contact Crispin Jordan: View Crispin Jordan's staff profileAims of the website'Experiments and what to do with them' aims to promote Research Reproducibility across biological disciplines. In particular this website:provides a firm introduction to essential aspects of experimental design emphasizes the need for appropriate experimental Power demonstrates power analysis for every type of analysis explored here (work in progress);discusses Questionable Research Practices and how to avoid themintroduces Open Research practices that increase transparency and work towards reproducibility follows the American Statistical Association's advice to move away from the concept of 'statistical significance'. The ASA advises that the term 'statistically significant' (and all variants thereof) be dropped entirely from statistical discourse, and this website largely adopts this advice. Specifically, in line with recommendations, we (i) continue to use p-values for inference but without reference to the arbitrary threshold of 0.05 and (ii) emphasize 'effect size' to understand effects. However, given that students continue to interact with a large body of pre-existing literature that adheres to the concept of 'statistical significance' this course cannot simply ignore this historical perspective. Therefore, we occasionally mention 'statistical significance' but also demonstrate alternative approaches that improve interpretation.Current analysesWe focus on General Linear Models (GLMs) to implement a broad range of analyses. At the moment, this website presents:An introduction to randomization tests, hypotheses, null distributions and p-valuesT-testsData transformation1-Factor GLM (i.e. very loosely speaking, “1-factor ANOVA”)Multi-factor GLM (e.g., again, very loosely speaking, “2-factor ANOVA”)GLM with continuous independent variables (covariates; i.e., “regression”)GLM combining factors and covariates (i.e., very loosely speaking, “ANCOVA”)An introduction of mixed effects modelsSystematic ReviewsIn the future, we will also deal with:Different ways to calculate p-values (e.g. type 1, 2 and 3 sum of squares, AIC, etc.)Models with multiple covariatesModel selectionMore on mixed effects modelsComputational methods; bootstrapping and randomization testsGeneralized Linear (Mixed) ModelsMultivariate analysesAnd more – please make a request!Please see the General Introduction for further perspective on this website.Further readingThe materials presented here draw heavily from a variety of excellent books. Several of these aim to be highly accessible to undergraduate students but have helped biologists at all career stages; I highlight the following as additional learning materials:The Analysis of Biological Data, by Whitlock and Schluter [A general treatment of data analysis];Experimental Design for the Life Sciences, by Ruxton and Colegrave [A fantastic introduction to experimental design];Modern Statistics for the Life Sciences, by Grafen and Hails [A great introduction to general linear models];Power Analysis: an introduction for the life sciences, by Colegrave and Ruxton [A thorough introduction to power analysis that remains approachable to undergraduate and more experienced scientists, alike.]Mixed effects models and extensions in ecology in R, by Zuur et al. [An excellent treatment of mixed effects models. The examples draw from ecology, but the principles and applications are general.]Nobody's perfectLike any standard textbook, this website is not perfect. I record my videos 'live' (little to no editing) to maintain an approachable feel; however, as can happen in a live lecture, I occasionally misspeak. Alternatively, a video may need minor updating to accommodate new information. I have added notes to each video where I am aware that such errors occur. If you find an error, please contact me and I will address it and I will thank you for your help!Contact Dr Crispin Jordan If you have an example (or a request for subject matter) that you believe would improve this resource, please contact Crispin: Dr Crispin Jordan Lecturer Biomedical Teaching Organisation Edinburgh Medical School: Biomedical Sciences University of Edinburgh Contact details Website: Ask me a question Website: Crispin's personal website Website: Crispin's staff profile The materials presented here draw from many scientists over many years; please see Acknowledgements, below. Chapter 1. General Introduction Practicing biologists need a strong foundation in experimental design and data analysis; these skills allow biologists to transform an idea or hypothesis into a conclusion. Interleaf 1- Biologists who found careers using statistics This page will share experiences of biologists who used their knowledge of experimental design or data analysis to find a career. We will continue to add stories over time. Chapter 2. An introduction to R This chapter provides an introduction to basic skills in R. Chapter 3. Using R to introduce basic concepts of hypothesis testing This chapter explains the logic used to test hypotheses from a frequentist perspective (i.e., the perspective most commonly taught for data analysis). Chapter 4. Plotting data In this chapter we discuss best practice for plotting data for common experimental designs in biomedical science. Chapter 5. Variance Variability is what makes biology (and life) so interesting. Chapter 6. Measuring an average with uncertainty We cannot measure anything perfectly: our measurements always include some degree of uncertainty. This chapter explains how we can describe this uncertainty when reporting and interpreting results. Specifically, we introduce the idea of ‘standard error’ and ‘confidence intervals’. Chapter 7. Comparing averages with two (or one) groups This chapter explores how to compare the average of a group to something else for simple experimental designs. Chapter 8. Abandon statistical significance This chapter explores the arguments to abandon the concept of statistical significance, and recommends alternative approaches to interpret results. Chapter 9. Experimental design ‘Experimental design’ is a huge topic, with many books devoted to the topic. The vast majority of experiments in the biological sciences, however, are based on a few foundational principles. We focus on these principles in this (and following) chapter(s) to provide the resources to design reliable, replicable and powerful experiments. Chapter 10. More experimental design: independence and pseudo-replication This chapter first describes the evidence for pseudo-replication in animal experiments. We then introduce the concepts to understand when pseudo-replication arises, why it matters, and provide advice to avoid pseudo-replication and practice to spot it in published studies. Chapter 11. Power analysis This chapter explains what power analysis is and why it is essential to design a study. Chapter 12. Questionable research practices This chapter highlights the prevalence of Questionable Research Practices for some disciplines of Biology and Psychology, explains how they lead to non-reproducible research, and discusses solutions to begin to deal with them. Chapter 14. Comparing averages between more than two groups: 1-factor models This chapter deals with the design and analysis of such 1-Factor experiments; the analyses are termed, 1-Factor General Linear Models (glm). Chapter 15. Dealing with violated assumptions This chapter focuses on data transformation. Chapter 16. Analysing experiments with multiple factors This chapter focuses on designing and analysing multi-factor experiments. Chapter 17. Understanding covariates: simple regression and analyses that combine covariates and factors This chapter introduces approaches to model continuous data as an independent variable. We refer to continuous independent variables as ‘covariates’. Interleaf 2. Practice with general linear models This chapter provides further opportunity to practice working general linear models. Chapter 18. Mixed effects models This current chapter introduces another type of effect: ‘random effects’. Mixed effects models, the subject of this chapter, combine ‘fixed’ and ‘random’ effects. Additional learning materials Other online courses Acknowledgements Crispin Jordan thanks many people who contributed to his understanding of experimental design and data analysis, challenged his thinking on the topic, and provided inspiration: This article was published on 2024-08-05