Experiments and what to do with them

This website serves as an open-access, “video textbook” for experimental design and data analysis.

We use examples drawn from biology to illustrate and apply concepts but the principles apply generally (e.g., social sciences).

As in a book, this website organizes topics by “chapter”, but this website replaces most text with videos. All videos offer transcripts to facilitate access with a poor internet connection; slides are also available, where applicable, as are practice problems with answers.

Unless specified, all videos and downloadable materials, (except for datasets that come from elsewhere; noted within the website), are published under Creative Commons licensing (Attribution – No Derivatives). We hope educators and researchers use and share these materials, following Creative Commons licensing rules . If you use these materials for teaching purposes we ask you to please let us know (see Crispin’s contact details)!

If you have an example (or a request for subject matter) that you believe would improve this resource, please contact Crispin Jordan:

View Crispin Jordan's staff profile

Aims of the website

'Experiments and what to do with them' aims to promote Research Reproducibility across biological disciplines. In particular this website:

provides a firm introduction to essential aspects of experimental design
emphasizes the need for appropriate experimental Power
demonstrates power analysis for every type of analysis explored here (work in progress);
discusses Questionable Research Practices and how to avoid them
introduces Open Research practices that increase transparency and work towards reproducibility
follows the American Statistical Association's advice to move away from the concept of 'statistical significance'. The ASA advises that the term 'statistically significant' (and all variants thereof) be dropped entirely from statistical discourse, and this website largely adopts this advice. Specifically, in line with recommendations, we (i) continue to use p-values for inference but without reference to the arbitrary threshold of 0.05 and (ii) emphasize 'effect size' to understand effects. However, given that students continue to interact with a large body of pre-existing literature that adheres to the concept of 'statistical significance' this course cannot simply ignore this historical perspective. Therefore, we occasionally mention 'statistical significance' but also demonstrate alternative approaches that improve interpretation.

Current analyses

We focus on General Linear Models (GLMs) to implement a broad range of analyses. At the moment, this website presents:

An introduction to randomization tests, hypotheses, null distributions and p-values
T-tests
Data transformation
1-Factor GLM (i.e. very loosely speaking, “1-factor ANOVA”)
Multi-factor GLM (e.g., again, very loosely speaking, “2-factor ANOVA”)
GLM with continuous independent variables (covariates; i.e., “regression”)
GLM combining factors and covariates (i.e., very loosely speaking, “ANCOVA”)
An introduction of mixed effects models
Systematic Reviews

In the future, we will also deal with:

Different ways to calculate p-values (e.g. type 1, 2 and 3 sum of squares, AIC, etc.)
Models with multiple covariates
Model selection
More on mixed effects models
Computational methods; bootstrapping and randomization tests
Generalized Linear (Mixed) Models
Multivariate analyses
And more – please make a request!

Please see the General Introduction for further perspective on this website.

Nobody's perfect

Like any standard textbook, this website is not perfect. I record my videos 'live' (little to no editing) to maintain an approachable feel; however, as can happen in a live lecture, I occasionally misspeak. Alternatively, a video may need minor updating to accommodate new information. I have added notes to each video where I am aware that such errors occur. If you find an error, please contact me and I will address it and I will thank you for your help!

Contact Dr Crispin Jordan

If you have an example (or a request for subject matter) that you believe would improve this resource, please contact Crispin:

Dr Crispin Jordan

Lecturer

Biomedical Teaching Organisation
Edinburgh Medical School: Biomedical Sciences
University of Edinburgh

Contact details

The materials presented here draw from many scientists over many years; please see Acknowledgements, below.

Chapter 1. General Introduction

Practicing biologists need a strong foundation in experimental design and data analysis; these skills allow biologists to transform an idea or hypothesis into a conclusion.

Interleaf 1- Biologists who found careers using statistics

This page will share experiences of biologists who used their knowledge of experimental design or data analysis to find a career. We will continue to add stories over time.

Chapter 2. An introduction to R

This chapter provides an introduction to basic skills in R.

Chapter 3. Using R to introduce basic concepts of hypothesis testing

This chapter explains the logic used to test hypotheses from a frequentist perspective (i.e., the perspective most commonly taught for data analysis).

Chapter 4. Plotting data

In this chapter we discuss best practice for plotting data for common experimental designs in biomedical science.

Chapter 5. Variance

Variability is what makes biology (and life) so interesting.

Chapter 6. Measuring an average with uncertainty

We cannot measure anything perfectly: our measurements always include some degree of uncertainty. This chapter explains how we can describe this uncertainty when reporting and interpreting results. Specifically, we introduce the idea of ‘standard error’ and ‘confidence intervals’.

Chapter 7. Comparing averages with two (or one) groups

This chapter explores how to compare the average of a group to something else for simple experimental designs.

Chapter 8. Abandon statistical significance

This chapter explores the arguments to abandon the concept of statistical significance, and recommends alternative approaches to interpret results.

Chapter 9. Experimental design

‘Experimental design’ is a huge topic, with many books devoted to the topic. The vast majority of experiments in the biological sciences, however, are based on a few foundational principles. We focus on these principles in this (and following) chapter(s) to provide the resources to design reliable, replicable and powerful experiments.

Chapter 10. More experimental design: independence and pseudo-replication

This chapter first describes the evidence for pseudo-replication in animal experiments. We then introduce the concepts to understand when pseudo-replication arises, why it matters, and provide advice to avoid pseudo-replication and practice to spot it in published studies.

Aims of the website

Current analyses

Further reading

Nobody's perfect

Contact Dr Crispin Jordan

Dr Crispin Jordan

Contact details

Chapter 1. General Introduction

Interleaf 1- Biologists who found careers using statistics

Chapter 2. An introduction to R

Chapter 3. Using R to introduce basic concepts of hypothesis testing

Chapter 4. Plotting data

Chapter 5. Variance

Chapter 6. Measuring an average with uncertainty

Chapter 7. Comparing averages with two (or one) groups

Chapter 8. Abandon statistical significance

Chapter 9. Experimental design

Chapter 10. More experimental design: independence and pseudo-replication

Chapter 11. Power analysis

Chapter 12. Questionable research practices

Chapter 14. Comparing averages between more than two groups: 1-factor models

Chapter 15. Dealing with violated assumptions

Chapter 16. Analysing experiments with multiple factors

Chapter 17. Understanding covariates: simple regression and analyses that combine covariates and factors

Interleaf 2. Practice with general linear models

Chapter 18. Mixed effects models

Additional learning materials

Acknowledgements