Course Content
Critical Review
0/54
MRCPsych Paper B 2.0 Made Easy
Research Methods:

Research Methods:

1.1 The Two Foundational Questions That Define Every Study

Before we can even name a study design like "cohort" or "case-control," we must understand that these designs are simply the logical outcomes of answering two fundamental questions. Every single study, no matter how complex, is built upon the answers to these two questions.

Think of a psychiatrist trying to understand the origins of a complex disorder. The "outcome" is the diagnosis (e.g., schizophrenia). The "exposure" is a potential contributing factor (e.g., cannabis use). The psychiatrist's entire investigation will be shaped by how they answer these two questions:

  1. Where do I start my investigation? (The Grouping Strategy)
  2. Which direction in time do I look? (The Arrow of Time)

Question 1: Where Do You Start? (The Grouping Strategy)

This is the most important strategic decision a researcher makes. It dictates the entire structure of the study. The psychiatrist has two possible starting points for their investigation.

Option A: The Outcome-First Approach

Imagine a psychiatrist working in a clinic for patients with established schizophrenia. The diagnosis (the "outcome") is already known. The psychiatrist's job is to work backwards from this known outcome to understand potential causes. They start by identifying the patients with the diagnosis (the "cases"). Then, to understand what might be different about them, they need a comparison. They need to know what the past looks like for people who did not develop schizophrenia. So, they find a similar group of people from the community who are mentally well (the "controls").

The psychiatrist then investigates the past of both the cases and the controls, looking for potential risk factors (exposures). Did the patient with schizophrenia use cannabis heavily in their teens? Did they have birth complications? Did the healthy control person also have these experiences? The psychiatrist is comparing the frequency of past exposures between the "cases" and the "controls."

This is the absolute core logic of a Case-Control Study.

Case-Control Logic

  • How it works: You begin by identifying a group of people who already have the disorder you're interested in. These are your Cases. For example, you recruit 100 patients from a clinic who have been diagnosed with Panic Disorder.
  • The Crucial Comparison: You then find another group of people who are as similar as possible to your cases (in age, gender, location, etc.) but who do not have Panic Disorder. These are your Controls.
  • The Investigation: You then look backwards in time for both groups to investigate a specific exposure. For instance, you might ask everyone, "In the five years before your diagnosis (or for the controls, the last five years), did you experience a severe life event like a bereavement or job loss?"
  • The Hypothesis: If the exposure (severe life event) is a genuine risk factor for Panic Disorder, you would logically expect to find that a significantly higher percentage of the Cases had that exposure compared to the Controls.
Clinical Example: BPD & Abuse

A researcher wants to know if experiencing severe childhood abuse is a risk factor for developing Borderline Personality Disorder (BPD) in adulthood.
1. Start with the Outcome: The researcher goes to a mental health trust and identifies 100 patients with a confirmed diagnosis of BPD. These are the Cases.
2. Find a Comparison Group: They then recruit 100 people from the local community of the same age and gender who do not have BPD. These are the Controls.
3. Look Backwards: They use structured interviews and review medical records to carefully assess the history of childhood abuse for every participant in both groups.
4. The Result: They find that 75 of the 100 BPD patients (75%) have a history of severe abuse, while only 15 of the 100 controls (15%) have such a history. This strong association suggests that childhood abuse is a major risk factor for BPD.

Option B: The Exposure-First Approach

Now, imagine a different clinical question. A psychiatrist is interested in the long-term mental health effects of a specific genetic marker. They don't start with people who are already ill. Instead, they start with a large group of healthy young people. They identify those who carry the genetic marker (the "exposed" group) and those who do not (the "unexposed" group). The psychiatrist's job is to follow both groups forward in time, perhaps for decades, to see who goes on to develop a psychiatric disorder.

The psychiatrist isn't starting with a disorder that has already happened. They are starting with a group defined by their biological characteristics ("exposed" vs. "unexposed") and waiting to see what outcome develops.

This is the absolute core logic of a Cohort Study.

Cohort Logic

  • How it works: You begin by identifying a large group of people (a "cohort") who are all free of the disease at the start of the study.
  • The Crucial Grouping: You then divide this cohort into at least two groups based on whether they have the exposure you're interested in. For example, you might divide the cohort into heavy smokers (the Exposed Group) and non-smokers (the Unexposed Group).
  • The Investigation: You then follow both of these groups forward in time—for years, or even decades—to see who develops the outcome, for example, lung cancer.
  • The Hypothesis: If the exposure (smoking) is a genuine risk factor for the outcome (lung cancer), you would logically expect to see a higher rate of new lung cancer cases develop in the exposed group compared to the unexposed group over the follow-up period.
Clinical Example: Antidepressants & Bipolar

A researcher wants to know if long-term use of a specific antidepressant during adolescence (the exposure) increases the risk of developing Bipolar I Disorder in later life (the outcome).
1. Start with the Exposure: The researcher identifies a large cohort of 2,000 adolescents who are being treated for depression.
2. Form the Groups: Using pharmacy records, they divide the cohort into two groups: 1,000 adolescents who were treated with Antidepressant X for over two years (the Exposed Group) and 1,000 adolescents who were treated with other methods like therapy alone (the Unexposed Group). Crucially, no one in either group has Bipolar Disorder at the start.
3. Look Forwards: The researcher follows all 2,000 participants for 15 years, tracking their medical records to see who is eventually diagnosed with Bipolar I Disorder.
4. The Result: At the end of the 15 years, they find that 50 of the 1,000 participants in the Antidepressant X group (5%) developed Bipolar I Disorder, compared to only 10 of the 1,000 in the therapy-only group (1%). This suggests an association between the long-term use of this specific drug in adolescence and a later bipolar diagnosis.

Question 2: Which Way Do You Look?

This second question is inextricably linked to the first. Once you've chosen your starting point, it largely determines the direction of your investigation.

PAST PRESENT FUTURE Retrospective (Case-Control) Prospective (Cohort) Cross-Sectional (Snapshot)
  • Retrospective (Looking Backwards): If you choose the "Outcome-First" approach (Case-Control), you have no choice but to look backwards in time. The outcome has already occurred. Your entire investigation is about reconstructing the past. The arrow of time for your data collection points from the present to the past.
    • Strength: Fast and cheap.
    • Weakness: You are at the mercy of memory and old records. This is called recall bias. A person with a serious illness (a case) might search their memory much harder for a cause and be more likely to remember an exposure than a healthy person (a control).
  • Prospective (Looking Forwards): If you choose the "Exposure-First" approach (Cohort), your investigation is designed to move forwards. You establish your groups in the present and follow them into the future. The arrow of time for your data collection points from the present to the future.
    • Strength: This is scientifically more powerful. You measure the exposure before the outcome happens. This eliminates recall bias and establishes a clear temporal relationship (the cause came before the effect), which is a vital piece of evidence for causation.
    • Weakness: Slow and very expensive.
  • Cross-Sectional (A Snapshot in Time): There is a third option. What if a psychiatrist wants to understand the relationship between burnout and working hours among trainees? They send out a single survey. The exposure (working hours) and the outcome (burnout score) are measured at the exact same time. The psychiatrist is measuring everything in a single snapshot.
    • How it works: You take a defined population and measure both the exposure and the outcome simultaneously.
    • Clinical Example: You survey a group of 500 junior doctors. In the same questionnaire, you ask them about their current work hours (exposure) and their current score on a burnout scale (outcome). You might find that those with longer hours have higher burnout scores.
    • The Critical Weakness: You cannot tell which came first. This is the problem of temporality. Did the long hours cause the burnout? Or did the doctors who were already burning out become less efficient and therefore have to work longer hours? A cross-sectional study can show an association is present, but it is very weak at explaining why it is present.

1.2 The Three Families of Study Design

Now that we understand the two foundational questions—Where do you start? and Which way do you look?—we can see how the answers logically combine to create three distinct "families" of research. Each family has a different purpose, a different level of strength, and answers a different kind of clinical question.

The Hierarchy of Design
  1. The Descriptive Family: The goal is to describe. These studies are like a clinical observation or a preliminary report. They map the landscape of a disorder: Who gets it? What does it look like? When does it appear? They generate questions and hypotheses but do not test them.
  2. The Analytical (or Observational) Family: The goal is to find associations. These studies are like a clinical epidemiologist who observes different groups of people to see if there's a statistical link between a risk factor and a later illness. The researcher never intervenes; they only observe and compare. They answer "Why?".
  3. The Experimental Family: The goal is to determine causation. This is the most powerful family. Here, the researcher acts like a clinical trialist. They deliberately give an intervention (like a new medication) to one group but not another to prove that the intervention causes a specific change in symptoms.

1.3 The Descriptive Family

Descriptive studies are the bedrock of medical inquiry. They provide the initial observations that spark more complex research. Their function is to describe the characteristics of a sample or population, and they do not have a comparison or control group in the formal sense.

1.3.1 Case Report and Case Series

This is the simplest and most traditional form of medical writing.

  • Case Report: A detailed, narrative account of a single, interesting patient case. It might describe an unusual presentation of a common illness, a rare side effect of a medication, or a novel treatment approach for one person.
  • Case Series: A collection of individual case reports involving several patients who share similar characteristics.
  • Purpose: To be a "red flag" or an "idea generator." They alert the medical community to a potential new phenomenon.
  • Psychiatric Example: A psychiatrist working in an early intervention service notices that five young men have presented in the last month with their first episode of psychosis. During history taking, she discovers that all five had been heavily using a new, illegally-marketed vaping liquid. She writes up a paper describing the common features of these five cases: the specific psychotic symptoms, the temporal link to the vaping, and their response to antipsychotics. This is a case series.
  • Strengths:
    • Excellent for identifying rare conditions or novel adverse effects.
    • Can generate hypotheses for future, more rigorous studies (e.g., "Does this vaping liquid cause psychosis?").
  • Weaknesses:
    • There is no comparison group. We don't know how many people used the liquid and didn't get psychosis.
    • It is essentially a collection of anecdotes and cannot prove association, let alone causation. It is highly susceptible to bias and coincidence.

1.3.2 Cross-Sectional Studies (Prevalence Studies)

This is a more structured descriptive design that provides a "snapshot" of a population at a single point in time.

  • How it works: The researcher defines a population of interest and measures both the exposure(s) and the outcome(s) simultaneously. Think of it as taking a single photograph of a crowd; you can see who is tall (exposure) and who is wearing a hat (outcome) at that exact moment, but you don't know if they were born tall or if wearing a hat made them tall.
  • Purpose: Its primary purpose is to measure the prevalence of a condition. This is why they are often called prevalence studies.
  • Psychiatric Example: A public health team wants to determine the prevalence of Post-Traumatic Stress Disorder (PTSD) among firefighters in a large city. They send a one-time survey to all 2,000 firefighters. The survey contains a validated questionnaire to screen for current PTSD symptoms (the outcome) and also asks about their years of service (an exposure). The results show that 300 firefighters (15%) currently meet the criteria for PTSD. This 15% is the point prevalence. The study also shows that firefighters with more years of service have higher rates of PTSD, suggesting an association.
  • Strengths:
    • Relatively quick and inexpensive to conduct.
    • The best design for determining the prevalence of a condition.
    • Good for generating hypotheses about associations.
  • Weaknesses:
    • The cardinal weakness is the inability to determine temporality (the "chicken and egg" problem). In our example, did more years of service (exposure) lead to PTSD (outcome)? Or did firefighters who were predisposed to PTSD stay in the job longer, while others left? The snapshot can't tell us the direction of the relationship.
    • Because of this, it is very weak for testing hypotheses about causation.

1.4 The Analytical Family

This is where true hypothesis testing begins. Analytical studies always involve a comparison between groups to quantify the statistical relationship between an exposure and an outcome. The researcher is still an observer—they do not assign the exposure.

1.4.1 Case-Control Studies (The Retrospective Detective)

As we discussed, this design starts with the "disorder" (the outcome) and works backwards to find the "clues" (the exposure).

  • Design Recap:
    1. Start with the Outcome: Identify a group with the disease (Cases) and a group without the disease (Controls).
    2. Look Backwards: Compare the past exposure rates between these two groups.
  • Psychiatric Example: Researchers want to test the hypothesis that prenatal maternal stress is a risk factor for Autism Spectrum Disorder (ASD) in children.
    • Cases: They recruit 100 children with a confirmed diagnosis of ASD.
    • Controls: They recruit 100 neurotypical children, matched for age and socioeconomic status.
    • Look Backwards: They conduct detailed interviews with all the mothers, using a validated scale to quantify the level of psychosocial stress (e.g., bereavement, financial hardship, relationship distress) they experienced during their pregnancy.
    • Result: If the average stress score is significantly higher in the mothers of the ASD children compared to the mothers of the control children, it supports the hypothesis of an association.
  • Strengths:
    • The best design for studying rare diseases like ASD or schizophrenia.
    • Relatively quick and inexpensive.
  • Weaknesses:
    • Recall Bias: This is a major flaw. A mother of a child with ASD (a case) may have spent years searching for a reason and might remember the stress of her pregnancy more vividly and report it more thoroughly than the mother of a neurotypical child (a control).
    • Selection Bias: Finding an appropriate control group is extremely difficult and can heavily bias the results.
    • Cannot calculate the incidence or absolute risk of the disease.

1.4.2 Cohort Studies (The Prospective Observer)

This is the most powerful type of observational study because it moves forward in time, mirroring the natural course of events.

  • Design Recap:
    1. Start with the Exposure: Identify a group of people free from the outcome and classify them as Exposed or Unexposed to a risk factor.
    2. Look Forwards: Follow both groups over time to see who develops the outcome.
  • Psychiatric Example: A research team wants to investigate if social isolation in older adults (exposure) is a risk factor for developing dementia (outcome).
    • The Cohort: They recruit 5,000 people aged 65 and over who are all dementia-free at the start of the study.
    • Grouping by Exposure: Using a social network index, they classify the participants into two groups: 1,000 who are socially isolated (the Exposed group) and 4,000 who are socially active (the Unexposed group).
    • Follow Forwards: They follow the entire cohort for 10 years, conducting cognitive assessments every two years to identify any new (incident) cases of dementia.
    • Result: If, after 10 years, 15% of the isolated group has developed dementia compared to only 5% of the socially active group, this provides strong evidence that social isolation is a risk factor.
  • Strengths:
    • Establishes temporality—the exposure is measured before the outcome develops. This is a critical step towards inferring causality.
    • Can measure the incidence (rate of new cases) of the disease in both groups.
    • Minimizes recall bias.
  • Weaknesses:
    • Very expensive and time-consuming (can take decades).
    • Prone to attrition bias (participants dropping out over time), which can skew results.
    • Inefficient for studying rare diseases—you would need an enormous cohort to get enough cases.
The Examiner's Focus

The MRCPsych exam will frequently test your ability to distinguish between these observational designs. The key is always to ask: Where did the study start?

  • If it started by grouping people based on a disease they already had (e.g., schizophrenia vs. healthy), it's a Case-Control study.
  • If it started by grouping healthy people based on a risk factor (e.g., smokers vs. non-smokers) and then followed them, it's a Cohort study.
  • A study with two groups followed forward without randomization is a cohort study.

Q1. A two-group study was conducted to assess the efficacy of Donepezil compared to placebo for agitation in Alzheimer’s disease. The study did not mention randomisation or blinding in the methodology section. What is the most appropriate classification for this type of study?

Correct Answer (b): Two groups + Followed forward + No randomization = Cohort Study. It started by creating two groups based on their exposure (Donepezil vs Placebo) and looked forward. The lack of randomization means it is an observational cohort, not an experimental one.

1.5 The Experimental Family

In the hierarchy of evidence, experimental studies sit at the very top for questions of causation. Unlike descriptive studies that merely paint a picture, or analytical studies that observe associations, experimental studies are designed to prove that an intervention or exposure causes a specific outcome.

The defining characteristic of an experimental study is intervention. The researcher is no longer a passive observer; they actively manipulate one variable (the "exposure" or "intervention") and then measure its effect on another variable (the "outcome"). This active manipulation allows for the strongest inferences about cause and effect.

1.5.1 Randomized Controlled Trials (RCTs)

The Randomized Controlled Trial (RCT) is the cornerstone of evidence-based medicine and the most robust design for evaluating the efficacy and safety of new treatments. It is essentially a prospective cohort study where the researcher controls the exposure by randomly assigning participants to different groups.

Core Principle: Randomization
  • What it is: Randomization is the process by which participants are assigned to either the intervention group (receiving the new treatment) or the control group (receiving a placebo or standard treatment) purely by chance.
  • Why it's crucial: This is the single most important feature of an RCT. Its purpose is to create groups that are, on average, identical in every way except for the intervention they receive. By distributing both known and unknown confounding factors evenly between the groups, randomization minimizes selection bias.
Core Principle: Control Group
  • What it is: The control group serves as the baseline for comparison. Without it, you cannot know if the observed changes in the intervention group are due to the treatment itself, the natural course of the illness, or other factors (like the placebo effect).
  • Types of Control:
    • Placebo Control: An inert substance (e.g., a sugar pill).
    • Active Control: The current standard treatment. Used when it would be unethical to withhold treatment.
    • Wait-list Control: Participants are told they will receive the treatment after the study period.
Core Principle: Blinding (Masking)
  • What it is: Blinding refers to keeping participants, researchers, and/or outcome assessors unaware of which treatment arm a participant is in.
  • Levels of Blinding:
    • Single-blind: Only the participants are unaware.
    • Double-blind: Both the participants and the researchers/clinicians are unaware. This is the most common and preferred method.
    • Triple-blind: Participants, researchers, and the data analysts are all unaware.
  • Why it's crucial: Blinding prevents observer bias and participant bias.

1.5.2 Crossover Trials

A crossover trial is a specialized type of RCT that is particularly efficient for certain types of clinical questions, especially when comparing two active treatments for chronic, stable conditions.

  • Core Principle: Within-Subject Comparison: In a crossover trial, each participant receives all the interventions being tested, but in a randomized sequence. For example, in a two-treatment trial (Treatment A and Treatment B), half the participants might receive A then B, and the other half receive B then A.
  • Washout Period: Crucially, there is usually a "washout period" between the two treatments. This prevents carryover effects, where the residual effect of the first treatment influences the response to the second.
  • Strengths: High Statistical Power/Efficiency (each participant serves as their own control).
  • Weaknesses: Only suitable for chronic, stable conditions. Risk of carryover effects.

Q2. A researcher wants to compare treatment effects in the same group of individuals before and after an intervention. What study design is best?

Correct Answer (c): The phrase "in the same group of individuals before and after an intervention" directly points to the core mechanism of a crossover trial. In this design, each participant acts as their own control. Exam Hook: Same patient + Different treatments/times = Crossover Trial.

1.5.3 Uncontrolled Trials

While technically "experimental" because an intervention is applied, uncontrolled trials lack a comparison group. Often used in early-phase drug development (Phase I or II) to assess safety.

2.0 Sampling and Randomization

2.0 Introduction

A study's credibility rests on two pillars: who you study, and how you compare them. These are governed by the distinct processes of sampling and randomization. It is absolutely critical to understand that these are not the same thing.

The Soup Analogy

Think of it this way: Sampling is like inviting guests to a party from the entire city. Randomization is like flipping a coin to decide which guests sit at the red table and which sit at the blue table.

  • Sampling: Selecting a representative group (sample) from the population. Goal: Generalizability (External Validity).
  • Randomization: Allocating participants to groups within the study. Goal: Comparability (Internal Validity).

Sampling vs Randomization

2.1 Sampling

In an ideal world, to find out the prevalence of depression in the UK, we would assess every single person in the UK (a census). This is impossible. Therefore, we study a smaller, manageable group—a sample.

2.1.1 Probability (Random) Sampling

In probability sampling, every single individual in the population has a known, non-zero chance of being selected. This is the "gold standard."

  • Simple Random Sampling: Every person has an equal chance of being picked (names in a hat).
  • Systematic Sampling: You select individuals at regular intervals from a list (e.g., every 10th person).
  • Stratified Sampling: You first divide the population into "strata" (subgroups) based on a key characteristic (e.g., age, gender) and then perform simple random sampling within each stratum. This guarantees specific subgroups are properly represented.
  • Cluster Sampling: The population is divided into pre-existing "clusters" (e.g., different hospitals), and a random sample of the clusters is chosen. Then, every individual within the selected clusters is studied.
2.1.2 Non-Probability Sampling

Selection is not random. Easier and cheaper, but higher risk of bias.

  • Convenience Sampling: Selecting whoever is easiest to reach.
  • Quota Sampling: Aiming to include a certain number of participants with specific characteristics, but selecting them non-randomly.
  • Snowball Sampling (Chain-Referral): Used when the population is hard to reach, hidden, or rare. The researcher starts with one individual (the "seed") and asks them to refer others.

Snowball Sampling

The Examiner's Focus: The MRCPSYCH exam frequently tests non-probability sampling techniques, especially snowball sampling. The key trigger for this is any mention of a rare disorder or a hard-to-reach population where participants are recruited via referrals from other participants.

Q3. In a study of a rare disorder, the researcher asks participants to suggest other individuals with the same condition to include in the sample. Which technique is being used?

Correct Answer (c): The scenario describes the exact definition of snowball sampling. The key elements are a "rare disorder" and the recruitment method of "asking participants to suggest other individuals."

Q4. A 28-year-old woman with a rare autoimmune disorder is recruited for a clinical trial. Her participation involves referring other individuals with the same condition. Which recruitment strategy is MOST likely being used?

Correct Answer (b): Exam Hook: Rare disease + Participant referrals = Snowball Sampling.

2.2 Randomization

Once you have your sample, if you are conducting an RCT, you must allocate them to the treatment and control groups. Randomization ensures this allocation is free from bias.

2.2.1 Simple Randomization

Like flipping a coin. Weakness: Can lead to unequal group sizes in smaller studies.

2.2.2 Block Randomization

Participants are randomized in small "blocks" (e.g., groups of 6). This guarantees that after every block, the groups are perfectly balanced. Crucial if a trial is stopped early.

Q6. Participants are randomly assigned to groups of six during sampling. Which randomization method is described?

Correct Answer (b): The phrase "assigned to groups of six" is the textbook description of block randomization. Exam Hook: "Groups of fixed size" or "Blocks" = Block Randomization.

2.2.3 Stratified Randomization

Used when there are specific baseline characteristics (prognostic factors) that are known to strongly influence the outcome. The goal is to ensure these crucial factors are perfectly balanced.

  • How it works: First, divide participants into strata (e.g., "Mild" vs "Severe"). Then, perform randomization within each stratum.
  • Benefit: Guarantees that the number of participants with that key characteristic is equal in both groups.

Q5. A researcher ensures equal distribution of male/female participants between treatment and control groups. This method is called:

Correct Answer (c): The researcher is actively ensuring that a key characteristic—gender—is equally distributed. Exam Hook: Balancing specific covariates (Sex, Age, Severity) = Stratification.

2.3 Ecological Studies and the Ecological Fallacy

Within the family of analytical studies, there is a unique design that is important to understand, primarily because of the specific and common error in interpretation associated with it.

  • What is an Ecological Study? An observational study where the unit of analysis is a population or a group, not an individual. It compares groups with different levels of exposure using aggregated data (e.g., comparing suicide rates in different cities).
  • The Critical Flaw: The Ecological Fallacy. The logical error of assuming that an association observed at the group level must also be true at the individual level.
The Ecological Fallacy Example

A study finds that high-unemployment wards have high admission rates. The ecological fallacy would be to conclude that the individuals who are admitted are themselves unemployed. This may not be true; the stress of living in a high-unemployment area might affect everyone, regardless of their personal job status.

2.4 Matching the Clinical Question to the Best Study Design

Clinical QuestionBest Study DesignExplanation
Treatment EfficacyExperimental RCT"Can this work under ideal conditions?" Best for internal validity.
Treatment EffectivenessPragmatic RCT"Does this work in the real world?" Broader inclusion criteria.
Causation (Aetiology)Cohort or Case-ControlEthical constraints prevent randomizing to harmful exposures.
PrognosisCohort StudyFollows a group forward from a uniform point in illness.
Diagnostic AssessmentCross-sectionalCompare new test to gold standard simultaneously.
Health EconomicsCost-effectiveness studyCompares clinical outcomes and costs (QALYs).
'Meaning' / ExperienceQualitative StudiesExplores thoughts, feelings, and lived experience.

Efficacy vs. Effectiveness

Efficacy (Ideal World) vs. Effectiveness (Real World).

0% Complete