Tutorials for MRCP candidates
Multiple Choice Questions
- Introduction
- Probability and research planning
- Clinical trials
- Prediction, diagnosis, and tests
- Miscellaneous statistical problems
- Questions raised but not answerable by the MRSC
The Mater research Support Centre provides a statistics tutorial for MRCP candidates around November of each
year, to prepare these candidates for the statistical part of the multiple choice questions examination the following March
Over the years, a number of questions have been recalled by candidates, and sample answers prepared by the
MRSC. This
web page reproduces them so they can be consulted by those preparing for the examinations
Please be reminded that the questions are as recalled by candidates, and may contain errors. Also staff of the MRSC
are not examiners, and not privileged to know the examiners' version of the correct answers. The questions and answers presented in these
pages therefore represent the best efforts to date, but cannot be taken as authoritative
nor can correctness be garanteed.
The staff of the MRSC (and future candidates of exams) would be most grateful if Anyone that can see errors in these pages
are to point these out so that they can be corrected. Corrections and feedbacks can be sent to
Allan Chang
The format used will be the question followed by the multiple choice. At the end of the
question there will be Answers . Click on this will reveal
the answer, explanations and comments
Q. What is the best way to reduce Type 2 Errors?
Answers
The suggested answer is A. Increase sample size
Type II error is defined as the probability of falsely accepting the null
hypothesis. Conceptually, it means how likely to miss a significant difference
when it is really there. Both type I and type II errors depend on the
relationship between the effect size and the sample size. For any required
effect size, Type II error decreases as sample size increases.
A. Increase sample size
B. Randomisation
C. Correction for confounders
D. Intention to treat analysis
E. blinding
Q. The 95% confidence interval of a mean is best defined as the interval:
Answers
The suggested answer is E
Standard Deviation measures where the observations will lie. Standard Error
estimates where the means of repeated sampling using the same sample size on a
similar population will lie. 95% Confidence interval of the mean therefore
represent where 95% of the mean values will lie in repeated samplings of a
similar population using the same sample size.
A. within which 95% of the means established from different samples drawn from the same population will lie
B. within which the sample mean will lie
C. within which 95% of observation will lie
D. containing 95% of the observations on the original population
E. containing 95% of the means estimated from samples drawn from the corresponding standard normal distribution
Q. Which is the role of an ethics committee in a research project?
Answers
The suggested answer is D.
The problem is that the Ethics Committee has a role in ensuring good ethics in research, and bad research is unethical research,
as it wastes resources and everyone's time, as well as putting research subjects at
discomfort and risk for no good reason. Ethics
committee therefore do also have a role to ensure feasibility and validity of research. The suggested answer is
therefore only correct if the question pertains to the most important role, or principle role, or something like that.
A. Feasibility of the study
B. Assessing scientific validity
C. Provide information to research subjects
D. Ensuring safety of research participants
E. Prevent bias
Q. Which one of the following steps in the design of a clinical trial
of a new therapeutic agent most reduces the chance of Type II error
Answers
The suggested answer is C
A, D, and E control bias. B ensures the result represents effectiveness rather than efficacy.
Type II error depends on the relationship between effect and
Standard Error, and Standard Error decreases with increasing sample size. Type II error therefore decreases with increasing sample size.
A. Random Allocation.
B. Intention to treat analysis.
C. Sample size calculation
D. Stratification of major risk factors.
E. Blinded assessment of outcomes.
Q. Which of the following best describes the reasons for obtaining informed
consent in patients entering clinical trials?
Answers
The suggested answer is E
Obtaining informed consent recognises the participant's right to make an
informed decision whether to participate or not. The best description is
therefore autonomy, although obtaining informed consent also reduces risks of
complaint and litigation.
A. beneficence
B. non malfeasance
C. justice
D. risk management
E. autonomy
Q. A new anti-HT agent is being trialed. It is being tested in 1000 patients in
50 centres for a three month period and is being compared to the standard anti-HT
agent. Which of the following best describes the nature of the trial?
Answers
The suggested answer is C, it is a phase III trial.
Phase I is the initial testing for toxicity and
dosages in the human. Phase II is a single group trial to evaluate whether the
drug is sufficiently efficacious to be worth a controlled trial. Phase III is
the definitive controlled trial against an existing treatment or a placebo.
Phase IV is sometime called the post marketing trial, and is mostly for
evaluating rare side effects.
A. It is a pre-marketing assessment
B. It is a phase IV trial
C. It is a phase III trial
D. It is a phase II trial
E. It is a phase I trial
Q. A trial was performed to determine benefits of smoking cessation, within a
group of patients with COAD who attend a particular hospital outpatient clinic.
'Intention to treat' analysis is most likely to result in:-
Answers
The suggested answer is B
Use of "intention to treat" means that all the cases recruited,
regardless of whether they actually completed the intervention, will be included
in the assigned group for analysis. The main reason is that it evaluates
"effectiveness", or how it may work in real life situation, rather
than "efficacy", how it works if the treatment is actually received in
full. It therefore underestimates efficacy because the treatment group will
contain some cases that did not complete the treatment.
A. Overestimation of efficacy of the trial
B. Underestimation of efficacy of the trial
C. Selection bias
D. Inaccuracy of the measures of outcome
Q. In a trial comparing treatment groups in asthma management, there is
found to be a 50% reduction in hospitalisation. 10% of the control group are
admitted to hospital, and 5% of the treatment group are admitted to hospital.
What is the number needed to treat to avoid one admission?
Answers
The suggested answer is D, 20
The number needed to treat is the inverse of risk difference. The risk of the
two groups are 0.1 (10%) and 0.05 (5%). The difference is risk is 0.05, and the
NNT = 1/0.05 = 20.
Please note that the 50% reduction is Relative Risk, where RR
= R1/R2 = 0.05/0.1 = 0.5 = 50%. Relative Risk is usually used in epidemiological
studies to evaluate how one parameter may affect another. NNT are used in
controlled trials of two groups, and is derived from the risk difference.
Q. Case control studies:
Answers
The suggested answer is B
Case controlled studies identify cases with an outcome of interest, then
match this case with one or more cases similar in every respect except the
causation of interest, then evaluate whether the incidence of the causation is
different in those with or without the outcome. This is a strategy to examine
uncommon outcomes
A. Are especially useful for studying increases in risk with odds rations of 1 to 2
B. Are most appropriate for uncommon events (population prevalence <1%)
C. should use community controls
D. Are the type of study most susceptible to recall bias
E. Are optimal designs for confirmatory testing of well- supported hypotheses
Q. A study of the intellectually handicapped was performed. The 112 subjects put
through program A showed an increase in their mean IQ score of 6 points. The 115
subjects put through program B, showed an increase in their mean IQ score of 4.
The p value was >0.05. Which of the following is true?
Answers
We are not sure, but D is the only one that is not incorrect
The results may or may not have demonstrated the usefulness of program A, but
it failed to demonstrate that the results are different to that from program B,
as this difference may arise by chance in more than 1 in 20 (p>0.05).
Student's t test must be used with caution in small sample size, but the sample
is never too large for the t test if the underlying distribution is normal, so
the distribution of individual values is important.
Confidence Intervals is an alternative presentation of statistical results,
and its appropriateness is not dependent on the outcome of a t test. All except
D are not correct.
By the way, this is a very badly written question, as what the p value is is
undefined (?price, protein concentration?). The guess is that it
refers to the probability of Type I Error, and should be stated as
such. At least it should be stated as the p value in testing the
null hypothesis. somebody ought to complain to the College.
A. the numbers are too large for a student t test
B. the study demonstrates the usefulness of program A
C. the distribution of the individual values is not important
D. even though the difference between the means is not significant it would be.appropriate to calculate confidence intervals
E. the above results would be found by chance in less than 1 in 20
Q. The principle aim of a Phase I trial of a cytotoxic drug is
Answers
The suggested answer is E
The maintenance of hope is not by drug trials. Define tumour response rate is
usually carried out in phase II studies. Measure survival rate is a particular
research model, usually carried out in phase III studies. Establishing the
maximum tolerated dose is a function of Phase I study. I am not sure about best
schedule of administration, but I think this is in another form of study.
A. Maintain the patient's hope that treatment is possible
B. Define the best schedule of administration
C. Define tumour response rate
D. Measure progress-free survival
E. Establish the maximum tolerable dose
Q. Below are the tabulated results of the five clinical trials for
different drugs using death as the primary outcome. Follow up time
is five years in all trials.
| Trial for | n(%) survival for drug | n(%) survival for placebo |
| Drug A (n=200) | 30 (15%) | 20 (10%) |
| Drug B (n=600) | 12 (2%) | 3 (0.5%) |
| Drug C (n=400) | 80 (20%) | 64 (16%) |
| Drug D (n=500) | 75 (15%) | 55 (11%) |
| Drug E (n=300) | 30 (10%) | 18(6%) |
The trial for which drug shows the lowest number needed to treat (NNT)?
Answers
The suggested answer is A
NNT is inverse of risk difference. The lowest NNT is therefore the highest risk
difference. Drug A is 10%-5%=5%, drug B is 2%-0.5%= 1.5%, drug C is 20%-16%=4%,
drug D=15%-11%=4%, and drug E is 10%-6%= 4%. Drug A therefore has the highest
difference in risk, therefore the lowest NNT.
A. Drug A
B. Drug B
C. Drug C
D. Drug D
E. Drug E

Answers
The suggested answer is C
The A line is to the left of B at 50% survival, so it has a shorter median survival time.
At 19 months the percentage of survival is better
Q. The following are results of assessment of a new test (B) against a definitive reference test (A)
| Test A positive | Test A negative |
| Test B positive | 9 | 9 |
| Test B negative | 1 | 91 |
|---|
Which one of the following statements regarding test B is correct?
Answers
The suggested answer is A
Sensitivity = true pos / (true pos + false neg) = 9/(9+1) = 1/10 = 0.9 = 90%
Specificity = true neg / (true neg + false Pos) = 91/(91+9) = 91/100 = 0.91 = 91%
A. The sensitivity is 90% and the specificty is 91%
B. The sensitivity is 50% and the specificty is 90%
C. The sensitivity is 10% and the specificty is 9%
D. The sensitivity is 18% and the specificty is 92%
E. The sensitivity is 90% and the specificty is 50%
Q. If pre-test probability is given, what else would u need to predict post test probability?
Answers
The suggested answer is A
Post test probability is a function of pre test probability and Likelihood
Ratio, as follows.
Pre test Odds = Pre test probability / (1-Pre test probability)
Post test Odds = Pre test Odds x Likelihood Ratio
Post test probability = Post test Odds / (1 + Post test Odds)
A. likelihood ratio
B. odds
C. sensitivity
D. specificity
E. accuracy
Q. A new test has 90% sensitivity and 80% specificity. The disorder has a
prevalence of 1 in 100. What is the positive predictive value?
Answers
The suggested answer is E
There are two methods to work this out
Method 1 : Reorganisation of the formulae
Prevalence is the proportion of the population that is outcome positive
Sensitivity is the proportion of those outcome positive that are true positives
True positive TP = Sensitivity x Prevalence
Those that are outcome negative are those not outcome positive
Outcome negative = (1-Prevalence)
Specificity is the proportion of those outcome negative that are true negatives
True negative TN = Specificity (1-Prevalence)
False Positive is the proportion of those outcome negative that are not true negatives
False Positive FP = (1-Specificity)(1-Prevalence)
PPV = TP / (TP + FN) = Sen x Prev / (Sen x Prev + (1-Spec)(1-Prev))
= 0.95 x 0.01 / (0.95 x 0.01 + 0.2 x 0.99)
= 0.0095 / (0.0095 + 0.198) = 0.0095 / 0.2075 = 0.046 or 4.6%
Method 2 : uses the known formulae associated with Bayesian probability
Likelihood Ratio for positive test = Sensitivity / (1-Specificity) = 0.95 / 0.2 = 4.74
Pre-test Odd = Pre-Test Probability / (1 - Pre-Test Probability) = 0.01 / 0.99 = 0.0101
Post-test Odd = Pre-test Odds x LR = 0.0101 x 4.74 = 0.048
Post-Test Probability = Post-Test Odds / (1 + Post-test Odds) = 0.048 / 1.048 = 0.046
or 4.6%
A. 0.2%
B. 0.5%
C. 1.0%
D. 2.0%
E. 5.0%
Q. A new diagnostoc test for a certain desease has been
evaluated. Compared with a definitive diagnostic standard, this test
has a sensitivity of 100% and specificity of 95%. The prevalence of the
disorder in the population to be tested is 0.1%. What is the best estimate
of the positive predictive value of the new test
Answers
The suggested answer is B
The same approach to the last question is used, the following is a truncated calculation.
Sensitivity = 100% = 1
Specificity = 95% = 0.95
Prevalence = 0.1% = 0.001
PPV = Sen x Prev / (Sen x Prev + (1-Spc)(1-Prev))
= 0.001 / (0.001 + 0.05 x 0.999)
= 0.001 / (0.001 + 0.04995) = 0.0196 or 2% the nearest
A. <1%
B. 2%
C. 5%
D. 10%
E. 25%
Q. In a population, those who are well at 70yrs have a 5% chance of developing
dementia over 5 years. 10% of this population is ApoE4 homogeneous or
heterogeneous, leading to a 3x increase risk of dementia. What is the proportion
of people developing Alzheimer's who do not have ApoE4?
Answers
None of the multiple choice answers is correct, but D is the nearest
This is a question on prediction. At the technical level the givens are as follows
1. prevalence or pre-test probability is 5%, in other words True Positive (TP) + False Negative (FN) = 0.05
2. Test Positive is 10%, in other words TP + false Positive (FP) = 0.1
3. The proportion of true positives in those test positive is 3 times those with false negatives in those test negative,
in other words TP / (TP+FP) = 3FN / (FN+TN)
Step 1. From the given, 5 simultaneous equations can be defined
1. TP + FN = 0.05; from the first given equation
2. FP + TN = 0.95; logically from 1
3. TP + FP = 0.1; from the second given equation
4. FN + TN = 0.9; logically from 3
5. TP / 0.1 = 3FN / 0.9; from the third given equation, 3, and 4
Step 2 progressive solving of the simultaneous equation
6. TP = 0.05 - FN; from 1
7. TP = 0.3 FN / 0.9 = FN/3; from 5
8. FN/3 = 0.05 - FN; from 6 and 7
9. FN/3 + FN = 0.05, 4FN/3=0.05, FN = 0.05 x 3 / 4 = 0.0375; from 8, in other words 3.75% of the population is false negative
10. TP = 0.0125; from 6, in other words, 1.25% of the population is true positive
11. FP = 0.0875; from 3 and 10; in other words 8.75% of the population is false positive
12. TN = 0.8625; from 2 and 11, in other words 86.25% of the population is true negative
Step 3. The question is the proportion of those with Alzheimer that are ApoE4 negative
13. Proportion of the population with Alzheimer = 0.05; from the first given equation
14. proportion of population that is test negative and with Alzheimer = 0.0375; from 9
15. Answer = 0.0375 / 0.05 = 0.75, in other words, 75%
A. <5%
B. 50%
C. 90%
D. 70%
E. 95%
Q. In a study to find out if concentration of drug X is related to weight,
subjects were given 500mg of the drug and serum levels were measured 2 hrs
later. Which of the following is the best statistical test to evaluate the results?
Answers
The suggested answer is E. However this is correct only if E refers to Pearson's Correlation Coefficient. Although
Pearson is most famous for this coefficient, he was the Professor of Statistics at Cambridge and had many things named after him.
Somebody should ask the College to be more specific.
Student's paired t test requires paired measurements of the same thing, so
cannot be done if drug concentration was not measured before administration.
Chi
Squares test usually concern proportions, so is not appropriate for
measurements.
Unpaired t test is for comparison of two groups which is absent in
this model.
Log regression (I assume this means logistic regression) is a
multivariate modelling of a binary outcome which drug concentration is not.
Pearson's correlation coefficient measures correlation between two measurements that are normally
distributed, so would be appropriate in this case if one assumes that body
weight and drug concentrations are both normally distributed.
A. Students paired t test
B. Chi squared test
C. Students unpaired t test
D. Log regression analaysis
E. Pearsons coefficient
[This is the answer to the question.]
Q. Which of the following is/are true regarding the elderly population of Australia and NZ?
Comments
A. The fastest growing segment of the population is that over 80 years of age
B. More than 15% of over 65years old live in continuing institutional care
C. Average life expectancy for an 80 year old woman is less than 3 years
D. The residual lifetime risk of proximal femur fracture in a 50 year old woman is greater than 10%
E) The ration of persons over 65 to those in the workforce is predicted to increase after 2010
Q. Which of the following statements is correct regarding SEM and SD?
Comments
None of the answers are truly correct, but B is the nearest to it. We
suspect that there is an error in transcribing the question
SEM = SD/sqrt(n), so A and C are not correct. In fact SEM decreases
with sample size. SEM=SD when n=1, and SEM=0 when n is infinity.
SD = sqrt(sum of squares / (n-1)). , therefore SD decreases with n. B is correct
up to a point as the sums of squares also increases. After a large number of
cases are collected however the SD usually stabalises..
The mean is central tendency and SD is the spread. They are different things, so
D is not correct.
Student t assumes an underlying normal distribution, so it is a parametric test.
A. SEM is calculated by taking the square root of the SD of the sample means
B. SD invariably falls with increasing sample size
C. SEM increases with sample size
D. If SD is greater than the mean the distribution is negative
E. Students t test is a non parametric test
Q. Which of the following best describes specificity?
Comments
There is possibly a transcrption error here, as none of the answers seem correct. The combinations presented refer
to True and False Positives and Negatives. A is true positive, B is false negative, C and E are true negative, and D false positive.
Specificity is the proportion of negative tests amongst those that are disease negative
a. Disease +ve, Test +ve
b. Disease +ve, Test -ve
c. Disease -ve, Test -ve
d. Test +ve, Disease -ve
e. Test -ve, Disease -ve
Q. What is the definition of the p-value?
Comments
There may have been multiple transcription errors, as neither the question nor answers make sense
p value as such remains undefined. By common usage, the p value when used
statistically in a test of the null hypothesis is an informal short hand way of saying the probability of Type I error, Fisher's p,
or the probability of error in rejecting the null hypothesis.
If I am correct in guessing what the examiner call p value, then the answer is the probability the result
being null, the smaller the p the less likely
the result to be null, the less likely the difference found arise out of chance, so the more likely a significant difference
exists.
Therefore, its not so much whether the result is correct, but whether the null hypothesis is correct.
Whether the result is correct depends on the diligence with which observations are made and recorded, and has nothing to do with statistics.
A. Probability that the result is correct
B. Probability that the result is incorrect
C. Probability that the result occurred by chance
Q. A disease has an annual incidence of 15 cases per 100,000. The
mean survival after diagnosis is five years. What is the best
estimate of prevalence of this disease
Comments
When MRSC can find the right formula the correct answer will be posted. In the mean time, the following is our best try, but may not be correct
In any year, in a population of 100,000, and assuming that there is no death
from any other causes, there are 15 cases that are new from the current year,
and at least 7.5 cases from each of the previous 5 years, so the minimum number
of cases must be 15 + 5 x 7.5 = 52.5. The prevalence must exceed this value, as
the survival rate from all the years other than the earliest of the 5 must be in
excess of 7.5. Base on this logic, 75 in E is the only possible answer.
The only worry I have with this answer is that it looks too much like a
secondary school arithmetic exercise and not a postgraduate epidemiological
problem, so I am not certain whether I am correct. However, I cannot think of
any other way of calculating the prevalence, as the shape of the survival curve
is not stated, and even assuming the survival time to be normally distributed
there is no Standard Deviation available from which to calculate probabilities.
A. 3 per 100,000.
B. 10 per 100,000.
C. 20 per 100,000.
D. 45 per 100,000.
E. 75 per 100,000.
Q. A large study was carried out comparing aspirin with placebo in
primary prevention of CHD in women (actual NEJM article), the results of which
are summarised in the graph below (this is the actual graph) It has cumulative
incidence on the Y-axis and time on the X-axis. Which statement is most correct?
Comments
The p=0.13 is not accompanied by explanation, so it is difficult to know
exactly what it is referred to. By common usage this refers to Type I error, and
means that there is a 13% chance of error in rejecting the null hypothesis. In
other words, there is a 13% chance that the difference observed was by chance
and not real. In other words, there is no significant difference between the groups, so A and D are therefore incorrect
Cumulative incidence is the proportion of positive cases amongst the
population of interest at any particular time. The cumulative incidence was 2.5%
in 10 years, so it cannot be 5% per year. E is therefore incorrect
I am uncertain of the term relative risk reduction. At end point, the two
incidences are roughly 2.2% and 2.5% The risk reduction is about 0.3%, relative
to the higher incidence 0.3/2.5 is roughly 0.12, so although I am not sure, I do
not think B is correct
By 10 years, both groups have cumulative incidences of over 2% so less than
98% are symptom free. So C is also not correct.
I do not understand this question at all, unless there are transcription errors.
The best interpretation of this data is:
A. There was a 13% chance that there was a benefit in taking aspirin to placebo.
B. The was a 23% relative risk reduction of the primary endpoint with taking
aspirin.
C. Regardless of the treatment group there was a 98% chance that the patients
would remain event free for the 10 year period of follow up
D. There was a 13% absolute risk reduction in events in the group taking aspirin
rather than placebo
E. Regardless of the treatment group there was a 5% rate of events per year