Course description
Block 1 | Block 2 | Block 3 | Block 4 |
Block 5 |
Stata 1 | Stata 2 | Stata 3
Block 1
Biostatistics I -
M. Pagano (Harvard University)
Introduces the fundamental principles of statistics applied to biomedicine.
The topics to be covered include: descriptive statistics, measures of central tendency,
probability, diagnostic testing, population and sample, comparison of proportions.
At the end of the course, students will be able to understand the descriptive statistical
methodologies which are used in clinical and epidemiological studies and to utilize the
estimates obtained from suitably selected samples, in order to draw statistical inferences.
Applied Linear Regression -
R. Bellocco (Karolinska Institutet and University of Milano-Bicocca)
The course introduces students to the practice and application of regression modeling.
Through the use of Stata®, students will learn how to fit a regression, estimate, and test regression coefficients.
Particular emphasis will be placed on the interpretation of the regression coefficients of continuous and categorical predictors.
Analysis of variance models and their correspondence with regression models will also be covered together with procedures
and issues in model selection, including confounding and interaction. Model building, goodness of fit, residual analysis,
and appropriate regression diagnostics will be discussed.
Applied Quantile Regression -
M. Bottai (Karolinska Institutet)
Quantile regression is a statistical method for the analysis of the median and other percentiles of an outcome of interest.
This increasingly popular method provides great insight in medical research and an alternative approach when other traditional methods may not be valid.
The course offers an introduction to quantile regression and its extensions through a series of real-life examples from clinical and epidemiological studies.
The focus is on interpretation and practical relevance. Based on lectures and computer activities, the course enables the participants to utilize
quantile regression as a simple tool for the
analysis of outcomes of interest with emphasis on its flexibility in the presence of non-normal outcomes, outlying values, survival and longitudinal data.
Block 2
Principles of Epidemiology -
J. Buring (Harvard University)
This course provides an introduction to the skills needed by public health professionals and clinicians to
critically interpret the epidemiologic literature. It will provide participants with the basic principles and practical
experience needed to develop these skills. This will be accomplished by covering the basic principles and methods of the design,
conduct and interpretation of epidemiologic studies, including descriptive studies, observational analytic studies (case-control and cohort),
and randomized clinical trials. In addition, the course will address the calculation and interpretation of measures of disease frequency and association;
the assessment of association versus causation in the interpretation of study results; and issues related to the evaluation of chance, bias, confounding,
and effect modification. Lectures will be complemented by seminars devoted to case studies, exercises, or critiques of relevant examples of epidemiologic studies.
Applied Logistic Regression -
D. Wypij (Harvard University)
This course introduces students to the practice and application of logistic regression modeling for binary outcomes.
Students will fit, evaluate, and interpret binary data models arising from epidemiological studies, clinical trials,
or other application areas. Topics include assessment of confounding and effect modification, use of indicator variables,
model building methods, goodness-of-fit assessment, presentation of logistic regression models for reports and publications,
and an introduction to conditional and ordinal logistic regression. Data sets from the medical and public health literature
will be used as case studies to be analyzed using the Stata® statistical package.
Missing data
in observational and randomized studies -
N.J. Horton (Smith College)
Missing data arise in most real-world situations, and can cause
bias or lead to inefficient analyses. The development of statistical
methods to address missingness has been actively pursued in recent years,
and sophisticated software to appropriately account for it
is available within Stata. In this course, participants will learn
ways to minimize missingness, the nomenclature for missing data
methods, appropriate ways to describe patterns of missing data as
well as how to account for incomplete observations using multiple
imputation and sensitivity analysis. The course will emphasize
practical skills with biomedical examples of observational studies
and randomized trials.
Block 3
Modern Epidemiology -
J. Kasperzyk (Harvard University)
This course will explore in greater depth the fundamental epidemiologic concepts introduced in Principles of Epidemiology (Week 1).
Topics will include disease surveillance, nutritional epidemiology, and a special emphasis on genetic and molecular epidemiology.
Epidemiologic examples across major chronic diseases/conditions (e.g. heart disease, metabolic syndrome, cancer) will be discussed.
Students will revisit the issues of confounding, selection bias, effect modification, and generalizability in the context of these topics.
Lectures will be augmented by workshops to illustrate practical examples in the epidemiologic literature.
The material covered in Principles of Epidemiology will be assumed of the students entering this course.
Applied Longitudinal Analysis -
G. Fitzmaurice (Harvard University)
This course focuses on methods for analyzing longitudinal and repeated measures data.
The defining feature of longitudinal studies is that measurements of the same individuals
are taken repeatedly through
time, thereby allowing the direct study of change over time. This type
of study design encompasses epidemiological follow-up studies as well
as clinical trials. The course covers many well-established methods for
the analysis of longitudinal data when the response variable is continuous.
Methods for discrete response variables (e.g., repeated binary responses
and counts) are introduced, but not emphasized.
An introductory course in biostatistics and a good background in linear
regression analysis are prerequisites for this course.
Monitoring and Evaluation of Health Programs -
M. Pagano (Harvard University)
This course covers the basic statistical tools necessary for monitoring and evaluation of health programs.
The range of topics includes assessing modern and rapid testing methods; an overview of various methodologies
and designs for estimating coverage and changes for a region; methods for evaluating subregional performance
(ie the health districts of a region); and comprehensive M&E approaches that allow for both local and regional assessment.
Emphasis will be on the practical aspects of design, analysis and presentation (including mapping), with a large number
of examples presented, and training on the use of software.
Block 4
Biostatistics II -
M. Bonetti (Bocconi University)
This course is designed to provide the student with an understanding of the foundations of biostatistics and of the various statistical
techniques that have been developed to answer research questions in the health sciences. Students will be introduced to methods for the
comparison of outcome between two groups (t-test and non parametric tests), as well as the extension to the comparison of outcome across
several groups (ANOVA); methods for the study of association between two continuous variables (correlation and linear regression);
the analysis of contingency tables; the study of survival (time-to-event) data. The afternoon sessions are devoted to discussion and
learning to use Stata® to implement materials covered in the morning lectures.
Survival Analysis -
P. Dickman (Karolinska Institutet)
The course aim is to introduce statistical methods for survival analysis, that is,
the analysis of studies where the outcome is a time-to-event. We will study methods
for estimating patient survival (life table and Kaplan-Meier methods), comparing
survival between patient subgroups (log-rank test), and modelling survival
(primarily Poisson regression and the Cox proportional hazards model).
Emphasis will be placed on describing how epidemiological cohort studies can be
analysed in the framework of survival analysis. We will study the concept of 'time'
as a potential confounder or effect modifier and discuss various approaches to defining 'time'
(e.g., time since entry, attained age, calendar time). The course will emphasise the basic
concepts of statistical modelling in epidemiology, such as controlling for confounding and
assessing effect modification. The course will consist of lectures, classroom
exercises, and computing exercises (using Stata®).
Evidence Based Public Health -
E. Savoia (Harvard University)
This course will present case studies aimed to understand how EBPH methods can be applied in the public health decision-making process.
Students will role play leadership positions during the presentation of case studies and discuss as a group the consequences of the decisions they make.
Topics will include: search for scientific evidence in public health, translation of research findings in public health practice,
methods for evaluating public health programs and policies. During the course emphasis will be given to the use of questionnaires
as tools designed to change policies and programs. Students will learn how to assess the validity and reliability of such questionnaires
using a variety of statistical methods. Knowledge of basic concepts and methods in biostatistics is required for this course.
Block 5
Causal Inference -
Andrea Rotnitzky (Harvard University and Universidad Di Tella)
Students will learn to critically evaluate the pitfalls of observational studies and of imperfect experimental studies, in particular all possible biases that can arise.
Students will learn analytic tools, based on causal diagrams and new statistical models, that can help squeeze as much evidence as these imperfect studies
carry about the causal effects of interventions, treatments and/or exposures of interest.
Randomized Clinical Trials -
D. Harrington (Harvard University)
Randomized clinical trials play a central role in the advancement
of knowledge about new and existing interventions for the prevention and treatment of disease,
and the interpretation of the results of most clinical trials requires a careful balance of
the apparent results and the limitations of the underlying study. With an aim of improving
one's ability to critically evaluate published results about clinical trials, this course
will provide an introduction to the methods used in the design, interim monitoring, and
analysis of clinical trials, including the impact of patient exclusions and other causes of incomplete data.
The course will emphasize concepts and principles, and the main focus will be on randomized Phase III (comparative) trials.
The morning sessions will consist of lectures, while the afternoon sessions will discuss case studies from recently published trials,
as well as practical experience in study design.
Statistical methods for population-based cancer survival analysis -
P. Dickman (Karolinska Institutet) &
P. Lambert (University of Leicester and Karolinska Institutet)
The course will cover central concepts, such as how to estimate and model relative survival, as well as recent methodological
developments including cure models, flexible parametric models, estimation in the presence of competing risks, and methods for
analysing data with missing covariates. Comparison of alternative methodological approaches (e.g., to estimating relative survival
and to modelling relative survival) will will be a focus of the course and participants will get the opportunity to apply and
contrast a range of methods to real data. The course will consist primarily of lectures and hands-on computing sessions with a
focus on individual instruction and discussion. Click here for further details of the course content.
Stata 1
Introduction to Stata® -
S. Venturini (Bocconi University)
This course is designed to introduce students to the basics of Stata.
It will focus on the minimum set of commands everyone should know to organize their own work.
Specific topics include data-management, data-reporting, graphics and basic use of do-files.
By the end of this one-day course, the student should be capable of using Stata independently.
Meta-analysis using Stata® -
R. D'Amico (University of Modena and Reggio Emilia)
The aim of this course is to provide an overview of methods to perform meta-analysis using Stata.
We will cover Stata commands for a variety of tasks: data preparation and input, fixed and random-effect models,
forest plots (publication quality graph), heterogeneity across studies, publications bias, sensitivity analysis,
and meta-regression models.
Tables for epidemiologists using Stata® -
N. Orsini (Karolinska Institutet)
This course is designed to introduce students to basic Stata commands useful in epidemiological research:
descriptive statistics to estimate the incidence of a binary response and to characterize the demographic
information supplied by study participants; statistical tests to identify univariate predictors associated with the binary response;
graph the incidence of a binary response as a function of a predictor; and table of standardized means and proportions.
Stata 2
Introduction to Stata® -
D. Rizzuto (Karolinska Institutet)
This course is designed to introduce students to the basics of Stata.
It will focus on the minimum set of commands everyone should know to organize their own work.
Specific topics include data-management, data-reporting, graphics and basic use of do-files.
By the end of this one-day course, the student should be capable of using Stata independently.
Flexible dose-response analysis with Stata® -
N. Orsini (Karolinska Institutet)
This course is designed to introduce students to flexible modeling of a quantitative covariate using different approaches.
Emphasis is placed on communicating results in a powerful manner using graphs. Motivating examples from top medical journals will be used during lectures and exercises.
By the end of this one-day course, the students will be capable to perform and present a dose-response analysis.
Multiple imputation of missing data with Stata® -
R. Bellocco (Karolinska Institutet and University of Milano-Bicocca)
To provide a practical overview of methods to estimate missing data. The course will introduce the basics of multiple imputation, in particular
imputation by chained equations. By the end of this one day course,
participants should be capable to analyse data by multiple imputation in Stata.
Students should have a background in linear regression methods prior to taking this course.
Stata 3
Introduction to Stata® for survival analysis -
S. Eloranta & T. Andersson
This course is designed to introduce students to basic survival analysis using Stata.
We will describe how Stata can be used to analyse time-to-event data, including an introduction to the
powerful built-in stset command. In addition, this short course will cover the topics such as the Kaplan-Meier
estimator of survival, log-rank tests, hazard functions as well as a short introduction to the Cox regression model.
Prior experience with at least one other statistical software will be helpful, but is not mandatory.
postmaster@biostatepi.org