Statistics Bootcamp

    Three days intensive training learn how to interpret and explain statistics.

    Room A218, College Building, City University London, EC1V 4PB
    26 April 2013 - 28 April 2013
    This three-day workshop is for those that already have a good understanding of data journalism (Excel) and want to learn more about understanding and interpreting statistics. This will cover linear regression, frequencies, correlation and working with continuous data.
     

    The last day of the Data Journalism Bootcamp will be the first day of the statistics course. Attendees who have not attended the previous days of the datajournalism bootcamp must have the basic Excel/spreadsheet skills before taking the course. If you have already completed a bootcamp (including current attendees) you will have the required level of skill to attend the statistics course.

    Click here to book your place
     
    Course fees:
    Full price: £400

    Course Outline

    The last day of the data journalism bootcamp will be the first day of introduction to statistical analysis for reporters.
    The course will emphasise the use of statistical reasoning and procedures to produce better in-depth, investigative journalism. As in introduction, it will assume very little, if any, statistics background.

    FRIDAY, 26 April
    9 - 10 am                Recent stories using advanced techniques – a data analysis perspective
    10 - 11 am              Interviewing your data for statistical analysis
    11 - 12:15 pm        Working with categorical variables: Frequencies, crosstabs and significance tests
    12:15 - 1:30 pm      Lunch
    1:30 – 3:00 pm       Making and mushing categorical variables: Recoding for frequencies and crosstabs
    3:00 – 4:30 pm       Moving up the ladder to continuous variables: Ranges, descriptives and pictures
    4:30 – 5 pm            Day one wrap up: Q&A and translating results for readers and viewers

    SATURDAY, 27 April
    9 – 10 am          Review/Practice
    10 – 11 am        Working with continuous variables: Correlations and scatter plots
    11 – Noon          The basics of linear regression
    Noon – 1:15 pm        Lunch
    1:15 – 3 pm        Diagnosing your regressions
    3 – 4:30 pm        Mixing it up: Categorical and continuous variables together; Analysis of Variance (ANOVA)
    4:30 – 5 pm        Day two wrap up: Q&A and translating results for readers and viewers

    SUNDAY, 28 April
    9 – 10:30 am         Going further with predictive models: Dummy variables, dichotomous variables and logistic regression
    10:30 – Noon        Combining many measures: Indexes
    Noon – 1:15pm     Lunch
    1:15 - 3 pm           Weird science: Monte Carlo simulations, log linear functions and other fun stuff
    3 – 4:30 pm           Problem solving using the Internet, spreadsheets, statistics, and SQL – the real world    
    4:30 – 5 pm           Review, Q&A and taking it back to the newsroom

    Some of the topics that will be included:
    •    Why statistics in the newsroom
    •    Types of data and variables -- The Data Ladder
    •    Descriptive statistics, frequencies, z scores and weighting
    •    Crosstabs, Chi Square and statistical significance
    •    Recoding values (rank and quartiles, creating categories)
    •    From categorical to continuous variables -- ranges and graphs
    •    Mixing categorical and continuous -- ANOVA
    •    Working with continuous variables -- correlations, scatterplots
    •    Linear regression
    •    Data transformations and issues of bad data
    •    Logistic regression (tentative, depending on Excel's capabilities)
    •    Polling and sampling
    •    Predictive models

    David Donald is the data editor at the Center for Public Integrity in Washington. His interests include financial, housing and healthcare analysis and new tools for data analysis. Prior to joining the Center in 2008, he served as training director at Investigative Reporters and Editors and the National Institute for Computer Assisted Reporting for five years. He uses data analysis to uncover fraud and governmental abuse. His stories have won three Batten Awards and two Hammet Awards for ethical and courageous journalism.

     
    But Don’t Just Take Our Word For It
    “David has been teaching data journalism in the UK since before any newspaper had a single data specialist – let alone data teams. His expertise is matched by his methodial, laid-back and humorous teaching style, which five years ago helped me grasp the basics of what has since become my trade. Now David is over in the UK trying – for what I think is the first time – the multi-day training previously only on offer in the US, I think that’s a great opportunity for anyone keen on this fast-developing field.”
    James Ball data journalist the Guardian
     
    “David Donald single-handedly opened my eyes to the power of computer-assisted reporting. His CAR training is the reason many of us do the work we do and find stories that otherwise would go unreported. From the first “aha-moment” in Excel years ago, David has been an expert guide and a patient teacher. A true master!”
    Cynthia O’Murchu, investigative reporter at the FT
     
    Read Cynthia's stories for the FT found using data journalism techniquest (no charge but registration required):
    Britain’s Private Care Faces Crisis
    BP Failures Heighten Fears at North Sea Risk
     
    Please note: All our courses are scheduled to go ahead subject to a minimum number of attendees signing up. The minimum number for each course differs and this is just to enable us to cover costs of the training venue and trainer fees where applicable.