Updated: May 23, 2017
Semester/code: Spring 2017, JOUR 72312
Duration 15 Weeks. February 1 to May 24. No classes on February 15 and April 12.
Room/Meeting: Wednesday 5:30 pm to 8:20 pm. Room 430.
Index
- Data Journalism. Spring 2017, CUNY J-School
- Course Description
- Objectives and outcomes
- Class rules
- Diversity
- Instructors and office hours
- Assignments and Due Dates
- Grading Rubric
- Capstone and extra credit
- Readings
- Class Schedule
- Week 1 (Feb. 1)
- Week 2 (Feb. 8)
- NO class, Monday schedule (Feb. 15)
- Week 3 (Feb, 22)
- Week 4 (March 1)
- Week 5 (March 8)
- Week 6 (March 15)
- Week 7 (March 22)
- Week 8 (March 29)
- Week 9 (April 5)
- SPRINGBREAK. NO CLASS. April 12.
- Week 10 (April 19)
- Week 11 (April 26)
- Week 12 (May 3)
- Week 13 (May 10)
- Week 14 (May 17)
- Week 15 (May 24)
- Coaches
Journalists live and get paid to find the truth, serve their communities and hold the powerful accountable. Today, fortunately, journalists have access to more data than ever before, as well as better tools to understand that data, expose the stories buried in the numbers, deal with troves of documents and combine this techniques with traditional serious reporting. From Phillip Meyer’s exposes in Detroit using statistical and surveying techniques to the global story of the Panama Papers, Data Journalism has proven it’s value as a powerful way to foster positive change in modern society.
Data are everywhere. In public sources, like election results, budgets and census reports: semi public and private data sets like hidden company information; in cross referencing documents and databases to discover conflicts of interest and corruption, to social media like Facebook, Twitter updates, and image and video uploads. Journalists need to know how to find stories in data and shape them in compelling ways.
In this hands-on course you will learn to gather, clean, structure and analyze data, use statistical methods to find relevant insights and bullet proof your work to produce two data-driven stories that may be of relevance to our community.
While you learn to use different software and improve your data skills in our lab sessions, the driving theme of this course will be to do Journalism in the public interest. Students will have individual lab assignments and will work in teams of two for the two semester stories, focused on original reporting about a public policy, an agency or government authority. Each team will pitch four stories and the instructors will choose two that they will report on during the entire semester. You will choose a Topic Beat. And you will OWN it for the entire semester. In the same way ProPublica defined its strategic reporting beats for the years to come.
This three-credit course explores complex storytelling using data. Students will pitch, report, conceptualize and produce informative compelling data-driven pieces focused on current affairs, public policies or government agencies to hold power accountable. The course emphasizes:
-
Learning basic descriptive and inferential statistical methods and concepts, the foundation of solid data reporting
-
Conceptualizing clear ways to illustrate trends as part of your analysis and stories
-
Data research, collection, organization and edition while maintaining data integrity
-
Understanding the development process for creating data stories
-
Applying strong traditional reporting techniques with data reporting in a weekly basis to plan and produce high quality data journalism
-
Understanding technologies available to work with data to create online, data-driven stories and to a lesser degree data-driven interactive stories
-
Using design basics, effective visual communication, and simple data visualization tools for analysis and creation of graphics
-
Critical evaluation of professional data stories (what makes a particular project successful?)
-
Seeking out innovative uses of data
Attendance is mandatory. Data Journalism meets 15 times during the spring semester. It is important that you attend each class. If you’re sick, please let us know via email, Slack or text. If you don’t show up, you will hear from us. More than one excused absence will hurt your grade.
Communicate. If you have a problem or if you have difficulties, tell us right away, not after is too late. In journalism that’s what we do. When we have a problem we immediately call our editor.
Punctuality matters. Be on time. It shows you are a professional journalist.
Be accurate. Always double check everything. Proofread. Ask at least two classmates to review your story before handing it in. When you make mistakes, it hurts your credibility and affects your grade.
Keep up with the news. Consuming information on a daily basis leads to a healthy diet of background, helps you connect the dots and discover story ideas to work on, If you care about a topic or your subject concentration you have to stay in top of the game.
Be a pro. Honesty, courtesy, curiosity and professionalism are the core values of a journalist. Behave like one because you are a journalist. When classmates are presenting or we have guests or we are working in teams don’t multitask, focus.
Code of honor. This class follows the guidelines of the student handbook of our school. More so, in journalism plagiarism or falsification of data, sources and facts are serious crimes that can lead to failing this class. You may also be the subject of suspension, probation or expulsion, pending the decision of the School administration.
It's critical that students learn to include a diverse set of voices in their stories, something that is often glossed over when finding stories in spreadsheets and online sources. You are encouraged and expected to look for stories about and voices from communities that are underrepresented. This also applies to our classroom. Diversity requires us all to discuss differences with respect and empathy, regarding race, gender, age, religion, gender, sexual preference, disability, language, origin or political beliefs.
-
Miguel Paz — miguel.paz@journalism.cuny.edu
- Office 421 A. Thursday 11:30 am - 1:00 p.m. and 2:30 pm - 5:30 pm. By appointment: https://calendly.com/miguelpaz/officehours. Teams are encouraged to book a weekly time slot to work with with me in their projects. If you have conflicting schedule issues let me know and I will try to accommodate you.
-
Ryan McNeill — ryan.mcneill@journalism.cuny.edu
- Meeting by appointment, via email.
Communication channels: class Slack group (#general, #leads, #recommended, direct messages for private questions). SMS for urgent stuff, email for longer things.
Note: For the first 3 weeks you will file your assignments via email to miguel.paz@journalism.cuny.edu and ryan.mcneill@journalism.cuny.edu.
After Week 3 you will file your assignments using the Class Github repository. The link and instructions will be informed to you in Week 3.
The class assignments are composed by: topic beat pre-reporting and pitch of 4 story ideas for 2 story projects (5%), class participation, class work and readings (15%), homework data exercises (15%), story project 1 (30%) and story project 2 (35%).
In the first class we will organize in teams of two people (5 teams). Each team will choose a topic beat and hand in a reporting memo via email about it before the 2nd class. Include as much relevant information you can gather about the topic. The idea is that you become as close to an expert as you can. In the 2nd class each team summarizes findings.
In 3rd class each team pitches four story ideas. The Instructors will choose two story ideas per team. You will work on them for the entire semester and we will pick together which one you will do first and which second (by the time you hand in the 1st story we revisit the value of the 2nd story and see if it’s still feasible or you can propose a new one).
Pitch requirements:
You must pitch 4 story ideas in 1 single document. Each story pitch should be 1 page long, 2 pages long max.
For each story describe it in detail (1 page, 2 max. per story) by answering the following questions:
-
Team byline and beat.
-
What is your story about? Tell us in 1 headline and 1 lead paragraph.
-
What is the single question your story tries to answer?
-
Why this story is relevant ("So what?) and why now?
-
Why will this story resonate with our audience?
-
What else has been done on this topic? (Provide links) and how is your angle different or fresh?
-
List in bullets the unique investigative findings your story will reveal
-
What are the potential main characters and scenes?
-
What documents/data/audio have you already gathered?
-
List of human sources and if you have approached them already
-
Maximum/minimum.
-
What is the maximum (best) story possible?
-
What’s the minimum (fallback) story if your hypothesis doesn’t prove out?
-
-
Extra: Other questions that you can answer in a sentence: What other question is your story trying to answer? How will this story hold someone accountable? Does this story have potential to drive change?
Repeat the process for each story pitch. Keep it all in 1 document.
Deadline: Wednesday February 22 at 12 pm. You will file the pitches by sharing the link to your Pitches Google Doc via email to your Instructors. Make sure you share the document with permission to edit so we can add comments and suggest changes.
Your participation and professionalism includes attending all classes, arriving on time, returning from breaks on time, being active in discussions about readings (see list of Readings), giving feedback to your peers, and participating in all in-class hands-on activities. Repeated absences or tardiness will result in a reduction in your grade.
There will be 8 7 individual homework data exercises detailed the week before deadline, here.
- Statistics exercise 1. Deadline: Wed. March 1, noon
- Statistics exercise 2. Deadline: Wed. March 8, noon
- SQL exercise 1. Deadline: Wed. March 15, noon
- SQL exercise 2. Deadline: Wed. March 22, noon
- Statistics exercise 3. Deadline: Wed. March 29, noon
Statistics exercise 4. Deadline: Wed. April 5, noon- Cleaning and extracting data. Deadline: Wed. April 26, noon
- Documents extraction exercise. Deadline: Wed. May 3, noon
Create charts or a simple graph database or network graph visualization with data of your Story 2. Deadline: Wed. May 3, noonGeoSpatial Analysis Exercise. Deadline: Wed. May 10, noon
The main assignments for this course are two data journalism stories. For each story you will hand in reporting memos, drafts and final pieces. You will do this by working in your team stories repository in Github. If you get stuck, you can send it via email to the instructors.
Reporting memos: These are very simple documents addressed to your instructors. Their goal is to let them know what you did the past week, what problems you came across, what you need help with and what are your next reporting steps.
Include:
- Story number and reporting memo number.
- Byline
- Date
- What you did this week
- List of reporting activities done (sources contacted, data reviewed)
- Major findings
- Problems
- What you need help with
- Next reporting steps
Draft and Final Stories: Your draft and final stories must include all the traditional elements of a story, a data folder and a methodology document.
Elements of story:
- Headline
- Lead or nut graph
- Authors
- Text
- Any necessary credits, captions, etc.
- Data folder: Where you place the data you used, including clean data in spreadsheets, already normalized, sorted, manipulated.
- Methodology: Add in methodology.md file. Include list of human sources, links to every dataset and document you used, a data diary or data log detailing the steps you took while working with the data.
Story Project 1 (30%)
Breakdown:
- Reporting memo 1 (10 points)
- Reporting memo 2 (15 points)
- One draft story (25 points)
- Final story (50 points)
Requirements:
- Write a data-driven story.
- 500 – 800 words.
- Uses two or more datasets and at least 3 human sources that YOU have interviewed. This sources do not necessarily need to be directly referenced in the piece. If he or she is not referenced, you must explain in a note how this source helped you.
Deadlines:
- Reporting memo 1, Week 4. Wed. March 1, noon
- Reporting memo 2, Week 5. Wed. March 8, noon
- Draft, Week 6. Wed. March 15, noon
- Story 1 final, Week 7. Wed. March 22, noon
Story Project 2 (35%)
Breakdown:
- Reporting memo 1 (5 points)
- Reporting memo 2 (10 points)
- Reporting memo 3 (10 points)
- Story draft 1 (12 points)
- Story draft 2 (13 points)
- Final story and web presentation (50 points)
Requirements:
- A data-driven story that in its final version will be presented in a web page that will include visual or graphic elements that you propose. .
- 700 – 1000 words.
- Uses three or more datasets and at least 6 human sources that YOU have interviewed. This sources do not necessarily need to be directly referenced in the piece. If he or she is not referenced, you must explain in a note how this source helped you.
- Propose and author a web page for your story
Deadlines:
- Reporting memo 1, Week 8. Wed. March 29, noon (or pitch new story 2 if your first idea does not work anymore)
- Reporting memo 2, Week 9. Wed. April 5, noon
- Reporting memo 3, Week 10. Wed. April 19, noon
- Draft 1, Week 11. Wed. April 26, noon
- Draft 2, Week 12. Wed. May 3, noon
- Final story 2 and web presentation, Week 13. Wed. May 10, noon.
- Hand in edits of final story 2 and web page, Week 14. Wed. May 17, noon. (Class dedicated to work on web page details).
Deadlines matter. Please carefully note these rule: for every 24 hours that passes after a deadline in which you still have not turned in part of a project, an additional 10 percent deduction applies to it.
There will be no extensions except in cases of medical emergency, family emergency, or zombie invasion. Make-up work will not be offered except in extenuating circumstances.
Final grades in the course correspond to the grading scale used in the CUNY Graduate School of Journalism. Here is the minimum percentage of each letter grade:
- 97: A+ Stellar work. Ready to be published by a professional news organization with minimal changes.
- 93: A Excellent work. It is ready to be published professionally with some changes.
- 90: A- Good quality work, although it needs a slightly more significant revision to be able to be published.
- 87: B+ Solid work that shows some deficiencies that need to be solved.
- 83: B Meets certain requirements, but lacks several important elements.
- 80: B- Below average and needs strong overall improvements.
- 77: C+ Poor job. It presents many problems of structure, reporting and storytelling
- 73: C Almost unacceptable because of major overall problems.
- 70: C- Unacceptable. Does not meet the minimum requirements of a graduate level journalism project.
- Anything below a 70 is an F. Work has failed at every level. There is no D in CUNY’s grading scale.
Each assignment represents a percentage of your final grade and has points that adds up to 100 points, which equals the highest score for the assignment. The points you earn, for the different elements of the assignment, directly results in your assignment grade.
Example: Story project 1 is 30% of your final grade. On a point scale your reporting memo 1 is 10 points, the reporting memo 2 is 15 points, your story draft is 25 points and your final story is 50 points. That’s a total of 100 points. If you get 97 to 100 points in total you will have an A+ for that assignment. To determine your assignment grade, simply add up all the points you have earned for each sub-assignment.
| Assignment and percentage of final grade | Assignment units breakdown | Points |
| Topic beat pre-reporting and pitch of 4 story ideas for 2 story projects (5%) | Pre-reporting memo, written pitch of 4 story ideas | 100 |
| Class participation, class work and readings (15%) | Engaging in class and reading debates, asking questions, giving feedback to your peers, doing class exercises | 100 |
| Homework data exercises (15%) | 7 data exercises. - Statistics exercise 1 - Statistics exercise 2 - SQL exercise 1 - SQL exercise 2 - Statistics exercise 3 - Cleaning and extracting data - Documents extraction exercise | 10 points for each completed exercise. 5 points for incomplete exercise. 0 points if you don’t hand it in. |
| Story project 1 (30%). | Story 1: Reporting memo 1 | 10 |
| Story 1: Reporting memo 2 | 15 | |
| Story 1: Draft of story | 25 | |
| Story 1: Final story | 50 | |
| Story project 2 (35%) | Story 2: Reporting memo 1 | 5 |
| Story 2: Reporting memo 2 | 10 | |
| Story 2: Reporting memo 3 | 10 | |
| Story 2: Story draft 1 | 12 | |
| Story 2: Story draft 2 | 13 | |
| Story 2: Final story and web presentation | 50 |
In assessing students' work, the instructor will focus on the following factors applicable to all assignments (specific criteria for each assignment will be detailed later):
-
Process: Were the drafts iterated and improved based on faculty feedback?
-
Quality and Shine: Is it executed with skill and subtlety, and has it been edited well and polished?
-
Organization and Presentation: Is it presented clearly and in a professional manner suitable for an audience?
-
Effort and Application: Has the work been prepared with careful thought and attention to detail, and does it take appropriate advantage of the relevant tools?
-
Punctuality and Completeness: Is it on time and complete, and does it fulfill the assignment?
-
Preparedness: Has the student completed the work necessary in preparation for the discussion (viewing assigned video, completing assigned reading or tutorial)?
-
Participation: Was the student engaged in the discussion (both paying attention and participating)?
-
Effort: Did the student try to complete the exercise to better understand the lesson at hand?
-
Participation: If the exercise involves collaboration, did the student contribute?
Capstone: Students who wish to incorporate material from the class into a capstone project may be able to make special arrangements.
Extra credit: One of the instructors (Miguel) maintains a Medium publication called Notes from the Classroom, where he and interested students write about tools, lessons, short tutorials and class work. If you write one or more posts about how you used data, specific tools and methodologies to produce one of your class stories you get extra credit. This extra credit will increase by 10% to 20% your A) class participation, class work and readings grade; or B) homework assignments grade, depending on the quality and detail of the post. If you and your team members write a post, you must tell me what each one did in the project and in the written post.
Students will read selected chapters of the following books (available at the Research Center for this course).
-
"Computer-Assisted Reporting: A practical guide", 4th Edition. By Brant Houston.
-
"Numbers in the Newsroom: Using math and statistics in News", 2nd Edition. By Sarah Cohen.
-
"The investigative reporter's handbook: a guide to documents, databases, and techniques". 4th Edition. Edited by Brant Houston et al.
-
"Precision Journalism: a Reporter’s Introduction to Social Science Methods", 4th Edition. By Philip Meyer.
-
"The Functional Art: An introduction to information graphics and visualization". By Alberto Cairo.
-
"The Curious Journalist Guide to Data" (online). By Jonathan Stray.
Other recommended books available at the Research Center:
-
"Storytelling with Data". By Cole Nussbaumer Knaflic
-
"Computer-Assisted Research: Information Strategies and Tools for Journalists". By Nora Paul and Kathleen A. Hansen
-
"Mapping for Stories: A Computer-Assisted Reporting Guide". By Jennifer LaFleur and Andy Lehren
-
"The Visual Display of Quantitative Information". By Edward R. Tufte
-
"Data Points: Visualization That Means Something". By Nathan Yau
-
"Design for Information". By Isabel Meirelles
Instructors will also share tip sheets, stories and tutorials for specific classes.
What follows is a tentative schedule. This may change depending on how well you are progressing. If we need more time to work on something or change the order of a lesson plan we will do it and we will update the syllabus. So please be aware of updates from instructors.
Introduction. Researching and finding data.
Class
- Introductions
- What is data & interactive journalism?
- Syllabus review
- Topics discussion and teams
- Finding data (web and database research) and creating original datasets, with Barbara Gray. http://bit.ly/research1_gray
Homework:
Background report on assigned Topic Beat of the Trump Administration and write a Reporting Memo
Read Chapter 1 "Data Journalism: What Computer-Assisted Reporting is and Why Journalists Use it". Pages 3 – 18. And chapter 2 “Online Resources: Researching and Finding Data Online”. Pages 19 – 44. From “Computer-Assisted Reporting: A Practical Guide” by Brant Houston.
Backgrounding people and organizations, planning and pitching investigations.
Class
- Backgrounding people and organizations. Lesson
- Review of team research, group feedback (5 minutes each).
- The Pitch (examples and requirements)
- How to plan and do a Data Driven Investigation. Lesson
Homework
- Each team prepares their Pitch of 4 Story Ideas and hands it in before next class.
- Read Chapter 3 "Spreadsheets, Part 1: Basic Math for Journalists". Pages 46 – 67. Read Chapter 1: “What’s a Number?” Pages 1 - 10 from “Numbers in the Newsroom: Using math and statistics in News”, by Sarah Cohen.
Math and Statistical Methods for Reporting 1.
Class
- Pitch presentations (10 minutes each). Editors Ryan and Miguel give feedback and choose the 2 stories to work on during the semester.
- Basic numeracy: Concepts like mean and median (and proper use of each).
- Basic roadmap for evaluating statistics in reports or scientific studies.
- Math for Journalists using Excel: Sums, differences, percent change, sorting. Hands on class work.
Homework:
- Statistics Exercise 1
- Story 1 reporting memo 1
- Read Chapter 4 "Spreadsheets, Part 2: More Math that Matters". Pages 69 – 94. From “Computer-Assisted Reporting: A Practical Guide” by Brant Houston. Read Chapter 2: “A Newsroom math guide”. Pages 11 – 60 from “Numbers in the Newsroom: Using math and statistics in News”, by Sarah Cohen.
Math and Statistical Methods for Reporting 2.
Class
- Brief reporting memo feedback and homework debate
- Math for Journalists using Excel: Review last week, Rates, Ranking, Ratios, Filtering, Pivot Tables, Adjusting for Inflation, String functions.
Homework
- Story 1 reporting memo 2
- Statistics Exercise 2
- Install SQLite and read Part 1 of this tutorial
Data and Database Managers 1: Interviewing Data.
Class
- Brief reporting memo feedback and homework debate
- What’s a Database Manager
- Introduction to SQL and the use of SQLite for data analysis and reporting: Queries, Sorting, Grouping, Filter.
Homework:
- Story 1 Draft
- SQL exercise 1 (Useful tutorials: SQLite Wiki,SQL Intro, Six min. Video tutorial to create a SQLite database(don't pay attention to the last part)
Data and Database Managers 2: Interviewing and creating Databases
Class
- Interviewing Data with SQL & SQLite
- Creating your database and reporting on it to find newsworthy threads to follow.
- Newsroom work: review each team’s Story 1 Draft and require improvements to be done for handing in the Final version.
Homework
- Story 1 Final Version
- SQL exercise 2
Math and Statistical Methods for Reporting 3: SPSS
Class
- Math and Statistical Methods for Reporting using SPSS.
- Why use statistical software? The data ladder. Frequencies. Ranks and quartiles. Manipulating variables.
- Newsroom work: review of each teams Story 1 Final. Teams decide if they go with Story 2 or will propose another Story idea.
Homework:
- Story 2 Reporting Memo 1 (or Pitch for New Story 2 if it’s the case)
- Statistics Exercise 3
Math and Statistical Methods for Reporting 4: SPSS.
Class Math and Statistical Methods for Reporting 4. SPSS.
- Crosstabs, ANOVA. Indexes (USA Today diversity index). Intro to linear regression.
- Brief reporting memo feedback
Homework:
- Story 2 Reporting Memo 2
- Statistics exercise 4
- Download Open Refine, Tabula, Scraper, sign up for Import.io
Math and Statistical Methods for Reporting 5: SPSS.
Class (reinforcement of previous two sessions)
Homework
- Story 2 Reporting Memo 3
Extracting and Cleaning Data: Open Refine, Tabula & PDF extractors, Basic Scraping.
Class
- Brief reporting memo feedback
- Extracting and Cleaning Data: Open Refine. Lesson
Homework
- Story 2 Draft 1
- Extracting and cleaning data exercise
Extracting and Cleaning Data: Open Refine, Tabula & PDF extractors, Basic Scraping.
Class
- Brief reporting memo feedback
- Extracting and Cleaning Data: Tabula & PDF extractors, Basic Scraping. Lesson
Homework
- Story 2 Draft 2
- Documents extraction exercise
- Download and install Qgis
GeoSpatial Analysis. Principles and practices 1. QGis.
Class
- GeoSpatial Analysis. Principles and practices. QGis. Lesson
- Newsroom work: review each team’s Story 2 Draft and require improvements to be done for handing in the Final version.
Homework
- Story 2 Final and web presentation
GeoSpatial Analysis Exercise
GeoSpatial Analysis. Principles and practices 2. QGis.
Class
- GeoSpatial Analysis. Principles and practices. QGis. Lesson
- Newsroom work: review of each teams Story 2 Final.
- Teams brainstorm and plan how to present and publish the final story 2 in the web.
Homework
- Author web pages of stories and include interactive elements. Requirements
Class
- Hands on web page work for stories.
Demo day
Class
- Demos
- Course evals
- Wrap up
Check coaching hours in the School website.. Related to our class.
| Name | Coaching areas | Hours | Office Location | |
| Kirsti Itameri | Interactive Journalism: Design, WordPress, Illustrator, Photoshop, Social Media | Tuesdays 6:30-8:30 pm or by appointment | Newsroom | kirsti.itameri@journalism.cuny.edu |
| TC McCarthy | Interactive Journalism: Coding | Thursday 6-8 pm | Newsroom | tc.mccarthy@journalism.cuny.edu |
| Malik Singleton | Interactive Journalism: Data Storytelling, WordPress, HTML, CSS | Mondays and Thursdays, 5:30-7:30 pm | Newsroom | malik.singleton@journalism.cuny.edu |
| Nicholas Wells | Interactive Journalism: Data Storytelling, HTML, CSS, R | Tuesdays 6:00 - 8:30 pm | Newsroom | Nicholasbwells@gmail.com |