A Million News Headlines

News headlines published over a period of 17 Years


This contains data of news headlines published over a period of seventeen years.

Sourced from the reputable Australian news source ABC (Australian Broadcasting Corp.)

Agency Site: (http://www.abc.net.au)


Format: CSV ; Single File

  1. publish_date : Date of publishing for the article in yyyyMMdd format
  2. headline_text : Text of the headline in Ascii , English , lowercase

Start Date: 2003-02-19 ; End Date: 2019-12-31


The dataset have a summarised historical record of noteworthy events in the globe from early-2003 to end-2019 with a more granular focus on Australia.

This includes the entire corpus of articles published by the abcnews website in the given time range.
With a volume of two hundred articles per day and a good focus on international news, we can be fairly certain that every event of significance has been captured here.

Digging into the keywords, one can see all the important episodes shaping the last decade and how they evolved over time.
Ex: afghanistan war, financial crisis, multiple elections, ecological disasters, terrorism, famous people, criminal activity et cetera.

Download Here