Dataset Collection

Please find a dataset collection below

General Instructions

The datasets below are available for download. They are available in raw format, or are semantically annotated using the citypulse information model. For a description of the information model, please refer to the Model Primer page. The social media datasets have not been published due to privacy policy of Twitter.

Overview

The table below shows the available datasets in the repository.

Description Duration Location (Provider) Type
Road Traffic Data 2/2014 - 6/2014
8/2014 - 9/2014
10/2014 - 11/2014
07/2015 - 10/2015
Aarhus, Denmark (Open Data Aarhus) Real
Pollution Data 8/2014 - 10/2014
2/2014 - 6/2014
8/2014 - 9/2014
Aarhus, Denmark (Open Data Aarhus)
Brasov, Romania
Brasov, Romania
Generated
Weather Data 2/2014 - 6/2014
8/2014 - 9/2014
2/2014 - 6/2014
8/2014 - 9/2014
Aarhus, Denmark (Open Data Aarhus)
Aarhus, Denmark (Open Data Aarhus)
Brasov, Romania
Brasov, Romania
Real
Cultural Event Data 5/2014 - 1/2015 Aarhus, Denmark (Open Data Aarhus) Real
Social Event Data 6/2012 - 6/2014 Surrey, UK (Municipality RSS) Real
Library Event Data 10/2013 - 6/2015 Aarhus, Denmark Real
Parking Data 5/2014 - 11/2014 Aarhus, Denmark Real

The next table allows for a quick download of all Aarhus datasets (raw format only) and can be used for temporal correlation. Clicking on the description provides information on how to retrieve the annotated datasets as well as more information on the datasets themselves.

Description Duration
2013 2014 2015 2016
9101112 123456789101112 123456789101112 123456789101112
Road Traffic Data Road Traffic Dataset-1 Road Traffic Dataset-2 Road Traffic Dataset-3 Road Traffic Dataset-4
Pollution Data Pollution Dataset-1
Weather Data Weather Dataset-1 Weather Dataset-2
Parking Data Parking Dataset-1 Parking Dataset-2
Cultural Event Data Cultural Event Dataset-1
Library Event Data Library Event Dataset-1

The next table allows for a quick download of all Brasov datasets (raw format only) and can be used for temporal correlation. Clicking on the description provides information on how to retrieve the annotated datasets as well as more information on the datasets themselves.

Description Duration
2013 2014 2015
9101112 123456789101112 123456789101112
Pollution Data Pollution Dataset-1
Weather Data Weather Dataset-1 Weather Dataset-2

Dataset List

Real-World or Generated Datasets

Vehicle Traffic, Provided by City of Aarhus in Denmark
Description A collection of datasets of vehicle traffic, observed between two points for a set duration of time over a period of 6 months (449 observation points in total). The data is available in raw (CSV) and semantically annotated format using the citypulse information model.
Metadata Download traffic metadata, which show information about the datastreams (position of each of the two sensors in the dataset, distance in meters, type of road, etc.) (click here)
Duration Dataset-1: February 2014 - June 2014
Download Options All Annotated Datasets of dataset-1 as a gzipped file [3,043,086,932 bytes]
All Raw Datasets (CSV) of dataset-1 as a gzipped file [118,994,945 bytes]
Choose individual datasets from dataset-1 from a list - generated using the metadata above
Duration Dataset-2: August 2014 - September 2014
Download Options All Annotated Datasets of dataset-2 as a gzipped file [2,589,767,939 bytes]
All Raw Datasets (CSV) of dataset-2 as a gzipped file [62,977,348 bytes]
Choose individual datasets from batch 2 from a list - generated using the metadata above
Duration Dataset-3: October 2014 - November 2014
Download Options All Annotated Datasets of dataset-3 as a gzipped file [976,925,134 bytes]
All Raw Datasets (CSV) of dataset-3 as a gzipped file [38,520,470 bytes]
Choose individual datasets from batch 3 from a list - generated using the metadata above
Duration Dataset-4: July 2015 - October 2015
Download Options All Annotated Datasets of dataset-4 as a gzipped file [976,925,134 bytes]
All Raw Datasets (CSV) of batch 3 as a gzipped file [38,520,470 bytes]
Choose individual datasets from batch 3 from a list - generated using the metadata above

Pollution Measurements (generated data)
Description A collection of pollution measurements designed to complement the vehicle traffic dataset above. For this pollution mockup stream we decided to simulate one sensor for each of the traffic sensor at the exact location of this traffic sensor. For more information on how the data was generated, please click here. The data is measured using Air Quality Index metric (449 observation points in total). The data is available in raw (CSV) and semantically annotated format using the citypulse information model.
Duration August 2014 - October 2014
Download Options All Annotated Datasets as a gzipped file [3,368,454,165 bytes]
All Raw Datasets (CSV) as a gzipped file [81,012,652 bytes]
Choose individual datasets from a map/list
Pollution Measurements for the City of Brasov in Romania
Description A collection of pollution measurements designed to complement the vehicle traffic dataset above. For this pollution mockup stream we decided to simulate one sensor for each of the traffic sensor at the exact location of this traffic sensor. For more information on how the data was generated, please click here. The data is measured using Air Quality Index metric (449 observation points in total). The data is available in raw (CSV) and semantically annotated format using the citypulse information model.
Duration August 2014 - October 2014
Download Options All Annotated Datasets as a gzipped file [3,368,454,165 bytes]
All Raw Datasets (CSV) as a gzipped file [81,012,652 bytes]
Choose individual datasets from a map/list


Parking Data Stream, Provided by City of Aarhus in Denmark
Description A datastream with parking data provided from the city of Aarhus. There are a total of 8 parking lots providing information over a period of 6 months (55.264 data points in total).
Duration May 22nd 2014 - November 4th 2014
Download Options Metadata file of location of the parking lots in the raw dataset below [521 bytes]
Raw Dataset in CSV format [3,734,561 bytes]
Annotated Dataset in TTL format [39,036,662 bytes]
Duration February 2015 - October 2015
Download Options Metadata file of location of the parking lots in the raw dataset below [521 bytes]
Raw Dataset in CSV format [3,734,561 bytes]
Annotated Dataset in TTL format [39,036,662 bytes]



Weather Data for the City of Aarhus in Denmark
Description A collection of datasets of weather observations from the city of Aarhus.
Duration February 2014 - June 2014 and August 2014 - September 2014
Download Options

Download All Datasets

Download Individual Datasets (annotated)

Description February-June 2014 August-September 2014
Dew point in degrees Celsius download download
Humidity (percentage) download download
Pressure in mBar download download
Temperature in degrees Celsius download download
Wind direction in degrees download download
Wind speed in kilometers per hour (kph) download download
Weather Data for the City of Brasov in Romania
Description A collection of datasets of weather observations from the city of Brasov.
Duration February 2014 - June 2014 and August 2014 - September 2014
Download Options

Download All Datasets

Download Individual Datasets (annotated)

Description February-June 2014 August-September 2014
Dew point in degrees Celsius download download
Humidity (percentage) download download
Pressure in mBar download download
Temperature in degrees Celsius download download
Wind direction in degrees download download
Wind speed in kilometers per hour (kph) download download
Pollution Measurements for the City of Brasov in Romania
Description A collection of pollution measurements designed to complement the vehicle traffic dataset above. For this pollution mockup stream we decided to simulate one sensor for each of the traffic sensor at the exact location of this traffic sensor. For more information on how the data was generated, please click here. The data is measured using Air Quality Index metric (449 observation points in total). The data is available in raw (CSV) and semantically annotated format using the citypulse information model.
Duration August 2014 - October 2014
Download Options All Annotated Datasets as a gzipped file [3,368,454,165 bytes]
All Raw Datasets (CSV) as a gzipped file [81,012,652 bytes]
Choose individual datasets from a map/list



Webcasted Events Dataset, Provided by City of Surrey in the United Kingdom
Description A set of webcasted social events from the municipality of Surrey, provided here as a datastream from the city of Surrey, the website is www.surreycc.public-i.tv/core/portal.
Duration June 20th 2012 - July 30th 2014
Download Options Raw Dataset in CSV format [8,783 bytes]
Annotated Dataset in TTL format [23,658 bytes]

Cultural Events Dataset, Provided by City of Aarhus in Denmark
Description A set of cultural event announcements provided as a datastream from the municipality of Aarhus.
Duration Events cover period May 5th 2014 - January 25th 2015
Scan perfomed during July 2014
Download Options Raw Dataset in CSV format [331,792 bytes]
Annotated Dataset in TTL format [100,732 bytes]

Library events, provided from the city of Aarhus in Denmark
Description A set of events hosted by libraries in Denmark (1548 events in total).
Duration October 10th 2013 - June 6th 2015
Download Options Raw Dataset in CSV format [1,762,666 bytes]
Annotated Dataset in TTL format [1,363,824 bytes]

Valid HTML 4.01 Transitional