New high-resolution traffic flow dataset from UBDC

Traffic flow data has a wide variety of research applications, but the quality of existing datasets is often inconsistent due to the differing ways the data has been collected and cleaned.  

A new paper from UBDC - High-resolution traffic flow data from the urban traffic control system in Glasgow co-authored by Yue Li, Mingshu Wang and Qunshan Zhao - introduces a long-term traffic flow dataset for a concise urban area combined with highly detailed space and time data.

The dataset – with 470 files containing hourly traffic flow data and a single file of geographical information for over 400 sensors - covers the Glasgow City Council area for four consecutive years spanning the COVID-19 pandemic, from October 2019 to September 2023.

Co-author, Qunshan Zhao, Senior Lecturer in Urban Analytics said: “The high-resolution traffic flow data from SCOOT (Split Cycle Offset Optimisation Technique) road sensors in Glasgow provides an excellent example of converting auto generated but messy raw traffic flow data into open and research ready data, with high spatial and temporal granularity.

“Similar traffic control systems or road sensors exist in many other cities but there are still very few equivalent datasets that have been cleaned and generated for public use. We would hope more cities can follow Glasgow City Council’s example and provide similar API access to these types of data to help better understand our living cities.

“Potential applications include traffic dynamic analysis and prediction, traffic management, infrastructure planning, and urban environment improvement. This data can also be used as a traffic flow validation source to cross-validate with other similar datasets.”

The raw traffic flow data were collected through Glasgow open data portal and refined by a two-fold filtration process based on spatial and temporal constraints. Then the remaining sensors were examined by the numerical filtering.

The cleaned traffic flow data were reconstructed and aggregated into hourly intervals. Such detailed information can be widely applied to a variety of research. Additionally, it helps us understand how traffic changed during a once-in-a-lifetime pandemic event.

Preprint paper: https://osf.io/preprints/osf/qgf2j

Data: https://zenodo.org/records/12100278

Code: https://github.com/YueLi-0816/TrafficFlowData?tab=readme-ov-file

Files

No items found.

Latest news

Brainstorming Data Dive promotes innovative social research

Researchers gathered in Glasgow for an innovative Data Dive, organised by the Urban Big Data Centre in collaboration with ESRC Smart Data Research UK. The aim of the event was to provide researchers with first-hand experience using smart data curated by UBDC and brainstorm new approaches and analyses.

Learn More

Introducing the Adzuna teaching dataset

UBDC are excited to announce the launch of ‘Adzuna Teaching’, a small and random subset of our main Adzuna dataset. Adzuna Teaching makes Adzuna accessible to a wider audience, specifically students undertaking research projects and lecturers interested in employment or Smart Data. It will also be of interest to researchers thinking of applying for the full Adzuna dataset. Adzuna Teaching preserves the structure, and contains all variables, of our main Adzuna dataset. It therefore includes detailed information on salary, occupation, sector, seniority, skills, location, and job descriptions. See our Adzuna metadata for all variables included. The dataset represents a random 20,000 subsample of all (deduplicated) UK adverts posted on Adzuna.co.uk in the month of September 2021.

Learn More

UofG social scientists receive UKRI funding for data-driven research

UofG social scientists have received UKRI funding that will enable them to conduct data-driven research aimed at improving lives across Scotland and the UK

Learn More

Jointly funded by