Nice thank you! Taking a brief look over the data but I was wondering what all the headers meant exactly, and also the uncertainties on the various measurements.
- SITE_NUMBER: ?
- SITE_ID: reference ID for different counters. Embankment is
4632, Blackfriars is
- SERIAL_NUMBER: ? thought this was total cumulative count (equivalent to “cyclists this year”), but on looking more closely it doesn’t line up, it’s about -40000 off.
- DATE: midnight on the day
- TIME: Time past 30/12/1899 that that cyclist went past midnight on the day. So exact time stamp would be DATE + TIME. Don’t see why not just store it as one?
- TimeString: Time past midnight formatted as
- LANE: ? Seems only available for Embankment, at Blackfriars lane is just 1?
- DIRECTION: north/south/east/west + bound. See below for issues
- SPEED: speed in km/h (uncertainty?)
- SPEED_MPH: speed in mp/h (uncertainty?)
- CLASS_INDEX: identifier for class
- CLASS: ? (what is 2N)
- LENGTH: bicycle length (uncertainty?)
- WHEELBASE: distance between the center of the front and the center of the back wheel? (uncertainty?)
- VALIDITY: ?
- STRADDLE: ?
- OVERLOADED: ?
- GROSS: ?
- HEADWAY: ?
- GAP: ?
- TIME_GAP: time since last bike in miliseconds, maximum 10 minutes
- LEGAL_STATUS: ?
- CHASSIS_CODE: ?
- TEMP: Outdoor temperature, degrees Celsius (uncertainty?)
- AX_WT1: ?
- AX_WT2: ?
- AXLE_TYPE_1: ?
- AXLE_TYPE_2: ?
- AX_SP1: ?
- FRONT_MIN_CHASSIS: ?
- FRONT_MAX_CHASSIS: ?
- MIDDLE_MIN_CHASSIS: ?
- MIDDLE_MAX_CHASSIS: ?
- REAR_MIN_CHASSIS: ?
- REAR_MAX_CHASSIS: ?
Also I spotted in
Tuesday, Jul 3 2018.xls which should be moved to
EDIT: Also realised Embankment and Blackfriars are wrong way round, need to be swapped (i.e. all the Blackfriar’s ones are in the Embankment folder and vice-versa).
EDIT: The Embankment data seems heavily skewed towards eastbound. Having processed a few files, I can’t seem to find one with more than 3% of journeys westbound, and I find it hard to believe for every cyclist going west there are 35 going east - unless like birds the cyclists are migrating in the summer… Also the Blackfriars data seems skewed southbound, but not as much; 70-80% heading southbound.
Also I think it would be helpful if the files were all CSV (as opposed to XLS) as it’s an open format and easier to parse for applications, and if they were named with an ISO 8601 date at the beginning in a standardised format across different counters.
EDIT: been thinking about this, think a sensible naming format would be:
SSSS is the SITE_ID.
EDIT: the files are actually CSV but think they have been renamed to .xls, which results in Excel mangling them. Renaming them back fixes the issue.
Once again thanks so much for publishing this data!