(denominator=157)
*The denominator excludes studies that explicitly state they do not report inclusion criteria due to exploratory, feasibility or other study-specific reasons.
†The denominator excludes previously excluded studies from P8, or cannot define a set number of days to be included in analysis (P8b).
GPS, Global Positioning System; PALMS, personal activity location measurement system.
In evaluating practices reported for GPS processing, only 52.9% of studies reported identifying noise and more specifically their method for noise detection (most commonly speed, elevation and satellite accuracy). Of those that include noise identification, most reported the threshold for noise detection (75.9%) and the tool used for detection (83.1%, most commonly personal activity location measurement system (PALMS)). The noise threshold was most commonly speed >130 km/hour or delta elevation >1000 m (PALMS default parameters). Only 15.7% of these studies reported the per cent of GPS points considered to be noise (median=0.4%). About 31% of studies reported whether they employed imputation. Of those who reported imputing missing GPS data, all of them reported the imputation method, and 81.6% reported the tool. Almost 70% reported the imputation threshold, but only 15.7% reported the number of imputed points (median 15.5%). Of studies that conducted data linkage with GPS data (n=132), all reported which data were linked. However, only 87.9% reported the linkage epoch (median=60 s), 61.4% reported the tool for linkage (most commonly PALMS) and 20.5% reported the data loss through the linkage process (median=11%). About 68% of studies reported the criteria for GPS data inclusion. Yet only 47.7% reported the minutes of data required to be a valid day (median=480) and 58.1% reported the number of valid days required (median=2). Over 90% reported participants lost by compliance criteria (median=3.6%). Of the included studies in this review, 81.5% reported all four GPS usage practices while <5% reported all general GPS processing practices (not including subthemes a–c/d—which were rarely reported).
When evaluating trends in reporting of GPS usage practices over time, there does not appear to be a clear pattern ( figure 2A ). However, all studies conducted prior to 2015, and in 2020 reported all practices. There does not appear to be a trend towards increased reporting over time. Likewise, temporal trends in reporting of GPS processing practices did not show a clear trend ( figure 2B ). But, overall, average reporting scores for GPS processing practices were much lower than those for GPS usage practices.
Average scores and 95% CIs for Global Positioning System (GPS) usage (high=4, low=0; A) and processing (high=5, low=0; B) practices reported, by year.
The aim of this review was to identify best practices of GPS data usage, processing and linkage in spatial health and exposure research, and assess the current state of reporting those practices. We explored the recommendations for reporting methods from best practice literature and then quantified reporting of GPS data best practice elements in published studies. To our knowledge, this is the first systematic review focused on the current state of GPS data usage, processing and linkage reporting, mirroring efforts in allied sciences to promote scientific transparency and replicability.
The themes identified in best practice manuscripts included the model/brand of GPS device used, sampling frequency, wear time, GPS data missingness, noise considerations, imputation, linkage of GPS data to a variety of other data and data inclusion criteria. These themes were each identified in at least 50% of the best practice manuscripts and were then used to assess reporting practices in our systematic review manuscripts. Of all papers included in the review, 81.5% reported GPS usage practices (P1–P3), however, only five papers (3.2%) reported on all GPS processing practices (P4–P6, not including subcomponents).
For our first practice—reporting GPS brand and model—8% of the studies in this review did not disclose the GPS device model used, while 6% did not report brand. This limits understanding of the capability or comparability of devices across studies, as devices have different levels of locational precision and varying lengths of time to acquire a signal. 20 Still, most research-related devices yield similar accuracy when unobstructed. 12 If researchers deploy previously unvalidated devices, this might be an important limitation or weakness of the study’s ability to measure relationships between location and health outcomes. Similarly, reporting GPS device wear protocols is important, as they may affect the reliability and generalisability of findings. We found that all but one study reported the amount of requested wear time, but only 68.2% reported the inclusion criteria for their data, whether that be at the data point, day or person level. Because so much processing must occur to raw GPS data, specificity in what is considered a ‘valid’ point can clarify the quality of the data, as well as assist the researcher in reporting thresholds and other aspects of data cleaning within a manuscript. While some studies operated on a subday level, understanding the parameters for inclusion of observation days is relevant because a study that requires 3 days of GPS wear time may find stronger or weaker associations between exposures and health outcomes than a study that requires 7 days. The 3-day study may overestimate or underestimate a given exposure if, for example, the observation period does not include a representative sample of participants’ extent of activity spaces (eg, weekdays only). Previous research has found that at least 14 days of valid GPS data are required to obtain a representative sample of participants’ activity spaces. 14 However, this may differ depending on the risks or outcomes of interest or the population under study. Future research is needed into GPS wear protocols and processing steps which may affect associations with specific health outcomes or exposures.
Assessment, processing and reporting of missing data via signal loss, noise or linkage was highly inconsistent among studies. Review and reporting of missing data are important aspects of assessing possible bias in a study, especially if GPS or linked data missingness leads to removal of participants from a study. Additional sources of missing data can occur when using GPS models that automatically turn off when they do not sense activity or lose satellite signal, or when a researcher decides to exclude data outside of a specific study area. We found that only 12.1% of studies reported the per cent of GPS data lost by signal loss, only 15.7% reported the per cent of GPS data considered to be noise and none reported how much data were removed when rectifying data to a specific study area (not tallied or shown in tables). Of studies reporting amount of data loss from missing GPS data, numbers ranged from 0.1% to 70% of data missing. Larger amounts of missing data may indicate a poorer estimate of GPS-derived metrics, effecting quality of a study and ability to compare results to other studies. Delineation of GPS errors due to device or satellite reception issues and methods for either removing for correcting such errors are important to report in studies because they may affect the strength of associations of environmental exposures or behavioural contexts with health outcomes. For example, noisy GPS data or missing GPS data may occur in dense urban areas, where heat island effects or air pollution may be the strongest. Further, specific participant characteristics may more often take them outside of a study area. Missing GPS data may underestimate participants’ environmental exposures and may bias the associations with the health outcome of interest. Further research is needed into the spatial variation in GPS positional errors and how they relate to specific exposures, 199 which was beyond the scope of this review. Closely related to missing data is the decision of whether to impute missing or noisy data or not. While many studies chose to ignore missing data, some research has found that this can bias results particularly when linking GPS to other data resources like accelerometers. At the minimum, reporting if imputation was performed or not (only 30.6% of studies reported this) will help in identifying if a study may be prone to potential biases. 16
Additionally, we noted that a fair amount of GPS data was lost (median 11%) through linkage to other devices or GIS data. The potential implications of these lost data varied by study type. For example, some studies were only interested in physical activity monitored by accelerometers and therefore used the accelerometer epoch as the standard for linkage. The lost GPS data (which were not matched to accelerometer data) were then likely to have minimal impact on the quantification of physical activity and potentially minimal impact on study results. However, if the focus was to assess where physical activity or in what types of environments it was occurring, the data lost due to linkage could bias the results. In an existing review of studies using GPS units with accelerometers or travel diaries, 17 had missing or unusable data ranging from 2.5% to 95% after linkage. 15 We also noted that very few studies specified how data were kept after linkage, for example, if only data that included both GPS and the linked data resource as kept, or if all GPS data were kept no matter if it had a linkage. Although beyond the scope of this review, we further note that simplistic linkage of GPS data to other sources may lead to uncertainty in estimates of exposure. For example, some studies used a simple intersection between GPS and GIS layers to determine exposure, which assumes that there is no positional error in the GPS or GIS data. This may lead to misclassification of exposure. For example, in a study evaluating time spent in a park, if the GPS points fall outside of the GIS park perimeter, linkage may misrepresent exposure time. Sensitivity analyses based on varying distance thresholds may help determine how variations in distance between GPS data and GIS layers may bias exposure estimates. 200 201 It is possible that missing data which results in removal or participants may bias the results of studies (although this was not formally assessed here). Very few manuscripts performed analysis comparing characteristics of their retained sample compared with participants who had to be removed due to either GPS missing/noisy data or linkage issues (eg, missing at random analysis), with notable exceptions (eg, ref 9 ). In fact, few manuscripts reported how many participants were lost due to GPS data issues, instead amalgamating all lost participants together, regardless of reason for exclusion.
To promote reporting of practices and methods in this research area, we created an example table for best practice reporting ( online supplemental table S1 ). This table provides examples for ways to report practices, reviewers to evaluate and readers to identify GPS data considerations and potential biases. The table was designed based on real examples from the reviewed literature and makes use of the reporting themes identified in this review.
While attempting to carry out an exhaustive review, there were certain aspects of GPS reporting and processing which we were not able to evaluate. For example, we did not consider the necessity of differential reporting for certain health outcomes. Future research may wish to provide guidance on an outcome-by-outcome basis (eg, for physical activity, depression, asthma). Moreover, our findings were not separated by subpopulations being studied (eg, child vs adult), though we do understand the need to modify methods should they not be appropriate for the population of interest. Thus, future research may wish to review best practices for each subpopulation and provide relevant guidance. Another potential limitation to our review was the omission of studies using mobile phone apps and smartwatches to collect GPS data. Although these devices are becoming commonplace, the decision was made to focus on ‘research grade’ GPS devices due to mobile phone apps often unknown calibration, source(s) of locational data and lack of homogeneity among these apps. However, by identifying best practices among research-related GPS devices, these practices can be transferred as applicable to mobile phone or smartwatch data collection and processing. Last, our restriction to evaluating studies published in English only is a limitation of this review and future studies pulling from non-English literature would be valuable.
Though this review was focused on identifying best practices and assessing the current state of reporting on those practices, several ancillary areas of future research remain. For example, it is unknown how much uncertainty not correcting or removing locational noise may be introduced to the estimation of exposure or how GPS wear protocols and processing steps could affect the detected associations with health outcomes. Future research may usefully estimate the magnitude of each of these practices and/or data loss on overall uncertainty or bias using a meta-analysis or similar approach. Perhaps the most evident need in future research, based on our findings, is a consensus on which practices should be reported, regardless of study design or research focus, and which practices may be optional. As mentioned in the Introduction section, this second step in our research process will make use of the themes identified in the current systematic review in order to build consensus among experts. With such a consensus, future geospatial health and exposure research will be more comparable, reliable and reproducible.
Because studies using GPS data may be used to quantify harmful exposures, and thus inform policies aimed at protecting the public from those exposures, the designation of minimum reporting for comparisons across studies would allow us to ensure that policies are based on the best available science. Furthermore, enabling meta-analyses to pool findings and create best guidance for policy could be afforded by efforts to standardise reporting.
In summary, because there is currently no consensus for the optimal use or reporting of GPS data in spatial health and exposure research, studies tend to report what they feel is essential, yielding such variety that comparisons across studies are challenging. Throughout this review process, we found a lack of consistency in both reporting and methods. Some manuscripts were meticulous in identifying and reporting their process and procedures, either in the main text or appendix. For other manuscripts, we had considerable difficulty finding processing decisions, criteria or other critical information. This review underscores that the current state of GPS usage and processing practices reporting has significant room for improvement. Details pertaining to acquiring and processing of GPS data are vital so that future studies can fully assess the methods used, identify quality of data inclusion, compile findings in a meta-analysis or draw comparisons across studies.
Acknowledgments.
The authors thank Dr David Berrigan, Phil Hurvitz and Steve Mooney for help in conceptualising this research and providing critical feedback.
Correction notice: This article has been corrected since it was published. The funding grant number has been corrected.
Contributors: ALP: conceptualisation, data curation, investigation, methodology, formal analysis, resources, visualisation, writing—original draft, writing—review and editing. CDB, CT, J-AY: data curation, writing—review and editing. KP: conceptualisation, writing—review and editing. MMJ: conceptualisation, data curation, investigation, methodology, resources, visualisation, writing—original draft, writing—review and editing, guaranteed the content of the manuscript.
Funding: AP is funded by the National Cancer Institute (NCI) of the NationalInstitutes of Health (NIH) R01 CA239187.
Competing interests: None declared.
Patient and public involvement: Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Provenance and peer review: Not commissioned; externally peer reviewed.
Supplemental material: This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
Ethics statements, patient consent for publication.
Not applicable.
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.
Chapter: chapter 2 - literature review.
Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
5 Probe and Cellular GPS Data Collection Methods Vehicle location information can be collected or extracted in several different ways: ⢠In-vehicle embedded devices. ⢠GPS devices embedded in cell phones. ⢠Cell phone service providersâ networks. ⢠Cell phone applications. In-Vehicle Embedded Devices In-vehicle embedded devices are part of vehicle manufacturer-built systems that collect vehicle telematics data including vehicle location, vehicle performance, and driver behavior. Unlike handheld devices, vehicles are not restrained by the small form factor and therefore necessary components can be distributed throughout the vehicle, while leveraging a vehicleâs power system for operation and vehicle velocity information for enhancing location information (Jagoe 2003). Location information is captured by fusing in-vehicle GPS receiver information and Dead Reckoning (DR) generated information. The DR method uses a previously determined position and the vehicleâs velocity to approximate the vehicleâs location (Cho and Choi 2006). GPS Devices Embedded in Cell Phones Most cell phone devices are equipped with a GPS receiver that can obtain navigation messages from at least three satellites to determine the deviceâs latitude and longitude (and altitude if using a fourth satellite signal). In addition to receiving a GPS signal, many devices utilize a method called assisted-GPS where wireless network information is used to supplement GPS information. This approach reduces the power consumed by the headset, optimizing start-up and acquisition time, and increasing the sensitivity of the GPS device (Barnes 2003). Cell Phone Service Providersâ Networks Cell phone service providers have the capability of tracking the location of individual devices as they move through the network. There are several network-level tracking methods generally used to locate a cell phone device. The Cell of Origin method approximates a userâs location based on the location of the base station serving the mobile device. This method has low accuracy of 150 to 10,000 meters based on individual cell size in the network (Barnes 2003). This method is supplemented by timing C H A P T E R 2 Literature Review
6 Use of Vehicle Probe and Cellular GPS Data by State Departments of Transportation advance information which calculates the time difference between the start of a radio frame and a data burst, which is used to further approximate a userâs location within a cell (Jagoe 2003). The Estimated Time of Arrival and Angle of Arrival methods use the time necessary for a deviceâs signal to reach multiple base stations and the direction from which the signal arrives, respectively. This information is then sent to a mobile switch where it is analyzed to generate the approximate location of the mobile device (Barnes 2003, Jagoe 2003). Cell Phone Applications Cell phone applications or apps often utilize device location (collected using a deviceâs GPS receiver and network information) to enhance user experience. When users install social media, weather, or similar apps that use location information for other purposes, their location informa- tion is captured and used to provide additional services and often resold to third parties (Huang et al. 2018). History of Probe Data for Transportation Applications Probe-Based Speed Data In the early 2000s, transportation agencies identified a need to supplement data from the traditional static sensor technology to better evaluate real-time and historical performance of the transportation system. Early efforts included generating floating car data using a single vehicle equipped with a GPS unit traversing corridors at specific times. The resulting GPS points were snapped to an underlying map to generate metrics such as average speed, travel time, and congestion index (Tong et al. 2005). The use of this floating car method was intermittent, tedious, and time-consuming, and the method proved to be only marginally useful (Pack et al. 2019). Radio camera positioning of cellular phones was also being developed at this same time. This technology used the signal strength of the cell phone at the antenna site and various forms of triangulation to identify the position and speeds of vehicles for use in traveler information and traffic control (Smith et al. 2001). Other methods relied on the cooperation of cellular carriers to mine the signal timing and handoff data emanating from a cellular tower-switching network (Myr 2003). Some agencies have been generating their own probe vehicle data using toll tags. For example, the Florida DOT (FDOT), the Texas DOT, and toll authorities around the country use toll tags to identify and re-identify vehicles as they traverse the toll facility and use that information to calculate speed and travel time between re-identification points (Pack et al. 2019). In 2006, new vehicle probe technology was emerging as a means of continuously monitoring traffic as commercial firms were offering traffic data services based on a variety of methods, the most common of which were based on either cellular geo-location or fleet GPS-based telematics reporting technologies. (Center for Advanced Transportation Technology, University of Maryland and KMJ Consulting Inc. 2011). The use of probe vehicle data for real time operations and planning among agencies was expanded to a larger number of agencies when The Eastern Transportation Coalition (formerly the I-95 Corridor Coalition) created a probe vehicle data marketplace in 2007 (University of Maryland 2007). This request for proposal (RFP) outlined desired requirements for both data and quality that propelled use of probe vehicle data among coalition member agencies and later other agencies across the country. It also resulted in a model data use agreement that has been cited in many publications and RFPs in the years following this initial RFP.
Literature Review 7 Commercial speed data providers have moved away from using a single method, technology, or source to generate probe data, to combining probe data from multiple sources and technolo- gies to create a comprehensive traffic information service (Center for Advanced Transportation Technology, University of Maryland and KMJ Consulting Inc. 2011). These offerings have largely replaced the need for fixed roadside infrastructure to support speed data collection. Origin-Destination Data O-D studies have been a core activity in the world of transportation to better understand travel patterns, identify traffic congestion sources and potential traffic control strategies, better plan transit services, and plan urban development for almost 100 years (McClintock 1927, Mickle 1944, Braff 1948, Blucher 1950). Traditionally, these studies relied on data collected through spotter observations, as well as paper-based and web surveys of travelers. As a result, they repre- sented a small sample of the population and required significant time and effort to administer and process, often producing results many years after commencing the study (Williams 1986, Hartgen 1992, Richardson 2003). For example, even the surveys themselves could be time-consuming because to accurately capture demand variability, multi-day surveys and, more specifically, a 2-week survey duration are necessary (Senbil and Kitamura 2009). Vehicle Telematics-Based Origin-Destination Data With the emergence of probe speed data in the mid-2000s, commercial firms such as AirSage, HERE, and INRIX began capturing O-D data and delivering that data as a product to agencies (Allos et al. 2014). The cost and latency of these products were lower than those of traditional surveys, and sampling rates were higher. Location-Based Services Data In the late 1990s and early 2000s, telecommunications companies, application developers, and content providers were looking to leverage the emerging LBS capabilities aggregated from multiple sensors in mobile devices (Myllymaki and Edlund 2002). Additional technology and algorithm advancement led to more accurate and easier ways to locate mobile devices and track device paths (Barbeau et al. 2008). These developments led to further refinements of the LBS, and produced data to not only identify locations and paths of users and their devices, but also to infer the mode of transportation to differentiate between cars, buses, and other modes (Byon, Abdulhai, and Shalaby 2009). In recent years, the proliferation of mobile apps (like social media apps and weather apps) that capture device location information enhanced the volume of available location data. For exam- ple, location-based social networks enable uploading geotagged content (e.g., photos, videos, and recorded routes), sharing the present location (e.g., âcheck-inâ at Foursquare), commenting on an event at the place where it is happening (e.g., via Twitter), or leaving ratings/tips/reviews for a location (e.g., a restaurant) (Huang et al. 2018). Today, location information extracted from social media and other mobile device applications allows agencies and the private sector to analyze a variety of transportation problems, such as parking supply and demand implications (Mondschein et al. 2020), trip purpose, the reliability of bike share programs (Svartzman et al. 2020), communications during natural disasters affecting the transportation system (Lovari and Bowen 2020, Roy et al. 2020), the analysis of transit services and ride hailing platforms (Kim et al. 2019), safety and mobility performance in smart cities (Oh et al. 2019), and others. Agencies began getting access to O-D and trajectory data based on LBS. Third-party data providers such as StreetLight collect and aggregate LBS data and package it for use by the agencies (Lee and Sener 2017).
8 Use of Vehicle Probe and Cellular GPS Data by State Departments of Transportation Trajectory Data In the last several years, some of the vehicle telematics-based O-D data providers began adding value to the traditional O-D data sets by including associated trajectory data. The new products consist of origins and destinations, as well as waypoints captured between those origins and destinations (Petrone and Franz 2018, Fan et al. 2019). Combining O-D and trajec- tory information provides a comprehensive view of âtripsâ taken by individual vehicles. These trips provide valuable information previously unavailable, such as routes taken, time to traverse those routes, and variation in travel time and speed along the route. Most of these O-D and trajectory data offerings are sourced directly from vehicle telematics available from integrated sensors and devices from vehicle original equipment manufacturers and vehicle fleets (Sharma et al. 2017). Types and Properties of Current Probe and Cellular GPS Data Available The literature review covered three main classes of probe and cellular GPS data: speed, O-D, and trajectory (sourced from telematics and LBS data). Speed Data Probe-based speed data are derived from vehicles and mobile devices equipped with embedded GPS devices. Speeds and travel times provide real-time or historic average speeds on one or more road segments. To be useful, probe vehicle data must be conflated to a roadway network. Initial commercial offerings relied on Traffic Message Channel (TMC) codes, a reference system designed to give a unique alpha-numeric code to each road segment for purpose of assigning traffic information to that segment (HERE 2015). These TMC codes are typically assigned at significant decision points, interchanges or intersections in an unambiguous format, independent of map vendor. The North America Location Code Alliance created, owns, maintains, and expands the U.S./Canada TMC location code table that adheres to the international standard on location referencing (INRIX 2018). While being the only standardized coding method to uniquely identify roadway segments for the purpose of conveying traffic and other information, the TMC code segmentation reached its limitations in road coverage, the ability to cover new roads more quickly, segment overlap and gapping, and segment resolution, as probe data have become denser and more granular. INRIX introduced XD segments in 2013, a proprietary segmentation that addresses some of the TMC code limitations (Young et al. 2015, INRIX 2018). Similarly, other data providers are offering sub-TMC products to address TMC segmentation limitations (HERE 2015, TomTom 2015). The National Performance Management Research Data Set In July 2013, the Federal Highway Administration (FHWA) procured the NPMRDS to support the Freight Performance Measures and Urban Congestion Report programs, as well as the Moving Ahead for Progress in the 21st Century Act (MAP-21) performance management activ- ities. This data set consists of actual observed average travel times every 5 minutes, 24 hours a day, 7 days a week covering the National Highway System (NHS) as defined by MAP-21 and delivered monthly starting with October 2011. The average travel time data are sepa- rated for freight, passenger, and all traffic. The first version of the NPMRDS was provided by HERE Inc. and sourced from mobile phones, vehicles, and portable navigation devices for passenger vehicles, and from the American Transportation Research Instituteâs leveraging of embedded fleet systems for freight vehicles (FHWA Office of Operations and Resource Center).
Literature Review 9 Since February 2017, data are provided by a team led by the University of Maryland Center for Advanced Transportation Technology (Center for Advanced Transportation Technology). The speed and travel time data are provided by INRIX, which leverages its existing source data, including millions of connected vehicles, trucks, and mobile devices that anonymously supply location and movement data. The data cover more than 400,000 segments provided in 5-minute intervals, 24 hours a day (NPMRDS FAQ 2020). Origin-Destination Data Probe-based O-D data contain basic information about trips between two geographic points. It does not contain information about specific routes between those two geographic endpoints. The geographic endpoints are reported as latitude/longitude pairs, but often generalized to zones. The O-D data set provides counts of trips between the origin zone and destination zone for a selected time period, such as workday a.m. peak period. The O-D data may also be broken down by travel mode and vehicle class type as well, such as passenger vehicles, heavy vehicles, medium vehicles, bikes, or pedestrians. (Pack et al. 2019, Southwest Washington Regional Transportation Council 2019, StreetLight 2020). Probe-based O-D data can be combined with traditionally available geodemographic infor- mation for origin and destination zones (Martin et al. 2018). Data may include standard census statistics as well as expanded data elements, such as population size and density, employment statistics, average income, and age, gender, race, and occupation statistics for a zone (Kim et al. 2012, Lawson 2018, StreetLight 2020). Trajectory Data Whereas O-D data provide two data points per trip, trajectory data provide information about the origin, destination, and routes taken between those endpoints. Trajectory data are timestamped location data from vehicles, cell phones, and other GPS-enabled devices throughout the network, often referred as âbread-crumb trailâ data. Other data elements can include (Pack et al. 2019): ⢠Unique device/vehicle ID, ⢠Unique trip ID, ⢠Departure time and location (trip origin), ⢠Arrival time and location (trip destination), ⢠Periodic âwaypointsâ during the trip, including latitude/longitude, where the period can be multiple times per second or once every X minutes, ⢠Instantaneous speed/heading, ⢠ID of road segment for the waypoint, and ⢠Data that describe the path and routes of a trip from an origin to a destination. Trajectory data are available everywhere that probe speed data are collected, which means that it has ubiquitous coverage not otherwise available using traditional O-D surveying and floating car studies. The capture rate for trajectory data varies by region. For example, in Utah INRIX provides trajectory data that cover 3% of all trips in the state (Markovic et al. 2020). Trajectory data can be processed and made available within hours of collection at a higher price than that of data that are delivered weekly or monthly. Location-Based Services Data LBS use smartphonesâ GPS technology (or control plane locating for older devices) to geolocate the device and integrate this location information with relevant services and content databases. The content databases provide supporting information such as the road network (digital maps),
10 Use of Vehicle Probe and Cellular GPS Data by State Departments of Transportation business and landmark information (points of interest), and dynamic data such as traffic and weather reports. The userâs past, present, or future location is joined with other context infor- mation to deliver a service (Schiller and Voisard 2004). For use in transportation, LBS data are collected when smartphone apps include code that collects location information from users that opt-in to use the app and report location informa- tion (Cuebiq 2019). The collected data are then packaged as a product that provides anonymous timestamped location information, including dwell times associated with points of interest (Cuebiq 2020). LBS data can then be further enhanced using inferred information such as income, race, education, and family status at the neighborhood level (StreetLight 2020). Current Primary Uses of Probe and Cellular GPS Data According to Pack et al. 2019, probe vehicle data conflated to underlying roadway network and in combination with other data sources can be used to support many different activities including but not limited to the following: Probe-Based Speed Data ⢠Monitoring real-time congestion â Detecting and identifying incidents. â Issuing traveler information. â Conducting work zone monitoring and impact analysis activities. â Detecting the end of the queue. â Comparing real-time speed information to historical trends. â Identifying recurring and non-recurring bottlenecks. â Evaluating regional operations and situational awareness, for example, Metropolitan Area Transportation Operations Coordination (MATOC) (Harrison et al. 2019). ⢠Performance management â Evaluating performance metrics over time: travel time, buffer time, reliability, planning time, and associated indices. â Incorporating data into dynamic performance management dashboards. â Investigating user delay cost. â Meeting legislative requirements, for example, MAP-21 and FAST Act target setting (Pu and Meese 2013, Vandervalk 2018). â Evaluating the worst bottlenecks in a region for a period of time. â Studying trends, including special event, holiday, and seasonal movements. â Exploring the impacts of capital investments prior to, during, and after completion of the project. â Conducting after action reviews, for example, Maryland State Highway Administration (Harrison et al. 2019). â Evaluating winter performance management, for example, the Ohio DOT (Harrison et al. 2019). â Measuring freight performance (Habtemichael et al. 2015). â Measuring truck and auto performance (Eshragh et al. 2015). ⢠Planning and research â Identifying problems. â Prioritizing projects. â Performing safety analyses. â Implementing public participation/information campaigns. â Conducting before-and-after studies.
Literature Review 11 â Reporting project assessments, for example, the New Jersey DOT Tactical-level Asset Management Plan (Harrison et al. 2019). â Evaluating at-grade railroad crossings (Hafeez and Kasemsarn 2017). ⢠Traveler information â Providing real-time travel time information on dynamic message signs (DMS). â Delivering network performance information. â Distributing special event and holiday guidance. Origin-Destination and/or Trajectory Data ⢠Real-time traffic pattern analysis â Evaluating corridor demands based on observed vehicle trips. â Evaluating the effectiveness of implemented detours around incidents and congestion and observing self-detouring patterns. â Identifying temporary but significant changes in origin-destination patterns and route utilization resulting from special events or closures. ⢠Signals performance and turning movement analysis (Center for Advanced Transportation Technology, INRIX 2020) â Observed distribution of speeds through intersections. â Arrival on green metrics. â Turning movement metrics. â Intersection travel times. â Approach speeds. â Intersection delays. ⢠Multi-modal system utilization â Discovering mode transitions and influence traveler decisions based on network utilization patterns. â Analyzing freight patterns, for example, âQuantifying Long-Haul Trucks on Floridaâs Highwaysâ (StreetLight 2020). â Assessing the economic impact of mode utilization, for example, âVirginia Bike Tourism: Measuring Cyclingâs Economic Impactâ (StreetLight 2020). ⢠Planning and research â Traditional origin-destination analysis to identify trip origins and destinations, work versus leisure travel, and many other data points. â Waypoint analysis to determine if traffic in a specific area (e.g., state, county, traffic analysis zone, or business center) originated in the same area, neighboring area, or another more distant location. This can identify whether certain corridors are mainly local travel or pass- through corridors. â Trip cluster analysis to evaluate the effectiveness of existing transit service or identifying areas in need of new transit service. â Analysis of trip patterns that may affect critical freight corridors or ports. â Infrastructure investment decisions, for example, analyzing metrics to know where to place electric vehicle charging station locations (StreetLight 2020). Data Quality, Accuracy, and Reliability Probe vehicle data are available from only a sample of total vehicles flowing through the network but are generally considered sufficient enough to estimate travel time distributions through the network. Early studies have shown that even a small percentage (1%-2%) can be adequate for producing data appropriate for certain congestion-related performance measures (Hunter et al. 2009).
12 Use of Vehicle Probe and Cellular GPS Data by State Departments of Transportation Early studies attempted to quantify the quality of probe-based speed data and its ability to be used for certain applications on various classes of roadways based on AADTs or signal spacing. However, the density, granularity, coverage, and quality of data has improved to where it has become a key source of information to support agency operations and planning efforts on most roads for most use cases (Pack et al. 2019). Extensive validation efforts have shown that probe vehicle data are accurate and sufficient enough to support most traffic operations and planning efforts (Vander Laan and Zahedian 2019-2020), as well as policy research (van der Loop et al. 2019). The U.S. DOT and many agencies have performed their own analysis of the NPMRDS, which is based on probe speed data, to support target setting and performance management (Turner and Koeneman 2018, Refai et al. 2017). Validation efforts for LBS and trajectory-related data are still in the early stages, but initial independent validation efforts have shown that the data are extremely representative of actual conditions. For example, multiple aerial photo studies have validated INRIX Trips data (Jordan et al. 2016a, 2016b, 2017), and the Maryland Transportation Instituteâs methodologies for computing trips by mode from LBS data have also been validated (Zhang et al. 2020). Gaps in Data Usage Information Many agencies consider travel time and probe-generated speed data as the top three data elements they would be interested in acquiring if available (Pack and Ivanov 2014). However, prior research has shown that agencies struggle to work with the data due to the size of the data. Data for even a small state like Rhode Island can include hundreds of millions of trajectory records and billions of waypoints. It is for these reasons that third-party analytics packages are starting to be adopted by agencies.
Over the last decade, state departments of transportation (DOTs) have begun to use vehicle probe and cellular GPS data for a variety of purposes, including real-time traffic and incident monitoring, highway condition, and travel demand management. DOTs are also using vehicle probe and cellular GPS data to inform system planning and investment decisions.
The TRB National Cooperative Highway Research Program's NCHRP Synthesis 561: Use of Vehicle Probe and Cellular GPS Data by State Departments of Transportation documents how DOTs are applying vehicle probe and cellular GPS data for planning and real-time traffic and incident monitoring and communication.
In December 2021, an erratum was issued.
READ FREE ONLINE
You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.
Do you want to take a quick tour of the OpenBook's features?
Show this book's table of contents , where you can jump to any chapter by name.
...or use these buttons to go back to the previous chapter or skip to the next one.
Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.
To search the entire text of this book, type in your search term here and press Enter .
Share a link to this book page on your preferred social network or via email.
View our suggested citation for this chapter.
Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.
Do you enjoy reading reports from the Academies online for free ? Sign up for email notifications and we'll let you know about new publications in your areas of interest when they're released.
International Journal of Research and Innovation in Applied Science (IJRIAS)
Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 754))
Included in the following conference series:
239 Accesses
1 Citations
Real-time information regarding buses is made available to passengers via the GPS and GSM-based Real-Time Bus Monitoring system, which aims to enhance public transportation. The system uses GPS and GSM technologies to track the location and estimated arrival time of buses and transmit this information to commuters’ mobile devices. This technology addresses the common problem of long waiting times at bus stops and the uncertainty of bus arrival times. The system's benefits are numerous, including the ability to reduce waiting times, optimize bus routes, and reduce traffic congestion. Commuters can track the arrival times of their buses and make informed decisions about their travel plans, which can save them time and effort. The system can also assist transportation authorities in making decisions about bus routes and improving the overall efficiency of public transportation. Moreover, the system's effectiveness can be enhanced through the use of data analytics and artificial intelligence techniques. By analyzing data on bus ridership and travel patterns, transportation authorities can identify areas of high demand and adjust bus schedules and routes accordingly. This approach can help reduce overcrowding on buses, improve travel times, and enhance the overall user experience.
This is a preview of subscription content, log in via an institution to check access.
Subscribe and save.
Tax calculation will be finalised at checkout
Purchases are for personal use only
Institutional subscriptions
Lin W-H, Zeng J (1999) Experimental study on real-time bus arrival time prediction with GPS data. Transp Res Rec J Transp Res Board (1666):1019
Google Scholar
Lau EC-W (2013) Simple bus tracking system. J Adv Comput Sci Technol Res 3(1)
Shende P, Bhosale P, Khan S, Patil P (2016) Bus tracking and transportation safety using internet of things. Int Res J Eng Technol (IRJET) 03(02)
Selvapriya PR, Mundada MR (2015) IOT based bus transport system in Bangalore. Int J Eng Tech Res (IJETR) 3(2). ISSN: 2321-0869
Kumbhar M, Survase M, Mastud P, Salunke A (2016) Real time web based bus tracking system. Int Res J Eng Technol (IRJET) 03(02)
Nair V, Pawar A, Tidke DL, Pagar V, Wani N (2018) Online bus tracking and ticketing system. MVP J Eng Sci 1(1). http://doi.org/10.18311/mvpjes/2018/v1i1/18297
Chandurkar S, Mugade S, Sinha S, Borkar P (2013) Implementation of real-time bus monitoring and passenger information system. Int J Sci Res Publ 3(5)
Download references
We express our gratitude to our supervisor, Prof. Kapil Mundada, for providing us with his invaluable guidance and critical evaluation throughout the duration of this project. Additionally, we extend our appreciation to the Department of Instrumentation Engineering at VIT Pune, for their generous support and provision of essential resources that enabled us to successfully undertake this endeavor.
Authors and affiliations.
Department of Instrumentation and Control Engineering, Vishwakarma Institute of Technology, Pune, India
Kapil Mundada, Sumedh Patti, Tejas Rajguru, Puskraj Savji & Sayali Shambharkar
You can also search for this author in PubMed Google Scholar
Correspondence to Kapil Mundada .
Editors and affiliations.
Singidunum University, Belgrade, Serbia
ITM University, Gwalior, Madhya Pradesh, India
Shyam Akashe
Global Knowledge Research Foundation, Ahmedabad, Gujarat, India
Reprints and permissions
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Cite this paper.
Mundada, K., Patti, S., Rajguru, T., Savji, P., Shambharkar, S. (2023). Smart Bus Real-Time Tracking System Using GSM and GPS Module. In: Tuba, M., Akashe, S., Joshi, A. (eds) ICT Infrastructure and Computing. ICT4SD 2023. Lecture Notes in Networks and Systems, vol 754. Springer, Singapore. https://doi.org/10.1007/978-981-99-4932-8_46
DOI : https://doi.org/10.1007/978-981-99-4932-8_46
Published : 26 September 2023
Publisher Name : Springer, Singapore
Print ISBN : 978-981-99-4931-1
Online ISBN : 978-981-99-4932-8
eBook Packages : Intelligent Technologies and Robotics Intelligent Technologies and Robotics (R0)
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Policies and ethics
Vehicle safety and monitoring system using arm processor, automotive electronic safety for intelligent transportation system, multi sensor fusion based vehicle tracking system, investigation of car surveillance using gps and gsm system based on google maps, an arduino based safety alerts for vehicle and accident detection system, determining criteria weights for vehicle tracking system selection using piprecia-s, crowdsource based vehicle tracking system, a survey on vehicle monitoring system, design and implementation of an easy-to-use tracking device for logistic applications, advanced vehicle monitoring system, 37 references, ubiquitous gps vehicle tracking and management system, development of vehicle tracking system using gps and gsm modem, vehicle tracking and monitoring by arm7, a review on vehicular monitoring and tracking, design and development of gps-gsm based tracking system with google map based monitoring, intelligent bus monitoring and management system, real time bus monitoring system using gps, rfid and communication technologies for an intelligent bus monitoring and management system, implementation of real time bus monitoring and passenger information system, gps real time vehicle alarm monitoring system base on gprs/csd using the embedded system, related papers.
Showing 1 through 3 of 0 Related Papers