Algorithmic Analysis of GTFS-RT vehicle position accuracy

AI-generated keywords: Geodesic intersections Ellipsoid Real-time transit data Data anomalies GTFS FeedMessages

AI-generated Key Points

  • Three novel algorithms for calculating geodesic intersections on an ellipsoid
  • Analysis of real-time transit data in California to assess vehicle position drift
  • Identification of key dataset issues, including missing GTFS FeedMessages and various types of missing data points
  • Around 30% of the dataset rendered unusable for analysis due to errors
  • Observation of a nightly pattern in the percentage of vehicles within 35 meters of their scheduled route, indicating potential errors like unlinked trips or disabled transponders
  • High standard deviation in vehicle distance from the scheduled route possibly caused by errors like stops too far from shape within the GTFS dataset
  • Distribution map showing most information originating from San Francisco Bay Area and Los Angeles County
  • Alignment of GTFS data with geographical features, despite some inaccuracies compared to OpenStreetMap
  • Proposal of practical solutions to improve positional accuracy for both data producers and consumers
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Joshua Wong

arXiv: 2506.06479v1 - DOI (physics.geo-ph)
License: CC BY-NC-SA 4.0

Abstract: This paper presents three novel algorithms for calculating geodesic intersections on an ellipsoid. These algorithms are applied in a case study analyzing real-time transit data in California to assess vehicle position drift. The analysis reveals that while certain data anomalies can be corrected, large-scale discrepancies persist. The paper concludes by proposing a set of practical solutions that can be implemented by either data producers or consumers to significantly improve positional accuracy.

Submitted to arXiv on 06 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.06479v1

This paper by Joshua Wong presents three novel algorithms for calculating geodesic intersections on an ellipsoid. These algorithms are applied in a case study analyzing real-time transit data in California to assess vehicle position drift. The analysis reveals that while certain data anomalies can be corrected, large-scale discrepancies persist. The study highlights key issues within the dataset, including missing GTFS FeedMessages and various types of missing data points. These errors render around 30% of the dataset unusable for analysis and raise concerns about the accuracy of the data. Furthermore, the paper discusses a nightly pattern observed in the percentage of vehicles within 35 meters of their scheduled route. This suggests potential errors such as vehicles not being unlinked from trips while in storage or transponders not being disabled during maintenance. The distribution of vehicle distance from the scheduled route also shows a high standard deviation, possibly caused by errors like stops too far from shape within the GTFS dataset. In addition to these findings, a map depicting California's GTFS and GTFS-RT data showcases that most information originates from the San Francisco Bay Area and Los Angeles County. While there may be some inaccuracies when compared to OpenStreetMap, overall the GTFS data aligns well with geographical features. Overall, this comprehensive analysis sheds light on challenges faced in real-time transit data accuracy and proposes practical solutions to improve positional accuracy for both data producers and consumers. By addressing these issues and implementing suggested measures, significant enhancements can be made to enhance the reliability and precision of transit data analysis.
Created on 03 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.