Mining large-scale human mobility data for long-term crime prediction

AI-generated keywords: Crime Prediction Human Mobility Data Foursquare Venues Subway Rides Taxi Rides

AI-generated Key Points

  • Traditional crime prediction models have limited use of census data
  • Ubiquitous computing provides an opportunity to improve these models by incorporating data that represent human presence in cities
  • The paper titled "Mining large-scale human mobility data for long-term crime prediction" by Cristina Kadar and Irena Pletikosa uses large human mobility data to develop features for crime prediction
  • Features are informed by theories in criminology and urban studies
  • Spatial and spatio-temporal features derived from Foursquare venues, check-ins, subway rides, and taxi rides significantly improve baseline models that rely solely on census and point-of-interest (POI) data
  • Proposed models achieve impressive absolute R^2 metrics with high accuracy on geographical and temporal out-of-sample test sets
  • Ambient population strongly predicts crime levels in addition to residential population
  • Predictive gain of human dynamics features varies across different types of crimes, with the greatest boost seen in predicting grand larcenies
  • Top predictive features for each main crime category are identified and discussed
  • Geo-tagged human dynamics data can be leveraged to measure aspects of criminological theories at a large scale
  • Using large-scale human mobility data alongside traditional census data can enhance understanding of crime patterns and develop more effective strategies for public safety.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Cristina Kadar, Irena Pletikosa

License: CC BY 4.0

Abstract: Traditional crime prediction models based on census data are limited, as they fail to capture the complexity and dynamics of human activity. With the rise of ubiquitous computing, there is the opportunity to improve such models with data that make for better proxies of human presence in cities. In this paper, we leverage large human mobility data to craft an extensive set of features for crime prediction, as informed by theories in criminology and urban studies. We employ averaging and boosting ensemble techniques from machine learning, to investigate their power in predicting yearly counts for different types of crimes occurring in New York City at census tract level. Our study shows that spatial and spatio-temporal features derived from Foursquare venues and checkins, subway rides, and taxi rides, improve the baseline models relying on census and POI data. The proposed models achieve absolute R^2 metrics of up to 65% (on a geographical out-of-sample test set) and up to 89% (on a temporal out-of-sample test set). This proves that, next to the residential population of an area, the ambient population there is strongly predictive of the area's crime levels. We deep-dive into the main crime categories, and find that the predictive gain of the human dynamics features varies across crime types: such features bring the biggest boost in case of grand larcenies, whereas assaults are already well predicted by the census features. Furthermore, we identify and discuss top predictive features for the main crime categories. These results offer valuable insights for those responsible for urban policy or law enforcement.

Submitted to arXiv on 04 Jun. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1806.01400v1

In traditional crime prediction models, the use of census data has been limited as it fails to capture the complexity and dynamics of human activity. However, with the advent of ubiquitous computing, there is an opportunity to improve these models by incorporating data that better represent human presence in cities. In their paper titled "Mining large-scale human mobility data for long-term crime prediction," Cristina Kadar and Irena Pletikosa leverage large human mobility data to develop a comprehensive set of features for crime prediction. These features are informed by theories in criminology and urban studies. The authors employ averaging and boosting ensemble techniques from machine learning to investigate the predictive power of these features for different types of crimes occurring at the census tract level in New York City. They find that spatial and spatio-temporal features derived from Foursquare venues and check-ins, subway rides, and taxi rides significantly improve baseline models that rely solely on census and point-of-interest (POI) data. The proposed models achieve impressive absolute R^2 metrics, with up to 65% accuracy on a geographical out-of-sample test set and up to 89% accuracy on a temporal out-of-sample test set. This demonstrates that, in addition to the residential population of an area, the ambient population strongly predicts crime levels in that area. The authors delve deeper into specific crime categories and observe that the predictive gain of human dynamics features varies across different types of crimes. For instance, these features provide the greatest boost in predicting grand larcenies while assaults are already well predicted by census features alone. Furthermore, Kadar and Pletikosa identify and discuss the top predictive features for each main crime category. These findings offer valuable insights for those responsible for urban policy or law enforcement. By leveraging geo-tagged human dynamics data such as public transportation or taxi usage patterns researchers can now empirically measure aspects of criminological theories that were previously difficult to study at a large scale. Overall this research highlights the potential of using large-scale human mobility data for long-term crime prediction models. By incorporating these dynamic data sources such as Foursquare venues and check-ins subway rides and taxi rides alongside traditional census data policymakers and law enforcement agencies can enhance their understanding of crime patterns and develop more effective strategies to ensure public safety and security.
Created on 23 Aug. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.