Mining large-scale human mobility data for long-term crime prediction
AI-generated Key Points
- Traditional crime prediction models have limited use of census data
- Ubiquitous computing provides an opportunity to improve these models by incorporating data that represent human presence in cities
- The paper titled "Mining large-scale human mobility data for long-term crime prediction" by Cristina Kadar and Irena Pletikosa uses large human mobility data to develop features for crime prediction
- Features are informed by theories in criminology and urban studies
- Spatial and spatio-temporal features derived from Foursquare venues, check-ins, subway rides, and taxi rides significantly improve baseline models that rely solely on census and point-of-interest (POI) data
- Proposed models achieve impressive absolute R^2 metrics with high accuracy on geographical and temporal out-of-sample test sets
- Ambient population strongly predicts crime levels in addition to residential population
- Predictive gain of human dynamics features varies across different types of crimes, with the greatest boost seen in predicting grand larcenies
- Top predictive features for each main crime category are identified and discussed
- Geo-tagged human dynamics data can be leveraged to measure aspects of criminological theories at a large scale
- Using large-scale human mobility data alongside traditional census data can enhance understanding of crime patterns and develop more effective strategies for public safety.
Authors: Cristina Kadar, Irena Pletikosa
Abstract: Traditional crime prediction models based on census data are limited, as they fail to capture the complexity and dynamics of human activity. With the rise of ubiquitous computing, there is the opportunity to improve such models with data that make for better proxies of human presence in cities. In this paper, we leverage large human mobility data to craft an extensive set of features for crime prediction, as informed by theories in criminology and urban studies. We employ averaging and boosting ensemble techniques from machine learning, to investigate their power in predicting yearly counts for different types of crimes occurring in New York City at census tract level. Our study shows that spatial and spatio-temporal features derived from Foursquare venues and checkins, subway rides, and taxi rides, improve the baseline models relying on census and POI data. The proposed models achieve absolute R^2 metrics of up to 65% (on a geographical out-of-sample test set) and up to 89% (on a temporal out-of-sample test set). This proves that, next to the residential population of an area, the ambient population there is strongly predictive of the area's crime levels. We deep-dive into the main crime categories, and find that the predictive gain of the human dynamics features varies across crime types: such features bring the biggest boost in case of grand larcenies, whereas assaults are already well predicted by the census features. Furthermore, we identify and discuss top predictive features for the main crime categories. These results offer valuable insights for those responsible for urban policy or law enforcement.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.