Skip to content

Data Utilization and Preparation

Demasi, Marguerite edited this page Aug 19, 2021 · 6 revisions

Description of All Data Used in StudySafe

User Predictions

This is the data utilized in StudySafe for the Map and Find a Place to Study. It contains the hourly median and mean user count, using data from the last three weeks, for each building and each day of the week. However, only the median user count is used. Additionally, it contains the latitude, longitude, and building type for implementation in the app.

The data is read in by an rds file created by a script in the data GitHub repository. The median and mean user counts are calculated, for a specific building and weekday, by taking the maximum user count per hour for each unique devname on a given date. These maximums are summed to be the total hourly users. This is repeated for the other dates from the last three weeks. The median and mean of these total hourly users are found, and the median is used for the predictions. This is completed for every building on every day of the week.