Skip to content
Permalink
Browse files
Merge pull request #2 from RensselaerIDEA/readme
Add links to papers from where these data files are used
  • Loading branch information
erickj4 committed Mar 8, 2022
2 parents 100a220 + 0d19ecc commit c254fbdac8fc26c94ae35a25d72dae69f8948eea
Showing 1 changed file with 2 additions and 0 deletions.
@@ -3,6 +3,8 @@ The repository includes the code for metrics designed for evaluating fairness of

## Repository structure
* **data:** The folder includes data for two datasets. *Atus* is the American Time Use Survey dataset, both the derived real and synthetic data files. *Mimic* is the MIMIC-III dataset based on a past study for identifying the impact of race on mortality and includes only the synthetic dataset. Note that the synthetic datasets are generated using a Generative Adversarial Network (GAN) model called [HealthGAN](https://github.com/TheRensselaerIDEA/synthetic_data) and are intended to not release any private information of the real datasets.
- *ATUS:* The real and synthetic data are used based on the previously published paper [Medical Time-Series Data Generation Using Generative Adversarial Networks](https://link.springer.com/chapter/10.1007/978-3-030-59137-3_34).
- *MIMIC:* The synthetic data is used based on the previously published paper [Generation and evaluation of privacy preserving synthetic health data](https://www.sciencedirect.com/science/article/pii/S0925231220305117).
* **scripts:** The scripts include code snippets which are used in multiple other files or notebooks and hence, have been designed to be imported as functions.
* **notebooks:** The notebooks include code for plotting figures and calculating metrics on the datatsets.
* **results:** The results for the log disparity metric on synthetic datasets is compiled into CSV files included in this folder.

0 comments on commit c254fbd

Please sign in to comment.