diff --git a/StudentNotebooks/Assignment02/dar-f24-assignment2-template.Rmd b/StudentNotebooks/Assignment02/dar-f24-assignment2-template.Rmd index bdf8f9c..3a06ea3 100755 --- a/StudentNotebooks/Assignment02/dar-f24-assignment2-template.Rmd +++ b/StudentNotebooks/Assignment02/dar-f24-assignment2-template.Rmd @@ -62,16 +62,18 @@ This notebook is broken into two main parts: * **Part 2:** Loading and some analysis of the Mars 2020 (M20) Datasets * Lithology: _Summarizes the mineral characteristics of samples collected at certain sample locations._ * PIXL: Planetary Instrument for X-ray Lithochemistry. _Measures elemental chemistry of samples at sub-millimeter scales of samples._ - * SHERLOC: Scanning Habitable Environments with Raman and Luminescence for Organics and Chemicals. _Uses cameras, a spectrometer, and a laser of samples to search for organic compounds and minerals that have been altered in watery environments and may be signs of past microbial life._ + * SHERLOC: Scanning Habitable Environments with Raman and Luminescence for Organics and Chemicals. _Uses cameras, a spectrometer, and a laser of samples to search for organic compounds and minerals that have been altered in watery environments and may be signs of past microbial life._ * LIBS: Laser-induced breakdown spectroscopy. _Uses a laser beam to help identify minerals in samples and other areas that are beyond the reach of the rover's robotic arm or in areas too steep for the rover to travel._ * **Part 3:** Individual analysis of your team's dataset +* **Part 4:** Preparation of Team Presentation + + **NOTE:** The RPI github repository for all the code and data required for this notebook may be found at: * https://github.rpi.edu/DataINCITE/DAR-Mars-F24 -* **Part 4:** Preparation of Team Presentation # DAR ASSIGNMENT 2 (Part 1): Preparing your local repo for Assignment 2 @@ -156,8 +158,7 @@ NOTES: * Also included is `rover.waypoints.Rds` that provides detailed location information (lat/lon) for the Perseverance rover throughout its journey, up to the present. This can be updated when necessary using the included `roverStatus-f24.R` script. * A general guide to the available Mars 2020 data is available here: https://pds-geosciences.wustl.edu/missions/mars2020/index.htm * Other useful MARS 2020 sites - https://science.nasa.gov/mission/mars-2020-perseverance/mars-rock-samples/ - https://an.rsl.wustl.edu/m20/AN/an3.aspx?AspxAutoDetectCookieSupport=1 + https://science.nasa.gov/mission/mars-2020-perseverance/mars-rock-samples/ and https://an.rsl.wustl.edu/m20/AN/an3.aspx?AspxAutoDetectCookieSupport=1 * Note that PIXL, SHERLOC, and Lithology describe 16 sample that were physically collected. There will eventually be 38 samples. These datasets can be merged by sample. The LIBS data includes observations collected at many more locations so how to combine the LIBS data with the other datasets is an open research question. ## Data Set A: Load the Lithology Data @@ -178,7 +179,8 @@ lithology.df<- readRDS("/academics/MATP-4910-F24/DAR-Mars-F24/Data/mineral_data_ lithology.df$sample <- as.numeric(lithology.df$sample) # Convert rest into factors -lithology.df[sapply(lithology.df, is.character)] <- lapply(lithology.df[sapply(lithology.df, is.character)], +lithology.df[sapply(lithology.df, is.character)] <- + lapply(lithology.df[sapply(lithology.df, is.character)], as.factor) # Keep only first 16 samples because the data for the rest of the samples is not available yet @@ -240,7 +242,7 @@ summary(libs.df) libs.matrix <- as.matrix(libs.df[,6:13]) # Check to see scaling -summary(libs.matrix) +str(libs.matrix) ``` @@ -284,7 +286,7 @@ sherloc.df <- cbind(pixl.df[,c("sample","type","campaign","abrasion")],sherloc.m summary(sherloc.df) # Measurements are everything except first column -sherloc.matrix<-sherloc.matrix[,-1] +sherloc.matrix<-as.matrix(sherloc.matrix[,-1]) # Sherlock measurement matrix # Review the structure @@ -367,7 +369,7 @@ str(sherloc_lithology_pixl.matrix) Each team has been assigned one of six datasets: -1. Dataset B: PIXL: The PIXL team's goal is to understand and explain how scaling improves results from Assignment 1. The matrix version was scaled above. +1. Dataset B: PIXL: The PIXL team's goal is to understand and explain how scaling changes results from Assignment 1. The matrix version was scaled above but not in Assignment 1. 2. Dataset C: LIBS (with appropriate scaling as necessary. Not scaled yet.) @@ -385,11 +387,11 @@ Each team has been assigned one of six datasets: 1. _Describe the data set contained in the data frame and matrix:_ How many rows does it have and how many features? Which features are measurements and which features are metadata about the samples? (3 pts) -2. _Scale this data appropriately (you can choose the scaling method):_ Explain why you chose that scaling method. (3 pts) +2. _Scale this data appropriately (you can choose the scaling method or decide to not scale data):_ Explain why you chose a scaling method or to not scale. (3 pts) 3. _Cluster the data using k-means or your favorite clustering method (like hierarchical clustering):_ Describe how you picked the best number of clusters. Indicate the number of points in each clusters. Coordinate with your team so you try different approaches. If you want to share results with your team mates, make sure to use the same random seeds. (6 pts) -4. _Perform a **creative analysis** that provides insights into what one or more of the clusters are and what they tell you about the MARS data:_ +4. _Perform a **creative analysis** that provides insights into what one or more of the clusters are and what they tell you about the MARS data: Alternatively do another creative analysis of your datasets that leads to one of more findings. Make sure to explain what your analysis and discuss your the results. # Preparation of Team Presentation (Part 4) diff --git a/StudentNotebooks/Assignment02/dar-f24-assignment2-template.html b/StudentNotebooks/Assignment02/dar-f24-assignment2-template.html index b1022eb..b42d711 100644 --- a/StudentNotebooks/Assignment02/dar-f24-assignment2-template.html +++ b/StudentNotebooks/Assignment02/dar-f24-assignment2-template.html @@ -11,7 +11,7 @@ - +
This notebook is broken into two main parts:
Part 1: Preparing your local repo for +DAR Assignment 2
Part 2: Loading and some analysis of the Mars +2020 (M20) Datasets
Part 3: Individual analysis of your team’s +dataset
Part 4: Preparation of Team +Presentation
NOTE: The RPI github repository for all the code and data required for this notebook may be found at:
Part 4: Preparation of Team -Presentation
## SiO2 TiO2 Al2O3 FeOT
-## Min. : 0.00 Min. :0.0000 Min. : 0.000 Min. : 0.29
-## 1st Qu.:42.04 1st Qu.:0.0300 1st Qu.: 3.080 1st Qu.:13.27
-## Median :45.80 Median :0.3200 Median : 4.925 Median :20.21
-## Mean :43.47 Mean :0.3719 Mean : 6.246 Mean :20.07
-## 3rd Qu.:49.23 3rd Qu.:0.6400 3rd Qu.: 8.533 3rd Qu.:25.45
-## Max. :76.12 Max. :2.4000 Max. :38.350 Max. :82.68
-## MgO CaO Na2O K2O
-## Min. : 0.29 Min. : 0.080 Min. :0.0000 Min. : 0.0000
-## 1st Qu.: 5.72 1st Qu.: 1.830 1st Qu.:0.9775 1st Qu.: 0.0000
-## Median :12.78 Median : 3.625 Median :1.5200 Median : 0.3000
-## Mean :16.47 Mean : 4.726 Mean :1.7600 Mean : 0.5909
-## 3rd Qu.:27.83 3rd Qu.: 4.622 3rd Qu.:2.4000 3rd Qu.: 0.7800
-## Max. :45.21 Max. :52.130 Max. :7.5200 Max. :34.8700
+str(libs.matrix)
+## num [1:1932, 1:8] 49.7 55.8 61.2 51 48 ...
+## - attr(*, "dimnames")=List of 2
+## ..$ : NULL
+## ..$ : chr [1:8] "SiO2" "TiO2" "Al2O3" "FeOT" ...
# Measurements are everything except first column
-sherloc.matrix<-sherloc.matrix[,-1]
+sherloc.matrix<-as.matrix(sherloc.matrix[,-1])
# Sherlock measurement matrix
# Review the structure
str(sherloc.matrix)
-## tibble [16 × 35] (S3: tbl_df/tbl/data.frame)
-## $ Plagioclase : num [1:16] 1 1 1 0 0 0 0 0 0 0 ...
-## $ Sulfate : num [1:16] 1 1 1 1 1 1 1 0 0 0 ...
-## $ Ca-sulfate : num [1:16] 1 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated Ca-sulfate : num [1:16] 0 1 1 0 0 0 0 0 0 0 ...
-## $ Mg-sulfate : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Hydrated Sulfates : num [1:16] 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated Mg-Fe sulfate : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Perchlorates : num [1:16] 1 0 0 0 0 0 0 0 0 0 ...
-## $ Na-perchlorate : num [1:16] 0.5 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous Silicate : num [1:16] 0.25 0.25 0.25 0.5 0.5 0.25 0.25 0 0 0 ...
-## $ Phosphate : num [1:16] 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Pyroxene : num [1:16] 1 1 1 1 1 1 1 1 1 1 ...
-## $ Olivine : num [1:16] 0 0 0 1 1 1 1 0.25 0.25 1 ...
-## $ Carbonate : num [1:16] 0 1 1 1 1 1 1 0.5 0.5 1 ...
-## $ Fe-Mg carbonate : num [1:16] 0 0 0 0 0 0 0 0 0 1 ...
-## $ Hydrated Carbonates : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered Silicates : num [1:16] 0 0 0 0 0 0 0 1 1 0 ...
-## $ Feldspar : num [1:16] 0 0 0 0 0 0 0 1 1 0 ...
-## $ Quartz : num [1:16] 0 0 0 0 0 0 0 0.25 0.25 0 ...
-## $ Apatite : num [1:16] 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi oxides : num [1:16] 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Halite : num [1:16] 0.25 0 0 0 0 0 0 0 0 0.25 ...
-## $ Iron oxide : num [1:16] 1 1 1 0 0 0 0 0.5 0.5 0.25 ...
-## $ Hydrated Iron oxide : num [1:16] 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ Organic matter : num [1:16] 0 0 0 1 1 1 1 1 1 0 ...
-## $ Sulfate+Organic matter : num [1:16] 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other hydrated phases : num [1:16] 0 0 0 1 1 1 1 0.5 0.5 1 ...
-## $ Phyllosilicates : num [1:16] 0 0 0 0 0 0 0 0.5 0.5 0.25 ...
-## $ Chlorite : num [1:16] 0 0 0 0 0 0 0 0.5 0.5 0 ...
-## $ Kaolinite (hydrous Al-clay): num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe-Mg-clay minerals : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num [1:16] 0 0 0 0 0 0 0 0 0 0 ...
+## num [1:16, 1:35] 1 1 1 0 0 0 0 0 0 0 ...
+## - attr(*, "dimnames")=List of 2
+## ..$ : NULL
+## ..$ : chr [1:35] "Plagioclase" "Sulfate" "Ca-sulfate" "Hydrated Ca-sulfate" ...
## 'data.frame': 16 obs. of 48 variables:
-## $ Na20 : num 1.928 1.338 -0.498 -0.538 1.225 ...
-## $ Mgo : num -1.14 -1.19 0.95 0.14 -1.38 ...
-## $ Al203 : num 0.663 0.506 -0.707 -0.723 1.741 ...
-## $ Si02 : num -0.0239 0.4749 0.0786 0.1584 1.682 ...
-## $ P205 : num 1.439 3.039 -0.247 -0.535 0.272 ...
-## $ S03 : num -0.384 -0.314 -0.639 -0.521 -0.609 ...
-## $ Cl : num 1.194 -0.281 -0.912 -0.696 0.18 ...
-## $ K20 : num 0.24 0.678 -0.565 -0.537 1.864 ...
-## $ Cao : num 2.101 2.023 -0.385 -0.385 0.32 ...
-## $ Ti02 : num 1.189 3.053 -0.821 0.312 -0.419 ...
-## $ Cr203 : num -0.53 -0.563 -0.155 -0.106 -0.579 ...
-## $ Mno : num 0.442 0.329 1.731 1.115 -0.568 ...
-## $ FeO-T : num -0.46 0.301 1.451 0.717 -1.385 ...
-## $ Plagioclase : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Sulfate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Ca-sulfate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated Ca-sulfate : num 0 1 1 0 0 0 0 0 0 0 ...
-## $ Mg-sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Hydrated Sulfates : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated Mg-Fe sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Perchlorates : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Na-perchlorate : num 0.5 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous Silicate : num 0.25 0.25 0.25 0.5 0.5 0.25 0.25 0 0 0 ...
-## $ Phosphate : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Pyroxene : num 1 1 1 1 1 1 1 1 1 1 ...
-## $ Olivine : num 0 0 0 1 1 1 1 0.25 0.25 1 ...
-## $ Carbonate : num 0 1 1 1 1 1 1 0.5 0.5 1 ...
-## $ Fe-Mg carbonate : num 0 0 0 0 0 0 0 0 0 1 ...
-## $ Hydrated Carbonates : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered Silicates : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Feldspar : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Quartz : num 0 0 0 0 0 0 0 0.25 0.25 0 ...
-## $ Apatite : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi oxides : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Halite : num 0.25 0 0 0 0 0 0 0 0 0.25 ...
-## $ Iron oxide : num 1 1 1 0 0 0 0 0.5 0.5 0.25 ...
-## $ Hydrated Iron oxide : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ Organic matter : num 0 0 0 1 1 1 1 1 1 0 ...
-## $ Sulfate+Organic matter : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other hydrated phases : num 0 0 0 1 1 1 1 0.5 0.5 1 ...
-## $ Phyllosilicates : num 0 0 0 0 0 0 0 0.5 0.5 0.25 ...
-## $ Chlorite : num 0 0 0 0 0 0 0 0.5 0.5 0 ...
-## $ Kaolinite (hydrous Al-clay): num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe-Mg-clay minerals : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num 0 0 0 0 0 0 0 0 0 0 ...
+## num [1:16, 1:48] 1.928 1.338 -0.498 -0.538 1.225 ...
+## - attr(*, "dimnames")=List of 2
+## ..$ : NULL
+## ..$ : chr [1:48] "Na20" "Mgo" "Al203" "Si02" ...
## 'data.frame': 16 obs. of 70 variables:
-## $ Plagioclase : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Sulfate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Ca-sulfate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated Ca-sulfate : num 0 1 1 0 0 0 0 0 0 0 ...
-## $ Mg-sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Hydrated Sulfates : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated Mg-Fe sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Perchlorates : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Na-perchlorate : num 0.5 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous Silicate : num 0.25 0.25 0.25 0.5 0.5 0.25 0.25 0 0 0 ...
-## $ Phosphate : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Pyroxene : num 1 1 1 1 1 1 1 1 1 1 ...
-## $ Olivine : num 0 0 0 1 1 1 1 0.25 0.25 1 ...
-## $ Carbonate : num 0 1 1 1 1 1 1 0.5 0.5 1 ...
-## $ Fe-Mg carbonate : num 0 0 0 0 0 0 0 0 0 1 ...
-## $ Hydrated Carbonates : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered Silicates : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Feldspar : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Quartz : num 0 0 0 0 0 0 0 0.25 0.25 0 ...
-## $ Apatite : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi oxides : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Halite : num 0.25 0 0 0 0 0 0 0 0 0.25 ...
-## $ Iron oxide : num 1 1 1 0 0 0 0 0.5 0.5 0.25 ...
-## $ Hydrated Iron oxide : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ Organic matter : num 0 0 0 1 1 1 1 1 1 0 ...
-## $ Sulfate+Organic matter : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other hydrated phases : num 0 0 0 1 1 1 1 0.5 0.5 1 ...
-## $ Phyllosilicates : num 0 0 0 0 0 0 0 0.5 0.5 0.25 ...
-## $ Chlorite : num 0 0 0 0 0 0 0 0.5 0.5 0 ...
-## $ Kaolinite (hydrous Al-clay): num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe-Mg-clay minerals : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ feldspar : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ plagioclase : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ pyroxene : num 1 1 1 1 1 1 1 1 1 1 ...
-## $ olivine : num 0 0 0 1 1 1 1 1 1 1 ...
-## $ quartz : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ apatite : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi_Oxides : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Iron_Oxide : num 1 1 1 0 0 0 0 1 1 1 ...
-## $ Sulfate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Perchlorates : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Phosphate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Ca_Sulfate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Carbonate : num 0 1 1 1 1 1 1 1 1 1 ...
-## $ Fe_Mg_clay : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe_Mg_carbonate : num 0 0 0 0 0 0 0 0 0 1 ...
-## $ Mg_sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Phyllosilicates : num 0 0 0 0 0 0 0 1 1 1 ...
-## $ Chlorite : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Halite : num 1 0 0 0 0 0 0 0 0 1 ...
-## $ Organic_matter : num 0 0 0 1 1 1 1 1 1 0 ...
-## $ Hydrated_Ca_Sulfate : num 0 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated_Sulfates : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated_Mg_Fe_Sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Na_Perchlorate : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous_Silicate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Hydrated_Carbonates : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered_Silicates : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Hydrated_Iron_Oxide : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Sulfate+Organic_Matter : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other_hydrated_phases : num 0 0 0 1 1 1 1 1 1 1 ...
-## $ Kaolinite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num 0 0 0 0 0 0 0 0 0 0 ...
+## num [1:16, 1:70] 1 1 1 0 0 0 0 0 0 0 ...
+## - attr(*, "dimnames")=List of 2
+## ..$ : NULL
+## ..$ : chr [1:70] "Plagioclase" "Sulfate" "Ca-sulfate" "Hydrated Ca-sulfate" ...
## 'data.frame': 16 obs. of 83 variables:
-## $ Plagioclase : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Sulfate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Ca-sulfate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated Ca-sulfate : num 0 1 1 0 0 0 0 0 0 0 ...
-## $ Mg-sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Hydrated Sulfates : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated Mg-Fe sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Perchlorates : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Na-perchlorate : num 0.5 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous Silicate : num 0.25 0.25 0.25 0.5 0.5 0.25 0.25 0 0 0 ...
-## $ Phosphate : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Pyroxene : num 1 1 1 1 1 1 1 1 1 1 ...
-## $ Olivine : num 0 0 0 1 1 1 1 0.25 0.25 1 ...
-## $ Carbonate : num 0 1 1 1 1 1 1 0.5 0.5 1 ...
-## $ Fe-Mg carbonate : num 0 0 0 0 0 0 0 0 0 1 ...
-## $ Hydrated Carbonates : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered Silicates : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Feldspar : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Quartz : num 0 0 0 0 0 0 0 0.25 0.25 0 ...
-## $ Apatite : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi oxides : num 0.25 1 1 0 0 0 0 0 0 0 ...
-## $ Halite : num 0.25 0 0 0 0 0 0 0 0 0.25 ...
-## $ Iron oxide : num 1 1 1 0 0 0 0 0.5 0.5 0.25 ...
-## $ Hydrated Iron oxide : num 0.25 0 0 0 0 0 0 0 0 0 ...
-## $ Organic matter : num 0 0 0 1 1 1 1 1 1 0 ...
-## $ Sulfate+Organic matter : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other hydrated phases : num 0 0 0 1 1 1 1 0.5 0.5 1 ...
-## $ Phyllosilicates : num 0 0 0 0 0 0 0 0.5 0.5 0.25 ...
-## $ Chlorite : num 0 0 0 0 0 0 0 0.5 0.5 0 ...
-## $ Kaolinite (hydrous Al-clay): num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe-Mg-clay minerals : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ feldspar : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ plagioclase : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ pyroxene : num 1 1 1 1 1 1 1 1 1 1 ...
-## $ olivine : num 0 0 0 1 1 1 1 1 1 1 ...
-## $ quartz : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ apatite : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ FeTi_Oxides : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Iron_Oxide : num 1 1 1 0 0 0 0 1 1 1 ...
-## $ Sulfate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Perchlorates : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Phosphate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Ca_Sulfate : num 1 1 1 0 0 0 0 0 0 0 ...
-## $ Carbonate : num 0 1 1 1 1 1 1 1 1 1 ...
-## $ Fe_Mg_clay : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Fe_Mg_carbonate : num 0 0 0 0 0 0 0 0 0 1 ...
-## $ Mg_sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Phyllosilicates : num 0 0 0 0 0 0 0 1 1 1 ...
-## $ Chlorite : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Halite : num 1 0 0 0 0 0 0 0 0 1 ...
-## $ Organic_matter : num 0 0 0 1 1 1 1 1 1 0 ...
-## $ Hydrated_Ca_Sulfate : num 0 1 1 0 0 0 0 0 0 0 ...
-## $ Hydrated_Sulfates : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Hydrated_Mg_Fe_Sulfate : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Na_Perchlorate : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Amorphous_Silicate : num 1 1 1 1 1 1 1 0 0 0 ...
-## $ Hydrated_Carbonates : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Disordered_Silicates : num 0 0 0 0 0 0 0 1 1 0 ...
-## $ Hydrated_Iron_Oxide : num 1 0 0 0 0 0 0 0 0 0 ...
-## $ Sulfate+Organic_Matter : num 0 0 0 0 0 1 1 0 0 0 ...
-## $ Other_hydrated_phases : num 0 0 0 1 1 1 1 1 1 1 ...
-## $ Kaolinite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Chromite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Ilmenite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Zircon/Baddeleyite : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Spinels : num 0 0 0 0 0 0 0 0 0 0 ...
-## $ Na20 : num 1.928 1.338 -0.498 -0.538 1.225 ...
-## $ Mgo : num -1.14 -1.19 0.95 0.14 -1.38 ...
-## $ Al203 : num 0.663 0.506 -0.707 -0.723 1.741 ...
-## $ Si02 : num -0.0239 0.4749 0.0786 0.1584 1.682 ...
-## $ P205 : num 1.439 3.039 -0.247 -0.535 0.272 ...
-## $ S03 : num -0.384 -0.314 -0.639 -0.521 -0.609 ...
-## $ Cl : num 1.194 -0.281 -0.912 -0.696 0.18 ...
-## $ K20 : num 0.24 0.678 -0.565 -0.537 1.864 ...
-## $ Cao : num 2.101 2.023 -0.385 -0.385 0.32 ...
-## $ Ti02 : num 1.189 3.053 -0.821 0.312 -0.419 ...
-## $ Cr203 : num -0.53 -0.563 -0.155 -0.106 -0.579 ...
-## $ Mno : num 0.442 0.329 1.731 1.115 -0.568 ...
-## $ FeO-T : num -0.46 0.301 1.451 0.717 -1.385 ...
+## num [1:16, 1:83] 1 1 1 0 0 0 0 0 0 0 ...
+## - attr(*, "dimnames")=List of 2
+## ..$ : NULL
+## ..$ : chr [1:83] "Plagioclase" "Sulfate" "Ca-sulfate" "Hydrated Ca-sulfate" ...
Each team has been assigned one of six datasets:
Dataset B: PIXL: The PIXL team’s goal is to understand and -explain how scaling improves results from Assignment 1. The matrix -version was scaled above.
Dataset C: LIBS (with appropriate scaling as necessary. Not scaled yet.)
Dataset D: Sherloc (with appropriate scaling as necessary. Not @@ -3160,17 +2926,19 @@
Scale this data appropriately (you can choose the scaling -method): Explain why you chose that scaling method. (3 -pts)
Cluster the data using k-means or your favorite clustering method (like hierarchical clustering): Describe how you picked the best number of clusters. Indicate the number of points in each clusters. Coordinate with your team so you try different approaches. If you want to share results with your team mates, make sure to use the same random seeds. (6 pts)
Perform a creative analysis that provides
+ _Perform a creative analysis that provides
insights into what one or more of the clusters are and what they tell
-you about the MARS data: